Professional Documents
Culture Documents
MODELLING
IN SPORT
Stephen R. Clarke
BSc(Hons), DipEd, MA
Doctor of Philosophy
December 1997
ii
Acknowledgments
My thanks to the following for their contribution to this thesis.
John Norman for first arousing my interest in applications in sport, for his continued
encouragement, support and collaboration over many years, and for his advice and
assistance on this thesis.
Ray Stefani, a most knowledgeable person on sport, for his collaboration on football
tipping and home advantage.
David Noble, for the interest he has shown over the years in sporting applications, for
his collaboration on several projects, for conscientiously reading my drafts and offering
his considered and helpful advice, and for being such a considerate supervisor.
Many anonymous referees, editors and other colleagues, who with their comments and
advice have improved the original papers which form the bulk of this thesis.
Swinburne and in particular the School of Mathematical Sciences for providing
encouragement for my research and leave to write up this thesis.
My mother, Connie Clarke, for providing the initial impetus by often asking when I was
going to do a PhD, and who by taking her first parachute jump at age 83 demonstrated
it was never too late to achieve an ambition.
Finally my wife Kaye, for bearing the brunt of late nights and untidy home offices, for
listening to my half formed ideas, correcting my original drafts, helping to proof read
this thesis, and assisting me in many other ways.
iii
Candidates statement
This work has not previously been submitted by the candidate for any degree or similar
award in a tertiary institution.
Except where acknowledged in the text the work is my own.
Stephen R. Clarke
iv
Preface
This thesis is submitted under Swinburne University Policy for the Degree of Doctor of
Philosophy, Regulation 4.2, Admission to candidature for thesis by publication. This
allows for
"thesis by publication on the basis of research which has been carried out
prior to admission by candidature and which has been published, normally in
texts or refereed publications. Only those publications not previously
submitted by the applicant for a degree in any tertiary institution may be
included in support of the application for candidature and in the Candidates
thesis. In such cases the...Candidate must produce a significant body of
work on an integrated theme that will comprise the thesis. This thesis would
normally include a substantial introduction showing the relevance of the
publications to development of the theme, plus a series of publications and
any necessary linking commentary. All publications must be appropriately
identified and referenced and the contributions of the Candidate to each
publication must be clearly specified."
From my work in applications in sport over many years, I have selected for this thesis
some of the papers relating to performance measurement in football (including
Australian rules and English soccer), and cricket. Most of the football work relates to
measurement of home advantage and forecasting match results. The football work has
the advantage from an operational research viewpoint that it is heavily based on real
data, and has also been implemented to the degree that the results have been regularly
published in the popular media. The work on performance measurement and tactics in
cricket makes up the second part of the thesis and uses the traditional OR technique of
dynamic programming. I toyed with the idea of using my work on squash and
badminton, which also falls in the general theme, but thought the thesis already long
enough.
In this thesis each chapter usually comprises one publication. This has meant some
repetition, as there is some commonality in literature surveys and game descriptions in
various papers. However it has the advantage that each chapter is self contained and
may be read in isolation. The content of each publication is as it originally appeared,
but with style changes to create uniformity of presentation throughout the thesis. This
necessitated some superficial changes. All headings, tables, figures and equations have
been renumbered using the chapter number as a prefix. The style of referencing has
been standardised, and the references consolidated in the bibliography at the end of the
thesis. In a very few cases notation has been changed and tables and figures given a
more descriptive caption.
A commentary section which details later developments or further work has been added
at the end of some chapters.
The following lists the publications used in each chapter, states if they were refereed,
and specifies the candidates input to co-authored papers.
Chapter I. Written specifically for the thesis.
Chapter II. The early part of this chapter is based loosely on Stefani, R. T., & Clarke,
S. R. (1991), Australian rules football during the 1980s. ASOR Bulletin, 10(3), 1115. The paper evolved from a 50% contribution by Professor Stefani and a 50%
contribution from myself. In Chapter II, I have repeated all the analysis on 16 years
of data.
The latter part of the chapter is from Clarke, S. R. (1997a), Home ground advantage
in the Australian Football League, 1980-85, a paper presented at the APORS
conference 1997, Melbourne.
Chapter III. Clarke, S. R., & Norman, J. M. (1995), Home ground advantage of
individual clubs in English soccer. The Statistician, 44, 509-521. Refereed. The
conception of this paper resulted from joint collaboration. Professor Norman
assisted with the collection of data, which I computerised and analysed. I wrote the
first draft of the paper, but Professor Norman assisted with following drafts and
provided the necessary local knowledge.
The commentary material is based on Clarke, S. R. (1996b), Home advantages in
balanced competitions - English soccer 1990-1996. In N. de Mestre (Ed.),
Mathematics and Computers in Sport (pp. 111-116). Gold Coast, Qld.: Bond
University.
Chapter IV. Clarke, S. R. (1993), Computer forecasting of Australian rules football for
a daily newspaper. Journal of the Operational Research Society, 44(8), 753-759.
Refereed.
vi
vii
Chapter XI. Clarke, S. R. (1988), Dynamic Programming in one day cricket - Optimal
scoring rates. Journal of the Operational Research Society, 39(4), 331-337.
Refereed.
The commentary is based on Johnston, M. I., Clarke, S. R., & Noble, D. H. (1992),
An analysis of scoring policies in one day cricket. In N. de Mestre (Ed.),
Mathematics and Computers in Sport (pp. 71-80). Gold Coast, Qld: Bond
University. I wrote this paper based on joint development of my original idea by all
authors. Mr Johnston wrote all the necessary computer programs.
Chapter XII. Clarke, S. R., & Norman, J. M. (1998a), Dynamic programming in
cricket: Protecting the weaker batsman. Asia Pacific Journal of Operational
Research, in press. Refereed. I developed the models and paper with the continued
assistance of Professor Norman's help and advice.
Chapter XIII. Clarke, S. R., & Norman, J. M. (1997c), To run or not? Some Dynamic
Programming models in cricket. In R. L. Jenson & I. R. Johnson (Eds.),
Proceedings of the Twenty-Sixth Annual Meeting of the Western Decision Sciences
Institute (pp. 744-746). Hawaii: Decision Sciences Institute. The full paper, which
appears here, was refereed. The above is a shorter version published in the
proceedings as is the practice at this conference. I developed Models 1 and 2 along
with the necessary computer programs. Professor Norman developed Model 3
along with the necessary computer programs. I wrote the first draft of the paper.
Both authors provided assistance and advice to the other in the development of the
models and the paper.
Chapter XIV. Johnston, M. I., Clarke, S. R., & Noble, D. H. (1993), Assessing player
performance in one-day Cricket using dynamic programming. Asia-Pacific Journal
of Operational Research, 10, 45-55. Refereed. I wrote the paper based on joint
development of my original idea by all authors. Mr Johnston wrote all the
necessary computer programs.
Chapter XV.
viii
Table of Contents
Acknowledgments.............................................................................................................ii
Candidates statement........................................................................................................iii
Preface................... iv
Table of Contents......... ..................................................................................................viii
List of Illustrations....... ...................................................................................................xv
List of Tables..................................................................................................................xvi
Abstract..................... ......................................................................................................xx
CHAPTER I.
1.1
1.2
1.3
1.4
1.5
1.6
1.7
1.8
1.9
1.10
CHAPTER II.
2.0
2.1
2.2
2.3
2.4
2.5
2.6
2.7
2.8
ix
2.9
2.10
2.11
CHAPTER III.
3.0
3.1
3.2
3.3
3.4
3.5
3.6
3.7
3.8
Appendix 3.1
Appendix 3.2
Appendix 3.3
CHAPTER IV.
4.0
4.1
4.2
4.3
4.4
4.5
4.6
CHAPTER V.
5.0
5.1
5.2
5.3
5.4
5.5
5.6
CHAPTER VI.
6.0
6.1
6.2
6.3
6.4
6.5
6.6
Abstract ..........................................................................................91
Introduction ...................................................................................91
Australian rules football ................................................................93
Modelling game results .................................................................93
Home advantage ............................................................................94
Single h for all teams .....................................................................94
Distinct hi for each team ................................................................97
6.7
6.8
6.9
6.10
xi
CHAPTER VII.
7.0
7.1
7.2
7.3
7.4
7.5
7.6
7.7
7.8
7.9
7.10
7.11
CHAPTER VIII.
8.0
8.1
8.2
8.3
8.4
8.5
8.6
8.7
xii
CHAPTER IX.
9.1
9.2
9.3
9.4
9.5
9.6
9.7
9.8
CHAPTER X.
10.1
10.2
10.3
10.4
10.5
10.6
10.7
CHAPTER XI.
11.0
11.1
11.2
11.3
11.4
11.5
11.6
TEST STATISTICS
Introduction .................................................................................144
Distribution of scores ..................................................................148
Rating players ..............................................................................151
Tactics ..........................................................................................153
Umpiring decisions ......................................................................154
Rain interruption in one-day cricket ............................................155
Sundries .......................................................................................156
Conclusion ...................................................................................158
ANOTHER LOOK AT THE 1985/86 SHEFFIELD SHIELD
COMPETITION CRICKET RESULTS
Introduction .................................................................................159
First innings win - outright win weighting ..................................160
Home ground advantage ..............................................................160
Overcoming home ground advantage ..........................................163
Alternative method ......................................................................164
Conclusion ...................................................................................166
Commentary ................................................................................167
DYNAMIC PROGRAMMING IN ONE-DAY CRICKET OPTIMAL SCORING RATES
Abstract ........................................................................................168
Introduction .................................................................................168
The problem .................................................................................169
Run rate .......................................................................................169
First-innings formulation .............................................................170
11.4.1 Evaluation of dismissal probabilities .............................171
11.4.2 Computation and results ................................................171
11.4.3 Discussion ......................................................................172
Second-innings formulation ........................................................175
11.5.1 Computation and results ................................................175
11.5.2 Discussion ......................................................................176
Extensions ....................................................................................177
xiii
11.7
11.8
CHAPTER XII.
12.0
12.1
12.2
12.3
12.4
12.5
12.6
12.7
12.8
Appendix 12.1
CHAPTER XIII.
13.0
13.1
13.2
13.3
13.4
13.5
13.6
13.7
Appendix 13.1
Appendix 13.2
Appendix 13.3
Conclusion ...................................................................................178
Commentary. Testing of heuristic ..............................................179
DYNAMIC PROGRAMMING IN CRICKET PROTECTING THE WEAKER BATSMAN
Abstract ........................................................................................181
Introduction .................................................................................181
The model ....................................................................................183
Model solution .............................................................................186
Prior probabilities ........................................................................187
Extension .....................................................................................187
Discussion ....................................................................................187
12.6.1 Example .........................................................................189
An alternative criterion of optimality ..........................................190
Conclusion ...................................................................................192
......................................................................................................194
TO RUN OR NOT? SOME DYNAMIC PROGRAMMING
MODELS IN CRICKET
Abstract ........................................................................................197
Introduction .................................................................................197
Model 1 ........................................................................................198
Simplifying results .......................................................................201
Some analytic results ...................................................................202
Model 2 ........................................................................................204
13.5.1. Computer implementation .............................................205
Model 3 ........................................................................................207
Conclusion ...................................................................................209
Derivation of functional equations for Model 1 ..........................211
Proof of theorems for Model 1 ....................................................212
Some Analytic results for Model 1 .............................................214
xiv
CHAPTER XIV.
14.0
14.1
14.2
14.3
14.4
14.5
14.6
14.7
CHAPTER XV
BIBLIOGRAPHY ......................................................................................................236
xv
List of Illustrations
Figure 2.1
Figure 2.2
Figure 3.1
Figure 5.1
Figure 5.2
Figure 5.3
Figure 5.4
Figure 5.5
Figure 5.6
Figure 5.7
Figure 7.1a
Figure 7.1b
Figure 8.1
Figure 12.1
xvi
List of Tables
TABLE 2.1
TABLE 2.2
TABLE 2.3
Match results and home advantage in points ratio for the team
with the perceived home advantage for each year 1980-1995 .........26
TABLE 2.4
Match results and home advantage in points ratio for the team
with the perceived home advantage for each stage of season ..........27
TABLE 2.5
Mean paired HA for each team and number of pairs for the
years 1980 to 1995 ...........................................................................29
TABLE 2.6
Average of the paired HA for each team from 1980 to 1995 ...........30
TABLE 2.7
TABLE 2.8
TABLE 2.9
TABLE 2.10
TABLE 2.11
TABLE 3.1
TABLE 3.2
TABLE 3.3
TABLE 3.4
xvii
TABLE 3.5
TABLE 3.6
TABLE 3.7
TABLE 5.1
TABLE 5.2
TABLE 6.1
TABLE 6.2
TABLE 6.3
TABLE 6.4
TABLE 6.5
TABLE 7.1
TABLE 7.2
TABLE 7.3
TABLE 7.4
TABLE 7.5
TABLE 7.6
Percentage chance of pairs of teams playing in grand final Equal probability model ...................................................................117
TABLE 7.7
TABLE 7.8
TABLE 7.9
xviii
TABLE 7.10
TABLE 7.11
TABLE 7.12
Expected final ladder for 1995, with team ratings and HAs
shown ...............................................................................................130
TABLE 7.13
TABLE 8.1
TABLE 8.2
TABLE 8.3
TABLE 8.4
TABLE 9.1
TABLE 10.1
TABLE 10.2
TABLE 10.3
TABLE 10.4
TABLE 10.5
TABLE 10.6
TABLE 11.1
TABLE 11.2
TABLE 11.3
TABLE 11.4
xix
TABLE 12.1
TABLE 12.2
TABLE 13.1
TABLE 13.2
TABLE 13.3
TABLE 13.4
TABLE 13.5
Scoring profile for batsmen when blocking, stroking and hitting ....208
TABLE 13.6
TABLE 13.7
TABLE 14.1
TABLE 14.2
TABLE 14.3
TABLE 14.4
TABLE 14.5
xx
Abstract
This thesis investigates problems of performance modelling in sport. Mathematical
models are used to evaluate the performance of individuals, teams, and the competition
rules under which they compete. The thesis comprises a collection of papers on
applications of modelling to Australian rules football, soccer and cricket. Using
variations of the model wij = ui + hi - uj + eij where wij is the home team winning
margin when home team i plays away team j, ui is a team rating, hi is an individual
ground effect and eij is random error, the evaluation of team home ground advantage
effect (HA) is studied in detail. Data from the Australian Football League and English
Association Football for 1980 to 1995 are investigated. The necessity of individual
team HAs is demonstrated. The usual methods of calculating HA for competitions is
shown to be inappropriate for individual teams. The existence of a spurious HA when
home and away performances are compared is discussed. For a balanced competition,
fitting the above model by least squares is equivalent to a simple calculator method
using only data from the final ladder. A method of calculating HA by pairing matches
is demonstrated. Tables of HA and paired HA in terms of points/game for each year are
given. The resultant HAs for both Australian rules football and soccer are analysed.
Clearly there is an isolation effect, where teams that are isolated geographically have
large HAs. For English soccer, the paired HA is shown to be linearly related to the
distance between club grounds. As an application of these methods, the development
and implementation of a computer tipping program used to forecast Australian rules
football by rating teams is described. The need for ground effects for each team and
ground, and the use of heuristic methods to optimise the program is discussed. The
accuracy of the prediction model and its implementation by publication in the media is
discussed. International comparisons show prediction methods are limited by the data.
Methods for evaluating the fairness of the League draw and the finals systems are
given. The thesis also investigates the use of dynamic programming to optimise tactics
in football and cricket. The thesis develops tables giving the optimal run rate and the
expected score or probability of winning at any stage of a one-day cricket innings.
They show a common strategy in one-day cricket to be non-optimal, and a heuristic is
developed that is near optimal under a range of parameter variations. A range of
dynamic programming models are presented, allowing for batsmen of different abilities
and various objective functions. Their application to performance modelling are shown
by developing a radically different performance measure for one day cricket, and
applying it to a one-day series.