Nearest-neighbor matchup effects: accounting for team matchups for predicting March Madness
Recently, the surge of predictive analytics competitions has improved sports predictions by fostering data-driven inference and steering clear of human bias. This article details methods developed for Kaggle’s March Machine Learning Mania competition for the 2014 NCAA tournament. A submission to the competition consists of outcome probabilities for each potential matchup. Most predictive models are based entirely on measures of overall team strength, resulting in the unintended “transitive property.” These models are therefore unable to capture specific matchup tendencies. We introduce our novel nearest-neighbor matchup effects framework, which presents a flexible way to account for team characteristics above and beyond team strength that may influence game outcomes. In particular we develop a general framework that couples a model predicting a point spread with a clustering procedure that borrows strength from games similar to a current matchup. This results in a model capable of issuing predictions controlling for team strength and that capture specific matchup characteristics.
Year of publication: |
2015
|
---|---|
Authors: | Andrew, Hoegh ; Marcos, Carzolio ; Ian, Crandell ; Xinran, Hu ; Lucas, Roberts ; Yuhyun, Song ; Leman Scotland C. |
Published in: |
Journal of Quantitative Analysis in Sports. - De Gruyter, ISSN 1559-0410. - Vol. 11.2015, 1, p. 29-37
|
Publisher: |
De Gruyter |
Saved in:
Saved in favorites
Similar items by person
-
Defining the Performance Coefficient in Golf: A Case Study at the 2009 Masters
Andrew, Hoegh, (2011)
-
Life on the bubble: Who’s in and who’s out of March Madness?
Leman Scotland C., (2014)
- More ...