Nearest-neighbor matchup effects: accounting for team matchups for predicting March Madness

Recently, the surge of predictive analytics competitions has improved sports predictions by fostering data-driven inference and steering clear of human bias. This article details methods developed for Kaggle’s March Machine Learning Mania competition for the 2014 NCAA tournament. A submission to the competition consists of outcome probabilities for each potential matchup. Most predictive models are based entirely on measures of overall team strength, resulting in the unintended “transitive property.” These models are therefore unable to capture specific matchup tendencies. We introduce our novel nearest-neighbor matchup effects framework, which presents a flexible way to account for team characteristics above and beyond team strength that may influence game outcomes. In particular we develop a general framework that couples a model predicting a point spread with a clustering procedure that borrows strength from games similar to a current matchup. This results in a model capable of issuing predictions controlling for team strength and that capture specific matchup characteristics.

MoreLess

Year of publication:	2015
Authors:	Andrew, Hoegh ; Marcos, Carzolio ; Ian, Crandell ; Xinran, Hu ; Lucas, Roberts ; Yuhyun, Song ; Leman Scotland C.
Published in:	Journal of Quantitative Analysis in Sports. - De Gruyter, ISSN 1559-0410. - Vol. 11.2015, 1, p. 29-37
Publisher:	De Gruyter

More details

Extent:	text/html
Type of publication:	Article
Source:	RePEc - Research Papers in Economics

Persistent link: https://www.econbiz.de/10011200190