A Machine Learning Approach to March Madness∗
نویسندگان
چکیده
The aim of this experiment was to learn which learning model, feature selection technique, ordering process, and distance criteria provided the best classification accuracy in predicting the outcome of any NCAA Men’s Basketball Tournament match. Ninety-four features were selected from ESPN.com for each team accepted into the tournament for the last four years. Ordering processes were tested against the baseline and the random ordering increased classification accuracy from 0.61 to 0.63. Random ordering was used to test a variety of feature reduction techniques. Random forest feature reduction performed best and increased accuracy from 0.63 to 0.7322 when used in conjunction with kNN and limiting the number of features to five. Using Manahattan distances as opposed to Euclidean distance further increased accuracy from 0.7322 to 0.7362.
منابع مشابه
Making Sense of the Mayhem: Machine Learning and March Madness
The goal of our research was to be able to accurately predict the outcome of every matchup in a March Madness bracket. This is an extraordinarily difficult problem because of the high amount of variance in college basketball and the sheer number of games that are played in the tournament. In the history of March Madness, no one has ever created a perfect bracket. Last year, Warren Buffett agree...
متن کاملA Hybrid Machine Learning Method for Intrusion Detection
Data security is an important area of concern for every computer system owner. An intrusion detection system is a device or software application that monitors a network or systems for malicious activity or policy violations. Already various techniques of artificial intelligence have been used for intrusion detection. The main challenge in this area is the running speed of the available implemen...
متن کاملDebt Collection Industry: Machine Learning Approach
Businesses are increasingly interested in how big data, artificial intelligence, machine learning, and predictive analytics can be used to increase revenue, lower costs, and improve their business processes. In this paper, we describe how we have developed a data-driven machine learning method to optimize the collection process for a debt collection agency. Precisely speaking, we create a frame...
متن کاملGeoreferencing Semi-Structured Place-Based Web Resources Using Machine Learning
In recent years, the shared content on the web has had significant growth. A great part of these information are publicly available in the form of semi-strunctured data. Moreover, a significant amount of these information are related to place. Such types of information refer to a location on the earth, however, they do not contain any explicit coordinates. In this research, we tried to georefer...
متن کاملForward kinematic analysis of planar parallel robots using a neural network-based approach optimized by machine learning
The forward kinematic problem of parallel robots is always considered as a challenge in the field of parallel robots due to the obtained nonlinear system of equations. In this paper, the forward kinematic problem of planar parallel robots in their workspace is investigated using a neural network based approach. In order to increase the accuracy of this method, the workspace of the parallel robo...
متن کامل