A Machine Learning Approach to March Madness∗

نویسندگان

  • Jared Forsyth
  • Andrew Wilde
چکیده

The aim of this experiment was to learn which learning model, feature selection technique, ordering process, and distance criteria provided the best classification accuracy in predicting the outcome of any NCAA Men’s Basketball Tournament match. Ninety-four features were selected from ESPN.com for each team accepted into the tournament for the last four years. Ordering processes were tested against the baseline and the random ordering increased classification accuracy from 0.61 to 0.63. Random ordering was used to test a variety of feature reduction techniques. Random forest feature reduction performed best and increased accuracy from 0.63 to 0.7322 when used in conjunction with kNN and limiting the number of features to five. Using Manahattan distances as opposed to Euclidean distance further increased accuracy from 0.7322 to 0.7362.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Making Sense of the Mayhem: Machine Learning and March Madness

The goal of our research was to be able to accurately predict the outcome of every matchup in a March Madness bracket. This is an extraordinarily difficult problem because of the high amount of variance in college basketball and the sheer number of games that are played in the tournament. In the history of March Madness, no one has ever created a perfect bracket. Last year, Warren Buffett agree...

متن کامل

A Hybrid Machine Learning Method for Intrusion Detection

Data security is an important area of concern for every computer system owner. An intrusion detection system is a device or software application that monitors a network or systems for malicious activity or policy violations. Already various techniques of artificial intelligence have been used for intrusion detection. The main challenge in this area is the running speed of the available implemen...

متن کامل

Debt Collection Industry: Machine Learning Approach

Businesses are increasingly interested in how big data, artificial intelligence, machine learning, and predictive analytics can be used to increase revenue, lower costs, and improve their business processes. In this paper, we describe how we have developed a data-driven machine learning method to optimize the collection process for a debt collection agency. Precisely speaking, we create a frame...

متن کامل

Georeferencing Semi-Structured Place-Based Web Resources Using Machine Learning

In recent years, the shared content on the web has had significant growth. A great part of these information are publicly available in the form of semi-strunctured data. Moreover, a significant amount of these information are related to place. Such types of information refer to a location on the earth, however, they do not contain any explicit coordinates. In this research, we tried to georefer...

متن کامل

Forward kinematic analysis of planar parallel robots using a neural network-based approach optimized by machine learning

The forward kinematic problem of parallel robots is always considered as a challenge in the field of parallel robots due to the obtained nonlinear system of equations. In this paper, the forward kinematic problem of planar parallel robots in their workspace is investigated using a neural network based approach. In order to increase the accuracy of this method, the workspace of the parallel robo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014