Applying Machine Learning to Product Categorization

نویسندگان

  • Sushant Shankar
  • Irving Lin
چکیده

We present a method for classifying products into a set of known categories by using supervised learning. That is, given a product with accompanying informational details such as name and descriptions, we group the product into a particular category with similar products, e.g., ‘Electronics’ or ‘Automotive’. To do this, we analyze product catalog information from different distributors on Amazon.com to build features for a classifier. Our implementation results show significant improvement over baseline results. Taking into particular criteria, our implementation is potentially able to substantially increase automation of categorization of products. General Terms Supervised and Unsupervised Learning, E-Commerce

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying an existing machine learning algorithm to text categorization

The information retrieval community is becoming increasingly interested in machine learning techniques, of which text catego-rization is an application. This paper describes how we have applied an existing similarity-based learning algorithm, Charade, to the text cat-egorization problem and compares the results with those obtained using decision tree construction algorithms. From a machine lear...

متن کامل

A Machine Learning Approach for Product Matching and Categorization

Consumers today have the option to purchase products from thousands of e-shops. However, the completeness of the product specifications and the taxonomies used for organizing the products differ across different e-shops. To improve the consumer experience, e.g., by allowing for easily comparing offers by different vendors, approaches for product integration on the Web are needed. In this paper,...

متن کامل

A comparative study of two automatic document classification methods in a library setting

In current library practice, trained human experts usually carry out document cataloguing and indexing based on a manual approach. With the explosive growth in the number of electronic documents available on the Internet and digital libraries, it is increasingly difficult for library practitioners to categorize both electronic documents and traditional library materials using just a manual appr...

متن کامل

The Process of Applying Machine Learning Algorithms

In this paper we present a view of the overall process of application development for real-world classiication and regression problems. Speciically, we identify, categorize and discuss the various problem-speciic factors that innuence this process.

متن کامل

Multi-Feature Segmentation and Cluster based Approach for Product Feature Categorization

At a recent time, the web has become a valuable source of online consumer review however as the number of reviews is growing in high speed. It is infeasible for user to read all reviews to make a valuable or satisfying decision because the same features, people can write it contrary words or phrases. To produce a useful summary of domain synonyms words and phrase, need to be a group into same f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011