How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation
نویسندگان
چکیده
Recently a few systems for automatically solving math word problems have reported promising results. However, the datasets used for evaluation have limitations in both scale and diversity. In this paper, we build a large-scale dataset which is more than 9 times the size of previous ones, and contains many more problem types. Problems in the dataset are semiautomatically obtained from community question-answering (CQA) web pages. A ranking SVM model is trained to automatically extract problem answers from the answer text provided by CQA users, which significantly reduces human annotation cost. Experiments conducted on the new dataset lead to interesting and surprising results.
منابع مشابه
Deep Neural Solver for Math Word Problems
This paper presents a deep neural solver to automatically solve math word problems. In contrast to previous statistical learning approaches, we directly translate math word problems to equation templates using a recurrent neural network (RNN) model, without sophisticated feature engineering. We further design a hybrid model that combines the RNN model and a similarity-based retrieval model to a...
متن کاملLIM-G: Learner-initiating instruction model based on cognitive knowledge for geometry word problem comprehension
Computer-assisted instruction systems have been broadly applied to help students solve math word problem. The majority of such systems, which are based on an instructor-initiating instruction strategy, provide pre-designed problems for the learners. When learners are asked to solve a word problem, the system will instruct the learners what to do. However, systems employing an instructor-initiat...
متن کاملMAWPS: A Math Word Problem Repository
Recent work across several AI subdisciplines has focused on automatically solving math word problems. In this paper we introduce MAWPS, an online repository of Math Word Problems, to provide a unified testbed to evaluate different algorithms. MAWPS allows for the automatic construction of datasets with particular characteristics, providing tools for tuning the lexical and template overlap of a ...
متن کاملConstruction and Validation of a Questionnaire on Metacognitive Knowledge Needed in Solving Mathematical Word Problems to be Used in Interviews
To provide researchers with an instrument, valid and reliable enough, for measuring students’ metacognitive knowledge needed in solving mathematical word problems, based on the theoretical foundation and previous research, a set of 24 questions at three levels of metacognitive knowledge was constructed. The initial validity of these questions was confirmed by Psychology Professors and high scho...
متن کاملCONSTRAINED BIG BANG-BIG CRUNCH ALGORITHM FOR OPTIMAL SOLUTION OF LARGE SCALE RESERVOIR OPERATION PROBLEM
A constrained version of the Big Bang-Big Crunch algorithm for the efficient solution of the optimal reservoir operation problems is proposed in this paper. Big Bang-Big Crunch (BB-BC) algorithm is a new meta-heuristic population-based algorithm that relies on one of the theories of the evolution of universe namely, the Big Bang and Big Crunch theory. An improved formulation of the algorithm na...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016