Inferring Document Readability by Integrating Text and Eye Movement Features
نویسندگان
چکیده
Latest research has shown that the readability of documents plays an important role in information seeking and acquisition, especially for non-domain-expert users. Classical document readability measures are based on surface text features, independent of users. In this paper, we propose to integrate text features with the users’ eye movement features. The latter is expected to reflect a user’s reading level, thus can be used to measure document readability in a personalized way. Based on the eye tracking data collected from our preliminary user evaluation, we investigated the impacts of different features on document readability prediction. The results show that the combination of text and eye movement features has a higher correlation with human judgments than using either of them individually.
منابع مشابه
Inferring Document Readability by Integrating Eye Movement Features
Latest research has shown that the readability of documents plays an important role in information seeking and acquisition, especially for non-domain-expert users. Classical document readability measures are based on surface text features, independent of users. In this paper, we propose to integrate text features with the users’ eye movement features. The latter is expected to reflect a user’s ...
متن کاملReadability Assessment for Text Simplification: From Analyzing Documents to Identifying Sentential Simplifications
Readability assessment can play a role in the evaluation of a simplification algorithm as well as in the identification of what to simplify. While some previous research used traditional readability formulas to evaluate text simplification, there is little research into the utility of readability assessment for identifying and analyzing sentence level targets for text simplification. We explore...
متن کاملAuthor gender identification from text using Bayesian Random Forest
Nowadays high usage of users from virtual environments and their connection via social networks like Facebook, Instagram, and Twitter shows the necessity of finding out shared subjects in this environment more than before. There are several applications that benefit from reliable methods for inferring age and gender of users in social media. Such applications exist across a wide area of fields,...
متن کاملREADABILITY STUDIES: How TECHNOCENTRISM CAN COMPROMISE RESEARCH AND LEGAL DETERMINATIONS
One way to determine whether consumers understand a document is to use a readability formula to assign it a score.' These formulas calculate readability by counting such variables as the number of words and syllables in a passage or document. The idea of readability formulas has been defined as "an equation which combines those text features that best predict text difficulty. The equation is us...
متن کاملPredicting Text Relevance from Sequential Reading Behavior
In this paper we show that it is possible to make good predictions of text relevance, from only features of conscious eye movements during reading. We pay special attention to the order in which the lines of text are read, and compute simple features of this sequence. Artificial neural networks are applied to classify the relevance of the read lines. The use of ensemble techniques creates stabl...
متن کامل