Sentence Correction Based on Large-scale Language Modelling
نویسنده
چکیده
With the further development of informatization, more and more data is stored in the form of text. There are some loss of text during their generation and transmission. The paper aims to establish a language model based on the large-scale corpus to complete the restoration of missing text. In this paper, we introduce a novel measurement to find the missing words, and a way of establishing a comprehensive candidate lexicon to insert the correct choice of words. The paper also introduces some effective optimization methods, which largely improve the efficiency of the text restoration and shorten the time of dealing with 1000 sentences into 3.6 seconds.
منابع مشابه
The Role of Emotioncy in Cognitive Load and Sentence Comprehension of Language Learners
Emotion and cognition are both considered influential factors in language learning. In this study, the role of "emotioncy" (which is a combination of emotion and frequency) in the cognitive load and sentence comprehension of a group of language learners was examined. Emotioncy includes emotions that are evoked by the senses. To this aim, 200 English as a foreign language (EFL) learners were ask...
متن کاملSpeech and Language Resources for LVCSR of Russian
A syllable-based language model reduces the lexicon size by hundreds of times. It is especially beneficial in case of highly inflective languages like Russian due to the abundance of word forms according to various grammatical categories. However, the main arising challenge is the concatenation of recognised syllables into the originally spoken sentence or phrase, particularly in the presence o...
متن کامل3D Modelling of Under Ground Burried Objects Based on Ground Penetration Radar
There is a growing demand for mapping and 3D modelling of buried objects such as pipelines, agricultural hetitage, landmines and other buried objects. Usually, large scale and high resolution maps from these objects are needed. Manually map generation and modeling of these objects are cost and time consuming and is dependent on lots of resources. Therefore, automating the subsurface mapping and...
متن کاملThe Effect of Sentence-Writing Practice on Iranian low-intermediate EFL Learners’ L2 Grammatical Accuracy
This study aimed to investigate the effect of sentence writing practice on male and female low-intermediate students’ English grammatical accuracy. The question this study tried to answer does English grammatical accuracy can be affected by sentence writing practice. To find the answer to the question, 15 low intermediate level students from Kish away institute were selected. They were both mal...
متن کاملSpelling Correction Using Context * Mohammad
This paper describes a spelling correction system that functions as part of an intelligent tutor that carries on a natural language dialogue with its users. The process that searches the lexicon is adaptive as is the system filter, to speed up the process. The basis of our approach is the interaction between the parser and the spelling corrector. Alternative correction targets are fed back to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1709.07777 شماره
صفحات -
تاریخ انتشار 2017