Contextual Urdu Lemmatization Using Recurrent Neural Network Models
نویسندگان
چکیده
In the field of natural language processing, machine translation is a colossally developing research area that helps humans communicate more effectively by bridging linguistic gap. translation, normalization and morphological analyses are first perhaps most important modules for information retrieval (IR). To build analyzer, or to complete process, it extract correct root out different words. Stemming lemmatization techniques commonly used find words in language. However, few studies on IR systems Urdu have shown effective than stemming due infixes found This paper presents algorithm based recurrent neural network models resource-scarce languages such as not very common. The proposed model trained tested two datasets, namely, Monolingual Corpus (UMC) Universal Dependencies (UDU). datasets lemmatized with help models. Word2Vec edit trees generate semantic syntactic embedding. Bidirectional long short-term memory (BiLSTM), bidirectional gated unit (BiGRU), (BiGRNN), attention-free encoder–decoder (AFED) under defined hyperparameters. Experimental results show encoder-decoder achieves an accuracy, precision, recall, F-score 0.96, 0.95, respectively, outperforms existing
منابع مشابه
Estimation of Reference Evapotranspiration Using Artificial Neural Network Models and the Hybrid Wavelet Neural Network
Estimation of evapotranspiration is essential for planning, designing and managing irrigation and drainage schemes, as well as water resources management. In this research, artificial neural networks, neural network wavelet model, multivariate regression and Hargreaves' empirical method were used to estimate reference evapotranspiration in order to determine the best model in terms of efficienc...
متن کاملDaily Pan Evaporation Estimation Using Artificial Neural Network-based Models
Accurate estimation of evaporation is important for design, planning and operation of water systems. In arid zones where water resources are scarce, the estimation of this loss becomes more interesting in the planning and management of irrigation practices. This paper investigates the ability of artificial neural networks (ANNs) technique to improve the accuracy of daily evaporation estimation....
متن کاملInvestigating how well contextual features are captured by bi-directional recurrent neural network models
Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined appropriate contextual features. On the other hand, neural network models learn these features automatically and have been successfully applied for several NLP tasks. Such models only consider vector representation of words and thus do not require efforts for manual feature engineering. This m...
متن کاملOCR For Printed Urdu Script Using Feed Forward Neural Network
This paper deals with an Optical Character Recognition system for printed Urdu, a popular Pakistani/Indian script and is the third largest understandable language in the world, especially in the subcontinent but fewer efforts are made to make it understandable to computers. Lot of work has been done in the field of literature and Islamic studies in Urdu, which has to be computerized. In the pro...
متن کاملProsodically-enhanced recurrent neural network language models
Recurrent neural network language models have been shown to consistently reduce the word error rates (WERs) of large vocabulary speech recognition tasks. In this work we propose to enhance the RNNLMs with prosodic features computed using the context of the current word. Since it is plausible to compute the prosody features at the word and syllable level we have trained the models on prosody fea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2023
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math11020435