A Joint Domain-Specific Pre-Training Method Based on Data Enhancement
نویسندگان
چکیده
State-of-the-art performances for natural language processing tasks are achieved by supervised learning, specifically, fine-tuning pre-trained models such as BERT (Bidirectional Encoder Representation from Transformers). With increasingly accurate models, the size of fine-tuned pre-training corpus is becoming larger and larger. However, very few studies have explored selection corpus. Therefore, this paper proposes a data enhancement-based domain method. At first, task downstream jointly trained to alleviate catastrophic forgetting problem generated existing classical methods. Then, based on hard-to-classify texts identified tasks’ feedback, can be reconstructed selecting similar it. The learning deepen model’s understanding undeterminable text expressions, thus enhancing feature extraction ability texts. Without any pre-processing corpus, experiments conducted two tasks, named entity recognition (NER) classification (CLS). results show that selected proposed method supplement domain-specific information improve performance basic model achieve best compared with other benchmark
منابع مشابه
A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملDomain-specific enhancement of metacognitive ability following meditation training.
Contemplative mental practices aim to enable individuals to develop greater awareness of their own cognitive and affective states through repeated examination of first-person experience. Recent cross-sectional studies of long-term meditation practitioners suggest that the subjective reports of such individuals are better calibrated with objective indices; however, the impact of mental training ...
متن کاملA HMM-based pre-training approach for sequential data
Much recent research highlighted the critical role of unsupervised pre-training to improve the performance of neural network models. However, extensions of those architectures to the temporal domain introduce additional issues, which often prevent to obtain good performance in a reasonable time. We propose a novel approach to pre-train sequential neural networks in which a simpler, approximate ...
متن کاملDiagnosis of diabetes by using a data mining method based on native data
Background & Aim: Detecting the abnormal performance of diabetes and subsequently getting proper treatment can reduce the mortality associated with the disease. Also, timely diagnosis will result in irreversible complications for the patient. The aim of this study was to determine the status of diabetes mellitus using data mining techniques. Methods: This is an analytical study and its databas...
متن کاملA Probabilistic Three-Phase Time Domain Electric Arc Furnace Model based on analytical method
An electric arc furnace (EAF) is known as nonlinear and time variant load that causes power quality (PQ) problems such as, current, voltage and current harmonics, voltage flicker, frequency changes in power system. One of the most important problems to study the EAF behavior is the choice of a suitable model for this load. Hence, in this paper, a probabilistic three-phase model is proposed base...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13074115