Automatic Chinese Catchword Extraction Based on Time Series Analysis
نویسندگان
چکیده
Catchwords refer to those popular words or phrases in a time period. In this paper, we propose a novel approach for automatic extraction of Chinese catchwords. By analyzing features of catchwords, we define three aspects to describe Popular Degree of catchwords. Then we use curve fitting in Time Series Analysis to build Popular Degree Curves of the extracted terms. Finally we give a formula that can calculate Popular Degree values of catchwords and get a ranking list of catchword candidates. Experiments show that the method is effective.
منابع مشابه
A Research on Automatic Chinese Catchword Extraction
Catchwords refer to popular words or phrases within certain area in certain period of time. In this paper, we propose a novel approach for automatic Chinese catchwords extraction. At the beginning, we discuss the linguistic definition of catchwords and analyze the features of catchwords by manual evaluation. According to those features of catchwords, we define three aspects to describe Popular ...
متن کاملA Comparative Study of the Effect of Word Segmentation On Chinese Terminology Extraction
Automatic term extraction is the first step towards automatic or semi-automatic update of existing domain knowledge base. Most of the researches applied word segmentation as a preprocessing step to Chinese term extraction. However, segmentation ambiguity is unavoidable, especially in identifying unknown words for Chinese. In this paper, we discuss the effect and limitations of segmentation to C...
متن کاملAutomatic Lane Extraction in Hemoglobin and Serum Protein Electrophoresis Using Image Processing
Image analysis is an image processing technique that aims to extract features or information from images. Image analysis in medicine has a special place because is a basis for disease diagnosis for physicians. Electrophoresis is a laboratory separating technique. Electrophoresis images are created during the electrophoresis process. Serum protein and hemoglobin electrophoresis test are the ...
متن کاملAutomatic Lane Extraction in Hemoglobin and Serum Protein Electrophoresis Using Image Processing
Image analysis is an image processing technique that aims to extract features or information from images. Image analysis in medicine has a special place because is a basis for disease diagnosis for physicians. Electrophoresis is a laboratory separating technique. Electrophoresis images are created during the electrophoresis process. Serum protein and hemoglobin electrophoresis test are the ...
متن کاملAutomated cropping intensity extraction from isolines of wavelet spectra
Timely and accurate monitoring of cropping intensity (CI) is essential to help us understand changes in food production. This paper aims to develop an automatic Cropping Intensity extraction method based on the Isolines of Wavelet Spectra (CIIWS) with consideration of intra-class variability. The CIIWS method involves the following procedures: (1) characterizing vegetation dynamics from time–fr...
متن کامل