Keyword Extraction from Scientific Research Projects Based on SRP?TF?IDF

نویسندگان

چکیده

Keyword extraction by Term frequency-Inverse document frequency (TF-IDF) is used for text information retrieval and mining in many domains, such as news text, social contact medical text. However, keyword special domains still needs to be improved optimized, particularly the scientific research field. The traditional TF-IDF algorithm considers only word documents, but not domain characteristics. Therefore, we propose Scientific project (SRP-TF-IDF) model, which combines with a weight balance designed recalculate candidate keywords. We have implemented SRP-TF-IDF model verified that our method has better precision, recall, F1 score than TextRank methods. In addition, investigated parameter of find an optimal value from projects.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Keyword Extraction From Chinese Text Based On Multidimensional Weighted Features

This paper proposed to solve the problems of incomplete coverage and low accuracy in keyword extraction of Chinese text based on intrinsic feature of the Chinese language and an extraction method of multidimensional information weighted eigenvalues. This method combined theoretical analysis and experimental calculation to study the parts of speech, word position, word length, semantic similarit...

متن کامل

Keyword Extraction Based on Implicit Feedback

To improve the results from search engines and make them more personalized for the user, we need to find out about the interests of a particular user. Many of the search personalization methods analyse documents visited by the user and from these documents infer the user’s interests. However, this approach is not accurate, because the user is rarely interested in the whole document; he might be...

متن کامل

Method Mention Extraction from Scientific Research Papers

Scientific publications contain many references to method terminologies used during scientific experiments. New terms are constantly created within the research community, especially in the biomedical domain where thousands of papers are published each week. In this study we report our attempt to automatically extract such method terminologies from scientific research papers, using rule-based a...

متن کامل

Citation Analysis and Keyword Mining based on Fulltext Extraction of Scientific Literature

Citation analysis as a meaningful research tool has been studied for a long time for domain information visualization, information retrieval, and bibliometric analysis. This paper proposes three steps of mining keyword relationships using citation graph analysis based on the fulltext of scientific literature in the scientific publication database. First, the method Citation Probability Distribu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Chinese Journal of Electronics

سال: 2021

ISSN: ['1022-4653', '2075-5597']

DOI: https://doi.org/10.1049/cje.2021.05.007