نتایج جستجو برای: term frequency and inverse document frequency tf idf

تعداد نتایج: 16977020  

Journal: :Khazanah informatika 2023

Literature review is the first step in starting research for a deep understanding of interest. However, finding literature relevant to interests difficult and takes time. Skyline query method that can be used filtering. An object p said dominate q if equals on all its attributes, at least better than one attribute. Categorical Data Search (CDSS) an algorithm filter skyline objects categorical d...

2004
Mark P. Sinka David W. Corne

Web document analysis, and its associated research, underpins much of what is referred to as web intelligence and the envisaged ‘semantic web’. A key issue in this field is how to encode a web document from the raft of potential document “features” without losing salient information. Current research almost always uses word-based feature vectors such as term frequency of specific words (TF) and...

2013
Weimao Ke

We propose a new theory that quantifies information in probability distributions and derive a new document representation model for text clustering. By extending Shannon entropy to accommodate a non-linear relation between information and uncertainty, the proposed Least Information theory (LIT) provides insight into how terms can be weighted based on their probability distributions in documents...

Journal: :Computers, materials & continua 2021

Social networking services (SNSs) provide massive data that can be a very influential source of information during pandemic outbreaks. This study shows social media analysis used as crisis detector (e.g., understanding the sentiment users regarding various outbreaks). The novel Coronavirus Disease-19 (COVID-19), commonly known coronavirus, has affected everyone worldwide in 2020. Streaming Twit...

Journal: :Open Journal for Information Technology 2022

Text summarization plays an important role in the area of natural language processing. The need for information all over world to solve specific problems keeps on increasing daily. This poses a greater challenge as data stored internet has gradually increased exponentially time. Finding out relevant and manually summarizing it short time is challenging tedious task human being. Summarization ai...

Journal: :ACM Transactions on Asian and Low-Resource Language Information Processing 2023

Phishing involves malicious activity whereby phishers, in the disguise of legitimate entities, obtain illegitimate access to victims’ personal and private information, usually through emails. Currently, phishing attacks threats are being handled effectively use latest email detection solutions. Most current systems assume be English, though other languages growing. In particular, Arabic is a wi...

Journal: :International Journal of Electrical and Computer Engineering 2022

<p>The main focus of this research is to find the reasons behind fresh cases COVID-19 from public’s perception for data specific India. The analysis done using machine learning approaches and validating inferences with medical professionals. processing accomplished in three steps. First, dimensionality vector space model (VSM) reduced improvised feature engineering (FE) process by a weigh...

2005
Fabius Klemm Karl Aberer

There has been an increasing research interest in developing full-text retrieval based on peer-to-peer (P2P) technology. So far, these research efforts have largely concentrated on efficiently distributing an index. However, ranking of the results retrieved from the index is a crucial part in information retrieval. To determine the relevance of a document to a query, ranking algorithms use coll...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید