Text Mining and Site Outlining Projects
نویسندگان
چکیده
2 Knowledge discovery from a large amount of unstructured or semi-structured text (KDT) has been quickly forming a major research trend. In particular, it has become extremely important for customer relationship management (CRM) and business intelligence (BI) applications since KDT will be able to go beyond conventional demographic and stochastic analysis of databases, and focus on textual information as a source of rich “context” for facts and entities. In this paper, we introduce two such projects – text mining and site outlining – conducted at the Tokyo Research Laboratory, IBM Research.
منابع مشابه
Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining
Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...
متن کاملAntecedents of open source software defects: A data mining approach to model formulation, validation and testing
This paper develops tests and validates a model for the antecedents of open source software (OSS) defects, using Data and Text Mining. The public archives of OSS projects are used to access historical data on over 5,000 active and mature OSS projects. Using domain knowledge and exploratory analysis, a wide range of variables is identified from the process, product, resource, and end-user charac...
متن کاملارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متنکاوی در حوزه یادگیری الکترونیکی
As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...
متن کاملA Text Mining Approach to Tracking Elements of Decision Making: a pilot study
Understanding rework, the causes of rework, and the relationship between issues, decisions and the associated actions, is crucial in minimizing the fundamental industrial problems in system engineering projects. The aim of our research is to apply text mining techniques to track elements of decision making and extract semantic associations between decisions, actions and rework. Text mining is s...
متن کاملA review of text mining approaches and their function in discovering and extracting a topic
Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling. Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...
متن کامل