نتایج جستجو برای: after pre cleaning

تعداد نتایج: 1917410  

2014
Harmit kaur Hardeep singh

The amount of web applications are increasing in large amount and users of web applications are also increasing rapidly with high speed. By increasing number of users the size of log file also increases .The information which stores in log files cannot be directly used for analysis. Therefore preprocessing of log files is necessary to improve the quality of web usage mining process. Preprocessi...

Journal: :J. Informetrics 2011
Marek Gagolewski

In this paper CITAN, the CITation ANalysis package for R statistical computing environment, is introduced. The main aim of the software is to support bibliometricians with a tool for preprocessing and cleaning bibliographic data retrieved from SciVerse Scopus and for calculating the most popular indices of scientific impact. To show the practical usability of the package, an exemplary assessmen...

2004
Filipe Pinto Manuel Filipe Santos Paulo Cortez Hélder Quintela

To increase effectiveness in their marketing and Customer Relationship Manager activities, many organizations are adopting strategies of Database Marketing (DBM). Nowadays, DBM faces new challenges in business knowledge since current strategies are mainly approached by classical statistical inference, which may fail when complex, multi-dimensional and incomplete data is available. An alternativ...

1999
Sherry Dixon Ellen Tohn Ron Rupp Scott Clark

This study evaluated and compared two procedures to clean lead dust and debris after lead hazard control activities were completed in housing with lead-based paint hazards. 1995 Federal guidelines prepared by the U.S. Department of Housing and Urban Development for the control of lead-based paint hazards in housing strongly recommend that after lead hazard control interventions all walls, ceili...

2011
Araly Barrera Rakesh M. Verma Ryan Vincent

This work presents, SemQuest, a questionanswering system used in the TAC 2011 guided summarization task based on semantics and extensions of a previous-developed single-document extractor. Our overall methodology includes: a data cleaning step, linguistic preprocessing among category articles, and a sentence extraction phase. A maximal marginal relevance technique, proposed by Carbonell et al.,...

2002
J. Pelt S. Refsdal R. Stabell

We present a short re–evaluation of a recently published time delay estimate for the gravitational lens system HE 1104–1805 with emphasis on important methodological aspects: bias of the statistics, inconsistency of the methods and use of the purposeful selection of data points (or so–called " cleaning ") at the preprocessing stage. We show how the inadequate use of simple analysis methods can ...

2012
Geetali Banerji Kanak Saxena

Classification rule mining aims to discover a small set of rules in the database that forms an accurate classifier. In classification rule mining there is one and only one predetermined target. In this paper, we proposed an algorithm, which performs preprocessing and cleaning prior to traditional classification. Experimental results show that the classifier built this way is, in general, more a...

Journal: :CoRR 2011
C. Ramya K. S. Shreedhara G. Kavitha

Web log data is usually diverse and voluminous. This data must be assembled into a consistent, integrated and comprehensive view, in order to be used for pattern discovery. Without properly cleaning, transforming and structuring the data prior to the analysis, one cannot expect to find meaningful patterns. As in most data mining applications, data preprocessing involves removing and filtering r...

2006
Wilson Wong Wei Liu Mohammed Bennamoun

An increasing number of language and speech applications are gearing towards the use of texts from online sources as input. Despite such rise, not much work can be found in the aspect of integrated approaches for cleaning dirty texts from online sources. This paper presents a mechanism of Integrated Scoring for Spelling error correction, Abbreviation expansion and Case restoration (ISSAC). The ...

2002
Osmar R. Zaïane Maria-Luiza Antonie Alexandru Coman

This paper proposes a new classification method based on association rule mining. This association rule-based classifier is experimented on a real dataset; a database of medical images. The system we propose consists of: a preprocessing phase, a phase for mining the resulted transactional database, and a final phase to organize the resulted association rules in a classification model. The exper...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید