An information-pattern-based approach to novelty detection

نویسندگان

  • Xiaoyan Li
  • W. Bruce Croft
چکیده

In this paper, a new novelty detection approach based on the identification of sentence level information patterns is proposed. First, ‘‘novelty’’ is redefined based on the proposed information patterns, and several different types of information patterns are given corresponding to different types of users’ information needs. Second, a thorough analysis of sentence level information patterns is elaborated using data from the TREC novelty tracks, including sentence lengths, named entities (NEs), and sentence level opinion patterns. Finally, a unified information-pattern-based approach to novelty detection (ip-BAND) is presented for both specific NE topics and more general topics. Experiments on novelty detection on data from the TREC 2002, 2003 and 2004 novelty tracks show that the proposed approach significantly improves the performance of novelty detection in terms of precision at top ranks. Future research directions are suggested. 2007 Elsevier Ltd. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sentence Level Information Patterns for Novelty Detection

SENTENCE LEVEL INFORMATION PATTERNS FOR NOVELTY DETECTION JULY 2006 XIAOYAN LI, B.E. TSINGHUA UNIVERSITY M.E., TSINGHUA UNIVERSITY Ph.D. UNIVERSITY OF MASSACHUSETTS AT AMHERST Directed by: Professor W. Bruce Croft The detection of new information in a document stream is an important component of many potential applications. In this thesis, a new novelty detection approach based on the identific...

متن کامل

Using the author’s comments for knowledge discovery

We present an approach to knowledge discovery based on the author’s comments on pieces of information he conveys in his text. To carry out this task we propose a pattern-matching framework based on syntactic analysis that is able to represent a large variety of expressions that convey the same comment type. Our method has been applied effectively in novelty detection and risk detection tasks. W...

متن کامل

Information theoretic novelty detection

We present a novel approach to online change detection problems when the training sample size is small. The proposed approach is based on estimating the expected information content of a new data point and allows an accurate control of the false positive rate even for small data sets. In the case of the Gaussian distribution, our approach is analytically tractable and closely related to classic...

متن کامل

Novelty detection in wildlife scenes through semantic context modelling

Novelty detection is an important functionality that has found many applications in information retrieval and processing. In this paper we propose a novel framework that deals with novelty detection in multiple-scene image sets. Working with wildlife image data, the framework starts with image segmentation, followed by feature extraction and classification of the image blocks extracted from ima...

متن کامل

Intrusion Detection based on a Novel Hybrid Learning Approach

Information security and Intrusion Detection System (IDS) plays a critical role in the Internet. IDS is an essential tool for detecting different kinds of attacks in a network and maintaining data integrity, confidentiality and system availability against possible threats. In this paper, a hybrid approach towards achieving high performance is proposed. In fact, the important goal of this paper ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 44  شماره 

صفحات  -

تاریخ انتشار 2008