Automatic Creation of Domain Templates

نویسندگان

  • Elena Filatova
  • Vasileios Hatzivassiloglou
  • Kathleen McKeown
چکیده

Recently, many Natural Language Processing (NLP) applications have improved the quality of their output by using various machine learning techniques to mine Information Extraction (IE) patterns for capturing information from the input text. Currently, to mine IE patterns one should know in advance the type of the information that should be captured by these patterns. In this work we propose a novel methodology for corpus analysis based on cross-examination of several document collections representing different instances of the same domain. We show that this methodology can be used for automatic domain template creation. As the problem of automatic domain template creation is rather new, there is no well-defined procedure for the evaluation of the domain template quality. Thus, we propose a methodology for identifying what information should be present in the template. Using this information we evaluate the automatically created domain templates through the text snippets retrieved according to the created templates.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Creation of Domain Templates

Recently, many Natural Language Processing applications have improved the quality of their output by using various machine learning techniques to mine Information Extraction patterns for capturing information from the input text. Currently, to mine IE patterns one should know in advance the type of the information which should be captured by these patterns. In this work we propose a novel metho...

متن کامل

Providing a structural model for psychological problems based on disconnection and rejection domain and negative automatic thoughts with mediating role of experimental avoidance

Introduction: Psychological problems are the result of a person's interaction with the environment and include behaviors that cause social conflicts, dissatisfaction and individual unhappiness. The present study aimed to provide a structural model for psychological problems based on disconnection and rejection domain and negative automatic thoughts with mediating role of experimental avoidance....

متن کامل

Automatic Creation of CV Templates for Formant Type Speech Synthesis Based on HMM-Based Segmentation and Syllable Boundary Detection

An automatic method to create CV forrnantsource templates from continuous speech corpus is proposedfor speechsynthesis,where the boundaries of the CV templates are decided on the basis of the Mahalanobis distance. The synthetic experiments have proved the methodto be useful.

متن کامل

Automatic Extraction of Briefing Templates

An approach to solving the problem of automatic briefing generation from non-textual events can be segmenting the task into two major steps, namely, extraction of briefing templates and learning aggregators that collate information from events and automatically fill up the templates. In this paper, we describe two novel unsupervised approaches for extracting briefing templates from human writte...

متن کامل

Automatic Modulation Recognition using the Discrete Wavelet Transform

An Automatic Modulation Recognition (AMR) process using the Discrete Wavelet Transform (DWT) is presented in this work. The AMR algorithm involves the use of wavelet domain signal templates derived from digitally modulated signals that are used to transmit binary data. The signal templates, locally stored in a receiver, are cross-correlated with the incoming noisy, received signal after it has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006