Harvesting Related Entities with a Search Engine

نویسندگان

  • Shu-Qi Sun
  • Shiqi Zhao
  • Muyun Yang
  • Haifeng Wang
  • Sheng Li
چکیده

This paper addresses the problem of related entity extraction and focuses on extracting related persons as a case study. The proposed method builds on a search engine. Specifically, we mine candidate related persons for a query person q using q’s search results and the query logs containing q. The acquired candidates are then automatically rated and ranked using a SVM regression model that investigates multiple features. Experimental results on a set of 200 randomly sampled query persons show that the precision of the extracted top-1, 5, and 10 related persons exceeds 91%, 90%, and 84%, respectively, which significantly outperforms a state-ofthe-art baseline.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore

Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...

متن کامل

Explaining relationships between entities

Modern search engines are increasingly aiming to understand users’ intent in order to answer information needs more effectively by providing richer information than the traditional “ten blue links”. This information might include context about the entities present in the query, direct answers to questions that concern entities and more. A recent trend when answering queries that refer to a sing...

متن کامل

Identifying the Names of Complex Search Tasks with Task-Related Entities

Conventional search engines usually consider a search query corresponding only to a simple task. Nevertheless, due to the explosive growth of web usage in recent years, more and more queries are driven by complex tasks. A complex task may consist of multiple sub-tasks. To accomplish a complex task, users may need to obtain information of various task-related entities corresponding to the sub-ta...

متن کامل

Measuring the Weight of Relations Between Entities

Extracting relations among entities is an active research area of Semantic Web studies related to semantic research and information inference. Although many studies have proposed extraction of large-scale relational data, how to weight each relation has not been well studied. Intuitively, a relation between two entities might be more important than relations between other entities. Therefore, t...

متن کامل

Entity-oriented Search Engine Result Pages

Modern search engine result pages often contain a mixture of results from structured and unstructured sources. Where such mixtures of structured and unstructured information are called for, the state-of-the-art is to organize complex search engine result pages around entities. Generating such a mixture of entity-oriented results in response to a traditional keyword query raises a number of inte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011