TALP at WePS-3 2010

نویسندگان

  • Daniel Ferrés
  • Horacio Rodríguez
چکیده

In this paper we present our system and experiments at the Third Web People Search Workshop (WePS-3) task for clustering web people search documents in English. In our experiments we used a simple approach with three algorithms: Lingo, Hierachical Agglomerative Clustering (HAC), and a 2-step HAC algorithm. We also present the results and initial conclusions in the context of the WePS-3 Task 1 for clustering. We obtained best results with HAC and 2-step HAC algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-document Coreference for WePS

A good clustering performance depends on the quality of the distance function used to asses similarity. In this paper we propose a pairwise document coreference model to improve performance over a wordvector similarity approach for the WePS 3 clustering task. We identify a simple criterion which discriminates between highly ambiguous queries, i.e. many small clusters, and balanced queries, i.e....

متن کامل

WePS-3 Evaluation Campaign: Overview of the Web People Search Clustering and Attribute Extraction Tasks

The third WePS (Web People Search) Evaluation campaign took place in 2009-2010 and attracted the participation of 13 research groups from Europe, Asia and North America. Given the top web search results for a person name, two tasks were addressed: a clustering task, which consists of grouping together web pages referring to the same person, and an extraction task, which consists of extracting s...

متن کامل

SINAI at WePS-3: Online Reputation Management

The online reputation management systems help to the consumers to make buying decisions looking for opinions in the web about many products offered by companies, also interested in the same opinions. This paper presents the system developed by the SINAI research group at the WEPS-3 task, called Online Reputation Management. Given a Twitter entry and a company name, the goal is to decide if the ...

متن کامل

An exploratory analysis of alkaline phosphatase, lactate dehydrogenase, and prostate-specific antigen dynamics in the phase 3 ALSYMPCA trial with radium-223

Background Baseline clinical variables are prognostic for overall survival (OS) in patients with castration-resistant prostate cancer (CRPC). Their prognostic and predictive value with agents targeting bone metastases, such as radium-223, is not established. Patients and methods The radium-223 ALSYMPCA trial enrolled patients with CRPC and symptomatic bone metastases. Prognostic potential of ...

متن کامل

Wild edible plant knowledge, distribution and transmission: a case study of the Achí Mayans of Guatemala

BACKGROUND Knowledge about wild edible plants (WEPs) has a high direct-use value. Yet, little is known about factors shaping the distribution and transfer of knowledge of WEPs at global level and there is concern that use of and knowledge about WEPs is decreasing. This study aimed to investigate the distribution, transmission and loss of traditional ecological knowledge (TEK) concerning WEPs us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010