6 . 863 Final Project Writeup

نویسنده

  • Mark Richardson
چکیده

The goal of this system is to provide a measure of semantic similarity between any two verbs. The VerbOcean dataset contains various semantic relationships for a set of 3477 unique verbs. The pairwise relationships between verbs contained in VerbOcean were used to create a graph of semantic relations between many verbs. A tree traversal algorithm was implemented so that, given a starting word and a target word, various semantic paths from the starting word to the target word could be found. The VerbOcean dataset was generated by searching for the number of web pages which contained pairs of verbs in various simple syntaxes. The relationships incorporated into VerbOcean are similarity, strength, antonymy, enablement, and precedence. Similarity between two verbs was found by searching for instances of ‘to X and Y’ or ‘Xed and Yed’, while precedence was found by searching for ‘to X and then Y’ or ‘Xed and then Yed’. The number of results found (using Google) corresponded to the strength of a particular relationship between two verbs. The score of the shortest semantic path between two verbs intuitively gives a measure of how closely related those two verbs should be.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CS380L Project Writeup: Distributed Completion Service

Task parallelism is difficult to implement in a distributed setting due to machine unreliability and communication latency. HTCondor, an existing distributed computation framework, is insufficient for addressing these shortcomings. In this report, we present a high level abstraction built on top of HTCondor called the Distributed Completion Service (DCS). The DCS uses multiple different methods...

متن کامل

Process and Pitfalls in Writing Information Visualization Research Papers

The goal of this paper is to help authors recognize and avoid a set of pitfalls that recur in many rejected information visualization papers, using a chronological model of the research process. Selecting a target paper type in the initial stage can avert an inappropriate choice of validation methods. Pitfalls involving the design of a visual encoding may occur during the middle stages of a pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012