Context Sensitive Verb Similarity Dataset for Legal Information Extraction

نویسندگان

چکیده

Existing literature demonstrates that verbs are pivotal in legal information extraction tasks due to their semantic and argumentative properties. However, granting computers the ability interpret meaning of a verb its properties relation given context can be considered as challenging task, mainly polysemic domain specific behaviours verbs. Therefore, developing mechanisms identify behaviors evaluate how artificial models detect with significant importance. In this regard, comprehensive dataset used an evaluation resource, well training data set, major requirement. paper, we introduce LeCoVe, which is similarity intended towards facilitating process identifying similar meanings context. Using dataset, evaluated both generic embedding models, were developed using state-of-the-art word representation language modelling techniques. As part experiments carried out announced Sense2Vec BERT trained corpus opinion texts order capture behaviours. addition demonstrate neural network model, was by combining semantic, syntactic, contextual features obtained from outputs perform comparatively well, even low resource scenario.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations

We present a new large dataset of 12403 context-sensitive verb relations manually annotated via crowdsourcing. These relations capture fine-grained semantic information between verb-centric propositions, such as temporal or entailment relations. We propose a novel semantic verb relation scheme and design a multi-step annotation approach for scaling-up the annotations using crowdsourcing. We emp...

متن کامل

Bootstrapping a Verb Lexicon for Biomedical Information Extraction

The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resou...

متن کامل

Temporal information extraction from legal documents

The aim of this paper is to analyze what kinds of temporal information can be found in different types of legal documents. In particular, it provides a comparison of different legal document types (case law, statute or transactional document) and how one can do further reasoning with the extracted temporal information.

متن کامل

Concept and Context in Legal Information Retrieval

There exist two broad approaches to information retrieval (IR) in the legal domain: those based on manual knowledge engineering (KE) and those based on natural language processing (NLP). The KE approach is grounded in artificial intelligence (AI) and case-based reasoning (CBR), whilst the NLP approach is associated with open domain statistical retrieval. We provide some original arguments regar...

متن کامل

Towards context sensitive information inference

Humans can make hasty, but generally robust judgements about what a text fragment is, or is not, about. Such judgements are termed information inference. By drawing on theories from non-classical logic and applied cognition, an information inference mechanism is proposed which makes inferences via computations of information flow through a high dimensional conceptual space. Within a conceptual ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data

سال: 2022

ISSN: ['2306-5729']

DOI: https://doi.org/10.3390/data7070087