Combining NLP Approaches for Rule Extraction from Legal Documents
نویسندگان
چکیده
Legal texts express conditions in natural language describing what is permitted, forbidden or mandatory in the context they regulate. Despite the numerous approaches tackling the problem of moving from a natural language legal text to the respective set of machine-readable conditions, results are still unsatisfiable and it remains a major open challenge. In this paper, we propose a preliminary approach which combines different Natural Language Processing techniques towards the extraction of rules from legal documents. More precisely, we combine the linguistic information provided by WordNet together with a syntax-based extraction of rules from legal texts, and a logic-based extraction of dependencies between chunks of such texts. Such a combined approach leads to a powerful solution towards the extraction of machine-readable rules from legal documents. We evaluate the proposed approach over the Australian “Telecommunications consumer protections code”.
منابع مشابه
A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model
Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...
متن کاملInformation Extraction Methods and Extraction Techniques in the Chemical Document's Contents: Survey
The volume of electronic documents has rapidly increased and the scientific literature has increased too. These huge documents contain considerable information, but it has to be retrieved and managed in a constructive and useful way. Information Extraction (IE) is the field of extracting useful information using different methods and approaches by means of Natural Language Processing (NLP). Res...
متن کاملNLP-based Ontology Learning from Legal Texts. A Case Study
The paper reports on the methodology and preliminary results of a case study in automatically extracting ontological knowledge from Italian legislative texts in the environmental domain. We use a fully–implemented ontology learning system (T2K) that includes a battery of tools for Natural Language Processing (NLP), statistical text analysis and machine language learning. Tools are dynamically i...
متن کامل(LP ): Rule Induction for Information Extraction Using Linguistic Constraints
Machine learning has been widely used in information extraction from texts in the last years. Two directions of research can be identified: wrapper induction (WI) and NLP-based methodologies. WI techniques have historically made scarce use of linguistic information and their application is mainly limited to rigidly structured documents. NLP-based methodologies tend to be brittle when linguistic...
متن کاملDeveloping NLP Tools for Genome Informatics: An Information Extraction Perspective.
Huge quantities of on-line medical texts such as Medline are available, and we would hope to extract useful information from these resources, as much as possible, hopefully in an automatic way, with the aid of computer technologies. Especially, recent advances in Natural Language Processing (NLP) techniques raise new challenges and opportunities for tackling genome-related on-line text; combini...
متن کامل