Improving Logical Structure Analysis of Visually Structured Documents with Textual Features
نویسندگان
چکیده
منابع مشابه
Logical Structure Analysis and Generation for Structured Documents: A Syntactic Approach
This paper presents a syntactic method for sophisticated logical structure analysis that transforms document images with multiple pages and hierarchical structure into an electronic document based on SGML/XML. To produce a logical structure more accurately and quickly than previous works of which the basic units are text lines, the proposed parsing method takes text regions with hierarchical st...
متن کاملGrammatical Approach for the Physical and Logical Structure of Documents Analysis; Application to Summary Documents
This paper deals with the use of grammatical formalism to recognize the physical and the logical structures of a composite document. We propose a new system for document recognition and analysis. The aim of our research is to create a document structuring system by using a two level grammar. A two level grammar constitute a high level formalism of expression and structuration in document analys...
متن کاملDocument Structure Analysis Based on Layout and Textual Features
Document image processing is a crucial process in the office automation and begins from the ’OCR’ phase with difficulty of the document ’analysis’ and ’understanding’. This paper presents a hybrid and comprehensive approach to document structure analysis. Hybrid in the sense, that it makes use of layout (geometrical) as well as textual features of a given document. These features are the base f...
متن کاملRecognising Textual Entailment with Robust Logical Inference
We use logical inference techniques for recognising textual entailment, with theorem proving operating on deep semantic interpretations as the backbone of our system. However, the performance of theorem proving on its own turns out to be highly dependent on a wide range of background knowledge, which is not necessarily included in publically available knowledge sources. Therefore, we achieve ro...
متن کاملSummarisation of the logical structure of XML documents
Summarisation is traditionally used to produce summaries of the textual contents of documents. In this paper, it is argued that summarisation methods can also be applied to the logical structure of XML documents. Structure summarisation selects the most important elements of the logical structure and ensures that the user’s attention is focused towards sections, subsections, etc. that are belie...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Science and Information Systems (FedCSIS), 2019 Federated Conference on
سال: 2022
ISSN: ['2300-5963']
DOI: https://doi.org/10.15439/2022r26