Extraction of Logical Structure from Articles in Mathematics
نویسندگان
چکیده
We propose a mathematical knowledge browser which helps people to read mathematical documents. By the browser printed mathematical documents can be scanned and recognized by OCR (Optical Character Recognition). Then the meta-information (e.g. title, author) and the logical structure (e.g. section, theorem) of the documents are automatically extracted. The purpose of this paper is to show the extraction method of logical structure specialized for mathematical documents. We implemented this method in INFTY which is an integrated OCR system for mathematical documents. In order to show the effectiveness of the method we made a correct database from an existing mathematical OCR database, and made an experiment.
منابع مشابه
An Investigation of the Generic Features of Research Articles Published in the Bulletin of Iranian Mathematical Society
In light of the understanding that the analysis of the generic features of different academic genres can enhance the ability of non-native members of academic discourse communities to understand, and where appropriate, to produce them, the present study aimed at investigating the dominant generic structure of research articles in mathematics. To start with a relatively narrow focus, a corpus of...
متن کاملA duality between LM-fuzzy possibility computations and their logical semantics
Let X be a dcpo and let L be a complete lattice. The family σL(X) of all Scott continuous mappings from X to L is a complete lattice under pointwise order, we call it the L-fuzzy Scott structure on X. Let E be a dcpo. A mapping g : σL(E) −> M is called an LM-fuzzy possibility valuation of E if it preserves arbitrary unions. Denote by πLM(E) the set of all LM-fuzzy possibility valuations of E. T...
متن کاملText Type Structure And Logical Document Structure
Most research on automated categorization of documents has concentrated on the assignment of one or many categories to a whole text. However, new applications, e.g. in the area of the Semantic Web, require a richer and more fine-grained annotation of documents, such as detailed thematic information about the parts of a document. Hence we investigate the automatic categorization of text segments...
متن کاملروش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کاملA new method for ranking of Z-numbers
In this paper we propose a new method for ranking Z- numbers and generalizations. This method is based on the internal structure of the artificial neural network, which suggests that the structure of this network consists of inputs weights and the transfer function linear, nonlinear and sometimes linear and nonlinear. It is shown that the proposed method while possessing the ranking properties ...
متن کامل