Research and Realization about Conversion Algorithm of PDF Format into PS Format
نویسندگان
چکیده
This paper firstly introduces the characteristics of PostScript document and PDF document as the basis, and proposes the necessity and the feasibility of the conversion from the PDF document format to the PostScript language program. Secondly, it studies the main algorithm and technology of the conversion process and realizes the information extraction for PDF document lastly, with achieving the software algorithm for the conversion from PDF document format into PS format on the basis of the
منابع مشابه
PDF2XML: Converting PDF to XML
XML is a markup language for documents containing structured information. It is designed to make it easy to interchange structured documents over the Internet and further integrate them with management database system. PDF is a document format intended to electronically reproduce the look of a page. There is a huge demand of converting existing PDF documents into XML documents, so that they wil...
متن کاملFrom Legacy Documents to XML: A Conversion Framework
We present an integrated framework for the document conversion from legacy formats to XML format. We describe the LegDoC project, aimed at automating the conversion of layout annotations layout-oriented formats like PDF, PS and HTML to semantic-oriented annotations. A toolkit of different components covers complementary techniques the logical document analysis and semantic annotations with the ...
متن کاملReverse Engineering of Network Software Binary Codes for Identification of Syntax and Semantics of Protocol Messages
Reverse engineering of network applications especially from the security point of view is of high importance and interest. Many network applications use proprietary protocols which specifications are not publicly available. Reverse engineering of such applications could provide us with vital information to understand their embedded unknown protocols. This could facilitate many tasks including d...
متن کاملPresentable Document Format: Improved On-demand PDF to HTML Conversion
Search engines such as Google and MSN Search crawl and index files in Adobe’s Portable Document Format (PDF) alongside material in HTML. Google furthermore offers a View as HTML option for PDF that includes query term highlighting. The visual appearance of these HTML files converted from PDF is very poor. In this paper we claim that significant improvements to the quality of on-demand PDF to HT...
متن کاملConversion of TEX fonts into Type 1 format
This paper analyses the problem of converting TEX fonts to Type 1 fonts, describes TEXtrace, a new free conversion program, and compares it to other possible methods and existing utilities. TEXtrace works by rendering the font in high resolution and then tracing (vectorizing) it. keywords: PDF, font conversion, Type1, METAFONT, vector, outline, raster, bitmap,
متن کامل