summary:In most cases the current on-line journals in mathematics are supplied in the form of PDF with print images of papers in the front and OCR’ed hidden texts behind to provide with search facilily using key words. The embedded hidden texts usually does not include good information about mathematical formulae in the papers. We can say that, for the future development of DML, it is desirable to include, in the digitised journals, more structured information of the content of mathematical papers, e.g. tag information to indicate logical structure of papers such as headings of sections, definitions, theorems, lemmas, etc., together with mathematical formulae structures included. In the talk, I will present the current stage of our technolo...
We present an approach to extracting mathematical formulae directly from PDF documents. We exploit b...
summary:Earlier work has examined the frequency of symbol and expression use in mathematical documen...
Paper discusses the needs for data normalization in a Digital Mathematics Library (DML). Specificall...
summary:In most cases the current on-line journals in mathematics are supplied in the form of PDF wi...
Many approaches have been proposed for the recognition of mathematical formulae, traditionally using...
summary:We report on a new project to design a semantic ground truth set for mathematical document a...
Full-text indexing of documents containing mathematics cannot be considered a complete success unles...
summary:The quality of digital mathematical library depends on the formats and quality of data it of...
While a number of techniques have been developed for table recognition in ordinary text documents, w...
summary:Experience in setting up a workflow from scanned images of mathematical papers into a fully ...
summary:As more and more scientific documents become available in PDF format, their automatic analys...
summary:We present a progress report on our ongoing project of reverse engineering scientific PDF do...
We describe the design of document analysis procedures to separate mathematics from ordinary text on...
AbstractMathematical texts can be computerized in many ways that capture differing amounts of the ma...
Abstract. Many approaches have been proposed over the years for the recognition of mathematical form...
We present an approach to extracting mathematical formulae directly from PDF documents. We exploit b...
summary:Earlier work has examined the frequency of symbol and expression use in mathematical documen...
Paper discusses the needs for data normalization in a Digital Mathematics Library (DML). Specificall...
summary:In most cases the current on-line journals in mathematics are supplied in the form of PDF wi...
Many approaches have been proposed for the recognition of mathematical formulae, traditionally using...
summary:We report on a new project to design a semantic ground truth set for mathematical document a...
Full-text indexing of documents containing mathematics cannot be considered a complete success unles...
summary:The quality of digital mathematical library depends on the formats and quality of data it of...
While a number of techniques have been developed for table recognition in ordinary text documents, w...
summary:Experience in setting up a workflow from scanned images of mathematical papers into a fully ...
summary:As more and more scientific documents become available in PDF format, their automatic analys...
summary:We present a progress report on our ongoing project of reverse engineering scientific PDF do...
We describe the design of document analysis procedures to separate mathematics from ordinary text on...
AbstractMathematical texts can be computerized in many ways that capture differing amounts of the ma...
Abstract. Many approaches have been proposed over the years for the recognition of mathematical form...
We present an approach to extracting mathematical formulae directly from PDF documents. We exploit b...
summary:Earlier work has examined the frequency of symbol and expression use in mathematical documen...
Paper discusses the needs for data normalization in a Digital Mathematics Library (DML). Specificall...