Tables are among the most informative components of documents, because they are exploited to compactly and intuitively represent data, typically for understandability purposes. The needs are to identify and extract tables from documents, and, on the other hand, to be able to extract the data they contain. The latter task involves the understanding of a table structure. Due to the variability in style, size, and aims of tables, algorithmic approaches to this task can be insufficient, and the exploitation of machine learning systems may represent an effective solution. This paper proposes the exploitation of a first-order logic representation, that is able to capture the complex spatial relationships involved in a table structure, and of a le...
Tables in scientific papers contain a wealth of valuable knowledge for the scientific enterprise. To...
Discovering significant meta-information from document collections is a critical factor for knowledg...
Tables in documents are a widely-available and rich source of information, but not yet well-utilised...
Tables are among the most informative components of documents, because they are exploited to compact...
Table structure recognition (TSR) aims at extracting tables in images into machine-understandable fo...
The paper presents a novel learning-based framework to identify tables from scanned document images....
The current spread of digital documents raised the need of effective content-based retrieval techni...
We are developing a system to perform table image understanding. The recognition problem is to locat...
A fundamental task of document image understanding is to recognize semantically relevant components ...
Document image understanding denotes the recognition of semantically relevant components in the layo...
This paper plans an end-to-end method for extracting information from tables embedded in documents; ...
this paper we describe a system that can analyze a wide variety of printed table formats. The adapta...
Document image understanding refers to logical and semantic analysis of document images in order to ...
We present a methodology for document processing that exploits logic-based machine learning techniqu...
Spreadsheet applications are one of the most used tools for content generation and presentation in i...
Tables in scientific papers contain a wealth of valuable knowledge for the scientific enterprise. To...
Discovering significant meta-information from document collections is a critical factor for knowledg...
Tables in documents are a widely-available and rich source of information, but not yet well-utilised...
Tables are among the most informative components of documents, because they are exploited to compact...
Table structure recognition (TSR) aims at extracting tables in images into machine-understandable fo...
The paper presents a novel learning-based framework to identify tables from scanned document images....
The current spread of digital documents raised the need of effective content-based retrieval techni...
We are developing a system to perform table image understanding. The recognition problem is to locat...
A fundamental task of document image understanding is to recognize semantically relevant components ...
Document image understanding denotes the recognition of semantically relevant components in the layo...
This paper plans an end-to-end method for extracting information from tables embedded in documents; ...
this paper we describe a system that can analyze a wide variety of printed table formats. The adapta...
Document image understanding refers to logical and semantic analysis of document images in order to ...
We present a methodology for document processing that exploits logic-based machine learning techniqu...
Spreadsheet applications are one of the most used tools for content generation and presentation in i...
Tables in scientific papers contain a wealth of valuable knowledge for the scientific enterprise. To...
Discovering significant meta-information from document collections is a critical factor for knowledg...
Tables in documents are a widely-available and rich source of information, but not yet well-utilised...