The structure of a document contains rich information such as logical relations in context, hierarchy, affiliation, dependence, and applicability. It will greatly affect the accuracy of document information processing, particularly of legal documents and business contracts. Therefore, intelligent document structural analysis is important to information extraction and data mining. However, unlike the well-studied field of text semantic analysis, current work in document structural analysis is still scarce. In this paper, we propose an intelligent document structural analysis framework through data pre-processing, feature engineering, and structural classification with a dynamic sample weighting algorithm. As a typical application, we collect...
Document classification has been involved in a variety of applications, such as phishing and fraud d...
Abstract. The paper describes possible representation models and ways of weighting text documents, w...
Document Structure Analysis and Performance Evaluation by Jisheng Liang Chair of Supervisory Committ...
[[abstract]]Querying a database for document retrieval is often a process close to querying an answe...
During the last decade national archives, libraries, muse-ums and companies started to make their re...
During the last decade national archives, libraries, muse-ums and companies started to make their re...
Analysis of large text data sets is gaining popularity providing the users some insights into their ...
Human beings can extract meaningful information from single documents and can even summarize them de...
A paradigm for the deep content analysis of documents in restricted domains is proposed, along with ...
Abstract—Electronic documents on the Internet are always generated with many kinds of side informati...
Discovering significant meta-information from document collections is a critical factor for knowledg...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In this paper, we present a new ranking algorithm and an intelligent Web search system using data mi...
This paper presents a new research theme at our institute in the field of document engineering; it d...
Nowadays PDF documents have become a dominating knowledge repository for both the academia and indus...
Document classification has been involved in a variety of applications, such as phishing and fraud d...
Abstract. The paper describes possible representation models and ways of weighting text documents, w...
Document Structure Analysis and Performance Evaluation by Jisheng Liang Chair of Supervisory Committ...
[[abstract]]Querying a database for document retrieval is often a process close to querying an answe...
During the last decade national archives, libraries, muse-ums and companies started to make their re...
During the last decade national archives, libraries, muse-ums and companies started to make their re...
Analysis of large text data sets is gaining popularity providing the users some insights into their ...
Human beings can extract meaningful information from single documents and can even summarize them de...
A paradigm for the deep content analysis of documents in restricted domains is proposed, along with ...
Abstract—Electronic documents on the Internet are always generated with many kinds of side informati...
Discovering significant meta-information from document collections is a critical factor for knowledg...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In this paper, we present a new ranking algorithm and an intelligent Web search system using data mi...
This paper presents a new research theme at our institute in the field of document engineering; it d...
Nowadays PDF documents have become a dominating knowledge repository for both the academia and indus...
Document classification has been involved in a variety of applications, such as phishing and fraud d...
Abstract. The paper describes possible representation models and ways of weighting text documents, w...
Document Structure Analysis and Performance Evaluation by Jisheng Liang Chair of Supervisory Committ...