Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2004.Includes bibliographical references (p. 49-51).Text documents generally contain two forms of structures, logical structures and physical structures. Loosely speaking, logical structures are sections of text that are both visually and semantically distinct. For example, a document may have an "introduction", a "body", and a "conclusion" as its logical structures. These structures are so named because each section has a distinct purpose in conveying the document's logical arguments or intentions. Perfect machine recognition of logical structures in large collections of documents is an unsolved problem in computational linguistics. This thesis presents evidence that ...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
Identification of a document\u27s discourse structure - what each part contributes to the ideas pres...
International audienceBackground. In recent years, libraries and archives led important digitisation...
The automated discovery of logical structure in text documents is an important problem that has rece...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
An important aspect of document understanding is document logical structure derivation, which involv...
This thesis is meant as a basis for a semantic annotation manual. The annotation will translate the ...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
Modeling natural human behavior in understanding written language is crucial for developing true art...
Most of the electronic documents available from todays huge number of electronic information sources...
Statistical methods have been widely employed in recent years to grasp many language properties. The...
We present a fully implemented system based on generic document knowledge for detecting the logical ...
This study examines what kind of cues and constraints for discourse interpretation can be derived fr...
This study outlines a two-layered system of text-structure representation for factual English texts....
ISBN: 0-8186-7898-4International audienceIn this paper, we are presenting an approach for the logica...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
Identification of a document\u27s discourse structure - what each part contributes to the ideas pres...
International audienceBackground. In recent years, libraries and archives led important digitisation...
The automated discovery of logical structure in text documents is an important problem that has rece...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
An important aspect of document understanding is document logical structure derivation, which involv...
This thesis is meant as a basis for a semantic annotation manual. The annotation will translate the ...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
Modeling natural human behavior in understanding written language is crucial for developing true art...
Most of the electronic documents available from todays huge number of electronic information sources...
Statistical methods have been widely employed in recent years to grasp many language properties. The...
We present a fully implemented system based on generic document knowledge for detecting the logical ...
This study examines what kind of cues and constraints for discourse interpretation can be derived fr...
This study outlines a two-layered system of text-structure representation for factual English texts....
ISBN: 0-8186-7898-4International audienceIn this paper, we are presenting an approach for the logica...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
Identification of a document\u27s discourse structure - what each part contributes to the ideas pres...
International audienceBackground. In recent years, libraries and archives led important digitisation...