Managing large document databases has become an important task. Being able to automatically compare document layouts and classify and search documents with respect to their visual appearance proves to be desirable in many applications. We propose a new algorithm that calculates similarity function between documents based on their visual appearance. The comparison is based only on documents\u27 visual appearance without taking into consideration its content. A user may wish to search for documents in a database that are similar to a query in terms of their stylistic features, or he/she may want to browse the whole database. In these tasks, clustering similar documents and organizing the document database with respect to the clusters is prefe...
In this work, four major components of image database have been examined: image similarity, search-b...
document repurposing, hierarchical metrics and structure, typography From the least to most prominen...
query top-5 stylistically similar infographics Figure 1: Infographics combine text, charts and image...
This paper describes the development of a new document ranking system based on layout similarity. Th...
Abstract—In this paper, we describe issues related to the measurement of structural similarity betwe...
In the present work we study visual comparision of texts, especially by nding similarity in text doc...
As a fundamental task, document similarity measure has broad impact to document-based classification...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
Determining the reading order for layout components extracted from a document image can be a crucial...
As a fundamental task, document similarity measure has broad impact to document-based classification...
Introduction Searching in a large heterogeneous collection of scanned document images often produce...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
Determining the similarity of document images is an important first step for several document retrie...
Abstract. The mathematical concept of document resemblance cap-tures well the informal notion of syn...
Accurately measuring document similarity is important for many text applications, e.g. document simi...
In this work, four major components of image database have been examined: image similarity, search-b...
document repurposing, hierarchical metrics and structure, typography From the least to most prominen...
query top-5 stylistically similar infographics Figure 1: Infographics combine text, charts and image...
This paper describes the development of a new document ranking system based on layout similarity. Th...
Abstract—In this paper, we describe issues related to the measurement of structural similarity betwe...
In the present work we study visual comparision of texts, especially by nding similarity in text doc...
As a fundamental task, document similarity measure has broad impact to document-based classification...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
Determining the reading order for layout components extracted from a document image can be a crucial...
As a fundamental task, document similarity measure has broad impact to document-based classification...
Introduction Searching in a large heterogeneous collection of scanned document images often produce...
Abstract: Clustering is a technique of collecting data into subsets in such a manner that identical ...
Determining the similarity of document images is an important first step for several document retrie...
Abstract. The mathematical concept of document resemblance cap-tures well the informal notion of syn...
Accurately measuring document similarity is important for many text applications, e.g. document simi...
In this work, four major components of image database have been examined: image similarity, search-b...
document repurposing, hierarchical metrics and structure, typography From the least to most prominen...
query top-5 stylistically similar infographics Figure 1: Infographics combine text, charts and image...