Finding suitable, less space consuming views for a document s main content is crucial to provide convenient access to large document collections on display devices of different size. We present a novel compact visualization which represents the document s key semantic as a mixture of images and important key terms, similar to cards in a top trumps game. The key terms are extracted using an advanced text mining approach based on a fully automatic document structure extraction. The images and their captions are extracted using a graphical heuristic and the captions are used for a semi-semantic image weighting. Furthermore, we use the image color histogram for classification and show at least one representative from each non-empty image class....
OCEAN is a tool for a posteriori visual data mining that uses the output of a text miner to help use...
Document image classification is an important step in document image analysis. Based on classificati...
No existing document image understanding technology, whether experimental or commercially available,...
Documents appear to us regularly in daily life in various designs and lengths to serve different pur...
International audienceDocument image classification is an important step in document image analysis....
International audienceDocument image classification is an important step in document image analysis....
Abstract. Document image classification is an important step in document image analysis. Based on cl...
Identifying and extracting figures and tables along with their captions from scholarly articles is i...
Knowledge extraction from detected document image is a complex problem in the field of information t...
Even though the digital processing of documents is increasingly widespread in industry, printed docu...
We present an analysis and visualization method for computing what distinguishes a given document co...
The paper introduces a descriptive data mining method to discover knowledge for the task of automati...
A vast amount of digital document material is continuously being produced as part of major digitizat...
International audienceDocuments exist in different formats. When we have document images, in order t...
In the last years, the spread of computers and the Internet have caused a significant amount of docu...
OCEAN is a tool for a posteriori visual data mining that uses the output of a text miner to help use...
Document image classification is an important step in document image analysis. Based on classificati...
No existing document image understanding technology, whether experimental or commercially available,...
Documents appear to us regularly in daily life in various designs and lengths to serve different pur...
International audienceDocument image classification is an important step in document image analysis....
International audienceDocument image classification is an important step in document image analysis....
Abstract. Document image classification is an important step in document image analysis. Based on cl...
Identifying and extracting figures and tables along with their captions from scholarly articles is i...
Knowledge extraction from detected document image is a complex problem in the field of information t...
Even though the digital processing of documents is increasingly widespread in industry, printed docu...
We present an analysis and visualization method for computing what distinguishes a given document co...
The paper introduces a descriptive data mining method to discover knowledge for the task of automati...
A vast amount of digital document material is continuously being produced as part of major digitizat...
International audienceDocuments exist in different formats. When we have document images, in order t...
In the last years, the spread of computers and the Internet have caused a significant amount of docu...
OCEAN is a tool for a posteriori visual data mining that uses the output of a text miner to help use...
Document image classification is an important step in document image analysis. Based on classificati...
No existing document image understanding technology, whether experimental or commercially available,...