Genre provides a characterization of a document with respect to its form or functional trait. Genre is orthogonal to topic, rendering genre information a powerful filter technology for information seekers in digital libraries. However, an efficient means for genre classification is an open and controversially discussed issue. This paper gives an overview and presents new results related to automatic genre classification of text documents. We present a comprehensive survey which contrasts the genre retrieval models that have been developed for Web and non-Web corpora. With the concept of genre-specific core vocabularies the paper provides an original contribution related to computational aspects and classification performance of genre retrie...
The categorization of documents is traditionally topic-based. This paper presents a complementary an...
We report on our ongoing study of using the genre of Web pages to facilitate information exploration...
We report on our ongoing study of using the genre of Web pages to facilitate information exploration...
Abstract. Genre provides a characterization of a document with respect to its form or functional tra...
This paper examines automated genre classification of text documents and its role in enabling the ef...
Genre characterizes text differently than the usual subject or prepositional content that has been t...
This paper examines automated genre classification of text documents and its role in enabling the ef...
We discuss the issues of resolving the information-retrieval problem in large digital collections th...
Retrieving relevant documents over the Web is an over-whelming task when search engines return thous...
Genre classification (e.g. whether a document is a scientific article or magazine article) is closel...
This thesis treats the sociotechnical notion of genre as a conflation of a communicative situation a...
Abstract. The massive amount of textual data on the Web raises nu-merous classification problems. Al...
Master of ScienceDepartment of Computing and Information SciencesWilliam HsuThis thesis examines aut...
This paper describes the KRYS I corpus, consisting of documents classified into 70 genre classes. It...
This thesis aims at examining to what extent a few, algorithmically very easily extractable document...
The categorization of documents is traditionally topic-based. This paper presents a complementary an...
We report on our ongoing study of using the genre of Web pages to facilitate information exploration...
We report on our ongoing study of using the genre of Web pages to facilitate information exploration...
Abstract. Genre provides a characterization of a document with respect to its form or functional tra...
This paper examines automated genre classification of text documents and its role in enabling the ef...
Genre characterizes text differently than the usual subject or prepositional content that has been t...
This paper examines automated genre classification of text documents and its role in enabling the ef...
We discuss the issues of resolving the information-retrieval problem in large digital collections th...
Retrieving relevant documents over the Web is an over-whelming task when search engines return thous...
Genre classification (e.g. whether a document is a scientific article or magazine article) is closel...
This thesis treats the sociotechnical notion of genre as a conflation of a communicative situation a...
Abstract. The massive amount of textual data on the Web raises nu-merous classification problems. Al...
Master of ScienceDepartment of Computing and Information SciencesWilliam HsuThis thesis examines aut...
This paper describes the KRYS I corpus, consisting of documents classified into 70 genre classes. It...
This thesis aims at examining to what extent a few, algorithmically very easily extractable document...
The categorization of documents is traditionally topic-based. This paper presents a complementary an...
We report on our ongoing study of using the genre of Web pages to facilitate information exploration...
We report on our ongoing study of using the genre of Web pages to facilitate information exploration...