AbstractRecurring sequences of words have long been considered as a signifier of different genres and registers by corpus linguists. The previous research mainly focused on lexical n-grams. Nevertheless, n-grams of other linguistic features, such as part-of-speech, have been less studied. The current study is expected to examine whether n-grams of part-of-speech tags extracted from a large corpus can be a discriminator of different genres. The results show that a strong correlation exists between the information about n-grams of part-of-speech tags and the genre of the text
Statistical n-gram language modeling is used in many domains like speech recognition, language ident...
This paper examines automated genre classification of text documents and its role in enabling the ef...
In this paper we describe some explorations of the potential of genre-revealing features on automati...
AbstractRecurring sequences of words have long been considered as a signifier of different genres an...
Human communicative practices are organized in terms of genres, and people are highly skilled at rec...
International audienceIn this chapter, it is shown how we can develop a new type of learner’s or stu...
In this paper, we study the effect of using n-grams (sequences of words of length n) for text catego...
Text genre classification is the process of identifying functional characteristics of text documents...
Recent linguistic studies have shown that proper use of multi-word expressions, or n-grams, is quite...
Quantitative analysis of literary style has heretofore utilized semantic elements-word counts. This...
The article discusses the theoretical and practical problems related to the study of speech genres o...
Psycholinguistics has traditionally been defined as the study of how we process units of language su...
ABSTRACT This paper describes a method of comparing routine language use in different corpora, and p...
This chapter deals with a study of personal pronouns in context and aims at revealing trends in popu...
Multiword expressions (MWEs) are words that co-occur so often that they are perceived as a linguisti...
Statistical n-gram language modeling is used in many domains like speech recognition, language ident...
This paper examines automated genre classification of text documents and its role in enabling the ef...
In this paper we describe some explorations of the potential of genre-revealing features on automati...
AbstractRecurring sequences of words have long been considered as a signifier of different genres an...
Human communicative practices are organized in terms of genres, and people are highly skilled at rec...
International audienceIn this chapter, it is shown how we can develop a new type of learner’s or stu...
In this paper, we study the effect of using n-grams (sequences of words of length n) for text catego...
Text genre classification is the process of identifying functional characteristics of text documents...
Recent linguistic studies have shown that proper use of multi-word expressions, or n-grams, is quite...
Quantitative analysis of literary style has heretofore utilized semantic elements-word counts. This...
The article discusses the theoretical and practical problems related to the study of speech genres o...
Psycholinguistics has traditionally been defined as the study of how we process units of language su...
ABSTRACT This paper describes a method of comparing routine language use in different corpora, and p...
This chapter deals with a study of personal pronouns in context and aims at revealing trends in popu...
Multiword expressions (MWEs) are words that co-occur so often that they are perceived as a linguisti...
Statistical n-gram language modeling is used in many domains like speech recognition, language ident...
This paper examines automated genre classification of text documents and its role in enabling the ef...
In this paper we describe some explorations of the potential of genre-revealing features on automati...