This dissertation explores data-driven methodology of finding recurrent structure withinand between languages. The goal is to develop a method that is able to account for variation in the language data more accurately, as well as detect subtle regularities that are difficult to detect by traditional means. The dissertation specifically deals with clause linkageconstructions as a case study, since this is a particularly complex area of grammar whichis closely tied to discourse patterns. The proposed method is to annotate language corporafor form and meaning structures and subsequently to explore the emerging correlationsusing a custom data mining algorithm. Particular attention is given to elaboration of theformal models used to annotate mea...
This is the author accepted manuscript. The final version is available from Springer Nature via the ...
This research applies an association rule mining technique to purely syntactic dialect data. The pap...
This paper reports experiments in the automatic discovery of linguistically significant regularities...
This dissertation explores data-driven methodology of finding recurrent structure withinand between ...
In this paper, we propose a novel approach to address the newly defined needs of linguistic typology...
We develop an aggregate measure of syntactic difference for automatically finding common syntactic d...
In this work, we discuss the benefits of using automatically parsed corpora to study language variat...
International audienceIn this paper, we present a method based on data mining techniques to automati...
We develop an aggregate measure of syn-tactic difference for automatically finding typical syntactic...
Nowadays a corpus is typically a large collection of text excerpts, representing a range of register...
Corpus-based studies have become increasingly common in linguistic typology over recent years, amoun...
This dissertation centers around the question whether syntactic differences between languages can be...
We analyze the linguistic evolution of selected scientific disciplines over a 30-year time span (197...
International audienceIn this paper, we study the use of data mining techniques for stylistic analys...
Corpora, i.e. collections of linguistic data (texts or conversations), are a fundamental asset of di...
This is the author accepted manuscript. The final version is available from Springer Nature via the ...
This research applies an association rule mining technique to purely syntactic dialect data. The pap...
This paper reports experiments in the automatic discovery of linguistically significant regularities...
This dissertation explores data-driven methodology of finding recurrent structure withinand between ...
In this paper, we propose a novel approach to address the newly defined needs of linguistic typology...
We develop an aggregate measure of syntactic difference for automatically finding common syntactic d...
In this work, we discuss the benefits of using automatically parsed corpora to study language variat...
International audienceIn this paper, we present a method based on data mining techniques to automati...
We develop an aggregate measure of syn-tactic difference for automatically finding typical syntactic...
Nowadays a corpus is typically a large collection of text excerpts, representing a range of register...
Corpus-based studies have become increasingly common in linguistic typology over recent years, amoun...
This dissertation centers around the question whether syntactic differences between languages can be...
We analyze the linguistic evolution of selected scientific disciplines over a 30-year time span (197...
International audienceIn this paper, we study the use of data mining techniques for stylistic analys...
Corpora, i.e. collections of linguistic data (texts or conversations), are a fundamental asset of di...
This is the author accepted manuscript. The final version is available from Springer Nature via the ...
This research applies an association rule mining technique to purely syntactic dialect data. The pap...
This paper reports experiments in the automatic discovery of linguistically significant regularities...