This thesis investigates an approach to exploiting the long context based on the information about the distance and occurrence. By modeling the joint event of distance and occurrence, this approach attempts to incorporate the inter-dependencies into the model, such that information captured from the long context can be more optimally made use. This thesis addresses the problem with the conventional language modeling approaches that tend to neglect the inter-dependencies. Based on the proposed approach, a novel language model, referred to as the term-distance term-occurrence (TDTO) model, is formulated. The TDTO model estimates probabilities based on the events of term-distance (TD) and term-occurrence (TO) that correspond to the distances a...
Statistical language modeling is one of the fundamental problems in natural language processing. In ...
We describe an extension to the use of Latent Semantic Analysis (LSA) for language modeling. This te...
Natural Language Processing (NLP) is a sub-field of Artificial Intelligence (AI) that allows machine...
This thesis investigates an approach to exploiting the long context based on the information about t...
The increasingly widespread adoption of large language models has highlighted the need for improving...
Colloque avec actes et comité de lecture.This paper deals with the use of a stochastic language mode...
This paper presents an extensive empirical study on two language modeling techniques, linguistically...
Recent works on word representations mostly rely on predictive models. Distributed word representati...
Natural language is rich and varied, but also highly struc-tured. The rules of grammar are a primary...
Virtually any modern speech recognition system relies on count-based language models. In this thesis...
Copyright c©1998 by The Association for Computational Linguistics The paper presents a language mode...
Ngram modeling is simple in language modeling and has been widely used in many applications. However...
A new language model for speech recognition inspired by linguistic analysis is presented. The model ...
International audienceThis study examines how to take originally advantage from distant information ...
ABSTRACT This paper describes ongoing work on a new approach for language modeling for large vocabul...
Statistical language modeling is one of the fundamental problems in natural language processing. In ...
We describe an extension to the use of Latent Semantic Analysis (LSA) for language modeling. This te...
Natural Language Processing (NLP) is a sub-field of Artificial Intelligence (AI) that allows machine...
This thesis investigates an approach to exploiting the long context based on the information about t...
The increasingly widespread adoption of large language models has highlighted the need for improving...
Colloque avec actes et comité de lecture.This paper deals with the use of a stochastic language mode...
This paper presents an extensive empirical study on two language modeling techniques, linguistically...
Recent works on word representations mostly rely on predictive models. Distributed word representati...
Natural language is rich and varied, but also highly struc-tured. The rules of grammar are a primary...
Virtually any modern speech recognition system relies on count-based language models. In this thesis...
Copyright c©1998 by The Association for Computational Linguistics The paper presents a language mode...
Ngram modeling is simple in language modeling and has been widely used in many applications. However...
A new language model for speech recognition inspired by linguistic analysis is presented. The model ...
International audienceThis study examines how to take originally advantage from distant information ...
ABSTRACT This paper describes ongoing work on a new approach for language modeling for large vocabul...
Statistical language modeling is one of the fundamental problems in natural language processing. In ...
We describe an extension to the use of Latent Semantic Analysis (LSA) for language modeling. This te...
Natural Language Processing (NLP) is a sub-field of Artificial Intelligence (AI) that allows machine...