We present two novel models of document coherence and their application to information retrieval (IR). Both models approximate document coherence using discourse entities, e.g. the subject or object of a sentence. Our first model views text as a Markov process generating sequences of discourse entities (entity n-grams); we use the entropy of these entity n-grams to approximate the rate at which new information appears in text, reasoning that as more new words appear, the topic increasingly drifts and text coherence decreases. Our second model extends the work of Guinaudeau & Strube [28] that represents text as a graph of discourse entities, linked by different relations, such as their distance or adjacency in text. We use several graph ...
Discourse cohesion facilitates text comprehension and helps the reader form a coherent narrative. In...
This paper introduces an objective metric for assessing the effectiveness of a parsing scheme. Infor...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
We present two novel models of document coherence and their application to information retrieval (IR...
International audienceDocument coherence describes how much sense text makes in terms of its logical...
In this paper we argue that coherence relations between discourse units are ultimately based on ment...
Coherence that ties sentences of a text into a meaningfully connected structure is of great importan...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, Febru...
Discourse coherence is an important aspect of text quality that refers to the way different textual ...
This paper describes how discursive knowledge, given by the discursive theories RST (Rhetorical Stru...
This paper presents research that connects the cohesion structure of a text to the derivation of its...
halle.de Quantifying the coherence of a set of statements is a long standing problem with many poten...
Abstract—In this paper, we propose an algorithm called coher-ence hidden Markov model (HMM) to extra...
Topic models arise from the need of understanding and exploring large text document collections and...
International audienceText generation applications such as machine translation and automatic summari...
Discourse cohesion facilitates text comprehension and helps the reader form a coherent narrative. In...
This paper introduces an objective metric for assessing the effectiveness of a parsing scheme. Infor...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
We present two novel models of document coherence and their application to information retrieval (IR...
International audienceDocument coherence describes how much sense text makes in terms of its logical...
In this paper we argue that coherence relations between discourse units are ultimately based on ment...
Coherence that ties sentences of a text into a meaningfully connected structure is of great importan...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, Febru...
Discourse coherence is an important aspect of text quality that refers to the way different textual ...
This paper describes how discursive knowledge, given by the discursive theories RST (Rhetorical Stru...
This paper presents research that connects the cohesion structure of a text to the derivation of its...
halle.de Quantifying the coherence of a set of statements is a long standing problem with many poten...
Abstract—In this paper, we propose an algorithm called coher-ence hidden Markov model (HMM) to extra...
Topic models arise from the need of understanding and exploring large text document collections and...
International audienceText generation applications such as machine translation and automatic summari...
Discourse cohesion facilitates text comprehension and helps the reader form a coherent narrative. In...
This paper introduces an objective metric for assessing the effectiveness of a parsing scheme. Infor...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...