In Information Retrieval (IR), the Dirichlet Priors have been applied to the smoothing technique of the language modeling approach. In this paper, we apply the Dirichlet Priors to the term frequency normalisation of the classical BM25 probabilistic model and the Divergence from Randomness PL2 model. The contributions of this paper are twofold. First, through extensive experiments on four TREC collections, we show that the newly generated models, to which the Dirichlet Priors normalisation is applied, provide robust and effective performance. Second, we propose a novel theoretically-driven approach to the automatic parameter tuning of the Dirichlet Priors normalisation. Experiments show that this tuning approach optimises the retrieval perfo...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Implementations of topic models typically use symmetric Dirichlet priors with fixed concentration pa...
Implementations of topic models typically use symmetric Dirichlet priors with fixed concentration pa...
In Information Retrieval (IR), the Dirichlet Priors have been applied to the smoothing technique of ...
We introduce and create a framework for deriving probabilistic models of Information Retrieval. The ...
The term frequency normalisation parameter tuning is a crucial issue in information retrieval (IR), ...
This thesis devises a novel methodology based on probability theory, suitable for the construction o...
Open access funding provided by Austrian Science Fund (FWF). This research was partly supported by t...
This thesis devises a novel methodology based on probability theory, suitable for the construction o...
Every information retrieval (IR) model embeds in its scoring function a form of term frequency (TF) ...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
In this thesis, I propose the relative term frequency to be integrated into traditional probabilisti...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Implementations of topic models typically use symmetric Dirichlet priors with fixed concentration pa...
Implementations of topic models typically use symmetric Dirichlet priors with fixed concentration pa...
In Information Retrieval (IR), the Dirichlet Priors have been applied to the smoothing technique of ...
We introduce and create a framework for deriving probabilistic models of Information Retrieval. The ...
The term frequency normalisation parameter tuning is a crucial issue in information retrieval (IR), ...
This thesis devises a novel methodology based on probability theory, suitable for the construction o...
Open access funding provided by Austrian Science Fund (FWF). This research was partly supported by t...
This thesis devises a novel methodology based on probability theory, suitable for the construction o...
Every information retrieval (IR) model embeds in its scoring function a form of term frequency (TF) ...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
In this thesis, I propose the relative term frequency to be integrated into traditional probabilisti...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Unigram Language modeling is a successful probabilistic framework for Information Retrieval (IR) tha...
Implementations of topic models typically use symmetric Dirichlet priors with fixed concentration pa...
Implementations of topic models typically use symmetric Dirichlet priors with fixed concentration pa...