We address the problem of identifying key domain concepts automatically from an unannotated corpus of goal-oriented human-human conversations. We examine two clustering algorithms, one based on mutual information and another one based on Kullback-Liebler distance. In order to compare the results from both techniques quantitatively, we evaluate the outcome clusters against reference concept labels using precision and recall metrics adopted from the evaluation of topic identification task. However, since our system allows more than one cluster to associate with each concept an additional metric, a singularity score, is added to better capture cluster quality. Based on the proposed quality metrics, the results show that Kullback-Liebler-based ...
We present a method based on clustering techniques to detect possible/probable novel concepts or co...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
Abstract. In this paper, we initiate a theoretical study of the problem of clustering data under int...
We address the problem of identifying key domain concepts automatically from an unannotated corpus o...
We address the problem of identifying key domain conceptsautomatically from an unannotated corpus of...
A speech understanding system typically includes a natural lan-guage understanding module that defin...
WordNet are extremely useful. However, they often include many rare senses while missing domain-sp...
Abstract: "The world wide web represents vast stores of information. However, the sheer amount of su...
The problem of identifying clusters arising in the context of topic models and related approaches is...
Abstract. This study proposes a concept extraction and clustering method, which improves Topic Keywo...
Topic detection in dialogue datasets has become a significant challenge for unsupervised and unlabel...
Many businesses in a wide range of industries are collecting and using large sets of text documents...
An automated method for clustering terms/concepts from a set of documents on the same topic was deve...
Clustering is used in information retrieval systems to enhance the efficiency and effectiveness of t...
The rapidly growing market demand for dialogue agents capable of goal-oriented behavior has caused m...
We present a method based on clustering techniques to detect possible/probable novel concepts or co...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
Abstract. In this paper, we initiate a theoretical study of the problem of clustering data under int...
We address the problem of identifying key domain concepts automatically from an unannotated corpus o...
We address the problem of identifying key domain conceptsautomatically from an unannotated corpus of...
A speech understanding system typically includes a natural lan-guage understanding module that defin...
WordNet are extremely useful. However, they often include many rare senses while missing domain-sp...
Abstract: "The world wide web represents vast stores of information. However, the sheer amount of su...
The problem of identifying clusters arising in the context of topic models and related approaches is...
Abstract. This study proposes a concept extraction and clustering method, which improves Topic Keywo...
Topic detection in dialogue datasets has become a significant challenge for unsupervised and unlabel...
Many businesses in a wide range of industries are collecting and using large sets of text documents...
An automated method for clustering terms/concepts from a set of documents on the same topic was deve...
Clustering is used in information retrieval systems to enhance the efficiency and effectiveness of t...
The rapidly growing market demand for dialogue agents capable of goal-oriented behavior has caused m...
We present a method based on clustering techniques to detect possible/probable novel concepts or co...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
Abstract. In this paper, we initiate a theoretical study of the problem of clustering data under int...