Conceptualization seeks to map a short text (i.e., a word or a phrase) to a set of concepts as a mecha-nism of understanding text. Most of prior research in conceptualization uses human-crafted knowl-edge bases that map instances to concepts. Such approaches to conceptualization have the limitation that the mappings are not context sensitive. To overcome this limitation, we propose a framework in which we harness the power of a probabilis-tic topic model which inherently captures the se-mantic relations between words. By combining la-tent Dirichlet allocation, a widely used topic model with Probase, a large-scale probabilistic knowledge base, we develop a corpus-based framework for context-dependent conceptualization. Through this simple bu...
This paper studies how to incorporate the ex-ternal word correlation knowledge to improve the cohere...
Measuring the similarity of short written contexts is a fundamental problem in Natural Language Proc...
Published by De Gruyter Open under the Creative Commons Attribution 3.0 License.We investigate the r...
Statistical topic models provide a general data-driven framework for automated discovery of high-lev...
Topics in semantic representation 1 Topics in semantic representation 2 Accounts of language process...
Recent theories of cognition recognize the importance of context in cognitive tasks. However, many t...
We develop latent Dirichlet allocation with WORDNET (LDAWN), an unsupervised probabilistic topic mod...
textComputer SciencesIn order to respond to increasing demand for natural language interfaces---and ...
Abstract In the era of Internet of “everything”, the natural language text is still the undiscussed ...
Understanding short texts is crucial to many applications, but challenges abound. First, short texts...
It is proposed that concepts contain two types of properties. Context-independent properties are act...
This thesis presents work on learning representations of text and Knowledge Bases by taking into con...
We address two challenges in topic models: (1) Context information around words helps in determining...
Understanding verbs is essential for many natural language tasks. Tothis end, large-scale lexical re...
Probabilistic topic models are a popular tool for the unsupervised analysis of text, providing both ...
This paper studies how to incorporate the ex-ternal word correlation knowledge to improve the cohere...
Measuring the similarity of short written contexts is a fundamental problem in Natural Language Proc...
Published by De Gruyter Open under the Creative Commons Attribution 3.0 License.We investigate the r...
Statistical topic models provide a general data-driven framework for automated discovery of high-lev...
Topics in semantic representation 1 Topics in semantic representation 2 Accounts of language process...
Recent theories of cognition recognize the importance of context in cognitive tasks. However, many t...
We develop latent Dirichlet allocation with WORDNET (LDAWN), an unsupervised probabilistic topic mod...
textComputer SciencesIn order to respond to increasing demand for natural language interfaces---and ...
Abstract In the era of Internet of “everything”, the natural language text is still the undiscussed ...
Understanding short texts is crucial to many applications, but challenges abound. First, short texts...
It is proposed that concepts contain two types of properties. Context-independent properties are act...
This thesis presents work on learning representations of text and Knowledge Bases by taking into con...
We address two challenges in topic models: (1) Context information around words helps in determining...
Understanding verbs is essential for many natural language tasks. Tothis end, large-scale lexical re...
Probabilistic topic models are a popular tool for the unsupervised analysis of text, providing both ...
This paper studies how to incorporate the ex-ternal word correlation knowledge to improve the cohere...
Measuring the similarity of short written contexts is a fundamental problem in Natural Language Proc...
Published by De Gruyter Open under the Creative Commons Attribution 3.0 License.We investigate the r...