Traditional information retrieval typically represents data using a bag of words; data mining typically uses a highly structured database ontology. This paper explores the a middle ground we term entity models, in which questions about structured data may be posed and answered, but the complexities and task-specific restrictions of ontologies are avoided. An entity model is a language model or word distribution associated with an entity, such as a person, place or organization. Using these per-entity language models, entities may be clustered, links may be detected or described with a short summary, entities may be collectively classified, and question answering may be performed. On a corpus of entities extracted from newswire and the Web, ...
International audienceIn recent years, several knowledge bases have been built to enable large-scale...
Representing information is a key challenge for all applications that process and organize documents...
Fang, HuiIn the past decade, the prosperity of the World Wide Web has led to fast explosion of info...
Traditional information retrieval typically represents data using a bag of words; data mining typica...
Thesis (Ph.D.)--University of Washington, 2019Real world entities such as people, organizations and ...
The “big data” era is characterized by an explosion of information in the form of digital data colle...
The exponential growth of digital information available in companies and on the web creates the need...
Abstract—Topic models, which factor each document into different topics and represent each topic as ...
Entity Ranking (ER) is a recently emerging search task in Information Retrieval, where the goal is n...
The “big data ” era is characterized by an explosion of infor-mation in the form of digital data col...
Entity Recognition (ER) can be used as a method for extracting information about socio-technical sys...
Entity retrieval is the problem of finding information about a given real-world entity (e.g., direct...
Thesis (Ph.D.)--University of Washington, 2015-12With the advent of the Web, textual information has...
A data model, called the entity-relationship model, is proposed. This model incorporates some of the...
A central aspect of natural language understanding consists of linking information across multiple s...
International audienceIn recent years, several knowledge bases have been built to enable large-scale...
Representing information is a key challenge for all applications that process and organize documents...
Fang, HuiIn the past decade, the prosperity of the World Wide Web has led to fast explosion of info...
Traditional information retrieval typically represents data using a bag of words; data mining typica...
Thesis (Ph.D.)--University of Washington, 2019Real world entities such as people, organizations and ...
The “big data” era is characterized by an explosion of information in the form of digital data colle...
The exponential growth of digital information available in companies and on the web creates the need...
Abstract—Topic models, which factor each document into different topics and represent each topic as ...
Entity Ranking (ER) is a recently emerging search task in Information Retrieval, where the goal is n...
The “big data ” era is characterized by an explosion of infor-mation in the form of digital data col...
Entity Recognition (ER) can be used as a method for extracting information about socio-technical sys...
Entity retrieval is the problem of finding information about a given real-world entity (e.g., direct...
Thesis (Ph.D.)--University of Washington, 2015-12With the advent of the Web, textual information has...
A data model, called the entity-relationship model, is proposed. This model incorporates some of the...
A central aspect of natural language understanding consists of linking information across multiple s...
International audienceIn recent years, several knowledge bases have been built to enable large-scale...
Representing information is a key challenge for all applications that process and organize documents...
Fang, HuiIn the past decade, the prosperity of the World Wide Web has led to fast explosion of info...