We consider the following spelling variants clustering problem: Given a list of distinct words, called lexicon, compute (possibly overlapping) clusters of words which are spelling variants of each other. This problem naturally arises in the context of error-tolerant full-text search of the following kind: For a given query, return not only documents matching the query words exactly but also those matching their spelling variants. This is the inverse of the well-known "Did you mean: ... ?" web search engine feature, where the error tolerance is on the side of the query, and not on the side of the documents. We combine various ideas from the large body of literature on approximate string searching and spelling correction techniques to a new a...
Abstract. We introduce a new approach to spellchecking for languages with extreme phonetic irregular...
The paper contains a new text searching method representing modification of the Boyer-Moore algorith...
There are several NLP systems whose ac- curacy depends crucially on finding mis- spellings fast. How...
We consider the following spelling variants clustering problem: Given a list of distinct words, ca...
In this thesis, the following spelling variants clustering problem is considered: Given a list of di...
This paper proposes a new method for approximate string search, specifically candidate generation in...
This paper discusses the issues involved in an information retrieval system when spelling errors are...
In this paper, we study the problem of online spelling correction for query completions. Misspelling...
This paper presents a novel approach to spell checking using dictionary clustering. The main goal is...
Searching for a pattern in a text file is a very common operation in many applications ranging from ...
In computing, spell checking is the process of detecting and sometimes providing spelling suggestion...
Traditional research on spelling correction in natural language processing and infor-mation retrieva...
This paper accounts for a new technique of correcting isolated words in typed texts. A language-depe...
Fast similarity search is important for time-sensitive applications. Those include both enterprise a...
We present TISC, a language-independent and context-sensitive spelling checking and correction syste...
Abstract. We introduce a new approach to spellchecking for languages with extreme phonetic irregular...
The paper contains a new text searching method representing modification of the Boyer-Moore algorith...
There are several NLP systems whose ac- curacy depends crucially on finding mis- spellings fast. How...
We consider the following spelling variants clustering problem: Given a list of distinct words, ca...
In this thesis, the following spelling variants clustering problem is considered: Given a list of di...
This paper proposes a new method for approximate string search, specifically candidate generation in...
This paper discusses the issues involved in an information retrieval system when spelling errors are...
In this paper, we study the problem of online spelling correction for query completions. Misspelling...
This paper presents a novel approach to spell checking using dictionary clustering. The main goal is...
Searching for a pattern in a text file is a very common operation in many applications ranging from ...
In computing, spell checking is the process of detecting and sometimes providing spelling suggestion...
Traditional research on spelling correction in natural language processing and infor-mation retrieva...
This paper accounts for a new technique of correcting isolated words in typed texts. A language-depe...
Fast similarity search is important for time-sensitive applications. Those include both enterprise a...
We present TISC, a language-independent and context-sensitive spelling checking and correction syste...
Abstract. We introduce a new approach to spellchecking for languages with extreme phonetic irregular...
The paper contains a new text searching method representing modification of the Boyer-Moore algorith...
There are several NLP systems whose ac- curacy depends crucially on finding mis- spellings fast. How...