When performing queries for text analytics on unstructured text data, a large amount of the processing time is spent on regular expressions and dictionary matching. In this paper we present a compilable architecture for token-bound pat-tern matching with support for token pattern sequence detec-tion. The architecture presented is capable of detecting sev-eral hundreds of dictionaries, each containing thousands of elements at high throughput. A programmable state machine is used as pattern detection engine to achieve deterministic performance while maintaining low storage requirements. For the detection of token sequences, a dedicated circuitry is compiled based on a non-deterministic automaton. A cas-caded result lookup ensures efficient st...
Search schemes enable the efficient identification of all approximate occurrences of a search patter...
<div><p>Exact pattern matching algorithms are popular and used widely in several applications, such ...
Abstract. We consider the online multiple-pattern matching problem for streams of XML documents, whe...
Advanced text analytics systems combine regular expres-sion (regex) matching, dictionary processing,...
Abstract — The pattern sequence is an expression that is a statement in a language designed specific...
2011-11-28Large-scale pattern matching has many applications ranging from text processing to deep pa...
The standard string matching problem involves finding all occurrences of a single pattern in a singl...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
Abstract—Accelerating multi-pattern matching is a critical is-sue in building high-performance deep ...
This article describes some common problems faced in natural language processing. The main problem c...
In this paper we focus on the problem of compressed pattern matching for the text compression using...
An important subtask of the pattern discovery process is pattern matching, where the pattern sought ...
Abstract – Semantic analysis often uses a pipeline of Natural Language Processing (NLP) tools such a...
Abstract—Modern network devices need to perform deep packet inspection at high speed for security an...
Search schemes enable the efficient identification of all approximate occurrences of a search patter...
<div><p>Exact pattern matching algorithms are popular and used widely in several applications, such ...
Abstract. We consider the online multiple-pattern matching problem for streams of XML documents, whe...
Advanced text analytics systems combine regular expres-sion (regex) matching, dictionary processing,...
Abstract — The pattern sequence is an expression that is a statement in a language designed specific...
2011-11-28Large-scale pattern matching has many applications ranging from text processing to deep pa...
The standard string matching problem involves finding all occurrences of a single pattern in a singl...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
Abstract—Accelerating multi-pattern matching is a critical is-sue in building high-performance deep ...
This article describes some common problems faced in natural language processing. The main problem c...
In this paper we focus on the problem of compressed pattern matching for the text compression using...
An important subtask of the pattern discovery process is pattern matching, where the pattern sought ...
Abstract – Semantic analysis often uses a pipeline of Natural Language Processing (NLP) tools such a...
Abstract—Modern network devices need to perform deep packet inspection at high speed for security an...
Search schemes enable the efficient identification of all approximate occurrences of a search patter...
<div><p>Exact pattern matching algorithms are popular and used widely in several applications, such ...
Abstract. We consider the online multiple-pattern matching problem for streams of XML documents, whe...