This thesis in text algorithmics studies the compression, indexation and querying on a labeled text}. A labeled text is a text to which we add information. As an example, in a V(D)J recombination, a marker for lymphocytes, the text is a DNA sequence and the labels are the genes' names. A person's immune system can be represented with a set of V(D)J recombinations. With high-throughput sequencing, we have access to millions of V(D)J recombinations which are stored and need to be recovered and compared quickly.The first contribution of this thesis is a compression method for a labeled text which uses the concept of storage by references. The text is divided into sections which point to pre-established labeled sequences. The second contributio...
In this Ph. D. Thesis we investigate several data compression methods on text in natural language. O...
URN: urn:nbn:de:0030-drops-104955URL: http://drops.dagstuhl.de/opus/volltexte/2019/10495/ISBN ={978-...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
This thesis in text algorithmics studies the compression, indexation and querying on a labeled text}...
Cette thèse en algorithmique du texte étudie la compression, l'indexation et les requêtes sur un tex...
International audienceBackground: Labels are a way to add some information on a text, such as functi...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
Much of the DNA and RNA sequencing data available is in the form of high-throughput sequencing (HTS)...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
We study a method by Ferragina and Manzini for creating an index of a text. This index allows us to ...
We study a method by Ferragina and Manzini for creating an index of a text. This index allows us to ...
An indexed sequence of strings is a data structure for storing a string sequence that supports rando...
Les volumes des données générées par les technologies de séquençage haut débit augmentent exponentie...
The world is drowning in data. The recent explosion of web publishing, XML data, bioinformation, sci...
In this Ph. D. Thesis we investigate several data compression methods on text in natural language. O...
URN: urn:nbn:de:0030-drops-104955URL: http://drops.dagstuhl.de/opus/volltexte/2019/10495/ISBN ={978-...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
This thesis in text algorithmics studies the compression, indexation and querying on a labeled text}...
Cette thèse en algorithmique du texte étudie la compression, l'indexation et les requêtes sur un tex...
International audienceBackground: Labels are a way to add some information on a text, such as functi...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
Much of the DNA and RNA sequencing data available is in the form of high-throughput sequencing (HTS)...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
We study a method by Ferragina and Manzini for creating an index of a text. This index allows us to ...
We study a method by Ferragina and Manzini for creating an index of a text. This index allows us to ...
An indexed sequence of strings is a data structure for storing a string sequence that supports rando...
Les volumes des données générées par les technologies de séquençage haut débit augmentent exponentie...
The world is drowning in data. The recent explosion of web publishing, XML data, bioinformation, sci...
In this Ph. D. Thesis we investigate several data compression methods on text in natural language. O...
URN: urn:nbn:de:0030-drops-104955URL: http://drops.dagstuhl.de/opus/volltexte/2019/10495/ISBN ={978-...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...