Streaming and Sketch Algorithms for Large Data NLP

Goyal, Amit

Publication date

January 2013

Language

English

Abstract

The availability of large and rich quantities of text data is due to the emergence of the World Wide Web, social media, and mobile devices. Such vast data sets have led to leaps in the performance of many statistically-based problems. Given a large magnitude of text data available, it is computationally prohibitive to train many complex Natural Language Processing (NLP) models on large data. This motivates the hypothesis that simple models trained on big data can outperform more complex models with small data. My dissertation provides a solution to effectively and efficiently exploit large data on many NLP applications. Datasets are growing at an exponential rate, much faster than increase in memory. To provide a memory-efficient solution ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Streaming and Sketch Algorithms for Large Data NLP

Abstract

Extracted data

Streaming and Sketch Algorithms for Large Data NLP

Abstract

Extracted data

Related items

Related items