We consider the problem of selecting non-zero entries of a matrix A in order to produce a sparse sketch of it, B, that minimizes}A´B}2. For large mˆn matri-ces, such that n " m (for example, representing n observations over m attributes) we give sampling distributions that exhibit four important properties. First, they have closed forms computable from minimal information regarding A. Second, they allow sketching of matrices whose non-zeros are presented to the algorithm in arbitrary order as a stream, with Op1q computation per non-zero. Third, the resulting sketch matrices are not only sparse, but their non-zero entries are highly compressible. Lastly, and most importantly, under mild assumptions, our distri-butions are provably compe...
Abstract — We introduce a new method of data collection for flow size estimation, the optimized flow...
Editor: To be assigned. We compute the singular values of an m × n sparse matrix A in a distributed ...
Summarization: Streaming sketching algorithms are data-processing algorithms for the summarization o...
We consider the problem of selecting non-zero entries of a matrix A in order to produce a sparse ske...
This study presents methods called TSGPR- and SAC-SketchyCoreSVD to improve the protocol of subsampl...
We initiate a systematic study of linear sketching over F_2. For a given Boolean function treated as...
We study the streaming model for approximate matrix multiplication (AMM). We are interested in the s...
© Sampath Kannan, Elchanan Mossel, Swagato Sanyal, and Grigory Yaroslavtsev; licensed under Creative...
We study three fundamental problems of Linear Algebra, lying in the heart of various Machine Learnin...
We consider the problem of reconstructing a sparse signal x0 ∈ Rn from a limited number of linear me...
We consider the approximate sparse recovery problem, where the goal is to (approximately) recover a ...
Given a matrix, the seriation problem consists in permuting its rows in such way that all its column...
We observe a N × M matrix of independent, identically distributed Gaussian random variables which ar...
Low rank matrix factorization is an important step in many high dimensional machine learning algorit...
This paper focuses on the low rank plus sparse matrix decomposition problem in big data settings. Co...
Abstract — We introduce a new method of data collection for flow size estimation, the optimized flow...
Editor: To be assigned. We compute the singular values of an m × n sparse matrix A in a distributed ...
Summarization: Streaming sketching algorithms are data-processing algorithms for the summarization o...
We consider the problem of selecting non-zero entries of a matrix A in order to produce a sparse ske...
This study presents methods called TSGPR- and SAC-SketchyCoreSVD to improve the protocol of subsampl...
We initiate a systematic study of linear sketching over F_2. For a given Boolean function treated as...
We study the streaming model for approximate matrix multiplication (AMM). We are interested in the s...
© Sampath Kannan, Elchanan Mossel, Swagato Sanyal, and Grigory Yaroslavtsev; licensed under Creative...
We study three fundamental problems of Linear Algebra, lying in the heart of various Machine Learnin...
We consider the problem of reconstructing a sparse signal x0 ∈ Rn from a limited number of linear me...
We consider the approximate sparse recovery problem, where the goal is to (approximately) recover a ...
Given a matrix, the seriation problem consists in permuting its rows in such way that all its column...
We observe a N × M matrix of independent, identically distributed Gaussian random variables which ar...
Low rank matrix factorization is an important step in many high dimensional machine learning algorit...
This paper focuses on the low rank plus sparse matrix decomposition problem in big data settings. Co...
Abstract — We introduce a new method of data collection for flow size estimation, the optimized flow...
Editor: To be assigned. We compute the singular values of an m × n sparse matrix A in a distributed ...
Summarization: Streaming sketching algorithms are data-processing algorithms for the summarization o...