International audienceWe present a matrix-factorization algorithm that scales to input matrices with both huge number of rows and columns. Learned factors may be sparse or dense and/or non-negative, which makes our algorithm suitable for dictionary learning, sparse component analysis, and non-negative matrix factorization. Our algorithm streams matrix columns while subsampling them to iteratively learn the matrix factors. At each iteration, the row dimension of a new sample is reduced by subsampling, resulting in lower time complexity compared to a simple streaming algorithm. Our method comes with convergence guarantees to reach a stationary point of the matrix-factorization problem. We demonstrate its efficiency on massive functional Magne...
We present ‘Factorbird’, a prototype of a parameter server approach for factor-izing large matrices ...
Matrix factorization methods are among the most common techniques for detecting latent components in...
to appearInternational audienceMany modern tools in machine learning and signal processing, such as ...
International audienceWe present a matrix-factorization algorithm that scales to input matrices with...
International audienceWe present a matrix factorization algorithm that scales to input matrices that...
International audienceSparse matrix factorization is a popular tool to obtain interpretable data dec...
As Web 2.0 and enterprise-cloud applications have proliferated, data mining algorithms increasingly ...
revised version.International audienceSparse coding---that is, modelling data vectors as sparse line...
Low rank matrix factorization is an important step in many high dimensional machine learning algorit...
As Web 2.0 and enterprise-cloud applications have proliferated, data mining algorithms increasingly ...
Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typ...
We present a very fast algorithm for general matrix factorization of a data matrix for use in the st...
International audienceThe computational cost of many signal processing and machine learning techniqu...
Fully observed large binary matrices appear in a wide variety of contexts. To model them, probabilis...
This work introduces Divide-Factor-Combine (DFC), a parallel divide-and-conquer framework for noisy ...
We present ‘Factorbird’, a prototype of a parameter server approach for factor-izing large matrices ...
Matrix factorization methods are among the most common techniques for detecting latent components in...
to appearInternational audienceMany modern tools in machine learning and signal processing, such as ...
International audienceWe present a matrix-factorization algorithm that scales to input matrices with...
International audienceWe present a matrix factorization algorithm that scales to input matrices that...
International audienceSparse matrix factorization is a popular tool to obtain interpretable data dec...
As Web 2.0 and enterprise-cloud applications have proliferated, data mining algorithms increasingly ...
revised version.International audienceSparse coding---that is, modelling data vectors as sparse line...
Low rank matrix factorization is an important step in many high dimensional machine learning algorit...
As Web 2.0 and enterprise-cloud applications have proliferated, data mining algorithms increasingly ...
Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typ...
We present a very fast algorithm for general matrix factorization of a data matrix for use in the st...
International audienceThe computational cost of many signal processing and machine learning techniqu...
Fully observed large binary matrices appear in a wide variety of contexts. To model them, probabilis...
This work introduces Divide-Factor-Combine (DFC), a parallel divide-and-conquer framework for noisy ...
We present ‘Factorbird’, a prototype of a parameter server approach for factor-izing large matrices ...
Matrix factorization methods are among the most common techniques for detecting latent components in...
to appearInternational audienceMany modern tools in machine learning and signal processing, such as ...