Boolean matrix factorization (BMF) is a popular and powerful technique for inferring knowledge from data. The mining result is the Boolean product of two matrices, approximating the input dataset. The Boolean product is a disjunction of rank-1 binary matrices, each describing a feature-relation, called pattern, for a group of samples. Yet, there are no guarantees that any of the returned patterns do not actually arise from noise, i.e., are false discoveries. In this paper, we propose and discuss the usage of the false discovery rate in the unsupervised BMF setting. We prove two bounds on the probability that a found pattern is constituted of random Bernoulli-distributed noise. Each bound exploits a specific property of the factorization w...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Is it possible to meaningfully analyze the structure of a Boolean matrix for which 99% data is missi...
Matrix factorizations—where a given data matrix is approximated by a prod- uct of two or more factor...
Finding patterns in binary data is a classical problem in data mining, dating back to at least frequ...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Matrix factorizations---where a given data matrix is approximated by a product of two or more factor...
Matrix factorizations—where a given data matrix is approximated by a prod-uct of two or more factor ...
Identifying discrete patterns in binary data is an important dimensionality reduction tool in machin...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Boolean Matrix Factorization (BMF) aims to find an approximation of a given binary matrix as the Boo...
Matrix factorizations---where a given data matrix is approximated by a product of two or more facto...
Boolean matrix has been used to represent digital information in many fields, including bank transac...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Matrix factorizations—where a given data matrix is approximated by a product of two or more factor m...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Is it possible to meaningfully analyze the structure of a Boolean matrix for which 99% data is missi...
Matrix factorizations—where a given data matrix is approximated by a prod- uct of two or more factor...
Finding patterns in binary data is a classical problem in data mining, dating back to at least frequ...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Matrix factorizations---where a given data matrix is approximated by a product of two or more factor...
Matrix factorizations—where a given data matrix is approximated by a prod-uct of two or more factor ...
Identifying discrete patterns in binary data is an important dimensionality reduction tool in machin...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Boolean Matrix Factorization (BMF) aims to find an approximation of a given binary matrix as the Boo...
Matrix factorizations---where a given data matrix is approximated by a product of two or more facto...
Boolean matrix has been used to represent digital information in many fields, including bank transac...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Matrix factorizations—where a given data matrix is approximated by a product of two or more factor m...
Finding patterns from binary data is a classical problem in data mining, dating back to at least fre...
Is it possible to meaningfully analyze the structure of a Boolean matrix for which 99% data is missi...
Matrix factorizations—where a given data matrix is approximated by a prod- uct of two or more factor...