Hierarchical Bayesian Models and Matrix factorization methods provide an unsupervised way to learn latent components of data from the grouped or sequence data. For example, in document data, latent component corn-responds to topic with each topic as a distribution over a note vocabulary of words. For many applications, there exist sparse relationships between the domain entities and the latent components of the data. Traditional approaches for topic modelling do not take into account these sparsity considerations. Modelling these sparse relationships helps in extracting relevant information leading to improvements in topic accuracy and scalable solution. In our thesis, we explore these sparsity relationships for di errant applications such ...
Unsupervised probabilistic Bayesian models are powerful tools for statistical analysis, especially i...
This work concentrates on mining textual data. In particular, I apply Statistical Machine Learning t...
With the development of information technology, electronic publications become popular. However, it ...
University of Technology Sydney. Faculty of Engineering and Information Technology.Text mining has g...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
Since the exponential growth of available Data (Big data), dimensional reduction techniques became e...
Many applications involve dyadic data, where associations between one pair of domain entities, such ...
Given the overwhelming quantities of data generated every day, there is a pressing need for tools th...
The proliferation of large electronic document archives requires new techniques for automatically an...
We present a nonparametric hierarchical Bayesian model of document collections that decouples sparsi...
One desirable property of machine learning algorithms is the ability to balance the number of p...
In the era of "big data", scalable statistical inference is necessary to learn from new and growing ...
Probabilistic mixture model is a powerful tool to provide a low-dimensional representation of count ...
Abstract—We introduce the four-parameter IBP compound Dirichlet process (ICDP), a stochastic process...
Learning low dimensional representations from a large number of short corpora has a profound practic...
Unsupervised probabilistic Bayesian models are powerful tools for statistical analysis, especially i...
This work concentrates on mining textual data. In particular, I apply Statistical Machine Learning t...
With the development of information technology, electronic publications become popular. However, it ...
University of Technology Sydney. Faculty of Engineering and Information Technology.Text mining has g...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
Since the exponential growth of available Data (Big data), dimensional reduction techniques became e...
Many applications involve dyadic data, where associations between one pair of domain entities, such ...
Given the overwhelming quantities of data generated every day, there is a pressing need for tools th...
The proliferation of large electronic document archives requires new techniques for automatically an...
We present a nonparametric hierarchical Bayesian model of document collections that decouples sparsi...
One desirable property of machine learning algorithms is the ability to balance the number of p...
In the era of "big data", scalable statistical inference is necessary to learn from new and growing ...
Probabilistic mixture model is a powerful tool to provide a low-dimensional representation of count ...
Abstract—We introduce the four-parameter IBP compound Dirichlet process (ICDP), a stochastic process...
Learning low dimensional representations from a large number of short corpora has a profound practic...
Unsupervised probabilistic Bayesian models are powerful tools for statistical analysis, especially i...
This work concentrates on mining textual data. In particular, I apply Statistical Machine Learning t...
With the development of information technology, electronic publications become popular. However, it ...