The domain of data mining and machine learning has expanded rapidly in recent years to include both large-scale distributed and streaming computation. Although many open-source and cloud-based frameworks are available for these tasks, many of which are used in-production by industry, this is a rapidly-evolving technology landscape, and the gap between the academic role of algorithm development and discovery and code available for use with real-world data has grown. In addition, although there is a rich history of mathematical models for streaming data on continuous vector spaces, there has been significantly less work on streaming discrete spaces. However, much if not most of the data available online is composed of high-dimensional spar...
<p>Constant technology advances have caused data explosion in recent years. Accord- ingly modern sta...
Large quantifies of online user activity data, such as weekly web search volumes, which co-evolve wi...
With the increasing availability of large datasets machine learning techniques are becoming an incr...
The domain of data mining and machine learning has expanded rapidly in recent years to include both ...
We propose a semi-parametric and dynamic rank factor model for topic model-ing, capable of (i) disco...
The proliferation of online communities has attracted much attention to modelling user behaviour in ...
The proliferation of online communities has attracted much attention to modelling user behaviour in ...
The proliferation of online communities has attracted much attention to modelling user behaviour in ...
A current challenge for data management systems is to support the construction and maintenance of ma...
Streaming is an important paradigm for handling high-speed data sets that are too large to fit in ma...
This thesis is concerned with the statistical learning of probabilistic models for graph-structured ...
Dynamic Bayesian Networks (DBNs) are temporal probabilistic models for reasoning over time. They oft...
A new framework for topic modeling is devel-oped, based on deep graphical models, where interactions...
University of Minnesota Ph.D. dissertation. May 2016. Major: Electrical Engineering. Advisor: Georgi...
Many real-world systems can be represented as networks driven by discrete {em events}, each event id...
<p>Constant technology advances have caused data explosion in recent years. Accord- ingly modern sta...
Large quantifies of online user activity data, such as weekly web search volumes, which co-evolve wi...
With the increasing availability of large datasets machine learning techniques are becoming an incr...
The domain of data mining and machine learning has expanded rapidly in recent years to include both ...
We propose a semi-parametric and dynamic rank factor model for topic model-ing, capable of (i) disco...
The proliferation of online communities has attracted much attention to modelling user behaviour in ...
The proliferation of online communities has attracted much attention to modelling user behaviour in ...
The proliferation of online communities has attracted much attention to modelling user behaviour in ...
A current challenge for data management systems is to support the construction and maintenance of ma...
Streaming is an important paradigm for handling high-speed data sets that are too large to fit in ma...
This thesis is concerned with the statistical learning of probabilistic models for graph-structured ...
Dynamic Bayesian Networks (DBNs) are temporal probabilistic models for reasoning over time. They oft...
A new framework for topic modeling is devel-oped, based on deep graphical models, where interactions...
University of Minnesota Ph.D. dissertation. May 2016. Major: Electrical Engineering. Advisor: Georgi...
Many real-world systems can be represented as networks driven by discrete {em events}, each event id...
<p>Constant technology advances have caused data explosion in recent years. Accord- ingly modern sta...
Large quantifies of online user activity data, such as weekly web search volumes, which co-evolve wi...
With the increasing availability of large datasets machine learning techniques are becoming an incr...