Scientific explanation often requires inferring maximally predictive features from a given data set. Unfortunately, the collection of minimal maximally predictive features for most stochastic processes is uncountably infinite. In such cases, one compromises and instead seeks nearly maximally predictive features. Here, we derive upper bounds on the rates at which the number and the coding cost of nearly maximally predictive features scale with desired predictive power. The rates are determined by the fractal dimensions of a process' mixed-state distribution. These results, in turn, show how widely used finite-order Markov models can fail as predictors and that mixed-state predictive features can offer a substantial improvement.United States....
We study the following learning problem with dependent data: Observing a trajectory of length $n$ fr...
Minimum Description Length (MDL) is an important principle for induction and prediction, with stron...
Many complexity measures are defined as the size of a minimal representation in a specific model cla...
Scientific explanation often requires inferring maximally predictive features from a given data set....
Even simply-defined, finite-state generators produce stochastic processes that require tracking an u...
Even simply defined, finite-state generators produce stochastic processes that require tracking an u...
The world around us is awash with structure and pattern. We observe it in thecycles of the seasons, ...
We are studying long term sequence prediction (forecasting). We approach this by investigating crit...
The ε-machine is a stochastic process's optimal model-maximally predictive and minimal in size. It o...
The $\epsilon$-machine is a stochastic process' optimal model -- maximally predictive and minimal in...
In this paper, we introduce Max Markov Chain (MMC), a novel representation for a useful subset of Hi...
AbstractGiven a set X of sequences over a finite alphabet, we investigate the following three quanti...
Predictive rate-distortion analysis suffers from the curse of dimensionality: clustering arbitrarily...
Predictive rate-distortion analysis suffers from the curse of dimensionality: clustering arbitrarily...
Hidden Markov chains are widely applied statistical models of stochastic processes, from fundamental...
We study the following learning problem with dependent data: Observing a trajectory of length $n$ fr...
Minimum Description Length (MDL) is an important principle for induction and prediction, with stron...
Many complexity measures are defined as the size of a minimal representation in a specific model cla...
Scientific explanation often requires inferring maximally predictive features from a given data set....
Even simply-defined, finite-state generators produce stochastic processes that require tracking an u...
Even simply defined, finite-state generators produce stochastic processes that require tracking an u...
The world around us is awash with structure and pattern. We observe it in thecycles of the seasons, ...
We are studying long term sequence prediction (forecasting). We approach this by investigating crit...
The ε-machine is a stochastic process's optimal model-maximally predictive and minimal in size. It o...
The $\epsilon$-machine is a stochastic process' optimal model -- maximally predictive and minimal in...
In this paper, we introduce Max Markov Chain (MMC), a novel representation for a useful subset of Hi...
AbstractGiven a set X of sequences over a finite alphabet, we investigate the following three quanti...
Predictive rate-distortion analysis suffers from the curse of dimensionality: clustering arbitrarily...
Predictive rate-distortion analysis suffers from the curse of dimensionality: clustering arbitrarily...
Hidden Markov chains are widely applied statistical models of stochastic processes, from fundamental...
We study the following learning problem with dependent data: Observing a trajectory of length $n$ fr...
Minimum Description Length (MDL) is an important principle for induction and prediction, with stron...
Many complexity measures are defined as the size of a minimal representation in a specific model cla...