This thesis is comprised of three papers "On the design of R-based scalable frameworks for data science applications". We discuss the design of conceptual and computational frameworks for the R language for statistical computing and graphics and build software artifacts for two typical data science use cases: optimization problem solving and large scale text analysis. Each part follows a design science approach. We use a verification method for the software frameworks introduced, i.e., prototypical instantiations of the designed artifacts are evaluated on the basis of real-world applications in mixed integer optimization (consensus journal ranking) and text mining (culturomics). The first paper introduces an extensible object oriented R ...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
R (R Core Team 2014) provides a powerful and flexible system for statistical computations. It has a ...
University of Minnesota Ph.D. dissertation. December 2014. Major: Computer Science. Advisor: Arindam...
Optimization plays an important role in many methods routinely used in statistics, machine learning ...
Optimization plays an important role in many methods routinely used in statistics, machine learning ...
This paper presents two complementary statistical computing frameworks that address challenges in pa...
R has gained explicit text mining support with the tm package enabling statisticians to answer many ...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...
Abstract—The exponential increase in the generation and collection of data has led us in a new era o...
It's tough to argue with R as a high-quality, cross-platform, open source statistical software produ...
R is a mature open-source programming language for statistical computing and graphics. Many areas of...
ABSTRACT Due to R's popularity as a data-mining tool, many distributed systems expose an R-base...
Huge data sets containing millions of training examples with a large number of attributes are relati...
The Data Science domain has expanded monumentally in both research and industry communities during t...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
R (R Core Team 2014) provides a powerful and flexible system for statistical computations. It has a ...
University of Minnesota Ph.D. dissertation. December 2014. Major: Computer Science. Advisor: Arindam...
Optimization plays an important role in many methods routinely used in statistics, machine learning ...
Optimization plays an important role in many methods routinely used in statistics, machine learning ...
This paper presents two complementary statistical computing frameworks that address challenges in pa...
R has gained explicit text mining support with the tm package enabling statisticians to answer many ...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...
Abstract—The exponential increase in the generation and collection of data has led us in a new era o...
It's tough to argue with R as a high-quality, cross-platform, open source statistical software produ...
R is a mature open-source programming language for statistical computing and graphics. Many areas of...
ABSTRACT Due to R's popularity as a data-mining tool, many distributed systems expose an R-base...
Huge data sets containing millions of training examples with a large number of attributes are relati...
The Data Science domain has expanded monumentally in both research and industry communities during t...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
R (R Core Team 2014) provides a powerful and flexible system for statistical computations. It has a ...
University of Minnesota Ph.D. dissertation. December 2014. Major: Computer Science. Advisor: Arindam...