Real-time data collection and analytics is a desirable but challenging feature to provide in data-intensive software systems. To provide highly concurrent and efficient real-time analytics on streaming data at interactive speeds requires a well-designed software architecture that makes use of a carefully selected set of software frameworks. In this paper, we report on the design and implementation of the Incremental Data Collection & Analytics Platform (IDCAP). The IDCAP provides incremental data collection and indexing in real-time of social media data; support for real-time analytics at interactive speeds; highly concurrent batch data processing supported by a novel data model; and a front-end web client that allows an analyst to manage I...
This work addresses the need for stateful dataflow programs that can rapidly sift through huge, evol...
The ‘Big Data’ of yesterday is the ‘data’ of today. As technology progresses, new challenges arise a...
In many fields, there is a need for quick analysis of data. As the number of devices connected to th...
Real-time data collection and analytics is a desirable but challenging feature to provide in data-in...
Thesis (Ph.D.) - Indiana University, Computer Sciences, 2015As Big Data processing problems evolve, ...
There is high demand for techniques and tools to process and analyze large sets of streaming data in...
A significant amount of physiological data is generated from bedside monitors and sensors in neonata...
Today, the ability to process big data has become crucial to the information needs of many enterpr...
In the quest for valuable information, modern big data applications continuously monitor streams of ...
International audienceThe current cloud landscape is getting populated with many applications that a...
Twitter is an online social networking service with more than 300 million users, generating a huge a...
Devices and sensors generate streams of data across a diversity of locations and protocols. That dat...
The clickstream analysis focuses on the records generated while a user clicks on a web page. This fi...
A huge variety of social applications, such as Twitter and Instagram, have been developed over the l...
The advent of Web 2.0 technologies which supports the creation and publishing of various social medi...
This work addresses the need for stateful dataflow programs that can rapidly sift through huge, evol...
The ‘Big Data’ of yesterday is the ‘data’ of today. As technology progresses, new challenges arise a...
In many fields, there is a need for quick analysis of data. As the number of devices connected to th...
Real-time data collection and analytics is a desirable but challenging feature to provide in data-in...
Thesis (Ph.D.) - Indiana University, Computer Sciences, 2015As Big Data processing problems evolve, ...
There is high demand for techniques and tools to process and analyze large sets of streaming data in...
A significant amount of physiological data is generated from bedside monitors and sensors in neonata...
Today, the ability to process big data has become crucial to the information needs of many enterpr...
In the quest for valuable information, modern big data applications continuously monitor streams of ...
International audienceThe current cloud landscape is getting populated with many applications that a...
Twitter is an online social networking service with more than 300 million users, generating a huge a...
Devices and sensors generate streams of data across a diversity of locations and protocols. That dat...
The clickstream analysis focuses on the records generated while a user clicks on a web page. This fi...
A huge variety of social applications, such as Twitter and Instagram, have been developed over the l...
The advent of Web 2.0 technologies which supports the creation and publishing of various social medi...
This work addresses the need for stateful dataflow programs that can rapidly sift through huge, evol...
The ‘Big Data’ of yesterday is the ‘data’ of today. As technology progresses, new challenges arise a...
In many fields, there is a need for quick analysis of data. As the number of devices connected to th...