Today many organisations and enterprises are using data from several sources either for strategic decision making or other business goals such as data integration. Data quality problems are always a hindrance to effective and efficient utilization of such data. Tools have been built to clean and standardize data, however, there is a need to pre-process this data by applying techniques and processes from statistical semantics, NLP, and lexical analysis. Data profiling employed these techniques to discover, reveal commonalties and differences in the inherent data structures, present ideas for creation of unified data model, and provide metrics for data standardization and verification. The IBM WebSphere tool was used to pre-process dataset/re...
The explosion of high throughput genomic data in recent years has already altered our view of the ex...
Text messages are essential these days; however, spam texts have contributed negatively to the succe...
This research paper examines cognitive computing relative to how businesses in healthcare may use co...
Linked Open Data (LOD) provides access to large amounts of data on Web. These data sets range from ...
Thisstudy aims at assessingdata monetizationas a potential profit source for carmakersthrough qualit...
Internship Report presented as the partial requirement for obtaining a Master's degree in Data Scien...
Building an accurate and reliable model for prediction for different application domains, is one of ...
Benchmarking is a common method in evaluating and choosing a NoSQL database. There are already lots ...
Personalized Internet Services have become an important topic of research and study. The concept of ...
The fast capacity growth of cheap storage devices presents an ever-growing problem of scale for digi...
Surveys are an important tool for researchers. Survey attributes are typically discrete data measure...
In the past few years, there has been a keen interest in mining frequent itemsets in large data repo...
State highway agencies invest a large amount of resources in collecting, storing and managing variou...
The efficiency and generalizability of a deep learning model is based on the amount and diversity of...
The rapidly changing nature of information and use of information systems within organisations has s...
The explosion of high throughput genomic data in recent years has already altered our view of the ex...
Text messages are essential these days; however, spam texts have contributed negatively to the succe...
This research paper examines cognitive computing relative to how businesses in healthcare may use co...
Linked Open Data (LOD) provides access to large amounts of data on Web. These data sets range from ...
Thisstudy aims at assessingdata monetizationas a potential profit source for carmakersthrough qualit...
Internship Report presented as the partial requirement for obtaining a Master's degree in Data Scien...
Building an accurate and reliable model for prediction for different application domains, is one of ...
Benchmarking is a common method in evaluating and choosing a NoSQL database. There are already lots ...
Personalized Internet Services have become an important topic of research and study. The concept of ...
The fast capacity growth of cheap storage devices presents an ever-growing problem of scale for digi...
Surveys are an important tool for researchers. Survey attributes are typically discrete data measure...
In the past few years, there has been a keen interest in mining frequent itemsets in large data repo...
State highway agencies invest a large amount of resources in collecting, storing and managing variou...
The efficiency and generalizability of a deep learning model is based on the amount and diversity of...
The rapidly changing nature of information and use of information systems within organisations has s...
The explosion of high throughput genomic data in recent years has already altered our view of the ex...
Text messages are essential these days; however, spam texts have contributed negatively to the succe...
This research paper examines cognitive computing relative to how businesses in healthcare may use co...