This is the final report of a one-year, Laboratory Directed Research and Development (LDRD) project at the Los Alamos National Laboratory (LANL). The objective of this project was to develop and implement data mining technology suited to the analysis of large collections of unstructured data. This has taken the form of a software tool, PADMA (Parallel Data Mining Agents), which incorporates parallel data accessing, parallel scalable hierarchical clustering algorithms, and a web-based user interface for submitting Structured Query Language (SQL) queries and interactive data visualization. The authors have demonstrated the viability and scalability of PADMA by applying it to an unstructured text database of 25,000 documents running on an IBM ...
Recently data mining has become more popular in the information industry. It is due to the availabil...
This whitepaper briefly describes a new, aggressive effort in large- scale data Livermore National L...
The present situation in biological and medical sciences is characterized by the availability of mas...
This paper describes an experimental parallel/distributed data mining system PADMA (PArallel Data Mi...
Scalability determines the potential in distributing h&h rlata anrl rnmnrlt,af.inn in cln.+n. mi...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
This paper introduces PADMA (PArallel Data Mining Agents), a parallel agent based system for scalabl...
Advances in hardware and software technology enable us to collect, store and distribute large quanti...
The advent of computing technology has significantly influenced our lives and two major impacts of t...
Managing and efficiently analysing the vast amounts of data produced by a huge variety of data sourc...
Recent advances in data capture, data transmission and data storage technologies have resulted in a ...
Data mining is the semi-automatic discovery of patterns, associations, changes, anomalies, and stati...
On-Line Analytical Processing techniques are used for data analysis and decision support systems. Th...
Many scientific datasets (e.g. earth sciences, medical sciences, etc.) increase with respect to thei...
Abstract: Data mining is the application of specific algorithms for extracting patterns from data. B...
Recently data mining has become more popular in the information industry. It is due to the availabil...
This whitepaper briefly describes a new, aggressive effort in large- scale data Livermore National L...
The present situation in biological and medical sciences is characterized by the availability of mas...
This paper describes an experimental parallel/distributed data mining system PADMA (PArallel Data Mi...
Scalability determines the potential in distributing h&h rlata anrl rnmnrlt,af.inn in cln.+n. mi...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
This paper introduces PADMA (PArallel Data Mining Agents), a parallel agent based system for scalabl...
Advances in hardware and software technology enable us to collect, store and distribute large quanti...
The advent of computing technology has significantly influenced our lives and two major impacts of t...
Managing and efficiently analysing the vast amounts of data produced by a huge variety of data sourc...
Recent advances in data capture, data transmission and data storage technologies have resulted in a ...
Data mining is the semi-automatic discovery of patterns, associations, changes, anomalies, and stati...
On-Line Analytical Processing techniques are used for data analysis and decision support systems. Th...
Many scientific datasets (e.g. earth sciences, medical sciences, etc.) increase with respect to thei...
Abstract: Data mining is the application of specific algorithms for extracting patterns from data. B...
Recently data mining has become more popular in the information industry. It is due to the availabil...
This whitepaper briefly describes a new, aggressive effort in large- scale data Livermore National L...
The present situation in biological and medical sciences is characterized by the availability of mas...