Efficient layered density-based clustering of categorical data

Andreopoulos, Bill
An, Aijun
Wang, Xiaogang
Labudde, Dirk

Open PDF

Open link

Publication date

April 2009

DOI

10.1016/j.jbi.2008.11.004

Publisher

Elsevier Inc.

ISSN

1532-0464

Citation count (estimate)

Abstract

AbstractA challenge involved in applying density-based clustering to categorical biomedical data is that the ”cube” of attribute values has no ordering defined, making the search for dense subspaces slow. We propose the HIERDENC algorithm for hierarchical density-based clustering of categorical data, and a complementary index for searching for dense subspaces efficiently. The HIERDENC index is updated when new objects are introduced, such that clustering does not need to be repeated on all objects. The updating and cluster retrieval are efficient. Comparisons with several other clustering algorithms showed that on large datasets HIERDENC achieved better runtime scalability on the number of objects, as well as cluster quality. By fast collap...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient layered density-based clustering of categorical data

Abstract

Extracted data

Efficient layered density-based clustering of categorical data

Abstract

Extracted data

Related items

Related items