HPDBSCAN – Highly Parallel DBSCAN - Prof. Dr.

HPDBSCAN – Highly Parallel DBSCAN

Goetz, M., Bodenstein, C., Riedel, M.: HPDBSCAN – Highly Parallel DBSCAN, in conference proceedings of ACM/IEEE International Conference for High Performance Computing, Networking, Storage, and Analysis (SC 2015), Machine Learning in HPC Environments (MLHPC 2015) Workshop, November 15-20, 2015, Austin, Texas, USA
[ EVENT ] [ DOI ] [ JUSER ] [ RESEARCHGATE ]

HPDBSCAN Highly Parallel DBSCAN

Abstract:
Clustering algorithms in the field of data-mining are used to aggregate similar objects into common groups. One of the best-known of these algorithms is called DBSCAN. Its distinct design enables the search for an apriori unknown number of arbitrarily shaped clusters, and at the same time allows to filter out noise. Due to its sequential formulation, the parallelization of DBSCAN renders a challenge. In this paper we present a new parallel approach which we call HPDBSCAN. It employs three major techniques in order to break the sequentiality, empower workload-balancing as well as speed up neighborhood searches in distributed parallel processing environments i) a computation split heuristic for domain decomposition, ii) a data index preprocessing step and iii) a rule-based cluster merging scheme. As a proof-of-concept we implemented HPDBSCAN as an OpenMP/MPI hybrid application. Using real-world data sets, such as a point cloud from the old town of Bremen, Germany, we demonstrate that our implementation is able to achieve a significant speed-up and scale-up in common HPC setups. Moreover, we compare our approach with previous attempts to parallelize DBSCAN showing an order of magnitude improvement in terms of computation time and memory consumption.

Social Media

@ResearchGate: Well Done Morris! Paper reached 1500 reads: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable #clustering #algorithm #DataScience #MachineLearning #datamining @DEEPprojects @fzj_jsc @fz_juelich @Haskoli_Islands @helmholtz_ai
.
full text: https://t.co/iUBVwodfHB
. pic.twitter.com/xupDfUbooH

— Morris Riedel (@MorrisRiedel) August 18, 2020

ResearchGate: Well Done Morris! Paper reached 1500 reads: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable…

Posted by Morris Riedel on Tuesday, August 18, 2020

Sieh dir diesen Beitrag auf Instagram an

ResearchGate: Well Done Morris! Paper reached 1500 reads: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable #clustering #algorithm #DataScience #MachineLearning #datamining #DEEPprojects @forschungszentrum_juelich #julichsupercomputingcenter @haskoli_islands @helmholtz.ai @helmholtz_de @von.hi . full text: buff.ly/2Qa70Ol .

Ein Beitrag geteilt von Morris Riedel (@morrisriedel) am Aug 18, 2020 um 1:28 PDT

ResearchGate:Good Job Morris!Paper reached 1000 reads: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable #clustering #algorithm #DataScience #MachineLearning #datamining @DEEPprojects @fzj_jsc @fz_juelich @Haskoli_Islands @helmholtz_ai
.
see full text: https://t.co/iUBVwodfHB
. pic.twitter.com/CsPHNOLOAq

— Morris Riedel (@MorrisRiedel) January 5, 2020

Sieh dir diesen Beitrag auf Instagram an

ResearchGate:Good Job Morris!Paper reached 1000 reads: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable #clustering #algorithm #DataScience #MachineLearning #datamining DEEP Projects #julichsupercomputingcenter @forschungszentrum_juelich @haskoli_islands @von_hi @the_effective_communicators @helmholtz_de #AI . see full text: buff.ly/2Qa70Ol .

Ein Beitrag geteilt von Morris Riedel (@morrisriedel) am Jan 6, 2020 um 5:23 PST

ResearchGate:Way to go Morris! Paper reached 20 citations: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable #clustering #algorithm #DataScience #MachineLearning #datamining @DEEPprojects @fzj_jsc @fz_juelich @Haskoli_Islands @helmholtz_ai
.
Full text: https://t.co/iUBVwodfHB
. pic.twitter.com/rbsyhKo0Gl

— Morris Riedel (@MorrisRiedel) December 21, 2019

Sieh dir diesen Beitrag auf Instagram an

ResearchGate:Way to go Morris! Paper reached 20 citations: HPDBSCAN Highly #Parallel #DBSCAN #HPC #AI scalable #clustering #algorithm #DataScience #MachineLearning #datamining DEEPprojects @forschungszentrum_juelich #julichsupercomputingcenter @haskoli_islands @helmholtz_de #ai . Full text: buff.ly/2Qa70Ol .

Ein Beitrag geteilt von Morris Riedel (@morrisriedel) am Dez 21, 2019 um 4:13 PST

Researchgate: Good job Morris! Your paper reached 800 reads: HPDBSCAN Highly #Parallel #DBSCAN fast & scalable #clustering #algorithm #DataScience #MachineLearning #datamining @DEEPprojects @fzj_jsc @fz_juelich @Haskoli_Islands @helmholtz_ai
.
Full text: https://t.co/iUBVwodfHB
. pic.twitter.com/cuVFzainNZ

— Morris Riedel (@MorrisRiedel) November 1, 2019

Sieh dir diesen Beitrag auf Instagram an

Researchgate: Good job Morris! Your paper reached 800 reads: HPDBSCAN Highly #Parallel #DBSCAN fast & scalable #clustering #algorithm #DataScience #MachineLearning #datamining DEEPprojects @von_hi @forschungszentrum_juelich @haskoli_islands @the_effective_communicators #julichsupercomputingcenter @helmholtz_de #artificialintelligence . Full text: buff.ly/2Qa70Ol . #unsupervisedlearning #iceland #hpc #highperformancecomputing

Ein Beitrag geteilt von Morris Riedel (@morrisriedel) am Nov 1, 2019 um 1:17 PDT

Researchgate: Great Work, Morris! Your paper reached 700 reads: HPDBSCAN Highly #Parallel #DBSCAN fast & scalable #clustering #algorithm #DataScience #MachineLearning #datamining @DEEPprojects @fzj_jsc @fz_juelich @Haskoli_Islands @helmholtz_ai

Full text: https://t.co/iUBVwodfHB pic.twitter.com/QAOv9T0FhC

— Morris Riedel (@MorrisRiedel) September 15, 2019

Sieh dir diesen Beitrag auf Instagram an

Researchgate: Great Work, Morris! Your paper reached 700 reads: HPDBSCAN Highly #Parallel #DBSCAN fast & scalable #clustering #algorithm #DataScience #MachineLearning #datamining DEEP Projects @forschungszentrum_juelich #julichsupercomputingcenter @haskoli_islands @helmholtz_de Artificial Intelligence Cooperation Unit (HAICU) Full text: buff.ly/2Qa70Ol

Ein Beitrag geteilt von Morris Riedel (@morrisriedel) am Sep 15, 2019 um 9:43 PDT

ResearchGate: Nice work, Morris! Your research items reached 6000 reads: most read paper ~600 reads is HPDBSCAN: highly parallel DBSCAN @ SC2015 & used in #MachineLearning #datamining @DEEPprojects @fzj_jsc @fz_juelich @Haskoli_Islands #SMITH

full text: https://t.co/72Ek7SPDK5 pic.twitter.com/TY8L2zuPXj

— Morris Riedel (@MorrisRiedel) July 5, 2019

Sieh dir diesen Beitrag auf Instagram an

ResearchGate: Nice work, Morris! Your research items reached 6000 reads: most read paper ~600 reads is HPDBSCAN: highly parallel DBSCAN @ SC2015 & used in #MachineLearning #datamining @ DEEP Projects #julichsupercomputingcenter @forschungszentrum_juelich @haskoli_islands #SMITH full text: https://www.researchgate.net/publication/301463871_HPDBSCAN_highly_parallel_DBSCAN

Ein Beitrag geteilt von Morris Riedel (@morrisriedel) am Jul 5, 2019 um 4:31 PDT

HPDBSCAN – Highly Parallel DBSCAN

Social Media

Categories

Contact