An open API service for producing an overview of a list of open source projects.

JOSS: https://github.com/scikit-learn-contrib/hdbscan

cluster-analysis clustering clustering-algorithm clustering-evaluation machine-learning machine-learning-algorithms

Score: 33.45731119080637

Last synced: about 7 hours ago
JSON representation

Repository metadata:

A high performance implementation of HDBSCAN clustering.


Owner metadata:


GitHub Events

Total
Last Year

Committers metadata

Last synced: 3 days ago

Total Commits: 925
Total Committers: 103
Avg Commits per committer: 8.981
Development Distribution Score (DDS): 0.31

Commits in past year: 37
Committers in past year: 10
Avg Commits per committer in past year: 3.7
Development Distribution Score (DDS) in past year: 0.595

Name Email Commits
Leland McInnes l****s@g****m 638
Jelmer Bot j****t@g****m 22
sebastianbreguel b****n@g****m 15
gclendenning 6****g 13
neontty n****y@g****m 11
Guillaume Lemaitre g****e@v****u 10
John Healy j****l@g****m 10
Steve Astels s****s@g****m 10
Greg Demand 6****d 9
luis261 l****r@g****e 8
jc-healy j****y@g****m 8
Guillaume Ansanay-Alex g****y@g****m 7
Dicksonchin93 c****n@d****m 7
João Matias j****s@t****m 7
gclen g****g@u****t 7
Bruno Alano b****o@n****r 7
Matthew Carrigan r****1@g****m 6
Sebastian Berg s****b@n****m 5
Rhaedonius l****o@a****n 5
cmalzer c****r@g****e 5
Nathaniel Saul n****t@s****m 4
m-dz m****c@g****m 4
Ryan Helinski r****i@g****m 4
the null 4****e 4
Lukas Großberger c****e@g****z 4
Adam Lugowski a****i@g****m 3
Andrija Jovanovic 1****8 3
areeh a****t@i****m 3
cmalzer c****r@g****e 3
gr g****y@g****m 3
and 73 more...

Issue and Pull Request metadata

Last synced: 3 days ago

Total issues: 161
Total pull requests: 89
Average time to close issues: 6 months
Average time to close pull requests: 2 months
Total issue authors: 153
Total pull request authors: 38
Average comments per issue: 4.0
Average comments per pull request: 1.11
Merged pull request: 59
Bot issues: 0
Bot pull requests: 0

Past year issues: 9
Past year pull requests: 25
Past year average time to close issues: N/A
Past year average time to close pull requests: 3 days
Past year issue authors: 9
Past year pull request authors: 7
Past year average comments per issue: 0.56
Past year average comments per pull request: 0.0
Past year merged pull request: 16
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/scikit-learn-contrib/hdbscan

Top Issue Authors

  • notluquis (3)
  • lucetka (2)
  • katzurik (2)
  • divya-agrawal3103 (2)
  • ivan-marroquin (2)
  • KukumavMozolo (2)
  • jakirkham (2)
  • don-hulsey (1)
  • mdagost (1)
  • xavgit (1)
  • OlgaGKononova (1)
  • SergioG-M (1)
  • PezAmaury (1)
  • jgonggrijp (1)
  • mczerny (1)

Top Pull Request Authors

  • sebastianbreguel (16)
  • JelmerBot (13)
  • gclendenning (7)
  • lmcinnes (5)
  • chenxinye (4)
  • Antobiotics (2)
  • divyegala (2)
  • seberg (2)
  • joaquindas (2)
  • Rhaedonius (2)
  • prodrigues-tdx (2)
  • cearlefraym (2)
  • smartIU (2)
  • axiak (2)
  • NicoSantamaria (2)

Top Issue Labels

  • new feature (1)
  • help wanted (1)
  • bug (1)

Top Pull Request Labels


Package metadata

pypi.org: hdbscan

Clustering based on density with variable density clusters

  • Homepage: http://github.com/scikit-learn-contrib/hdbscan
  • Documentation: https://hdbscan.readthedocs.io/
  • Licenses: BSD
  • Latest release: 0.8.44 (published 27 days ago)
  • Last Synced: 2026-06-25T19:30:20.332Z (3 days ago)
  • Versions: 61
  • Dependent Packages: 120
  • Dependent Repositories: 483
  • Downloads: 2,954,986 Last month
  • Docker Downloads: 934,700,138
  • Rankings:
    • Dependent packages count: 0.173%
    • Docker downloads count: 0.295%
    • Downloads: 0.477%
    • Dependent repos count: 0.648%
    • Average: 0.918%
    • Stargazers count: 1.451%
    • Forks count: 2.463%
  • Maintainers (3)
conda-forge.org: hdbscan

  • Homepage: http://github.com/scikit-learn-contrib/hdbscan
  • Licenses: BSD-3-Clause
  • Latest release: 0.8.29 (published over 3 years ago)
  • Last Synced: 2026-03-23T09:14:27.342Z (3 months ago)
  • Versions: 28
  • Dependent Packages: 11
  • Dependent Repositories: 39
  • Downloads: 3,245,412 Total
  • Rankings:
    • Dependent packages count: 5.484%
    • Dependent repos count: 5.809%
    • Average: 6.743%
    • Forks count: 7.62%
    • Stargazers count: 8.059%
spack.io: py-hdbscan

HDBSCAN - Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates the result to find a clustering that gives the best stability over epsilon. This allows HDBSCAN to find clusters of varying densities (unlike DBSCAN), and be more robust to parameter selection. In practice this means that HDBSCAN returns a good clustering straight away with little or no parameter tuning -- and the primary parameter, minimum cluster size, is intuitive and easy to select. HDBSCAN is ideal for exploratory data analysis; it's a fast and robust algorithm that you can trust to return meaningful clusters (if there are any).

  • Homepage: https://github.com/scikit-learn-contrib/hdbscan
  • Licenses: []
  • Latest release: 0.8.29 (published about 3 years ago)
  • Last Synced: 2026-06-25T01:01:37.126Z (3 days ago)
  • Versions: 2
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Rankings:
    • Dependent repos count: 0.0%
    • Forks count: 4.321%
    • Stargazers count: 4.599%
    • Average: 9.247%
    • Dependent packages count: 28.067%
  • Maintainers (1)
pypi.org: bf44-hdbscan-0844

Unofficial pinned rebuild of hdbscan 0.8.44 with additional wheels (incl. linux-aarch64). Imports as `hdbscan`. See https://github.com/scikit-learn-contrib/hdbscan

  • Homepage: http://github.com/scikit-learn-contrib/hdbscan
  • Documentation: https://bf44-hdbscan-0844.readthedocs.io/
  • Licenses: BSD
  • Latest release: 1.0.0 (published 14 days ago)
  • Last Synced: 2026-06-25T01:01:34.798Z (3 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Dependent packages count: 7.034%
    • Average: 23.414%
    • Dependent repos count: 39.793%
  • Maintainers (1)
pypi.org: hdbscan-314

Clustering based on density with variable density clusters

  • Homepage: http://github.com/scikit-learn-contrib/hdbscan
  • Documentation: https://hdbscan-314.readthedocs.io/
  • Licenses: BSD
  • Latest release: 0.8.43 (published 19 days ago)
  • Last Synced: 2026-06-10T18:04:09.391Z (18 days ago)
  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 901 Last month
  • Rankings:
    • Dependent packages count: 7.256%
    • Average: 24.15%
    • Dependent repos count: 41.044%
  • Maintainers (1)
nixpkgs-23.11: python311Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-unstable: python314Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-23.05: python310Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-23.11: python310Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-24.05: python311Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

guix: python-hdbscan

High performance implementation of HDBSCAN clustering

nixpkgs-23.05: python311Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-unstable: python313Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-24.11: python311Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-24.05: python312Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API

nixpkgs-24.11: python312Packages.hdbscan

Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API


Dependencies

.github/workflows/pythonpublish.yml actions
  • actions/checkout v1 composite
  • actions/setup-python v1 composite
.github/workflows/pythonpublish_windows.yml actions
  • actions/checkout v1 composite
  • actions/setup-python v1 composite
requirements.txt pypi
  • cython >=0.27
  • joblib >=1.0
  • numpy >=1.20
  • scikit-learn >=0.20
  • scipy >=1.0
.github/workflows/pythonpublish_wheel.yml actions
  • actions/checkout v1 composite
  • actions/setup-python v1 composite
pyproject.toml pypi
setup.py pypi
environment.yml conda
  • hdbscan >=0.8.11
  • matplotlib >=2.0
  • python >=3.5
  • scikit-learn >=0.19
  • seaborn >=0.8
docs/docs_requirements.txt pypi
  • sphinx_rtd_theme *