JOSS: https://github.com/scikit-learn-contrib/hdbscan
cluster-analysis clustering clustering-algorithm clustering-evaluation machine-learning machine-learning-algorithms
Score: 33.392823364786125
Last synced: about 18 hours ago
JSON representation
Repository metadata:
A high performance implementation of HDBSCAN clustering.
- Host: GitHub
- URL: https://github.com/scikit-learn-contrib/hdbscan
- Owner: scikit-learn-contrib
- License: bsd-3-clause
- Created: 2015-04-22T13:32:37.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2025-12-24T20:27:20.000Z (3 months ago)
- Last Synced: 2026-01-17T05:19:54.890Z (2 months ago)
- Topics: cluster-analysis, clustering, clustering-algorithm, clustering-evaluation, machine-learning, machine-learning-algorithms
- Language: Jupyter Notebook
- Homepage: http://hdbscan.readthedocs.io/en/latest/
- Size: 27.8 MB
- Stars: 3,049
- Watchers: 54
- Forks: 526
- Open Issues: 381
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Owner metadata:
- Name: scikit-learn-contrib
- Login: scikit-learn-contrib
- Email:
- Kind: organization
- Description: scikit-learn compatible projects
- Website: http://contrib.scikit-learn.org
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/17349883?v=4
- Repositories: 27
- Last Synced at: 2024-03-25T20:03:09.812Z
- Profile URL: https://github.com/scikit-learn-contrib
GitHub Events
Total
- Create event: 1
- Fork event: 24
- Issue comment event: 49
- Issues event: 19
- Member event: 1
- Pull request event: 25
- Pull request review comment event: 3
- Pull request review event: 6
- Push event: 8
- Release event: 2
- Watch event: 197
- Total: 335
Last Year
- Fork event: 11
- Issue comment event: 21
- Issues event: 11
- Pull request event: 11
- Pull request review event: 1
- Push event: 2
- Release event: 1
- Watch event: 107
- Total: 165
Committers metadata
Last synced: 23 days ago
Total Commits: 898
Total Committers: 99
Avg Commits per committer: 9.071
Development Distribution Score (DDS): 0.293
Commits in past year: 15
Committers in past year: 7
Avg Commits per committer in past year: 2.143
Development Distribution Score (DDS) in past year: 0.667
| Name | Commits | |
|---|---|---|
| Leland McInnes | l****s@g****m | 635 |
| Jelmer Bot | j****t@g****m | 22 |
| neontty | n****y@g****m | 11 |
| Guillaume Lemaitre | g****e@v****u | 10 |
| John Healy | j****l@g****m | 10 |
| Steve Astels | s****s@g****m | 10 |
| Greg Demand | 6****d | 9 |
| luis261 | l****r@g****e | 8 |
| gclendenning | 6****g | 8 |
| jc-healy | j****y@g****m | 8 |
| Guillaume Ansanay-Alex | g****y@g****m | 7 |
| Dicksonchin93 | c****n@d****m | 7 |
| João Matias | j****s@t****m | 7 |
| gclen | g****g@u****t | 7 |
| Bruno Alano | b****o@n****r | 7 |
| Matthew Carrigan | r****1@g****m | 6 |
| Rhaedonius | l****o@a****n | 5 |
| cmalzer | c****r@g****e | 5 |
| Sebastian Berg | s****b@n****m | 5 |
| Ryan Helinski | r****i@g****m | 4 |
| the null | 4****e | 4 |
| m-dz | m****c@g****m | 4 |
| Nathaniel Saul | n****t@s****m | 4 |
| Lukas Großberger | c****e@g****z | 4 |
| Adam Lugowski | a****i@g****m | 3 |
| Andrija Jovanovic | 1****8 | 3 |
| areeh | a****t@i****m | 3 |
| cmalzer | c****r@g****e | 3 |
| gr | g****y@g****m | 3 |
| K.-Michael Aye | m****e | 2 |
| and 69 more... | ||
Issue and Pull Request metadata
Last synced: 8 days ago
Total issues: 159
Total pull requests: 70
Average time to close issues: 6 months
Average time to close pull requests: 3 months
Total issue authors: 152
Total pull request authors: 36
Average comments per issue: 4.04
Average comments per pull request: 1.41
Merged pull request: 44
Bot issues: 0
Bot pull requests: 0
Past year issues: 12
Past year pull requests: 11
Past year average time to close issues: 24 days
Past year average time to close pull requests: about 2 months
Past year issue authors: 11
Past year pull request authors: 7
Past year average comments per issue: 0.58
Past year average comments per pull request: 0.36
Past year merged pull request: 5
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- notluquis (3)
- lucetka (2)
- katzurik (2)
- divya-agrawal3103 (2)
- KukumavMozolo (2)
- jakirkham (2)
- aleereza (1)
- mdagost (1)
- xavgit (1)
- OlgaGKononova (1)
- SergioG-M (1)
- PezAmaury (1)
- jgonggrijp (1)
- mczerny (1)
- huijiawu0 (1)
Top Pull Request Authors
- JelmerBot (13)
- lmcinnes (5)
- gclendenning (5)
- chenxinye (4)
- NicoSantamaria (2)
- smartIU (2)
- prodrigues-tdx (2)
- Rhaedonius (2)
- Antobiotics (2)
- divyegala (2)
- axiak (2)
- seberg (2)
- meshari343 (2)
- joaquindas (2)
- cearlefraym (2)
Top Issue Labels
- help wanted (1)
- new feature (1)
- bug (1)
Top Pull Request Labels
Package metadata
- Total packages: 14
-
Total downloads:
- pypi: 1,544,356 last-month
- Total docker downloads: 934,700,138
- Total dependent packages: 133 (may contain duplicates)
- Total dependent repositories: 522 (may contain duplicates)
- Total versions: 101
- Total maintainers: 5
pypi.org: hdbscan
Clustering based on density with variable density clusters
- Homepage: http://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://hdbscan.readthedocs.io/
- Licenses: BSD
- Latest release: 0.8.41 (published 3 months ago)
- Last Synced: 2026-03-20T17:31:13.088Z (6 days ago)
- Versions: 58
- Dependent Packages: 120
- Dependent Repositories: 483
- Downloads: 1,544,356 Last month
- Docker Downloads: 934,700,138
-
Rankings:
- Dependent packages count: 0.173%
- Docker downloads count: 0.295%
- Downloads: 0.477%
- Dependent repos count: 0.648%
- Average: 0.918%
- Stargazers count: 1.451%
- Forks count: 2.463%
- Maintainers (3)
conda-forge.org: hdbscan
- Homepage: http://github.com/scikit-learn-contrib/hdbscan
- Licenses: BSD-3-Clause
- Latest release: 0.8.29 (published over 3 years ago)
- Last Synced: 2026-02-23T09:03:31.226Z (about 1 month ago)
- Versions: 28
- Dependent Packages: 11
- Dependent Repositories: 39
-
Rankings:
- Dependent packages count: 5.484%
- Dependent repos count: 5.809%
- Average: 6.743%
- Forks count: 7.62%
- Stargazers count: 8.059%
spack.io: py-hdbscan
HDBSCAN - Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates the result to find a clustering that gives the best stability over epsilon. This allows HDBSCAN to find clusters of varying densities (unlike DBSCAN), and be more robust to parameter selection. In practice this means that HDBSCAN returns a good clustering straight away with little or no parameter tuning -- and the primary parameter, minimum cluster size, is intuitive and easy to select. HDBSCAN is ideal for exploratory data analysis; it's a fast and robust algorithm that you can trust to return meaningful clusters (if there are any).
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Licenses: []
- Latest release: 0.8.29 (published almost 3 years ago)
- Last Synced: 2026-03-08T06:32:00.942Z (19 days ago)
- Versions: 2
- Dependent Packages: 2
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Forks count: 4.321%
- Stargazers count: 4.599%
- Average: 9.247%
- Dependent packages count: 28.067%
- Maintainers (1)
nixpkgs-unstable: python314Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/hdbscan/default.nix#L79
- Licenses: BSD-3-Clause
- Latest release: 0.8.41 (published 23 days ago)
- Last Synced: 2026-03-04T09:16:51.740Z (23 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
nixpkgs-23.11: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.11/pkgs/development/python-modules/hdbscan/default.nix#L47
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published about 2 months ago)
- Last Synced: 2026-03-04T04:48:23.294Z (23 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
guix: python-hdbscan
High performance implementation of HDBSCAN clustering
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://git.savannah.gnu.org/cgit/guix.git/tree/gnu/packages/machine-learning.scm#n2641
- Licenses: bsd-3
- Latest release: 0.8.41 (published 24 days ago)
- Last Synced: 2026-03-02T20:42:06.147Z (24 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
nixpkgs-23.05: python310Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.05/pkgs/development/python-modules/hdbscan/default.nix#L59
- Licenses: BSD-3-Clause
- Latest release: 0.8.28 (published about 2 months ago)
- Last Synced: 2026-03-03T05:24:40.894Z (24 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-23.05: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.05/pkgs/development/python-modules/hdbscan/default.nix#L59
- Licenses: BSD-3-Clause
- Latest release: 0.8.28 (published about 2 months ago)
- Last Synced: 2026-03-03T08:22:50.096Z (24 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-unstable: python313Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/hdbscan/default.nix#L79
- Licenses: BSD-3-Clause
- Latest release: 0.8.41 (published 24 days ago)
- Last Synced: 2026-03-03T09:14:06.700Z (24 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-24.05: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.05/pkgs/development/python-modules/hdbscan/default.nix#L58
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published about 2 months ago)
- Last Synced: 2026-03-09T02:08:44.564Z (18 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
nixpkgs-24.11: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.11/pkgs/development/python-modules/hdbscan/default.nix#L55
- Licenses: BSD-3-Clause
- Latest release: 0.8.38.post1 (published about 2 months ago)
- Last Synced: 2026-03-03T19:15:51.578Z (23 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-23.11: python310Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.11/pkgs/development/python-modules/hdbscan/default.nix#L47
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published about 2 months ago)
- Last Synced: 2026-03-03T22:32:57.030Z (23 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-24.05: python312Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.05/pkgs/development/python-modules/hdbscan/default.nix#L58
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published about 2 months ago)
- Last Synced: 2026-03-03T23:40:27.712Z (23 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-24.11: python312Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.11/pkgs/development/python-modules/hdbscan/default.nix#L55
- Licenses: BSD-3-Clause
- Latest release: 0.8.38.post1 (published about 2 months ago)
- Last Synced: 2026-03-08T06:18:56.071Z (19 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
Dependencies
- actions/checkout v1 composite
- actions/setup-python v1 composite
- actions/checkout v1 composite
- actions/setup-python v1 composite
- cython >=0.27
- joblib >=1.0
- numpy >=1.20
- scikit-learn >=0.20
- scipy >=1.0
- actions/checkout v1 composite
- actions/setup-python v1 composite
- hdbscan >=0.8.11
- matplotlib >=2.0
- python >=3.5
- scikit-learn >=0.19
- seaborn >=0.8
- sphinx_rtd_theme *