JOSS: https://github.com/scikit-learn-contrib/hdbscan
cluster-analysis clustering clustering-algorithm clustering-evaluation machine-learning machine-learning-algorithms
Score: 33.420115461398865
Last synced: about 20 hours ago
JSON representation
Repository metadata:
A high performance implementation of HDBSCAN clustering.
- Host: GitHub
- URL: https://github.com/scikit-learn-contrib/hdbscan
- Owner: scikit-learn-contrib
- License: bsd-3-clause
- Created: 2015-04-22T13:32:37.000Z (about 11 years ago)
- Default Branch: master
- Last Pushed: 2026-03-27T18:59:08.000Z (about 2 months ago)
- Last Synced: 2026-04-06T10:13:15.984Z (about 1 month ago)
- Topics: cluster-analysis, clustering, clustering-algorithm, clustering-evaluation, machine-learning, machine-learning-algorithms
- Language: Jupyter Notebook
- Homepage: http://hdbscan.readthedocs.io/en/latest/
- Size: 27.8 MB
- Stars: 3,097
- Watchers: 54
- Forks: 532
- Open Issues: 380
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Owner metadata:
- Name: scikit-learn-contrib
- Login: scikit-learn-contrib
- Email:
- Kind: organization
- Description: scikit-learn compatible projects
- Website: http://contrib.scikit-learn.org
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/17349883?v=4
- Repositories: 27
- Last Synced at: 2024-03-25T20:03:09.812Z
- Profile URL: https://github.com/scikit-learn-contrib
GitHub Events
Total
- Create event: 1
- Fork event: 25
- Issue comment event: 49
- Issues event: 19
- Member event: 1
- Pull request event: 25
- Pull request review comment event: 3
- Pull request review event: 6
- Push event: 10
- Release event: 2
- Watch event: 198
- Total: 339
Last Year
- Fork event: 10
- Issue comment event: 18
- Issues event: 9
- Pull request event: 10
- Pull request review event: 1
- Push event: 4
- Release event: 1
- Watch event: 102
- Total: 155
Committers metadata
Last synced: about 1 month ago
Total Commits: 901
Total Committers: 100
Avg Commits per committer: 9.01
Development Distribution Score (DDS): 0.294
Commits in past year: 17
Committers in past year: 7
Avg Commits per committer in past year: 2.429
Development Distribution Score (DDS) in past year: 0.706
| Name | Commits | |
|---|---|---|
| Leland McInnes | l****s@g****m | 636 |
| Jelmer Bot | j****t@g****m | 22 |
| neontty | n****y@g****m | 11 |
| Guillaume Lemaitre | g****e@v****u | 10 |
| John Healy | j****l@g****m | 10 |
| Steve Astels | s****s@g****m | 10 |
| gclendenning | 6****g | 9 |
| Greg Demand | 6****d | 9 |
| luis261 | l****r@g****e | 8 |
| jc-healy | j****y@g****m | 8 |
| Guillaume Ansanay-Alex | g****y@g****m | 7 |
| Dicksonchin93 | c****n@d****m | 7 |
| João Matias | j****s@t****m | 7 |
| gclen | g****g@u****t | 7 |
| Bruno Alano | b****o@n****r | 7 |
| Matthew Carrigan | r****1@g****m | 6 |
| Rhaedonius | l****o@a****n | 5 |
| cmalzer | c****r@g****e | 5 |
| Sebastian Berg | s****b@n****m | 5 |
| Ryan Helinski | r****i@g****m | 4 |
| the null | 4****e | 4 |
| m-dz | m****c@g****m | 4 |
| Nathaniel Saul | n****t@s****m | 4 |
| Lukas Großberger | c****e@g****z | 4 |
| Adam Lugowski | a****i@g****m | 3 |
| Andrija Jovanovic | 1****8 | 3 |
| areeh | a****t@i****m | 3 |
| cmalzer | c****r@g****e | 3 |
| gr | g****y@g****m | 3 |
| K.-Michael Aye | m****e | 2 |
| and 70 more... | ||
Issue and Pull Request metadata
Last synced: about 2 months ago
Total issues: 159
Total pull requests: 71
Average time to close issues: 6 months
Average time to close pull requests: 3 months
Total issue authors: 152
Total pull request authors: 36
Average comments per issue: 4.04
Average comments per pull request: 1.39
Merged pull request: 45
Bot issues: 0
Bot pull requests: 0
Past year issues: 11
Past year pull requests: 12
Past year average time to close issues: N/A
Past year average time to close pull requests: about 2 months
Past year issue authors: 11
Past year pull request authors: 7
Past year average comments per issue: 0.55
Past year average comments per pull request: 0.33
Past year merged pull request: 6
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- notluquis (3)
- lucetka (2)
- katzurik (2)
- divya-agrawal3103 (2)
- KukumavMozolo (2)
- jakirkham (2)
- aleereza (1)
- mdagost (1)
- xavgit (1)
- OlgaGKononova (1)
- SergioG-M (1)
- PezAmaury (1)
- jgonggrijp (1)
- mczerny (1)
- huijiawu0 (1)
Top Pull Request Authors
- JelmerBot (13)
- gclendenning (6)
- lmcinnes (5)
- chenxinye (4)
- NicoSantamaria (2)
- smartIU (2)
- prodrigues-tdx (2)
- Rhaedonius (2)
- Antobiotics (2)
- divyegala (2)
- axiak (2)
- seberg (2)
- meshari343 (2)
- joaquindas (2)
- cearlefraym (2)
Top Issue Labels
- new feature (1)
- help wanted (1)
- bug (1)
Top Pull Request Labels
Package metadata
- Total packages: 14
-
Total downloads:
- conda: 3,245,412 total
- pypi: 1,705,728 last-month
- Total docker downloads: 934,700,138
- Total dependent packages: 133 (may contain duplicates)
- Total dependent repositories: 522 (may contain duplicates)
- Total versions: 102
- Total maintainers: 5
pypi.org: hdbscan
Clustering based on density with variable density clusters
- Homepage: http://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://hdbscan.readthedocs.io/
- Licenses: BSD
- Latest release: 0.8.42 (published about 2 months ago)
- Last Synced: 2026-04-08T00:30:58.915Z (about 1 month ago)
- Versions: 59
- Dependent Packages: 120
- Dependent Repositories: 483
- Downloads: 1,705,728 Last month
- Docker Downloads: 934,700,138
-
Rankings:
- Dependent packages count: 0.173%
- Docker downloads count: 0.295%
- Downloads: 0.477%
- Dependent repos count: 0.648%
- Average: 0.918%
- Stargazers count: 1.451%
- Forks count: 2.463%
- Maintainers (3)
conda-forge.org: hdbscan
- Homepage: http://github.com/scikit-learn-contrib/hdbscan
- Licenses: BSD-3-Clause
- Latest release: 0.8.29 (published over 3 years ago)
- Last Synced: 2026-03-23T09:14:27.342Z (about 2 months ago)
- Versions: 28
- Dependent Packages: 11
- Dependent Repositories: 39
- Downloads: 3,245,412 Total
-
Rankings:
- Dependent packages count: 5.484%
- Dependent repos count: 5.809%
- Average: 6.743%
- Forks count: 7.62%
- Stargazers count: 8.059%
spack.io: py-hdbscan
HDBSCAN - Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates the result to find a clustering that gives the best stability over epsilon. This allows HDBSCAN to find clusters of varying densities (unlike DBSCAN), and be more robust to parameter selection. In practice this means that HDBSCAN returns a good clustering straight away with little or no parameter tuning -- and the primary parameter, minimum cluster size, is intuitive and easy to select. HDBSCAN is ideal for exploratory data analysis; it's a fast and robust algorithm that you can trust to return meaningful clusters (if there are any).
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Licenses: []
- Latest release: 0.8.29 (published almost 3 years ago)
- Last Synced: 2026-03-08T06:32:00.942Z (2 months ago)
- Versions: 2
- Dependent Packages: 2
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Forks count: 4.321%
- Stargazers count: 4.599%
- Average: 9.247%
- Dependent packages count: 28.067%
- Maintainers (1)
nixpkgs-unstable: python314Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/hdbscan/default.nix#L79
- Licenses: BSD-3-Clause
- Latest release: 0.8.41 (published 2 months ago)
- Last Synced: 2026-03-04T09:16:51.740Z (2 months ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
nixpkgs-23.11: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.11/pkgs/development/python-modules/hdbscan/default.nix#L47
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published 3 months ago)
- Last Synced: 2026-03-04T04:48:23.294Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-23.05: python310Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.05/pkgs/development/python-modules/hdbscan/default.nix#L59
- Licenses: BSD-3-Clause
- Latest release: 0.8.28 (published 4 months ago)
- Last Synced: 2026-04-03T06:09:06.239Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-24.05: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.05/pkgs/development/python-modules/hdbscan/default.nix#L58
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published 3 months ago)
- Last Synced: 2026-03-09T02:08:44.564Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
guix: python-hdbscan
High performance implementation of HDBSCAN clustering
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://git.savannah.gnu.org/cgit/guix.git/tree/gnu/packages/machine-learning.scm#n2669
- Licenses: bsd-3
- Latest release: 0.8.41 (published 2 months ago)
- Last Synced: 2026-04-03T07:18:30.127Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
nixpkgs-23.05: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.05/pkgs/development/python-modules/hdbscan/default.nix#L59
- Licenses: BSD-3-Clause
- Latest release: 0.8.28 (published 4 months ago)
- Last Synced: 2026-04-03T09:20:52.922Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-unstable: python313Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/hdbscan/default.nix#L79
- Licenses: BSD-3-Clause
- Latest release: 0.8.41 (published 2 months ago)
- Last Synced: 2026-04-03T10:08:29.858Z (about 1 month ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
nixpkgs-24.11: python311Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.11/pkgs/development/python-modules/hdbscan/default.nix#L55
- Licenses: BSD-3-Clause
- Latest release: 0.8.38.post1 (published 3 months ago)
- Last Synced: 2026-03-03T19:15:51.578Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-23.11: python310Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-23.11/pkgs/development/python-modules/hdbscan/default.nix#L47
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published 3 months ago)
- Last Synced: 2026-03-03T22:32:57.030Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-24.05: python312Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.05/pkgs/development/python-modules/hdbscan/default.nix#L58
- Licenses: BSD-3-Clause
- Latest release: 0.8.33 (published 3 months ago)
- Last Synced: 2026-03-03T23:40:27.712Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
nixpkgs-24.11: python312Packages.hdbscan
Hierarchical Density-Based Spatial Clustering of Applications with Noise, a clustering algorithm with a scikit-learn compatible API
- Homepage: https://github.com/scikit-learn-contrib/hdbscan
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-24.11/pkgs/development/python-modules/hdbscan/default.nix#L55
- Licenses: BSD-3-Clause
- Latest release: 0.8.38.post1 (published 3 months ago)
- Last Synced: 2026-03-08T06:18:56.071Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
Dependencies
- actions/checkout v1 composite
- actions/setup-python v1 composite
- actions/checkout v1 composite
- actions/setup-python v1 composite
- cython >=0.27
- joblib >=1.0
- numpy >=1.20
- scikit-learn >=0.20
- scipy >=1.0
- actions/checkout v1 composite
- actions/setup-python v1 composite
- hdbscan >=0.8.11
- matplotlib >=2.0
- python >=3.5
- scikit-learn >=0.19
- seaborn >=0.8
- sphinx_rtd_theme *