JOSS: https://github.com/scikit-learn-contrib/category_encoders
Score: 25.12641731058571
Last synced: 1 day ago
JSON representation
Repository metadata:
A library of sklearn compatible categorical variable encoders
- Host: GitHub
- URL: https://github.com/scikit-learn-contrib/category_encoders
- Owner: scikit-learn-contrib
- License: bsd-3-clause
- Created: 2015-11-29T19:32:37.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2026-03-01T12:51:08.000Z (2 months ago)
- Last Synced: 2026-04-14T14:04:34.766Z (30 days ago)
- Language: Python
- Homepage: http://contrib.scikit-learn.org/category_encoders/
- Size: 43.3 MB
- Stars: 2,488
- Watchers: 35
- Forks: 409
- Open Issues: 41
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.md
- Code of conduct: CODE_OF_CONDUCT.md
Owner metadata:
- Name: scikit-learn-contrib
- Login: scikit-learn-contrib
- Email:
- Kind: organization
- Description: scikit-learn compatible projects
- Website: http://contrib.scikit-learn.org
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/17349883?v=4
- Repositories: 27
- Last Synced at: 2024-03-25T20:03:09.812Z
- Profile URL: https://github.com/scikit-learn-contrib
GitHub Events
Total
- Create event: 7
- Delete event: 2
- Fork event: 13
- Issue comment event: 19
- Issues event: 18
- Pull request event: 18
- Pull request review comment event: 7
- Pull request review event: 2
- Push event: 11
- Release event: 4
- Watch event: 71
- Total: 172
Last Year
- Create event: 3
- Delete event: 2
- Fork event: 6
- Issue comment event: 4
- Issues event: 1
- Pull request event: 7
- Pull request review comment event: 7
- Pull request review event: 2
- Push event: 2
- Release event: 1
- Watch event: 30
- Total: 65
Committers metadata
Last synced: 28 days ago
Total Commits: 825
Total Committers: 71
Avg Commits per committer: 11.62
Development Distribution Score (DDS): 0.816
Commits in past year: 9
Committers in past year: 4
Avg Commits per committer in past year: 2.25
Development Distribution Score (DDS) in past year: 0.667
| Name | Commits | |
|---|---|---|
| Jan Motl | j****n@m****s | 152 |
| jcastaldo08 | j****8@g****m | 100 |
| Will McGinnis | w****l@p****m | 96 |
| paul | p****r@w****e | 63 |
| SLLiu | s****6@1****m | 60 |
| florian | d****n@a****h | 34 |
| Will McGinnis | w****l@p****m | 32 |
| Carlos Mougan | c****n@g****m | 26 |
| Lisa | l****l@g****m | 24 |
| Ben Reiniger | 4****r | 20 |
| PaulWestenthanner | p****l@w****v | 19 |
| florian | c****t@a****h | 16 |
| joshua.dunn | j****n@e****m | 15 |
| makrobios | b****h@g****m | 15 |
| Will McGinnis | w****6 | 13 |
| JaimeArboleda | j****a@g****m | 10 |
| bkhant1 | b****n@g****m | 9 |
| Gleb Levitski | 3****v | 7 |
| anjum | a****8@g****m | 7 |
| hhy | h****y@1****m | 7 |
| Nicholas Bollweg | n****g@g****m | 6 |
| david26694 | d****4@g****m | 6 |
| taowenwu | 7****9@q****m | 6 |
| Chapman Siu | c****u@g****m | 5 |
| Mavs | m****7@g****m | 5 |
| Rishoban | r****7@g****m | 5 |
| Cameron Davison | c****n@n****m | 4 |
| John Hopfensperger | 4****h | 4 |
| Gijsbers | p****s@t****l | 4 |
| Dennis O'Brien | d****o@z****m | 3 |
| and 41 more... | ||
Issue and Pull Request metadata
Last synced: about 1 month ago
Total issues: 101
Total pull requests: 77
Average time to close issues: over 1 year
Average time to close pull requests: 4 months
Total issue authors: 84
Total pull request authors: 31
Average comments per issue: 3.43
Average comments per pull request: 1.74
Merged pull request: 55
Bot issues: 0
Bot pull requests: 4
Past year issues: 3
Past year pull requests: 12
Past year average time to close issues: 7 months
Past year average time to close pull requests: 2 months
Past year issue authors: 3
Past year pull request authors: 7
Past year average comments per issue: 1.67
Past year average comments per pull request: 0.92
Past year merged pull request: 4
Past year bot issues: 0
Past year bot pull requests: 2
Top Issue Authors
- PaulWestenthanner (8)
- janmotl (4)
- bmreiniger (3)
- wdm0006 (2)
- eddietaylor (2)
- JoshuaC3 (2)
- willsthompson (2)
- tvdboom (2)
- DZIMDZEM (1)
- CoteDave (1)
- PraveshKoirala (1)
- TobiasSackmannDacoso (1)
- euisuk-chung (1)
- nexusme (1)
- iuiu34 (1)
Top Pull Request Authors
- PaulWestenthanner (26)
- dependabot[bot] (4)
- bmreiniger (4)
- wdm0006 (4)
- glevv (4)
- fullflu (2)
- marekschneider (2)
- dennisobrien (2)
- bkhant1 (2)
- nercisla (2)
- Jordanbarker (2)
- s-banach (2)
- tvdboom (2)
- bollwyvl (2)
- aadimaxi (1)
Top Issue Labels
- enhancement (22)
- good first issue (8)
- bug (8)
- help wanted (7)
- non-reproducible (6)
- question (3)
- discussion (3)
- documentation (2)
- wontfix (2)
Top Pull Request Labels
- dependencies (4)
- python (4)
Package metadata
- Total packages: 5
-
Total downloads:
- conda: 455,000 total
- pypi: 12 last-month
- Total dependent packages: 9 (may contain duplicates)
- Total dependent repositories: 25 (may contain duplicates)
- Total versions: 27
- Total maintainers: 2
conda-forge.org: category_encoders
A set of scikit-learn-style transformers for encoding categorical variables into numeric with different techniques. While ordinal, one-hot, and hashing encoders have similar equivalents in the existing scikit-learn version, the transformers in this library all share a few useful properties: - First-class support for pandas dataframes as an input (and optionally as output) - Can explicitly configure which columns in the data are encoded by name or index, or infer non-numeric columns regardless of input type - Can drop any columns with very low variance based on training set optionally - Portability: train a transformer on data, pickle it, reuse it later and get the same thing out. - Full compatibility with sklearn pipelines, input an array-like dataset like any other transformer
- Homepage: https://github.com/scikit-learn-contrib/category_encoders
- Licenses: BSD-3-Clause
- Latest release: 2.5.0 (published almost 4 years ago)
- Last Synced: 2026-03-23T03:13:08.074Z (about 2 months ago)
- Versions: 16
- Dependent Packages: 7
- Dependent Repositories: 12
- Downloads: 444,805 Total
-
Rankings:
- Dependent packages count: 8.011%
- Forks count: 8.505%
- Stargazers count: 8.535%
- Average: 8.798%
- Dependent repos count: 10.142%
pypi.org: category-encoders-dev
A collection sklearn transformers to encode categorical variables as numeric
- Homepage: https://github.com/scikit-learn-contrib/category_encoders
- Documentation: https://category-encoders-dev.readthedocs.io/
- Licenses: BSD
- Latest release: 2.2.2.post2021 (published over 4 years ago)
- Last Synced: 2026-04-14T14:01:45.619Z (30 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 1
- Downloads: 12 Last month
-
Rankings:
- Stargazers count: 1.473%
- Forks count: 2.661%
- Dependent packages count: 7.306%
- Average: 11.524%
- Dependent repos count: 22.077%
- Downloads: 24.101%
- Maintainers (1)
anaconda.org: category_encoders
A set of scikit-learn-style transformers for encoding categorical variables into numeric with different techniques. While ordinal, one-hot, and hashing encoders have similar equivalents in the existing scikit-learn version, the transformers in this library all share a few useful properties: - First-class support for pandas dataframes as an input (and optionally as output) - Can explicitly configure which columns in the data are encoded by name or index, or infer non-numeric columns regardless of input type - Can drop any columns with very low variance based on training set optionally - Portability: train a transformer on data, pickle it, reuse it later and get the same thing out. - Full compatibility with sklearn pipelines, input an array-like dataset like any other transformer
- Homepage: https://github.com/scikit-learn-contrib/category_encoders
- Licenses: BSD-3-Clause
- Latest release: 2.8.1 (published 10 months ago)
- Last Synced: 2026-03-23T03:13:19.415Z (about 2 months ago)
- Versions: 8
- Dependent Packages: 2
- Dependent Repositories: 12
- Downloads: 10,195 Total
-
Rankings:
- Stargazers count: 17.059%
- Forks count: 17.059%
- Dependent packages count: 20.447%
- Average: 22.82%
- Dependent repos count: 36.715%
nixpkgs-unstable: python314Packages.category-encoders
Library for sklearn compatible categorical variable encoders
- Homepage: https://github.com/scikit-learn-contrib/category_encoders
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/category-encoders/default.nix#L58
- Licenses: BSD-3-Clause
- Latest release: 2.9.0 (published 4 months ago)
- Last Synced: 2026-04-10T19:01:52.489Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-unstable: python313Packages.category-encoders
Library for sklearn compatible categorical variable encoders
- Homepage: https://github.com/scikit-learn-contrib/category_encoders
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/category-encoders/default.nix#L58
- Licenses: BSD-3-Clause
- Latest release: 2.9.0 (published 4 months ago)
- Last Synced: 2026-03-07T00:31:36.400Z (2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Maintainers (1)
Dependencies
- actions/checkout v2 composite
- ammaraskar/sphinx-action master composite
- peaceiris/actions-gh-pages v3 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- pypa/gh-action-pypi-publish master composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- ammaraskar/sphinx-action master composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- numpy >=1.14.0
- numpydoc *
- pandas >=0.21.1
- patsy >=0.5.1
- scikit-learn >=0.20.0
- scipy >=1.0.0
- sphinx >=3.0
- sphinx_rtd_theme *
- statsmodels >=0.9.0
- unittest2 *
- numpydoc * development
- pytest * development
- sphinx * development
- sphinx_rtd_theme * development
- numpy >=1.14.0
- pandas >=1.0.5
- patsy >=0.5.1
- scikit-learn >=1.0.0
- scipy >=1.0.0
- statsmodels >=0.9.0
- unittest2 *
- numpy >=1.14.0
- pandas >=1.0.5
- patsy >=0.5.1
- scikit-learn >=0.20.0
- scipy >=1.0.0
- statsmodels >=0.9.0