https://github.com/nltk/nltk
machine-learning natural-language-processing nlp nltk python
Score: 36.555979766676984
Last synced: about 12 hours ago
JSON representation
Repository metadata:
NLTK Source
- Host: GitHub
- URL: https://github.com/nltk/nltk
- Owner: nltk
- License: apache-2.0
- Created: 2009-09-07T10:53:58.000Z (over 16 years ago)
- Default Branch: develop
- Last Pushed: 2026-03-24T06:10:45.000Z (5 days ago)
- Last Synced: 2026-03-27T05:59:03.854Z (2 days ago)
- Topics: machine-learning, natural-language-processing, nlp, nltk, python
- Language: Python
- Homepage: https://www.nltk.org
- Size: 339 MB
- Stars: 14,575
- Watchers: 442
- Forks: 3,003
- Open Issues: 277
-
Metadata Files:
- Readme: README.md
- Changelog: ChangeLog
- Contributing: CONTRIBUTING.md
- License: LICENSE.txt
- Citation: CITATION.cff
- Security: SECURITY.md
- Authors: AUTHORS.md
Owner metadata:
- Name: Natural Language Toolkit
- Login: nltk
- Email:
- Kind: organization
- Description:
- Website: http://nltk.org
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/124114?v=4
- Repositories: 10
- Last Synced at: 2024-03-25T20:23:21.452Z
- Profile URL: https://github.com/nltk
GitHub Events
Total
- Commit comment event: 7
- Create event: 4
- Delete event: 3
- Fork event: 91
- Gollum event: 4
- Issue comment event: 353
- Issues event: 105
- Pull request event: 103
- Pull request review comment event: 64
- Pull request review event: 89
- Push event: 43
- Watch event: 827
- Total: 1693
Last Year
- Commit comment event: 7
- Create event: 4
- Delete event: 3
- Fork event: 53
- Gollum event: 4
- Issue comment event: 270
- Issues event: 73
- Pull request event: 92
- Pull request review comment event: 64
- Pull request review event: 86
- Push event: 42
- Watch event: 540
- Total: 1238
Committers metadata
Last synced: 1 day ago
Total Commits: 13,576
Total Committers: 490
Avg Commits per committer: 27.706
Development Distribution Score (DDS): 0.693
Commits in past year: 263
Committers in past year: 22
Avg Commits per committer in past year: 11.955
Development Distribution Score (DDS) in past year: 0.643
| Name | Commits | |
|---|---|---|
| Steven Bird | s****1@g****m | 4165 |
| Edward Loper | e****r@l****u | 2188 |
| Ewan Klein | e****n@g****m | 1477 |
| alvations | a****s@g****m | 789 |
| Dan Garrette | d****e@g****m | 357 |
| Mikhail Korobov | k****4@g****m | 276 |
| Pierpaolo Pantone | 2****o@g****m | 257 |
| Eric Kafe | k****c@g****m | 205 |
| Steven Xu | 193 | |
| Ilia Kurenkov | i****v@g****m | 178 |
| Tom Aarsen | C****v@g****m | 175 |
| Will Roberts | w****s@r****e | 154 |
| Peter Ljunglöf | p****f@h****e | 106 |
| Paul Bone | p****e@c****u | 94 |
| Sumukh Ghodke | s****e@g****m | 91 |
| Dmitrijs Milajevs | d****t@g****m | 91 |
| hoontw | h****w@g****m | 80 |
| Joel Nothman | j****n@s****u | 79 |
| Joseph Frazee | j****e@g****m | 72 |
| nschneid | n****t@g****m | 72 |
| Marcus Uneson | m****n@g****m | 69 |
| purificant | p****t | 66 |
| Mike Recachinas | m****p@v****u | 65 |
| Haejoong Lee | h****g@l****u | 65 |
| Trevor Cohn | t****n@c****u | 64 |
| xim | x****t@a****o | 63 |
| lrnzcig | l****g@g****m | 62 |
| Long Duong | l****9@g****m | 59 |
| Rob Speer | r****r@m****u | 56 |
| Greg Aumann | g****n@g****m | 53 |
| and 460 more... | ||
Issue and Pull Request metadata
Last synced: 1 day ago
Total issues: 290
Total pull requests: 286
Average time to close issues: 11 months
Average time to close pull requests: 4 months
Total issue authors: 250
Total pull request authors: 103
Average comments per issue: 3.73
Average comments per pull request: 3.02
Merged pull request: 155
Bot issues: 0
Bot pull requests: 1
Past year issues: 33
Past year pull requests: 79
Past year average time to close issues: 18 days
Past year average time to close pull requests: 30 days
Past year issue authors: 26
Past year pull request authors: 32
Past year average comments per issue: 1.3
Past year average comments per pull request: 3.73
Past year merged pull request: 28
Past year bot issues: 0
Past year bot pull requests: 1
Top Issue Authors
- ekaf (13)
- alvations (12)
- BLKSerene (3)
- mcepl (2)
- hiDevman (2)
- purificant (2)
- TomerYS (2)
- DavidNemeskey (2)
- tomaarsen (2)
- kloczek (2)
- Killpit (2)
- shavetakhepra (2)
- ExplodingCabbage (2)
- asrelo (2)
- LordTT (2)
Top Pull Request Authors
- ekaf (68)
- purificant (24)
- alvations (14)
- tomaarsen (12)
- HyperPS (10)
- Mike014 (6)
- Shazid08 (4)
- WilliamPLaCroix (4)
- Copilot (3)
- smithct2 (3)
- trevorjwood (2)
- antoniomika (2)
- emmanuel-ferdman (2)
- pdeblanc (2)
- GeneralPoxter (2)
Top Issue Labels
- enhancement (11)
- wordnet (9)
- bug (9)
- nltk_data (8)
- corpus (8)
- good first issue (7)
- tokenizer (7)
- SMT (6)
- inactive (5)
- critical (5)
- resolved (5)
- invalid (4)
- tagger (4)
- CI (4)
- nice idea (3)
- pythonic (3)
- documentation (3)
- installation (3)
- tests (2)
- metrics (2)
- pleaseverify (2)
- admin (2)
- internals (2)
- windows related (2)
- parsing (2)
- stanford api (2)
- stem/lemma (2)
- multithread / multiprocessing (2)
- classifier (1)
- language-model (1)
Top Pull Request Labels
- corpus (43)
- tokenizer (34)
- CI (29)
- parsing (19)
- tagger (19)
- metrics (18)
- GUI (13)
- classifier (12)
- stem/lemma (12)
- critical (8)
- enhancement (7)
- admin (6)
- bug (6)
- flag-to-close (5)
- LGTM (5)
- language-model (4)
- internals (4)
- sentiment (4)
- tests (3)
- translate (2)
- nice idea (2)
- awesome-contribution (2)
- documentation (2)
- cluster (2)
- twitter (2)
- wordnet (2)
- cli (1)
- pythonic (1)
- inactive (1)
- dependencies (1)
Package metadata
- Total packages: 19
-
Total downloads:
- conda: 380,905 total
- pypi: 57,536,953 last-month
- Total docker downloads: 974,969,708
- Total dependent packages: 1,491 (may contain duplicates)
- Total dependent repositories: 59,006 (may contain duplicates)
- Total versions: 131
- Total maintainers: 7
- Total advisories: 10
pypi.org: nltk
Natural Language Toolkit
- Homepage: https://www.nltk.org/
- Documentation: https://www.nltk.org/
- Licenses: Apache License, Version 2.0
- Latest release: 3.9.4 (published 5 days ago)
- Last Synced: 2026-03-28T03:00:24.516Z (1 day ago)
- Versions: 66
- Dependent Packages: 1,440
- Dependent Repositories: 57,572
- Downloads: 57,536,953 Last month
- Docker Downloads: 974,969,708
-
Rankings:
- Dependent packages count: 0.021%
- Dependent repos count: 0.024%
- Downloads: 0.071%
- Average: 0.199%
- Docker downloads count: 0.276%
- Forks count: 0.394%
- Stargazers count: 0.406%
- Maintainers (5)
-
Advisories:
- NLTK has a Downloader Path Traversal Vulnerability (AFO) - Arbitrary File Overwrite
- Unauthenticated remote shutdown in nltk.app.wordnet_app
- Improper Neutralization of Input During Web Page Generation ('Cross-site Scripting') in nltk
- Natural Language Toolkit (NLTK) has unbounded recursion in JSONTaggedDecoder.decode_obj() may cause DoS
- NLTK has a Zip Slip Vulnerability
- ntlk unsafe deserialization vulnerability
- NLTK Vulnerable to REDoS
- Inefficient Regular Expression Complexity in nltk (word_tokenize, sent_tokenize)
- NLTK Vulnerable to REDoS
- NLTK Vulnerable To Path Traversal
alpine-v3.18: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.8.1-r1 (published almost 3 years ago)
- Last Synced: 2026-03-18T15:37:55.204Z (11 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 0.539%
- Forks count: 0.828%
- Stargazers count: 1.327%
- Maintainers (1)
alpine-v3.18: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.8.1-r1 (published almost 3 years ago)
- Last Synced: 2026-03-08T20:46:37.846Z (20 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 0.539%
- Forks count: 0.828%
- Stargazers count: 1.327%
- Maintainers (1)
alpine-edge: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.3-r0 (published about 1 month ago)
- Last Synced: 2026-03-25T19:08:48.280Z (4 days ago)
- Versions: 10
- Dependent Packages: 2
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Forks count: 0.732%
- Stargazers count: 1.331%
- Average: 1.364%
- Dependent packages count: 3.393%
- Maintainers (1)
conda-forge.org: nltk
- Homepage: http://nltk.org/
- Licenses: Apache-2.0
- Latest release: 3.6.7 (published about 4 years ago)
- Last Synced: 2026-03-01T15:23:54.176Z (28 days ago)
- Versions: 15
- Dependent Packages: 43
- Dependent Repositories: 717
-
Rankings:
- Dependent repos count: 0.948%
- Dependent packages count: 1.632%
- Average: 1.789%
- Forks count: 2.106%
- Stargazers count: 2.471%
alpine-edge: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.3-r0 (published about 1 month ago)
- Last Synced: 2026-03-25T19:07:45.034Z (4 days ago)
- Versions: 8
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Forks count: 0.802%
- Stargazers count: 1.369%
- Average: 4.076%
- Dependent packages count: 14.133%
- Maintainers (1)
alpine-v3.17: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.7-r1 (published over 3 years ago)
- Last Synced: 2026-03-03T13:52:43.944Z (26 days ago)
- Versions: 1
- Dependent Packages: 2
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Forks count: 0.803%
- Stargazers count: 1.336%
- Average: 5.273%
- Dependent packages count: 18.951%
- Maintainers (1)
proxy.golang.org: github.com/nltk/nltk
- Homepage:
- Documentation: https://pkg.go.dev/github.com/nltk/nltk#section-documentation
- Licenses: apache-2.0
- Latest release: v3.9.2+incompatible (published 6 months ago)
- Last Synced: 2026-03-16T19:42:27.683Z (13 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.15%
- Average: 5.323%
- Dependent repos count: 5.496%
anaconda.org: nltk
NLTK has been called a wonderful tool for teaching and working in computational linguistics using Python and an amazing library to play with natural language.
- Homepage: https://www.nltk.org
- Licenses: Apache-2.0
- Latest release: 3.9.3 (published 26 days ago)
- Last Synced: 2026-03-03T00:05:25.183Z (26 days ago)
- Versions: 18
- Dependent Packages: 4
- Dependent Repositories: 717
- Downloads: 380,905 Total
-
Rankings:
- Dependent repos count: 5.713%
- Forks count: 6.081%
- Stargazers count: 6.787%
- Average: 10.037%
- Dependent packages count: 21.567%
alpine-v3.22: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.1-r0 (published over 1 year ago)
- Last Synced: 2026-03-27T14:08:48.591Z (1 day ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.23: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.2-r0 (published 6 months ago)
- Last Synced: 2026-03-04T02:17:18.026Z (25 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.20: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.8.1-r3 (published almost 2 years ago)
- Last Synced: 2026-03-03T13:44:10.577Z (26 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.23: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.2-r0 (published 6 months ago)
- Last Synced: 2026-03-03T17:52:33.530Z (26 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.19: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.8.1-r2 (published over 2 years ago)
- Last Synced: 2026-03-03T13:49:33.444Z (26 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.21: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.1-r0 (published over 1 year ago)
- Last Synced: 2026-03-27T14:07:32.254Z (1 day ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.22: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.1-r0 (published over 1 year ago)
- Last Synced: 2026-03-01T01:13:32.520Z (28 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.20: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.8.1-r3 (published almost 2 years ago)
- Last Synced: 2026-03-03T13:43:42.756Z (26 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.19: py3-nltk-pyc
Precompiled Python bytecode for py3-nltk
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.8.1-r2 (published over 2 years ago)
- Last Synced: 2026-03-02T15:28:05.844Z (27 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
alpine-v3.21: py3-nltk
Natural Language Toolkit
- Homepage: https://github.com/nltk/nltk
- Licenses: Apache-2.0
- Latest release: 3.9.1-r0 (published over 1 year ago)
- Last Synced: 2026-03-01T01:19:13.687Z (28 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
Dependencies
- actions/checkout v3 composite
- citation-file-format/cffconvert-github-action 2.0.0 composite
- actions/cache v3 composite
- actions/checkout v3 composite
- actions/setup-java v3 composite
- actions/setup-python v3 composite
- pre-commit/action v2.0.3 composite
- actions/labeler v4 composite
- click *
- gensim >=4.0.0
- markdown-it-py *
- matplotlib *
- mdit-plain *
- mdit-py-plugins *
- pytest *
- pytest-mock *
- pytest-xdist *
- pyyaml *
- regex *
- scikit-learn *
- tqdm *
- twython *
- pylint * test
- pytest >=6.0.1 test
- pytest-cov >=2.10.1 test
- pytest-mock * test
- tox * test
- click *
- joblib *
- regex >=2021.8.3
- tqdm *