https://github.com/apache/tika
content extraction java metadata tika
Score: 36.53298337360196
Last synced: about 8 hours ago
JSON representation
Repository metadata:
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
- Host: GitHub
- URL: https://github.com/apache/tika
- Owner: apache
- License: apache-2.0
- Created: 2009-05-21T02:12:11.000Z (about 17 years ago)
- Default Branch: main
- Last Pushed: 2026-06-06T00:27:52.000Z (14 days ago)
- Last Synced: 2026-06-06T01:03:32.611Z (14 days ago)
- Topics: content, extraction, java, metadata, tika
- Language: Java
- Homepage: https://tika.apache.org/
- Size: 360 MB
- Stars: 3,793
- Watchers: 88
- Forks: 935
- Open Issues: 57
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGES.txt
- Contributing: CONTRIBUTING.md
- License: LICENSE.txt
- Security: SECURITY.md
- Notice: NOTICE.txt
Owner metadata:
- Name: The Apache Software Foundation
- Login: apache
- Email:
- Kind: organization
- Description:
- Website: https://www.apache.org/
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/47359?v=4
- Repositories: 2832
- Last Synced at: 2025-12-08T20:34:27.907Z
- Profile URL: https://github.com/apache
GitHub Events
Total
- Commit comment event: 2
- Create event: 289
- Delete event: 286
- Fork event: 100
- Issue comment event: 91
- Pull request event: 578
- Pull request review comment event: 10
- Pull request review event: 19
- Push event: 1099
- Watch event: 716
- Total: 3190
Last Year
- Commit comment event: 2
- Create event: 289
- Delete event: 286
- Fork event: 100
- Issue comment event: 91
- Pull request event: 578
- Pull request review comment event: 10
- Pull request review event: 19
- Push event: 1099
- Watch event: 716
- Total: 3190
Committers metadata
Last synced: about 1 month ago
Total Commits: 8,944
Total Committers: 182
Avg Commits per committer: 49.143
Development Distribution Score (DDS): 0.695
Commits in past year: 943
Committers in past year: 23
Avg Commits per committer in past year: 41.0
Development Distribution Score (DDS) in past year: 0.6
| Name | Commits | |
|---|---|---|
| tallison | t****n@a****g | 2726 |
| dependabot[bot] | 4****] | 1532 |
| Tilman Hausherr | t****n@a****g | 966 |
| Jukka Zitting | j****a@a****g | 960 |
| Nick Burch | n****k@a****g | 932 |
| Chris Mattmann | m****n@a****g | 437 |
| David Meikle | d****e@a****g | 137 |
| Tyler Palsulich | t****h@a****g | 121 |
| Michael McCandless | m****d@a****g | 98 |
| Maxim Valyanskiy | m****m@a****g | 67 |
| ThejanW | t****4@c****k | 63 |
| Konstantin Gribov | g****s@g****m | 62 |
| Nicholas DiPiazza | n****a@l****m | 51 |
| Lee | 5****e | 49 |
| Thamme Gowda | t****a@a****g | 41 |
| Kenneth William Krugler | k****r@a****g | 35 |
| Ray Gauss II | r****s@a****g | 34 |
| Hong-Thai Nguyen | t****4@a****g | 30 |
| Lewis John McGibbney | l****y@g****m | 25 |
| Luis Nassif | l****f@g****m | 23 |
| manali | m****1@g****m | 22 |
| Kranthi Kiran GV | k****v@g****m | 20 |
| Rohan Surana | r****0@g****m | 19 |
| bitsgalore | j****f@k****l | 18 |
| Bob Paulin | b****b@b****m | 17 |
| Dmitry Kryukov | d****k | 16 |
| Madhav Sharan | g****v@g****m | 16 |
| Zarana Parekh | z****7@g****m | 15 |
| nprate2 | 4****2 | 14 |
| ashankbehara | a****2@i****u | 14 |
| and 152 more... | ||
Issue and Pull Request metadata
Last synced: 5 months ago
Total issues: 7
Total pull requests: 1,475
Average time to close issues: about 1 hour
Average time to close pull requests: 16 days
Total issue authors: 2
Total pull request authors: 57
Average comments per issue: 0.0
Average comments per pull request: 0.31
Merged pull request: 1,276
Bot issues: 5
Bot pull requests: 1,076
Past year issues: 2
Past year pull requests: 370
Past year average time to close issues: 24 minutes
Past year average time to close pull requests: 1 day
Past year issue authors: 2
Past year pull request authors: 22
Past year average comments per issue: 0.0
Past year average comments per pull request: 0.21
Past year merged pull request: 304
Past year bot issues: 1
Past year bot pull requests: 244
Top Issue Authors
- dependabot[bot] (5)
- tballison (2)
Top Pull Request Authors
- dependabot[bot] (1,076)
- tballison (249)
- dk2k (29)
- nddipiazza (21)
- alexey-pelykh (8)
- bartek (5)
- subbudvk (5)
- jogerh (4)
- ldh5574 (4)
- gastaldi (3)
- lsliwko (3)
- ruwi-next (3)
- rob975 (2)
- Lonzak (2)
- sunluman (2)
Top Issue Labels
- dependencies (5)
- java (1)
Top Pull Request Labels
- dependencies (1,076)
- java (156)
Package metadata
- Total packages: 100
- Total downloads: unknown
- Total docker downloads: 10,484,318,569
- Total dependent packages: 1,511 (may contain duplicates)
- Total dependent repositories: 12,449 (may contain duplicates)
- Total versions: 2,826
- Total advisories: 18
repo1.maven.org: org.apache.tika:tika-core
This is the core Apache Tika™ toolkit library from which all other modules inherit functionality. It also includes the core facades for the Tika API.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-core/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-19T18:14:51.928Z (about 9 hours ago)
- Versions: 72
- Dependent Packages: 766
- Dependent Repositories: 6,036
- Docker Downloads: 1,586,845,805
-
Rankings:
- Docker downloads count: 0.074%
- Dependent packages count: 0.098%
- Dependent repos count: 0.117%
- Average: 3.246%
- Forks count: 6.812%
- Stargazers count: 9.132%
-
Advisories:
- Apache Tika has XXE vulnerability
- Regular expression denial of service in apache tika
- Regular expression denial of service in apache tika
- Allocation of Resources Without Limits or Throttling in Apache Tika
- Allocation of Resources Without Limits or Throttling in Apache Tika
- Moderate severity vulnerability that affects org.apache.tika:tika-core
- Comparison errorr in org.apache.tika:tika-core
- Moderate severity vulnerability that affects org.apache.tika:tika-core
- High severity vulnerability that affects org.apache.tika:tika-core
- Apache Tika allows Java code execution for serialized objects embedded in MATLAB files
- Apache Tika does not properly initialize the XML parser or choose handlers
- Command injection in org.apache.tika:tika-core
- Apache Tika is vulnerable to entity expansions which can lead to a denial of service attack
repo1.maven.org: org.apache.tika:tika-parsers
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-18T21:13:22.190Z (1 day ago)
- Versions: 72
- Dependent Packages: 310
- Dependent Repositories: 4,443
- Docker Downloads: 168,682,978
-
Rankings:
- Dependent repos count: 0.142%
- Dependent packages count: 0.254%
- Docker downloads count: 0.389%
- Average: 3.346%
- Forks count: 6.812%
- Stargazers count: 9.132%
- Advisories:
repo1.maven.org: org.apache.tika:tika-parsers-standard-package
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard-package/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-05T09:51:23.677Z (15 days ago)
- Versions: 26
- Dependent Packages: 54
- Dependent Repositories: 234
- Docker Downloads: 10,733,392
-
Rankings:
- Docker downloads count: 0.963%
- Dependent repos count: 1.036%
- Dependent packages count: 1.381%
- Average: 3.869%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-xmp
Converts Tika metadata to XMP
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-xmp/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-03T11:14:37.093Z (17 days ago)
- Versions: 61
- Dependent Packages: 24
- Dependent Repositories: 72
- Docker Downloads: 23,917,677
-
Rankings:
- Docker downloads count: 0.795%
- Dependent repos count: 2.377%
- Dependent packages count: 2.646%
- Average: 4.352%
- Forks count: 6.812%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-serialization
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-serialization/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-01T22:45:47.576Z (18 days ago)
- Versions: 57
- Dependent Packages: 23
- Dependent Repositories: 57
- Docker Downloads: 20,868,736
-
Rankings:
- Docker downloads count: 0.825%
- Dependent packages count: 2.752%
- Dependent repos count: 2.761%
- Average: 4.456%
- Forks count: 6.812%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-java7
Java-7 reliant components, including FileTypeDetector implementations
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-java7/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-14T14:45:55.903Z (about 1 month ago)
- Versions: 59
- Dependent Packages: 26
- Dependent Repositories: 59
- Docker Downloads: 2,325,584
-
Rankings:
- Docker downloads count: 1.496%
- Dependent packages count: 2.554%
- Dependent repos count: 2.7%
- Average: 4.539%
- Forks count: 6.812%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-pdf-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-pdf-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-18T21:13:24.242Z (1 day ago)
- Versions: 29
- Dependent Packages: 13
- Dependent Repositories: 113
- Docker Downloads: 1,011,074,006
-
Rankings:
- Docker downloads count: 0.099%
- Dependent repos count: 1.742%
- Average: 4.573%
- Dependent packages count: 5.06%
- Forks count: 6.822%
- Stargazers count: 9.144%
- Advisories:
repo1.maven.org: org.apache.tika:tika-app
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-app/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-28T07:18:49.617Z (23 days ago)
- Versions: 69
- Dependent Packages: 17
- Dependent Repositories: 160
- Docker Downloads: 665,943
-
Rankings:
- Dependent repos count: 1.342%
- Docker downloads count: 2.024%
- Dependent packages count: 3.655%
- Average: 4.593%
- Forks count: 6.812%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-langdetect
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-30T20:15:41.281Z (20 days ago)
- Versions: 50
- Dependent Packages: 21
- Dependent Repositories: 126
- Docker Downloads: 17,208
-
Rankings:
- Dependent repos count: 1.604%
- Docker downloads count: 2.961%
- Dependent packages count: 2.996%
- Average: 4.701%
- Forks count: 6.812%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-microsoft-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-microsoft-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-28T06:04:00.085Z (23 days ago)
- Versions: 27
- Dependent Packages: 13
- Dependent Repositories: 94
- Docker Downloads: 1,011,074,872
-
Rankings:
- Docker downloads count: 0.098%
- Dependent repos count: 1.981%
- Average: 4.704%
- Dependent packages count: 5.473%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-html-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-html-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-18T04:04:42.486Z (about 1 month ago)
- Versions: 27
- Dependent Packages: 12
- Dependent Repositories: 91
- Docker Downloads: 1,011,074,933
-
Rankings:
- Docker downloads count: 0.098%
- Dependent repos count: 2.022%
- Average: 4.712%
- Dependent packages count: 5.473%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-xml-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-xml-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-12T04:01:58.897Z (8 days ago)
- Versions: 27
- Dependent Packages: 10
- Dependent Repositories: 90
- Docker Downloads: 1,011,074,872
-
Rankings:
- Docker downloads count: 0.098%
- Dependent repos count: 2.038%
- Average: 4.932%
- Dependent packages count: 6.583%
- Forks count: 6.812%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-zip-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-zip-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-09T17:01:29.604Z (10 days ago)
- Versions: 27
- Dependent Packages: 10
- Dependent Repositories: 53
- Docker Downloads: 1,011,074,872
-
Rankings:
- Docker downloads count: 0.098%
- Dependent repos count: 2.883%
- Average: 4.986%
- Dependent packages count: 5.981%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-image-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-image-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-25T10:15:41.299Z (26 days ago)
- Versions: 28
- Dependent Packages: 12
- Dependent Repositories: 48
- Docker Downloads: 64,144,938
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.078%
- Average: 5.017%
- Dependent packages count: 5.473%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-langdetect-optimaize
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-optimaize/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-18T19:31:33.566Z (1 day ago)
- Versions: 27
- Dependent Packages: 11
- Dependent Repositories: 67
- Docker Downloads: 20,767,540
-
Rankings:
- Docker downloads count: 0.825%
- Dependent repos count: 2.485%
- Average: 5.052%
- Dependent packages count: 5.981%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-xmp-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-xmp-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-30T04:31:28.185Z (21 days ago)
- Versions: 27
- Dependent Packages: 8
- Dependent Repositories: 53
- Docker Downloads: 1,011,074,892
-
Rankings:
- Docker downloads count: 0.098%
- Dependent repos count: 2.883%
- Average: 5.258%
- Forks count: 6.822%
- Dependent packages count: 7.341%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-pkg-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Status: removed
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-pkg-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-04T12:39:56.578Z (about 2 months ago)
- Versions: 27
- Dependent Packages: 10
- Dependent Repositories: 44
- Docker Downloads: 64,144,938
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.242%
- Average: 5.272%
- Dependent packages count: 6.583%
- Forks count: 6.822%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-apple-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-apple-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-26T04:03:13.581Z (25 days ago)
- Versions: 27
- Dependent Packages: 8
- Dependent Repositories: 90
- Docker Downloads: 1,011,073,986
-
Rankings:
- Docker downloads count: 0.099%
- Dependent repos count: 2.038%
- Average: 5.273%
- Forks count: 6.822%
- Dependent packages count: 8.262%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-audiovideo-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-audiovideo-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-26T06:17:37.401Z (25 days ago)
- Versions: 27
- Dependent Packages: 9
- Dependent Repositories: 48
- Docker Downloads: 64,151,040
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.078%
- Average: 5.386%
- Forks count: 6.812%
- Dependent packages count: 7.341%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-font-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-font-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-17T00:19:02.483Z (3 days ago)
- Versions: 26
- Dependent Packages: 8
- Dependent Repositories: 44
- Docker Downloads: 64,144,938
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.242%
- Average: 5.603%
- Forks count: 6.812%
- Dependent packages count: 8.262%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-crypto-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-crypto-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-19T12:41:52.708Z (about 1 month ago)
- Versions: 27
- Dependent Packages: 8
- Dependent Repositories: 44
- Docker Downloads: 64,144,938
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.242%
- Average: 5.603%
- Forks count: 6.812%
- Dependent packages count: 8.262%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-code-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-code-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-10T00:16:42.241Z (10 days ago)
- Versions: 27
- Dependent Packages: 8
- Dependent Repositories: 44
- Docker Downloads: 64,144,938
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.242%
- Average: 5.603%
- Forks count: 6.812%
- Dependent packages count: 8.262%
- Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-news-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-news-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-10T13:15:44.646Z (10 days ago)
- Versions: 27
- Dependent Packages: 8
- Dependent Repositories: 44
- Docker Downloads: 64,144,938
-
Rankings:
- Docker downloads count: 0.568%
- Dependent repos count: 3.242%
- Average: 5.608%
- Forks count: 6.822%
- Dependent packages count: 8.262%
- Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-translate
This is the translate Apache Tika™ toolkit. Translator implementations may depend on web services.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-translate/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-19T05:30:34.526Z (about 1 month ago)
- Versions: 57
- Dependent Packages: 5
- Dependent Repositories: 32
- Docker Downloads: 20,251,923
-
Rankings:
- Docker downloads count: 0.827%
- Dependent repos count: 3.996%
- Average: 6.418%
- Forks count: 6.812%
- Stargazers count: 9.132%
- Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-parser-advancedmedia-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-advancedmedia-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-11T13:51:46.522Z (9 days ago)
- Versions: 28
- Dependent Packages: 3
- Dependent Repositories: 43
- Docker Downloads: 32,367,042
-
Rankings:
- Docker downloads count: 0.669%
- Dependent repos count: 3.306%
- Forks count: 6.822%
- Average: 7.455%
- Stargazers count: 9.144%
- Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-parser-sqlite3-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/tika-parsers/tika-parsers-extended/tika-parser-sqlite3-module/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-sqlite3-module/
- Licenses: Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-05-29T12:47:38.792Z (22 days ago)
- Versions: 26
- Dependent Packages: 5
- Dependent Repositories: 49
-
Rankings:
- Dependent repos count: 3.038%
- Forks count: 6.822%
- Average: 7.582%
- Stargazers count: 9.144%
- Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-parser-html-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-html-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.9.4 (published about 1 year ago)
- Last Synced: 2026-05-22T04:04:07.588Z (29 days ago)
- Versions: 18
- Dependent Packages: 14
- Dependent Repositories: 2
- Docker Downloads: 30,276,116
-
Rankings:
- Docker downloads count: 0.711%
- Dependent packages count: 5.473%
- Forks count: 6.822%
- Average: 7.629%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-batch
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-batch/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-07T07:16:09.948Z (13 days ago)
- Versions: 55
- Dependent Packages: 4
- Dependent Repositories: 12
- Docker Downloads: 616,888
-
Rankings:
- Docker downloads count: 2.039%
- Forks count: 6.822%
- Dependent repos count: 6.963%
- Average: 7.744%
- Stargazers count: 9.144%
- Dependent packages count: 13.753%
repo1.maven.org: org.apache.tika:tika-parser-digest-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-digest-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-12T04:00:34.053Z (8 days ago)
- Versions: 26
- Dependent Packages: 8
- Dependent Repositories: 2
- Docker Downloads: 31,777,847
-
Rankings:
- Docker downloads count: 0.71%
- Forks count: 6.822%
- Dependent packages count: 7.341%
- Average: 8.002%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-emitter-fs
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-fs/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-03T19:31:33.709Z (16 days ago)
- Versions: 26
- Dependent Packages: 8
- Dependent Repositories: 2
- Docker Downloads: 20,240,607
-
Rankings:
- Docker downloads count: 0.828%
- Forks count: 6.822%
- Dependent packages count: 7.341%
- Average: 8.026%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-server-core
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-core/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-12T04:00:30.986Z (8 days ago)
- Versions: 26
- Dependent Packages: 4
- Dependent Repositories: 3
- Docker Downloads: 20,238,617
-
Rankings:
- Docker downloads count: 0.828%
- Forks count: 6.812%
- Average: 8.838%
- Stargazers count: 9.132%
- Dependent repos count: 13.663%
- Dependent packages count: 13.753%
repo1.maven.org: org.apache.tika:tika-langdetect-tika
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-tika/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-10T03:33:20.729Z (10 days ago)
- Versions: 26
- Dependent Packages: 1
- Dependent Repositories: 53
- Docker Downloads: 946,929,633
-
Rankings:
- Docker downloads count: 0.11%
- Dependent repos count: 2.883%
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 10.339%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-httpclient-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-httpclient-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-27T08:36:19.267Z (24 days ago)
- Versions: 26
- Dependent Packages: 7
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Dependent packages count: 8.262%
- Stargazers count: 9.132%
- Average: 11.213%
- Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-eval-core
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval-core/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-02T21:16:24.445Z (17 days ago)
- Versions: 26
- Dependent Packages: 4
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 11.428%
- Dependent packages count: 13.753%
- Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-langdetect-test-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-test-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-12T05:02:12.777Z (8 days ago)
- Versions: 25
- Dependent Packages: 6
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent packages count: 9.56%
- Average: 11.543%
- Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-parent
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parent/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-25T05:18:22.941Z (26 days ago)
- Versions: 71
- Dependent Packages: 2
- Dependent Repositories: 7
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 9.173%
- Average: 12.014%
- Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-fetcher-s3
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-s3/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-14T08:23:34.699Z (6 days ago)
- Versions: 25
- Dependent Packages: 3
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 12.323%
- Dependent repos count: 15.993%
- Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-s3
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-s3/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-18T04:46:19.481Z (2 days ago)
- Versions: 25
- Dependent Packages: 3
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 12.323%
- Dependent repos count: 15.993%
- Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-emitter-s3
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-s3/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-04T16:46:25.403Z (15 days ago)
- Versions: 27
- Dependent Packages: 3
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 12.323%
- Dependent repos count: 15.993%
- Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-parser-nlp-module
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-nlp-module/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-06T02:16:14.299Z (14 days ago)
- Versions: 27
- Dependent Packages: 2
- Dependent Repositories: 4
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 12.011%
- Average: 12.723%
- Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-parser-sqlite3-package
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-sqlite3-package/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-06T19:31:20.729Z (13 days ago)
- Versions: 24
- Dependent Packages: 2
- Dependent Repositories: 3
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Average: 13.131%
- Dependent repos count: 13.663%
- Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-solr
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-solr/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-12T15:20:55.633Z (7 days ago)
- Versions: 25
- Dependent Packages: 3
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 13.486%
- Dependent packages count: 17.332%
- Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-emitter-solr
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-solr/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-02T07:00:17.957Z (18 days ago)
- Versions: 26
- Dependent Packages: 3
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 13.486%
- Dependent packages count: 17.332%
- Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-parser-scientific-package
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-scientific-package/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-31T01:16:29.744Z (20 days ago)
- Versions: 24
- Dependent Packages: 1
- Dependent Repositories: 8
-
Rankings:
- Forks count: 6.822%
- Dependent repos count: 8.595%
- Stargazers count: 9.144%
- Average: 14.324%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-eval
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-16T16:31:26.581Z (3 days ago)
- Versions: 49
- Dependent Packages: 1
- Dependent Repositories: 6
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Dependent repos count: 9.917%
- Average: 14.648%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-langdetect-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.0.0-ALPHA (published over 5 years ago)
- Last Synced: 2026-05-22T21:02:53.741Z (28 days ago)
- Versions: 1
- Dependent Packages: 4
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Dependent packages count: 13.421%
- Average: 14.685%
- Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-transcribe-aws
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-transcribe-aws/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-02T16:04:57.098Z (17 days ago)
- Versions: 25
- Dependent Packages: 2
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Average: 14.876%
- Dependent repos count: 20.645%
- Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-parser-jdbc-commons
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-jdbc-commons/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-16T20:46:41.378Z (3 days ago)
- Versions: 27
- Dependent Packages: 2
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Average: 14.876%
- Dependent repos count: 20.645%
- Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-emitter-opensearch
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-opensearch/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-25T03:45:35.399Z (26 days ago)
- Versions: 25
- Dependent Packages: 2
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 14.882%
- Dependent repos count: 20.645%
- Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-dl
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://maven.apache.org
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-dl/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-17T22:32:11.422Z (2 days ago)
- Versions: 47
- Dependent Packages: 1
- Dependent Repositories: 4
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 12.011%
- Average: 15.178%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-18T11:02:11.204Z (1 day ago)
- Versions: 63
- Dependent Packages: 0
- Dependent Repositories: 8
- Docker Downloads: 13,306
-
Rankings:
- Docker downloads count: 3.067%
- Forks count: 6.822%
- Dependent repos count: 8.595%
- Stargazers count: 9.144%
- Average: 15.503%
- Dependent packages count: 49.885%
- Advisories:
repo1.maven.org: org.apache.tika:tika-fuzzing
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fuzzing/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-29T20:31:58.785Z (21 days ago)
- Versions: 36
- Dependent Packages: 1
- Dependent Repositories: 3
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Dependent repos count: 13.663%
- Average: 15.585%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-jdbc
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-jdbc/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-10T15:30:35.886Z (9 days ago)
- Versions: 26
- Dependent Packages: 1
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
- Average: 16.173%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-fetcher-http
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-http/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-25T15:58:11.735Z (25 days ago)
- Versions: 26
- Dependent Packages: 1
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
- Average: 16.173%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server-standard
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-standard/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-01T17:15:27.697Z (18 days ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 2
- Docker Downloads: 20,238,617
-
Rankings:
- Docker downloads count: 0.828%
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
- Average: 16.534%
- Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-async-cli
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-async-cli/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-03T01:47:29.495Z (17 days ago)
- Versions: 17
- Dependent Packages: 2
- Dependent Repositories: 0
- Docker Downloads: 9
-
Rankings:
- Forks count: 5.227%
- Stargazers count: 7.527%
- Average: 16.774%
- Dependent packages count: 22.361%
- Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-kafka
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-kafka/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-12T14:12:11.540Z (8 days ago)
- Versions: 18
- Dependent Packages: 2
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.628%
- Stargazers count: 7.48%
- Average: 16.862%
- Dependent packages count: 22.361%
- Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-langdetect-lingo24
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-lingo24/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-24T14:31:14.296Z (27 days ago)
- Versions: 27
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Average: 17.33%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-emitter-jdbc
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-jdbc/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-18T14:45:38.270Z (1 day ago)
- Versions: 17
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Average: 17.33%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-gcs
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Status: removed
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-gcs/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-05T19:06:07.485Z (about 2 months ago)
- Versions: 24
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Average: 17.33%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-emitter-gcs
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-gcs/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-21T16:18:07.583Z (29 days ago)
- Versions: 24
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.336%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-age-recogniser
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://maven.apache.org
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-age-recogniser/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-06T23:45:35.542Z (13 days ago)
- Versions: 27
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.336%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-langdetect-mitll-text
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-mitll-text/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-04T16:32:03.433Z (15 days ago)
- Versions: 29
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.336%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server-client
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-client/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-13T11:49:22.762Z (7 days ago)
- Versions: 27
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.336%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-bundle-standard
OSGi bundle that contains the tika-parsers-standard component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle-standard/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-19T11:48:51.741Z (about 1 month ago)
- Versions: 26
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.336%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-fetcher-gcs
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-gcs/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-17T13:48:37.456Z (3 days ago)
- Versions: 24
- Dependent Packages: 1
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.336%
- Dependent repos count: 20.645%
- Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-example
This module contains examples of how to use Apache Tika.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-example/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-13T20:03:19.287Z (about 1 month ago)
- Versions: 58
- Dependent Packages: 0
- Dependent Repositories: 20
-
Rankings:
- Dependent repos count: 5.315%
- Forks count: 6.822%
- Stargazers count: 9.144%
- Average: 17.792%
- Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-detectors
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-detectors/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-26T18:20:40.080Z (24 days ago)
- Versions: 17
- Dependent Packages: 1
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.227%
- Stargazers count: 7.527%
- Average: 19.183%
- Dependent repos count: 31.98%
- Dependent packages count: 31.998%
repo1.maven.org: org.apache.tika:tika-bom
Apache Tika Bill of Materials
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bom/
- Licenses: Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-08T12:01:13.508Z (about 1 month ago)
- Versions: 21
- Dependent Packages: 0
- Dependent Repositories: 2
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 15.993%
- Average: 20.461%
- Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-eval-app
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Status: removed
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval-app/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-05-04T22:02:27.421Z (about 2 months ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.812%
- Stargazers count: 9.132%
- Dependent repos count: 20.645%
- Average: 21.618%
- Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-nlp
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://maven.apache.org
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-nlp/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 1.28.5 (published almost 4 years ago)
- Last Synced: 2026-05-11T00:16:32.726Z (about 1 month ago)
- Versions: 20
- Dependent Packages: 0
- Dependent Repositories: 1
-
Rankings:
- Forks count: 6.822%
- Stargazers count: 9.144%
- Dependent repos count: 20.645%
- Average: 21.624%
- Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-parsers-standard-modules
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard-modules/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-02T17:17:35.349Z (17 days ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-emitters
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitters/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-22T01:31:38.207Z (29 days ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parser-advancedmedia-package
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-advancedmedia-package/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-10T10:31:17.965Z (10 days ago)
- Versions: 20
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-classic-modules
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-classic-modules/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.0.0-ALPHA (published over 5 years ago)
- Last Synced: 2026-06-11T16:46:37.423Z (8 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-05T11:13:44.539Z (15 days ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-extended-integration-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-extended-integration-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-04T15:31:07.428Z (15 days ago)
- Versions: 29
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-server-classic
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-classic/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.0.0-ALPHA (published over 5 years ago)
- Last Synced: 2026-05-22T08:45:42.187Z (29 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-server-eval
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-eval/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-21T16:50:36.496Z (29 days ago)
- Versions: 20
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-bundle-classic
OSGi bundle that contains the tika-parsers-classic component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle-classic/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.0.0-ALPHA (published over 5 years ago)
- Last Synced: 2026-05-17T21:32:57.674Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-classic
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-classic/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.0.0-ALPHA (published over 5 years ago)
- Last Synced: 2026-06-09T19:31:30.121Z (10 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-opensearch-integration-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-opensearch-integration-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-05-03T03:31:07.393Z (about 2 months ago)
- Versions: 24
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-extended
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-extended/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-15T20:46:40.769Z (4 days ago)
- Versions: 27
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-emitter-az-blob
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-az-blob/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-22T01:48:56.632Z (29 days ago)
- Versions: 20
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-az-blob
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-az-blob/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-15T13:46:46.198Z (about 1 month ago)
- Versions: 20
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-ml
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-ml/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.1 (published about 1 month ago)
- Last Synced: 2026-06-08T07:56:18.243Z (12 days ago)
- Versions: 28
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-advanced
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: http://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-advanced/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 2.0.0-ALPHA (published over 5 years ago)
- Last Synced: 2026-06-13T10:42:07.513Z (7 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-s3-integration-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-s3-integration-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-02T11:16:42.184Z (18 days ago)
- Versions: 25
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-bundles
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundles/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-20T04:03:27.628Z (about 1 month ago)
- Versions: 27
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-fetcher-az-blob
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-az-blob/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-08T00:31:25.149Z (about 1 month ago)
- Versions: 20
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-solr-integration-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-solr-integration-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-18T02:15:25.334Z (2 days ago)
- Versions: 25
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-iterators
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterators/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-25T21:02:41.573Z (25 days ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-integration-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-integration-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-27T07:32:52.885Z (24 days ago)
- Versions: 25
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-standard
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-25T15:35:04.577Z (25 days ago)
- Versions: 27
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.513%
- Stargazers count: 7.827%
- Average: 23.545%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-kafka-integration-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-kafka-integration-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-20T00:19:31.710Z (about 1 month ago)
- Versions: 18
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.58%
- Stargazers count: 8.488%
- Average: 23.727%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-reporter-fs-status
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-reporter-fs-status/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-11T17:08:13.077Z (about 1 month ago)
- Versions: 18
- Dependent Packages: 1
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.58%
- Stargazers count: 8.488%
- Average: 23.727%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-reporters
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-reporters/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-06-01T15:01:34.449Z (19 days ago)
- Versions: 18
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.58%
- Stargazers count: 8.488%
- Average: 23.727%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-resource-loading-tests
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-resource-loading-tests/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.2.3 (published 9 months ago)
- Last Synced: 2026-06-15T13:59:16.680Z (5 days ago)
- Versions: 18
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Forks count: 5.58%
- Stargazers count: 8.488%
- Average: 23.727%
- Dependent repos count: 31.98%
- Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-detector-siegfried
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-detector-siegfried/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-30T14:32:00.263Z (21 days ago)
- Versions: 17
- Dependent Packages: 1
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 31.98%
- Average: 31.989%
- Dependent packages count: 31.998%
repo1.maven.org: org.apache.tika:tika-fetchers
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
- Homepage: https://tika.apache.org/
- Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetchers/
- Licenses: Apache-2.0,Apache-2.0
- Latest release: 3.3.0 (published 3 months ago)
- Last Synced: 2026-05-26T06:30:57.655Z (25 days ago)
- Versions: 26
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 31.98%
- Average: 40.42%
- Dependent packages count: 48.86%
Dependencies
- org.apache.logging.log4j:log4j-core ${log4j2.version}
- org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
- org.apache.tika:tika-batch 2.4.2-SNAPSHOT
- org.apache.tika:tika-emitter-fs 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-optimaize 2.4.2-SNAPSHOT
- org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
- org.apache.tika:tika-serialization 2.4.2-SNAPSHOT
- org.apache.tika:tika-xmp 2.4.2-SNAPSHOT
- org.slf4j:jcl-over-slf4j
- org.apache.tika:tika-age-recogniser 2.4.2-SNAPSHOT
- org.apache.tika:tika-bundle-standard 2.4.2-SNAPSHOT
- org.apache.tika:tika-core 2.4.2-SNAPSHOT
- org.apache.tika:tika-dl 2.4.2-SNAPSHOT
- org.apache.tika:tika-emitter-fs 2.4.2-SNAPSHOT
- org.apache.tika:tika-emitter-gcs 2.4.2-SNAPSHOT
- org.apache.tika:tika-emitter-opensearch 2.4.2-SNAPSHOT
- org.apache.tika:tika-emitter-s3 2.4.2-SNAPSHOT
- org.apache.tika:tika-emitter-solr 2.4.2-SNAPSHOT
- org.apache.tika:tika-eval-core 2.4.2-SNAPSHOT
- org.apache.tika:tika-fetcher-gcs 2.4.2-SNAPSHOT
- org.apache.tika:tika-fetcher-http 2.4.2-SNAPSHOT
- org.apache.tika:tika-fetcher-s3 2.4.2-SNAPSHOT
- org.apache.tika:tika-fuzzing 2.4.2-SNAPSHOT
- org.apache.tika:tika-httpclient-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-java7 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-lingo24 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-mitll-text 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-opennlp 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-optimaize 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-test-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-langdetect-tika 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-advancedmedia-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-apple-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-audiovideo-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-cad-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-code-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-crypto-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-digest-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-font-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-html-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-html-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-image-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-jdbc-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-mail-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-mail-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-microsoft-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-miscoffice-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-news-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-nlp-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-ocr-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-pdf-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-pkg-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-scientific-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-scientific-package 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-sqlite3-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-sqlite3-package 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-text-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-xml-module 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-xmp-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-parser-zip-commons 2.4.2-SNAPSHOT
- org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
- org.apache.tika:tika-pipes-iterator-csv 2.4.2-SNAPSHOT
- org.apache.tika:tika-pipes-iterator-gcs 2.4.2-SNAPSHOT
- org.apache.tika:tika-pipes-iterator-jdbc 2.4.2-SNAPSHOT
- org.apache.tika:tika-pipes-iterator-s3 2.4.2-SNAPSHOT
- org.apache.tika:tika-pipes-iterator-solr 2.4.2-SNAPSHOT
- org.apache.tika:tika-serialization 2.4.2-SNAPSHOT
- org.apache.tika:tika-server-client 2.4.2-SNAPSHOT
- org.apache.tika:tika-server-core 2.4.2-SNAPSHOT
- org.apache.tika:tika-transcribe-aws 2.4.2-SNAPSHOT
- org.apache.tika:tika-translate 2.4.2-SNAPSHOT
- org.apache.tika:tika-xmp 2.4.2-SNAPSHOT
- org.osgi:org.osgi.compendium ${osgi.compendium.version}
- ${project.groupId}:tika-core 2.4.2-SNAPSHOT
- ${project.groupId}:tika-parsers-standard-package 2.4.2-SNAPSHOT
- com.sun.activation:javax.activation 1.2.0
- org.apache.logging.log4j:log4j-api
- com.sun.xml.fastinfoset:FastInfoset 2.1.0 test
- javax.inject:javax.inject 1 test
- org.apache.felix:org.apache.felix.framework 7.0.5 test
- org.glassfish.jaxb:jaxb-runtime ${jaxb.version} test
- org.ops4j.pax.exam:pax-exam-container-native ${pax.exam.version} test
- org.ops4j.pax.exam:pax-exam-junit4 ${pax.exam.version} test
- org.ops4j.pax.exam:pax-exam-link-assembly ${pax.exam.version} test
- org.ops4j.pax.url:pax-url-aether 2.6.1 test
- org.osgi:org.osgi.core ${osgi.core.version} test
- org.slf4j:slf4j-simple ${slf4j.version} test
- biz.aQute.bnd:biz.aQute.bndlib provided
- org.osgi:org.osgi.compendium ${osgi.compendium.version} provided
- org.osgi:org.osgi.core ${osgi.core.version} provided
- commons-io:commons-io ${commons.io.version}
- org.slf4j:slf4j-api
- com.google.guava:guava ${guava.version} test
- com.martensigwart:fakeload ${fakeload.version} test
- org.apache.logging.log4j:log4j-core ${log4j2.version} test
- org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version} test
- org.junit.jupiter:junit-jupiter-api ${junit5.version} test
- org.junit.jupiter:junit-jupiter-engine ${junit5.version} test
- com.h2database:h2 ${h2.version}
- org.apache.logging.log4j:log4j-core ${log4j2.version}
- org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
- org.apache.poi:poi-ooxml ${poi.version}
- org.apache.tika:tika-batch ${project.version}
- org.apache.tika:tika-eval-core ${project.version}
- org.apache.tika:tika-batch ${project.version} test
- org.apache.tika:tika-core ${project.version} test
- ${project.groupId}:tika-core ${project.version}
- ${project.groupId}:tika-langdetect-opennlp ${project.version}
- ${project.groupId}:tika-serialization ${project.version}
- com.fasterxml.jackson.core:jackson-databind
- commons-codec:commons-codec ${commons.codec.version}
- org.apache.commons:commons-lang3 ${commons.lang3.version}
- org.apache.commons:commons-math3 ${commons.math3.version}
- org.apache.lucene:lucene-analyzers-common ${lucene.version}
- org.apache.lucene:lucene-analyzers-icu ${lucene.version}
- org.apache.lucene:lucene-core ${lucene.version}
- org.ccil.cowan.tagsoup:tagsoup 1.2.1
- org.apache.lucene:lucene-memory ${lucene.version} test
- org.apache.tika:tika-core ${project.version} test
- ${project.groupId}:tika-langdetect-optimaize ${project.version}
- javax.jcr:jcr ${javax.jcr.version}
- org.apache.jackrabbit:jackrabbit-core ${jackrabbit.version}
- org.apache.jackrabbit:jackrabbit-jcr-server ${jackrabbit.version}
- org.apache.lucene:lucene-core ${lucene.version}
- org.apache.tika:tika-app ${project.version}
- org.apache.tika:tika-eval-core ${project.version}
- org.apache.tika:tika-serialization ${project.version}
- org.apache.tika:tika-transcribe-aws ${project.version}
- org.apache.tika:tika-translate ${project.version}
- org.osgi:org.osgi.compendium ${osgi.compendium.version}
- org.springframework:spring-context ${spring.version}
- org.apache.tika:tika-core ${project.version} test
- ${project.groupId}:tika-core ${project.version} provided
- commons-cli:commons-cli ${commons.cli.version}
- org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
- org.apache.tika:tika-parser-digest-commons ${project.version}
- org.apache.tika:tika-parser-pdf-module ${project.version}
- org.apache.tika:tika-parser-pkg-module ${project.version}
- org.slf4j:jcl-over-slf4j
- ${project.groupId}:tika-core ${project.version} test
- ${project.groupId}:tika-core ${project.version} test
- ${project.groupId}:tika-serialization ${project.version} test
- org.junit.vintage:junit-vintage-engine ${junit5.version} test
- ${project.groupId}:tika-app ${project.version} test
- ${project.groupId}:tika-emitter-opensearch ${project.version} test
- net.java.dev.jna:jna ${jna.version} test
- org.testcontainers:testcontainers ${test.containers.version} test
- ${project.groupId}:tika-core ${project.version} test
- ${project.groupId}:tika-emitter-s3 ${project.version} test
- ${project.groupId}:tika-fetcher-s3 ${project.version} test
- ${project.groupId}:tika-pipes-iterator-s3 ${project.version} test
- ${project.groupId}:tika-app ${project.version} test
- ${project.groupId}:tika-emitter-solr ${project.version} test
- ${project.groupId}:tika-pipes-iterator-solr ${project.version} test
- org.apache.solr:solr-solrj ${solrj.version} test
- org.testcontainers:testcontainers ${test.containers.version} test
- org.apache.tika:tika-core 2.4.2-SNAPSHOT test
- biz.aQute.bnd:biz.aQute.bndlib provided
- org.apache.tika:tika-core 2.4.2-SNAPSHOT
- org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
- ${project.groupId}:tika-core ${project.version} provided
- org.junit.jupiter:junit-jupiter-api ${junit5.version} test
- org.junit.jupiter:junit-jupiter-engine ${junit5.version} test
- com.fasterxml.jackson.core:jackson-databind
- org.apache.cxf:cxf-rt-rs-client ${cxf.version}
- org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
- org.glassfish.jaxb:jaxb-runtime ${jaxb.version}
- ${project.groupId}:tika-langdetect-test-commons ${project.version} test
- com.fasterxml.jackson.core:jackson-databind
- org.apache.cxf:cxf-rt-rs-client ${cxf.version}
- org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
- org.glassfish.jaxb:jaxb-runtime ${jaxb.version}
- ${project.groupId}:tika-langdetect-test-commons ${project.version} test
- org.apache.opennlp:opennlp-tools ${opennlp.version}
- ${project.groupId}:tika-langdetect-test-commons ${project.version} test
- actions/checkout v4 composite
- actions/setup-java v1 composite
- actions/checkout v4 composite
- actions/setup-java v1 composite
- quay.io/minio/minio latest
- ${project.groupId}:tika-core ${project.version} provided
- com.fasterxml.jackson.core:jackson-databind
- ${project.groupId}:tika-core ${project.version} test
- ${project.groupId}:tika-parsers-standard-package ${project.version} test
- org.apache.logging.log4j:log4j-core test
- org.apache.logging.log4j:log4j-slf4j2-impl test
- ${project.groupId}:tika-core ${project.version} provided
- de.l3s.boilerpipe:boilerpipe 1.1.0
- ${project.groupId}:tika-app ${project.version} test
- ${project.groupId}:tika-core ${project.version} test
- ${project.groupId}:tika-emitter-kafka ${project.version} test
- ${project.groupId}:tika-pipes-iterator-kafka ${project.version} test
- org.apache.logging.log4j:log4j-slf4j2-impl test
- org.testcontainers:junit-jupiter test
- org.testcontainers:kafka test
- org.testcontainers:testcontainers test
- org.apache.tomcat:annotations-api 6.0.53 provided
- com.beust:jcommander
- com.fasterxml.jackson.module:jackson-module-jsonSchema
- com.google.guava:guava
- com.google.j2objc:j2objc-annotations 3.0.0
- com.google.protobuf:protobuf-java-util ${protobuf.version}
- io.grpc:grpc-netty-shaded
- io.grpc:grpc-protobuf
- io.grpc:grpc-services
- io.grpc:grpc-stub
- org.apache.logging.log4j:log4j-core
- org.apache.logging.log4j:log4j-slf4j2-impl
- org.apache.tika:tika-async-cli 4.0.0-SNAPSHOT
- org.apache.tika:tika-core 4.0.0-SNAPSHOT
- org.apache.tika:tika-fetcher-http 4.0.0-SNAPSHOT
- org.apache.tika:tika-parsers-standard-package 4.0.0-SNAPSHOT
- org.slf4j:jcl-over-slf4j
- com.asarkar.grpc:grpc-test 1.2.2 test
- io.grpc:grpc-testing test
- org.awaitility:awaitility 4.2.2 test
- org.eclipse.jetty:jetty-server test
- org.mockito:mockito-core test
- ${project.groupId}:tika-core ${project.version} provided
- com.fasterxml.jackson.core:jackson-databind
- ${project.groupId}:tika-core ${project.version} test
- org.apache.logging.log4j:log4j-core test
- org.apache.logging.log4j:log4j-slf4j2-impl test
- com.optimaize.languagedetector:language-detector ${optimaize.version}
- org.jetbrains:annotations 26.0.2
- ${project.groupId}:tika-langdetect-test-commons ${project.version} test
- com.optimaize.languagedetector:language-detector ${optimaize.version}
- ${project.groupId}:tika-langdetect-test-commons ${project.version} test
- org.junit.jupiter:junit-jupiter-api 5.13.0-M3 test
- org.junit.jupiter:junit-jupiter-engine 5.13.0-M3 test
- org.apache.tika:tika-core ${project.version} test
- org.junit.jupiter:junit-jupiter-api test
- org.junit.jupiter:junit-jupiter-engine test
- ${project.groupId}:tika-core ${project.version} provided
- ${project.groupId}:tika-parser-text-module ${project.version}
- edu.ucar:grib ${netcdf-java.version}
- edu.ucar:netcdf4 ${netcdf-java.version}
- javax.measure:unit-api
- net.jcip:jcip-annotations 1.0
- org.apache.commons:commons-csv
- org.apache.sis.core:sis-metadata
- org.apache.sis.core:sis-utility
- org.apache.sis.storage:sis-netcdf
- org.glassfish.jaxb:jaxb-runtime
- org.opengis:geoapi
- org.apache.tika:tika-parser-scientific-module 4.0.0-SNAPSHOT
- org.apache.tika:tika-parsers-standard-package 4.0.0-SNAPSHOT test
- ${project.groupId}:tika-parser-jdbc-commons ${project.version}
- org.xerial:sqlite-jdbc ${sqlite.version}
- ${project.groupId}:tika-parser-sqlite3-module ${project.version}
- ${project.groupId}:tika-parser-scientific-module ${project.version} test
- ${project.groupId}:tika-parser-sqlite3-module ${project.version} test
- ${project.groupId}:tika-parser-sqlite3-package ${project.version} test
- ${project.groupId}:tika-parsers-standard-package ${project.version} test
- org.apache.logging.log4j:log4j-core test
- org.apache.logging.log4j:log4j-slf4j2-impl test
- ${project.groupId}:tika-core ${project.version} provided
- org.apache.ctakes:ctakes-core ${ctakes.version} provided
- ${project.groupId}:tika-parser-pdf-module ${project.version}
- com.github.openjson:openjson ${openjson.version}
- com.google.code.gson:gson
- com.googlecode.json-simple:json-simple
- commons-codec:commons-codec
- edu.usc.ir:sentiment-analysis-parser 0.1
- jakarta.annotation:jakarta.annotation-api
- org.apache.cxf:cxf-rt-rs-client
- org.apache.httpcomponents:httpclient
- org.apache.httpcomponents:httpcore
- org.slf4j:log4j-over-slf4j test
- org.apache.tika:tika-parser-nlp-module 4.0.0-SNAPSHOT
- com.fasterxml.jackson.core:jackson-databind
- com.googlecode.json-simple:json-simple
- javax.xml.bind:jaxb-api 2.3.1
- software.amazon.awssdk:s3
- software.amazon.awssdk:sts
- software.amazon.awssdk:transcribe
- org.slf4j:slf4j-simple test
- ${project.groupId}:tika-core ${project.version} provided
- org.apache.logging.log4j:log4j-core test
- org.apache.logging.log4j:log4j-slf4j2-impl test
- ${project.groupId}:tika-parser-zip-commons ${project.version}
- com.googlecode.plist:dd-plist ${ddplist.version}
- com.drewnoakes:metadata-extractor
- ${project.groupId}:tika-parser-microsoft-module ${project.version}
- com.fasterxml.jackson.core:jackson-core
- com.fasterxml.jackson.core:jackson-databind
- ${project.groupId}:tika-parser-text-module ${project.version}
- com.epam:parso ${parso.version}
- org.apache.commons:commons-lang3
- org.codelibs:jhighlight ${jhighlight.version}
- org.jsoup:jsoup
- org.ow2.asm:asm ${asm.version}
- org.tallison:jmatio 1.5
- org.bouncycastle:bcjmail-jdk18on
- org.bouncycastle:bcprov-jdk18on
- commons-codec:commons-codec
- org.bouncycastle:bcjmail-jdk18on
- org.bouncycastle:bcprov-jdk18on
- org.apache.pdfbox:fontbox ${pdfbox.version}
- commons-codec:commons-codec
- org.jsoup:jsoup
- ${project.groupId}:tika-parser-text-module ${project.version} test
- ${project.groupId}:tika-parser-xmp-commons ${project.version}
- com.drewnoakes:metadata-extractor
- com.github.jai-imageio:jai-imageio-core
- org.apache.pdfbox:jbig2-imageio ${jbig2.version}
- ${project.groupId}:tika-parser-xmp-commons ${project.version} test
- com.github.jai-imageio:jai-imageio-jpeg2000 ${imageio.version} test
- org.mockito:mockito-core test
- org.apache.james:apache-mime4j-core ${mime4j.version}
- org.apache.james:apache-mime4j-dom ${mime4j.version}
- ${project.groupId}:tika-parser-html-module ${project.version}
- ${project.groupId}:tika-parser-mail-commons ${project.version}
- ${project.groupId}:tika-parser-text-module ${project.version}
- ${project.groupId}:tika-parser-ocr-module ${project.version} test
- org.mockito:mockito-core test
- ${project.groupId}:tika-parser-html-module ${project.version}
- ${project.groupId}:tika-parser-mail-commons ${project.version}
- ${project.groupId}:tika-parser-text-module ${project.version}
- ${project.groupId}:tika-parser-xml-module ${project.version}
- ${project.groupId}:tika-parser-zip-commons ${project.version}
- com.healthmarketscience.jackcess:jackcess
- com.healthmarketscience.jackcess:jackcess-encrypt
- com.pff:java-libpst ${libpst.version}
- commons-codec:commons-codec
- commons-logging:commons-logging
- org.apache.commons:commons-lang3
- org.apache.poi:poi
- org.apache.poi:poi-ooxml
- org.apache.poi:poi-scratchpad ${poi.version}
- org.bouncycastle:bcjmail-jdk18on
- org.bouncycastle:bcprov-jdk18on
- org.slf4j:slf4j-api
- ${project.groupId}:tika-parser-mail-module ${project.version} test
- ${project.groupId}:tika-parser-text-module ${project.version}
- ${project.groupId}:tika-parser-xml-module ${project.version}
- ${project.groupId}:tika-parser-xmp-commons ${project.version}
- ${project.groupId}:tika-parser-zip-commons ${project.version}
- commons-codec:commons-codec
- org.apache.commons:commons-collections4
- org.apache.commons:commons-lang3
- org.apache.poi:poi
- org.glassfish.jaxb:jaxb-runtime
- com.rometools:rome ${rome.version}
- org.slf4j:slf4j-api
- org.apache.commons:commons-exec
- org.apache.commons:commons-lang3
- ${project.groupId}:tika-parser-image-module ${project.version} test
- ${project.groupId}:tika-parser-xmp-commons ${project.version}
- org.apache.pdfbox:jempbox ${jempbox.version}
- org.apache.pdfbox:pdfbox ${pdfbox.version}
- org.apache.pdfbox:pdfbox-tools ${pdfbox.version}
- org.bouncycastle:bcjmail-jdk18on
- org.bouncycastle:bcprov-jdk18on
- org.glassfish.jaxb:jaxb-runtime
- com.github.jai-imageio:jai-imageio-core test
- org.slf4j:jcl-over-slf4j test
- actions/checkout v4 composite
- actions/setup-java v4 composite
- ${project.groupId}:tika-core ${project.version}
- com.fasterxml.woodstox:woodstox-core