An open API service for producing an overview of a list of open source projects.

https://github.com/apache/tika

content extraction java metadata tika

Score: 36.53298337360196

Last synced: about 8 hours ago
JSON representation

Repository metadata:

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).


Owner metadata:


GitHub Events

Total
Last Year

Committers metadata

Last synced: about 1 month ago

Total Commits: 8,944
Total Committers: 182
Avg Commits per committer: 49.143
Development Distribution Score (DDS): 0.695

Commits in past year: 943
Committers in past year: 23
Avg Commits per committer in past year: 41.0
Development Distribution Score (DDS) in past year: 0.6

Name Email Commits
tallison t****n@a****g 2726
dependabot[bot] 4****] 1532
Tilman Hausherr t****n@a****g 966
Jukka Zitting j****a@a****g 960
Nick Burch n****k@a****g 932
Chris Mattmann m****n@a****g 437
David Meikle d****e@a****g 137
Tyler Palsulich t****h@a****g 121
Michael McCandless m****d@a****g 98
Maxim Valyanskiy m****m@a****g 67
ThejanW t****4@c****k 63
Konstantin Gribov g****s@g****m 62
Nicholas DiPiazza n****a@l****m 51
Lee 5****e 49
Thamme Gowda t****a@a****g 41
Kenneth William Krugler k****r@a****g 35
Ray Gauss II r****s@a****g 34
Hong-Thai Nguyen t****4@a****g 30
Lewis John McGibbney l****y@g****m 25
Luis Nassif l****f@g****m 23
manali m****1@g****m 22
Kranthi Kiran GV k****v@g****m 20
Rohan Surana r****0@g****m 19
bitsgalore j****f@k****l 18
Bob Paulin b****b@b****m 17
Dmitry Kryukov d****k 16
Madhav Sharan g****v@g****m 16
Zarana Parekh z****7@g****m 15
nprate2 4****2 14
ashankbehara a****2@i****u 14
and 152 more...

Issue and Pull Request metadata

Last synced: 5 months ago

Total issues: 7
Total pull requests: 1,475
Average time to close issues: about 1 hour
Average time to close pull requests: 16 days
Total issue authors: 2
Total pull request authors: 57
Average comments per issue: 0.0
Average comments per pull request: 0.31
Merged pull request: 1,276
Bot issues: 5
Bot pull requests: 1,076

Past year issues: 2
Past year pull requests: 370
Past year average time to close issues: 24 minutes
Past year average time to close pull requests: 1 day
Past year issue authors: 2
Past year pull request authors: 22
Past year average comments per issue: 0.0
Past year average comments per pull request: 0.21
Past year merged pull request: 304
Past year bot issues: 1
Past year bot pull requests: 244

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/apache/tika

Top Issue Authors

  • dependabot[bot] (5)
  • tballison (2)

Top Pull Request Authors

  • dependabot[bot] (1,076)
  • tballison (249)
  • dk2k (29)
  • nddipiazza (21)
  • alexey-pelykh (8)
  • bartek (5)
  • subbudvk (5)
  • jogerh (4)
  • ldh5574 (4)
  • gastaldi (3)
  • lsliwko (3)
  • ruwi-next (3)
  • rob975 (2)
  • Lonzak (2)
  • sunluman (2)

Top Issue Labels

  • dependencies (5)
  • java (1)

Top Pull Request Labels

  • dependencies (1,076)
  • java (156)

Package metadata

repo1.maven.org: org.apache.tika:tika-core

This is the core Apache Tika™ toolkit library from which all other modules inherit functionality. It also includes the core facades for the Tika API.

repo1.maven.org: org.apache.tika:tika-parsers

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

repo1.maven.org: org.apache.tika:tika-parsers-standard-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-05T09:51:23.677Z (15 days ago)
  • Versions: 26
  • Dependent Packages: 54
  • Dependent Repositories: 234
  • Docker Downloads: 10,733,392
  • Rankings:
    • Docker downloads count: 0.963%
    • Dependent repos count: 1.036%
    • Dependent packages count: 1.381%
    • Average: 3.869%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-xmp

Converts Tika metadata to XMP

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-xmp/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-03T11:14:37.093Z (17 days ago)
  • Versions: 61
  • Dependent Packages: 24
  • Dependent Repositories: 72
  • Docker Downloads: 23,917,677
  • Rankings:
    • Docker downloads count: 0.795%
    • Dependent repos count: 2.377%
    • Dependent packages count: 2.646%
    • Average: 4.352%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-serialization

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-serialization/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-01T22:45:47.576Z (18 days ago)
  • Versions: 57
  • Dependent Packages: 23
  • Dependent Repositories: 57
  • Docker Downloads: 20,868,736
  • Rankings:
    • Docker downloads count: 0.825%
    • Dependent packages count: 2.752%
    • Dependent repos count: 2.761%
    • Average: 4.456%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-java7

Java-7 reliant components, including FileTypeDetector implementations

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-java7/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-14T14:45:55.903Z (about 1 month ago)
  • Versions: 59
  • Dependent Packages: 26
  • Dependent Repositories: 59
  • Docker Downloads: 2,325,584
  • Rankings:
    • Docker downloads count: 1.496%
    • Dependent packages count: 2.554%
    • Dependent repos count: 2.7%
    • Average: 4.539%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-pdf-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

repo1.maven.org: org.apache.tika:tika-app

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-app/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-28T07:18:49.617Z (23 days ago)
  • Versions: 69
  • Dependent Packages: 17
  • Dependent Repositories: 160
  • Docker Downloads: 665,943
  • Rankings:
    • Dependent repos count: 1.342%
    • Docker downloads count: 2.024%
    • Dependent packages count: 3.655%
    • Average: 4.593%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-langdetect

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-30T20:15:41.281Z (20 days ago)
  • Versions: 50
  • Dependent Packages: 21
  • Dependent Repositories: 126
  • Docker Downloads: 17,208
  • Rankings:
    • Dependent repos count: 1.604%
    • Docker downloads count: 2.961%
    • Dependent packages count: 2.996%
    • Average: 4.701%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-microsoft-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-microsoft-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-28T06:04:00.085Z (23 days ago)
  • Versions: 27
  • Dependent Packages: 13
  • Dependent Repositories: 94
  • Docker Downloads: 1,011,074,872
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 1.981%
    • Average: 4.704%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-html-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-html-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-18T04:04:42.486Z (about 1 month ago)
  • Versions: 27
  • Dependent Packages: 12
  • Dependent Repositories: 91
  • Docker Downloads: 1,011,074,933
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.022%
    • Average: 4.712%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-xml-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-xml-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-12T04:01:58.897Z (8 days ago)
  • Versions: 27
  • Dependent Packages: 10
  • Dependent Repositories: 90
  • Docker Downloads: 1,011,074,872
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.038%
    • Average: 4.932%
    • Dependent packages count: 6.583%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-zip-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-zip-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-09T17:01:29.604Z (10 days ago)
  • Versions: 27
  • Dependent Packages: 10
  • Dependent Repositories: 53
  • Docker Downloads: 1,011,074,872
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.883%
    • Average: 4.986%
    • Dependent packages count: 5.981%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-image-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-image-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-25T10:15:41.299Z (26 days ago)
  • Versions: 28
  • Dependent Packages: 12
  • Dependent Repositories: 48
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.078%
    • Average: 5.017%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-langdetect-optimaize

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-optimaize/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-18T19:31:33.566Z (1 day ago)
  • Versions: 27
  • Dependent Packages: 11
  • Dependent Repositories: 67
  • Docker Downloads: 20,767,540
  • Rankings:
    • Docker downloads count: 0.825%
    • Dependent repos count: 2.485%
    • Average: 5.052%
    • Dependent packages count: 5.981%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-xmp-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-xmp-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-30T04:31:28.185Z (21 days ago)
  • Versions: 27
  • Dependent Packages: 8
  • Dependent Repositories: 53
  • Docker Downloads: 1,011,074,892
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.883%
    • Average: 5.258%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-pkg-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Status: removed
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-pkg-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-04T12:39:56.578Z (about 2 months ago)
  • Versions: 27
  • Dependent Packages: 10
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.272%
    • Dependent packages count: 6.583%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-apple-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-apple-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-26T04:03:13.581Z (25 days ago)
  • Versions: 27
  • Dependent Packages: 8
  • Dependent Repositories: 90
  • Docker Downloads: 1,011,073,986
  • Rankings:
    • Docker downloads count: 0.099%
    • Dependent repos count: 2.038%
    • Average: 5.273%
    • Forks count: 6.822%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-audiovideo-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-audiovideo-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-26T06:17:37.401Z (25 days ago)
  • Versions: 27
  • Dependent Packages: 9
  • Dependent Repositories: 48
  • Docker Downloads: 64,151,040
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.078%
    • Average: 5.386%
    • Forks count: 6.812%
    • Dependent packages count: 7.341%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-font-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-font-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-17T00:19:02.483Z (3 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.603%
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-crypto-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-crypto-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-19T12:41:52.708Z (about 1 month ago)
  • Versions: 27
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.603%
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-code-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-code-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-10T00:16:42.241Z (10 days ago)
  • Versions: 27
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.603%
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-news-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-news-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-10T13:15:44.646Z (10 days ago)
  • Versions: 27
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.608%
    • Forks count: 6.822%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-translate

This is the translate Apache Tika™ toolkit. Translator implementations may depend on web services.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-translate/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-19T05:30:34.526Z (about 1 month ago)
  • Versions: 57
  • Dependent Packages: 5
  • Dependent Repositories: 32
  • Docker Downloads: 20,251,923
  • Rankings:
    • Docker downloads count: 0.827%
    • Dependent repos count: 3.996%
    • Average: 6.418%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-parser-advancedmedia-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-advancedmedia-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.1 (published about 1 month ago)
  • Last Synced: 2026-06-11T13:51:46.522Z (9 days ago)
  • Versions: 28
  • Dependent Packages: 3
  • Dependent Repositories: 43
  • Docker Downloads: 32,367,042
  • Rankings:
    • Docker downloads count: 0.669%
    • Dependent repos count: 3.306%
    • Forks count: 6.822%
    • Average: 7.455%
    • Stargazers count: 9.144%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-parser-sqlite3-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/tika-parsers/tika-parsers-extended/tika-parser-sqlite3-module/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-sqlite3-module/
  • Licenses: Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-05-29T12:47:38.792Z (22 days ago)
  • Versions: 26
  • Dependent Packages: 5
  • Dependent Repositories: 49
  • Rankings:
    • Dependent repos count: 3.038%
    • Forks count: 6.822%
    • Average: 7.582%
    • Stargazers count: 9.144%
    • Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-parser-html-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-html-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.9.4 (published about 1 year ago)
  • Last Synced: 2026-05-22T04:04:07.588Z (29 days ago)
  • Versions: 18
  • Dependent Packages: 14
  • Dependent Repositories: 2
  • Docker Downloads: 30,276,116
  • Rankings:
    • Docker downloads count: 0.711%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Average: 7.629%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-batch

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-batch/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-07T07:16:09.948Z (13 days ago)
  • Versions: 55
  • Dependent Packages: 4
  • Dependent Repositories: 12
  • Docker Downloads: 616,888
  • Rankings:
    • Docker downloads count: 2.039%
    • Forks count: 6.822%
    • Dependent repos count: 6.963%
    • Average: 7.744%
    • Stargazers count: 9.144%
    • Dependent packages count: 13.753%
repo1.maven.org: org.apache.tika:tika-parser-digest-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-digest-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-12T04:00:34.053Z (8 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 2
  • Docker Downloads: 31,777,847
  • Rankings:
    • Docker downloads count: 0.71%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Average: 8.002%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-emitter-fs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-fs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-03T19:31:33.709Z (16 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 2
  • Docker Downloads: 20,240,607
  • Rankings:
    • Docker downloads count: 0.828%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Average: 8.026%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-server-core

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-core/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-12T04:00:30.986Z (8 days ago)
  • Versions: 26
  • Dependent Packages: 4
  • Dependent Repositories: 3
  • Docker Downloads: 20,238,617
  • Rankings:
    • Docker downloads count: 0.828%
    • Forks count: 6.812%
    • Average: 8.838%
    • Stargazers count: 9.132%
    • Dependent repos count: 13.663%
    • Dependent packages count: 13.753%
repo1.maven.org: org.apache.tika:tika-langdetect-tika

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-tika/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-10T03:33:20.729Z (10 days ago)
  • Versions: 26
  • Dependent Packages: 1
  • Dependent Repositories: 53
  • Docker Downloads: 946,929,633
  • Rankings:
    • Docker downloads count: 0.11%
    • Dependent repos count: 2.883%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 10.339%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-httpclient-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-httpclient-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-27T08:36:19.267Z (24 days ago)
  • Versions: 26
  • Dependent Packages: 7
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
    • Average: 11.213%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-eval-core

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval-core/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-02T21:16:24.445Z (17 days ago)
  • Versions: 26
  • Dependent Packages: 4
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 11.428%
    • Dependent packages count: 13.753%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-langdetect-test-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-test-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-12T05:02:12.777Z (8 days ago)
  • Versions: 25
  • Dependent Packages: 6
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent packages count: 9.56%
    • Average: 11.543%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-parent

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parent/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-25T05:18:22.941Z (26 days ago)
  • Versions: 71
  • Dependent Packages: 2
  • Dependent Repositories: 7
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 9.173%
    • Average: 12.014%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-fetcher-s3

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-s3/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-14T08:23:34.699Z (6 days ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 12.323%
    • Dependent repos count: 15.993%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-s3

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-s3/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-18T04:46:19.481Z (2 days ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 12.323%
    • Dependent repos count: 15.993%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-emitter-s3

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-s3/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.1 (published about 1 month ago)
  • Last Synced: 2026-06-04T16:46:25.403Z (15 days ago)
  • Versions: 27
  • Dependent Packages: 3
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 12.323%
    • Dependent repos count: 15.993%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-parser-nlp-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-nlp-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-06T02:16:14.299Z (14 days ago)
  • Versions: 27
  • Dependent Packages: 2
  • Dependent Repositories: 4
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 12.011%
    • Average: 12.723%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-parser-sqlite3-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-sqlite3-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-06T19:31:20.729Z (13 days ago)
  • Versions: 24
  • Dependent Packages: 2
  • Dependent Repositories: 3
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 13.131%
    • Dependent repos count: 13.663%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-solr

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-solr/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-12T15:20:55.633Z (7 days ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 13.486%
    • Dependent packages count: 17.332%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-emitter-solr

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-solr/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-02T07:00:17.957Z (18 days ago)
  • Versions: 26
  • Dependent Packages: 3
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 13.486%
    • Dependent packages count: 17.332%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-parser-scientific-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-scientific-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-31T01:16:29.744Z (20 days ago)
  • Versions: 24
  • Dependent Packages: 1
  • Dependent Repositories: 8
  • Rankings:
    • Forks count: 6.822%
    • Dependent repos count: 8.595%
    • Stargazers count: 9.144%
    • Average: 14.324%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-eval

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-16T16:31:26.581Z (3 days ago)
  • Versions: 49
  • Dependent Packages: 1
  • Dependent Repositories: 6
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent repos count: 9.917%
    • Average: 14.648%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-langdetect-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published over 5 years ago)
  • Last Synced: 2026-05-22T21:02:53.741Z (28 days ago)
  • Versions: 1
  • Dependent Packages: 4
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Dependent packages count: 13.421%
    • Average: 14.685%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-transcribe-aws

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-transcribe-aws/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-02T16:04:57.098Z (17 days ago)
  • Versions: 25
  • Dependent Packages: 2
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 14.876%
    • Dependent repos count: 20.645%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-parser-jdbc-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-jdbc-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-16T20:46:41.378Z (3 days ago)
  • Versions: 27
  • Dependent Packages: 2
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 14.876%
    • Dependent repos count: 20.645%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-emitter-opensearch

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-opensearch/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-25T03:45:35.399Z (26 days ago)
  • Versions: 25
  • Dependent Packages: 2
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 14.882%
    • Dependent repos count: 20.645%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-dl

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://maven.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-dl/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-17T22:32:11.422Z (2 days ago)
  • Versions: 47
  • Dependent Packages: 1
  • Dependent Repositories: 4
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 12.011%
    • Average: 15.178%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.1 (published about 1 month ago)
  • Last Synced: 2026-06-18T11:02:11.204Z (1 day ago)
  • Versions: 63
  • Dependent Packages: 0
  • Dependent Repositories: 8
  • Docker Downloads: 13,306
  • Rankings:
    • Docker downloads count: 3.067%
    • Forks count: 6.822%
    • Dependent repos count: 8.595%
    • Stargazers count: 9.144%
    • Average: 15.503%
    • Dependent packages count: 49.885%
  • Advisories:
repo1.maven.org: org.apache.tika:tika-fuzzing

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fuzzing/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-29T20:31:58.785Z (21 days ago)
  • Versions: 36
  • Dependent Packages: 1
  • Dependent Repositories: 3
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent repos count: 13.663%
    • Average: 15.585%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-jdbc

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-jdbc/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-10T15:30:35.886Z (9 days ago)
  • Versions: 26
  • Dependent Packages: 1
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 16.173%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-fetcher-http

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-http/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-25T15:58:11.735Z (25 days ago)
  • Versions: 26
  • Dependent Packages: 1
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 16.173%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server-standard

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-standard/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-01T17:15:27.697Z (18 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Docker Downloads: 20,238,617
  • Rankings:
    • Docker downloads count: 0.828%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 16.534%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-async-cli

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-async-cli/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-03T01:47:29.495Z (17 days ago)
  • Versions: 17
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Docker Downloads: 9
  • Rankings:
    • Forks count: 5.227%
    • Stargazers count: 7.527%
    • Average: 16.774%
    • Dependent packages count: 22.361%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-kafka

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-kafka/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-12T14:12:11.540Z (8 days ago)
  • Versions: 18
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.628%
    • Stargazers count: 7.48%
    • Average: 16.862%
    • Dependent packages count: 22.361%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-langdetect-lingo24

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-lingo24/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-24T14:31:14.296Z (27 days ago)
  • Versions: 27
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 17.33%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-emitter-jdbc

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-jdbc/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-18T14:45:38.270Z (1 day ago)
  • Versions: 17
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 17.33%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-gcs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Status: removed
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-gcs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-05T19:06:07.485Z (about 2 months ago)
  • Versions: 24
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 17.33%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-emitter-gcs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-gcs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-21T16:18:07.583Z (29 days ago)
  • Versions: 24
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-age-recogniser

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://maven.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-age-recogniser/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-06T23:45:35.542Z (13 days ago)
  • Versions: 27
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-langdetect-mitll-text

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-mitll-text/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.1 (published about 1 month ago)
  • Last Synced: 2026-06-04T16:32:03.433Z (15 days ago)
  • Versions: 29
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server-client

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-client/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-13T11:49:22.762Z (7 days ago)
  • Versions: 27
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-bundle-standard

OSGi bundle that contains the tika-parsers-standard component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle-standard/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-19T11:48:51.741Z (about 1 month ago)
  • Versions: 26
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-fetcher-gcs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-gcs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-17T13:48:37.456Z (3 days ago)
  • Versions: 24
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-example

This module contains examples of how to use Apache Tika.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-example/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-13T20:03:19.287Z (about 1 month ago)
  • Versions: 58
  • Dependent Packages: 0
  • Dependent Repositories: 20
  • Rankings:
    • Dependent repos count: 5.315%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.792%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-detectors

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-detectors/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-26T18:20:40.080Z (24 days ago)
  • Versions: 17
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.227%
    • Stargazers count: 7.527%
    • Average: 19.183%
    • Dependent repos count: 31.98%
    • Dependent packages count: 31.998%
repo1.maven.org: org.apache.tika:tika-bom

Apache Tika Bill of Materials

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bom/
  • Licenses: Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-08T12:01:13.508Z (about 1 month ago)
  • Versions: 21
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 20.461%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-eval-app

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Status: removed
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval-app/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-05-04T22:02:27.421Z (about 2 months ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent repos count: 20.645%
    • Average: 21.618%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-nlp

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://maven.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-nlp/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 1.28.5 (published almost 4 years ago)
  • Last Synced: 2026-05-11T00:16:32.726Z (about 1 month ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 20.645%
    • Average: 21.624%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-parsers-standard-modules

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard-modules/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-02T17:17:35.349Z (17 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-emitters

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitters/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-22T01:31:38.207Z (29 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parser-advancedmedia-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-advancedmedia-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-10T10:31:17.965Z (10 days ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-classic-modules

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-classic-modules/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published over 5 years ago)
  • Last Synced: 2026-06-11T16:46:37.423Z (8 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-05T11:13:44.539Z (15 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-extended-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-extended-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.1 (published about 1 month ago)
  • Last Synced: 2026-06-04T15:31:07.428Z (15 days ago)
  • Versions: 29
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-server-classic

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-classic/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published over 5 years ago)
  • Last Synced: 2026-05-22T08:45:42.187Z (29 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-server-eval

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-eval/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-21T16:50:36.496Z (29 days ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-bundle-classic

OSGi bundle that contains the tika-parsers-classic component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle-classic/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published over 5 years ago)
  • Last Synced: 2026-05-17T21:32:57.674Z (about 1 month ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-classic

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-classic/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published over 5 years ago)
  • Last Synced: 2026-06-09T19:31:30.121Z (10 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-opensearch-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-opensearch-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-05-03T03:31:07.393Z (about 2 months ago)
  • Versions: 24
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-extended

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-extended/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-15T20:46:40.769Z (4 days ago)
  • Versions: 27
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-emitter-az-blob

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-az-blob/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-22T01:48:56.632Z (29 days ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-az-blob

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-az-blob/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-15T13:46:46.198Z (about 1 month ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-ml

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-ml/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.1 (published about 1 month ago)
  • Last Synced: 2026-06-08T07:56:18.243Z (12 days ago)
  • Versions: 28
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-advanced

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-advanced/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published over 5 years ago)
  • Last Synced: 2026-06-13T10:42:07.513Z (7 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-s3-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-s3-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-02T11:16:42.184Z (18 days ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-bundles

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundles/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-20T04:03:27.628Z (about 1 month ago)
  • Versions: 27
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-fetcher-az-blob

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-az-blob/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-08T00:31:25.149Z (about 1 month ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-solr-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-solr-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-18T02:15:25.334Z (2 days ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-iterators

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterators/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-25T21:02:41.573Z (25 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-27T07:32:52.885Z (24 days ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-standard

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-25T15:35:04.577Z (25 days ago)
  • Versions: 27
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-kafka-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-kafka-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-20T00:19:31.710Z (about 1 month ago)
  • Versions: 18
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-reporter-fs-status

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-reporter-fs-status/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-11T17:08:13.077Z (about 1 month ago)
  • Versions: 18
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-reporters

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-reporters/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-06-01T15:01:34.449Z (19 days ago)
  • Versions: 18
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-resource-loading-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-resource-loading-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 9 months ago)
  • Last Synced: 2026-06-15T13:59:16.680Z (5 days ago)
  • Versions: 18
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-detector-siegfried

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-detector-siegfried/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-30T14:32:00.263Z (21 days ago)
  • Versions: 17
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Rankings:
    • Dependent repos count: 31.98%
    • Average: 31.989%
    • Dependent packages count: 31.998%
repo1.maven.org: org.apache.tika:tika-fetchers

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetchers/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 3 months ago)
  • Last Synced: 2026-05-26T06:30:57.655Z (25 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Dependent repos count: 31.98%
    • Average: 40.42%
    • Dependent packages count: 48.86%

Dependencies

tika-app/pom.xml maven
  • org.apache.logging.log4j:log4j-core ${log4j2.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.apache.tika:tika-batch 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-fs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-optimaize 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-serialization 2.4.2-SNAPSHOT
  • org.apache.tika:tika-xmp 2.4.2-SNAPSHOT
  • org.slf4j:jcl-over-slf4j
tika-bom/pom.xml maven
  • org.apache.tika:tika-age-recogniser 2.4.2-SNAPSHOT
  • org.apache.tika:tika-bundle-standard 2.4.2-SNAPSHOT
  • org.apache.tika:tika-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-dl 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-fs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-gcs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-opensearch 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-s3 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-solr 2.4.2-SNAPSHOT
  • org.apache.tika:tika-eval-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fetcher-gcs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fetcher-http 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fetcher-s3 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fuzzing 2.4.2-SNAPSHOT
  • org.apache.tika:tika-httpclient-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-java7 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-lingo24 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-mitll-text 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-opennlp 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-optimaize 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-test-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-tika 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-advancedmedia-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-apple-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-audiovideo-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-cad-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-code-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-crypto-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-digest-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-font-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-html-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-html-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-image-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-jdbc-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-mail-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-mail-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-microsoft-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-miscoffice-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-news-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-nlp-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-ocr-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-pdf-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-pkg-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-scientific-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-scientific-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-sqlite3-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-sqlite3-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-text-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-xml-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-xmp-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-zip-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-csv 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-gcs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-jdbc 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-s3 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-solr 2.4.2-SNAPSHOT
  • org.apache.tika:tika-serialization 2.4.2-SNAPSHOT
  • org.apache.tika:tika-server-client 2.4.2-SNAPSHOT
  • org.apache.tika:tika-server-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-transcribe-aws 2.4.2-SNAPSHOT
  • org.apache.tika:tika-translate 2.4.2-SNAPSHOT
  • org.apache.tika:tika-xmp 2.4.2-SNAPSHOT
tika-bundles/pom.xml maven
  • org.osgi:org.osgi.compendium ${osgi.compendium.version}
tika-bundles/tika-bundle-standard/pom.xml maven
  • ${project.groupId}:tika-core 2.4.2-SNAPSHOT
  • ${project.groupId}:tika-parsers-standard-package 2.4.2-SNAPSHOT
  • com.sun.activation:javax.activation 1.2.0
  • org.apache.logging.log4j:log4j-api
  • com.sun.xml.fastinfoset:FastInfoset 2.1.0 test
  • javax.inject:javax.inject 1 test
  • org.apache.felix:org.apache.felix.framework 7.0.5 test
  • org.glassfish.jaxb:jaxb-runtime ${jaxb.version} test
  • org.ops4j.pax.exam:pax-exam-container-native ${pax.exam.version} test
  • org.ops4j.pax.exam:pax-exam-junit4 ${pax.exam.version} test
  • org.ops4j.pax.exam:pax-exam-link-assembly ${pax.exam.version} test
  • org.ops4j.pax.url:pax-url-aether 2.6.1 test
  • org.osgi:org.osgi.core ${osgi.core.version} test
  • org.slf4j:slf4j-simple ${slf4j.version} test
tika-core/pom.xml maven
  • biz.aQute.bnd:biz.aQute.bndlib provided
  • org.osgi:org.osgi.compendium ${osgi.compendium.version} provided
  • org.osgi:org.osgi.core ${osgi.core.version} provided
  • commons-io:commons-io ${commons.io.version}
  • org.slf4j:slf4j-api
  • com.google.guava:guava ${guava.version} test
  • com.martensigwart:fakeload ${fakeload.version} test
  • org.apache.logging.log4j:log4j-core ${log4j2.version} test
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version} test
  • org.junit.jupiter:junit-jupiter-api ${junit5.version} test
  • org.junit.jupiter:junit-jupiter-engine ${junit5.version} test
tika-eval/tika-eval-app/pom.xml maven
  • com.h2database:h2 ${h2.version}
  • org.apache.logging.log4j:log4j-core ${log4j2.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.apache.poi:poi-ooxml ${poi.version}
  • org.apache.tika:tika-batch ${project.version}
  • org.apache.tika:tika-eval-core ${project.version}
  • org.apache.tika:tika-batch ${project.version} test
  • org.apache.tika:tika-core ${project.version} test
tika-eval/tika-eval-core/pom.xml maven
  • ${project.groupId}:tika-core ${project.version}
  • ${project.groupId}:tika-langdetect-opennlp ${project.version}
  • ${project.groupId}:tika-serialization ${project.version}
  • com.fasterxml.jackson.core:jackson-databind
  • commons-codec:commons-codec ${commons.codec.version}
  • org.apache.commons:commons-lang3 ${commons.lang3.version}
  • org.apache.commons:commons-math3 ${commons.math3.version}
  • org.apache.lucene:lucene-analyzers-common ${lucene.version}
  • org.apache.lucene:lucene-analyzers-icu ${lucene.version}
  • org.apache.lucene:lucene-core ${lucene.version}
  • org.ccil.cowan.tagsoup:tagsoup 1.2.1
  • org.apache.lucene:lucene-memory ${lucene.version} test
  • org.apache.tika:tika-core ${project.version} test
tika-example/pom.xml maven
  • ${project.groupId}:tika-langdetect-optimaize ${project.version}
  • javax.jcr:jcr ${javax.jcr.version}
  • org.apache.jackrabbit:jackrabbit-core ${jackrabbit.version}
  • org.apache.jackrabbit:jackrabbit-jcr-server ${jackrabbit.version}
  • org.apache.lucene:lucene-core ${lucene.version}
  • org.apache.tika:tika-app ${project.version}
  • org.apache.tika:tika-eval-core ${project.version}
  • org.apache.tika:tika-serialization ${project.version}
  • org.apache.tika:tika-transcribe-aws ${project.version}
  • org.apache.tika:tika-translate ${project.version}
  • org.osgi:org.osgi.compendium ${osgi.compendium.version}
  • org.springframework:spring-context ${spring.version}
  • org.apache.tika:tika-core ${project.version} test
tika-fuzzing/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • commons-cli:commons-cli ${commons.cli.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.apache.tika:tika-parser-digest-commons ${project.version}
  • org.apache.tika:tika-parser-pdf-module ${project.version}
  • org.apache.tika:tika-parser-pkg-module ${project.version}
  • org.slf4j:jcl-over-slf4j
  • ${project.groupId}:tika-core ${project.version} test
tika-integration-tests/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-serialization ${project.version} test
  • org.junit.vintage:junit-vintage-engine ${junit5.version} test
tika-integration-tests/tika-pipes-opensearch-integration-tests/pom.xml maven
  • ${project.groupId}:tika-app ${project.version} test
  • ${project.groupId}:tika-emitter-opensearch ${project.version} test
  • net.java.dev.jna:jna ${jna.version} test
  • org.testcontainers:testcontainers ${test.containers.version} test
tika-integration-tests/tika-pipes-s3-integration-tests/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-emitter-s3 ${project.version} test
  • ${project.groupId}:tika-fetcher-s3 ${project.version} test
  • ${project.groupId}:tika-pipes-iterator-s3 ${project.version} test
tika-integration-tests/tika-pipes-solr-integration-tests/pom.xml maven
  • ${project.groupId}:tika-app ${project.version} test
  • ${project.groupId}:tika-emitter-solr ${project.version} test
  • ${project.groupId}:tika-pipes-iterator-solr ${project.version} test
  • org.apache.solr:solr-solrj ${solrj.version} test
  • org.testcontainers:testcontainers ${test.containers.version} test
tika-integration-tests/tika-resource-loading-tests/pom.xml maven
  • org.apache.tika:tika-core 2.4.2-SNAPSHOT test
tika-java7/pom.xml maven
  • biz.aQute.bnd:biz.aQute.bndlib provided
  • org.apache.tika:tika-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
tika-langdetect/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • org.junit.jupiter:junit-jupiter-api ${junit5.version} test
  • org.junit.jupiter:junit-jupiter-engine ${junit5.version} test
tika-langdetect/tika-langdetect-lingo24/pom.xml maven
  • com.fasterxml.jackson.core:jackson-databind
  • org.apache.cxf:cxf-rt-rs-client ${cxf.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.glassfish.jaxb:jaxb-runtime ${jaxb.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-langdetect/tika-langdetect-mitll-text/pom.xml maven
  • com.fasterxml.jackson.core:jackson-databind
  • org.apache.cxf:cxf-rt-rs-client ${cxf.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.glassfish.jaxb:jaxb-runtime ${jaxb.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-langdetect/tika-langdetect-opennlp/pom.xml maven
  • org.apache.opennlp:opennlp-tools ${opennlp.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
.github/workflows/main-jdk17-build.yml actions
  • actions/checkout v4 composite
  • actions/setup-java v1 composite
.github/workflows/main-jdk21-build.yml actions
  • actions/checkout v4 composite
  • actions/setup-java v1 composite
tika-integration-tests/tika-pipes-s3-integration-tests/src/test/resources/docker-compose.yml docker
  • quay.io/minio/minio latest
pom.xml maven
tika-detectors/pom.xml maven
tika-detectors/tika-detector-siegfried/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • com.fasterxml.jackson.core:jackson-databind
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-parsers-standard-package ${project.version} test
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-eval/pom.xml maven
tika-handlers/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
tika-handlers/tika-handler-boilerpipe/pom.xml maven
  • de.l3s.boilerpipe:boilerpipe 1.1.0
tika-integration-tests/tika-pipes-kafka-integration-tests/pom.xml maven
  • ${project.groupId}:tika-app ${project.version} test
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-emitter-kafka ${project.version} test
  • ${project.groupId}:tika-pipes-iterator-kafka ${project.version} test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
  • org.testcontainers:junit-jupiter test
  • org.testcontainers:kafka test
  • org.testcontainers:testcontainers test
tika-grpc/pom.xml maven
  • org.apache.tomcat:annotations-api 6.0.53 provided
  • com.beust:jcommander
  • com.fasterxml.jackson.module:jackson-module-jsonSchema
  • com.google.guava:guava
  • com.google.j2objc:j2objc-annotations 3.0.0
  • com.google.protobuf:protobuf-java-util ${protobuf.version}
  • io.grpc:grpc-netty-shaded
  • io.grpc:grpc-protobuf
  • io.grpc:grpc-services
  • io.grpc:grpc-stub
  • org.apache.logging.log4j:log4j-core
  • org.apache.logging.log4j:log4j-slf4j2-impl
  • org.apache.tika:tika-async-cli 4.0.0-SNAPSHOT
  • org.apache.tika:tika-core 4.0.0-SNAPSHOT
  • org.apache.tika:tika-fetcher-http 4.0.0-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 4.0.0-SNAPSHOT
  • org.slf4j:jcl-over-slf4j
  • com.asarkar.grpc:grpc-test 1.2.2 test
  • io.grpc:grpc-testing test
  • org.awaitility:awaitility 4.2.2 test
  • org.eclipse.jetty:jetty-server test
  • org.mockito:mockito-core test
tika-detectors/tika-detector-magika/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • com.fasterxml.jackson.core:jackson-databind
  • ${project.groupId}:tika-core ${project.version} test
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-langdetect/tika-langdetect-optimaize/pom.xml maven
  • com.optimaize.languagedetector:language-detector ${optimaize.version}
  • org.jetbrains:annotations 26.0.2
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-langdetect/tika-langdetect-test-commons/pom.xml maven
tika-langdetect/tika-langdetect-tika/pom.xml maven
  • com.optimaize.languagedetector:language-detector ${optimaize.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-parent/pom.xml maven
  • org.junit.jupiter:junit-jupiter-api 5.13.0-M3 test
  • org.junit.jupiter:junit-jupiter-engine 5.13.0-M3 test
tika-parsers/pom.xml maven
  • org.apache.tika:tika-core ${project.version} test
  • org.junit.jupiter:junit-jupiter-api test
  • org.junit.jupiter:junit-jupiter-engine test
tika-parsers/tika-parsers-extended/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
tika-parsers/tika-parsers-extended/tika-parser-scientific-module/pom.xml maven
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • edu.ucar:grib ${netcdf-java.version}
  • edu.ucar:netcdf4 ${netcdf-java.version}
  • javax.measure:unit-api
  • net.jcip:jcip-annotations 1.0
  • org.apache.commons:commons-csv
  • org.apache.sis.core:sis-metadata
  • org.apache.sis.core:sis-utility
  • org.apache.sis.storage:sis-netcdf
  • org.glassfish.jaxb:jaxb-runtime
  • org.opengis:geoapi
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/pom.xml maven
  • org.apache.tika:tika-parser-scientific-module 4.0.0-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 4.0.0-SNAPSHOT test
tika-parsers/tika-parsers-extended/tika-parser-sqlite3-module/pom.xml maven
  • ${project.groupId}:tika-parser-jdbc-commons ${project.version}
  • org.xerial:sqlite-jdbc ${sqlite.version}
tika-parsers/tika-parsers-extended/tika-parser-sqlite3-package/pom.xml maven
  • ${project.groupId}:tika-parser-sqlite3-module ${project.version}
tika-parsers/tika-parsers-extended/tika-parsers-extended-integration-tests/pom.xml maven
  • ${project.groupId}:tika-parser-scientific-module ${project.version} test
  • ${project.groupId}:tika-parser-sqlite3-module ${project.version} test
  • ${project.groupId}:tika-parser-sqlite3-package ${project.version} test
  • ${project.groupId}:tika-parsers-standard-package ${project.version} test
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-parsers/tika-parsers-ml/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
tika-parsers/tika-parsers-ml/tika-parser-nlp-module/pom.xml maven
  • org.apache.ctakes:ctakes-core ${ctakes.version} provided
  • ${project.groupId}:tika-parser-pdf-module ${project.version}
  • com.github.openjson:openjson ${openjson.version}
  • com.google.code.gson:gson
  • com.googlecode.json-simple:json-simple
  • commons-codec:commons-codec
  • edu.usc.ir:sentiment-analysis-parser 0.1
  • jakarta.annotation:jakarta.annotation-api
  • org.apache.cxf:cxf-rt-rs-client
  • org.apache.httpcomponents:httpclient
  • org.apache.httpcomponents:httpcore
  • org.slf4j:log4j-over-slf4j test
tika-parsers/tika-parsers-ml/tika-parser-nlp-package/pom.xml maven
  • org.apache.tika:tika-parser-nlp-module 4.0.0-SNAPSHOT
tika-parsers/tika-parsers-ml/tika-transcribe-aws/pom.xml maven
  • com.fasterxml.jackson.core:jackson-databind
  • com.googlecode.json-simple:json-simple
  • javax.xml.bind:jaxb-api 2.3.1
  • software.amazon.awssdk:s3
  • software.amazon.awssdk:sts
  • software.amazon.awssdk:transcribe
  • org.slf4j:slf4j-simple test
tika-parsers/tika-parsers-standard/pom.xml maven
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-apple-module/pom.xml maven
  • ${project.groupId}:tika-parser-zip-commons ${project.version}
  • com.googlecode.plist:dd-plist ${ddplist.version}
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/pom.xml maven
  • com.drewnoakes:metadata-extractor
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-cad-module/pom.xml maven
  • ${project.groupId}:tika-parser-microsoft-module ${project.version}
  • com.fasterxml.jackson.core:jackson-core
  • com.fasterxml.jackson.core:jackson-databind
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-code-module/pom.xml maven
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • com.epam:parso ${parso.version}
  • org.apache.commons:commons-lang3
  • org.codelibs:jhighlight ${jhighlight.version}
  • org.jsoup:jsoup
  • org.ow2.asm:asm ${asm.version}
  • org.tallison:jmatio 1.5
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-crypto-module/pom.xml maven
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-digest-commons/pom.xml maven
  • commons-codec:commons-codec
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-font-module/pom.xml maven
  • org.apache.pdfbox:fontbox ${pdfbox.version}
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-html-module/pom.xml maven
  • commons-codec:commons-codec
  • org.jsoup:jsoup
  • ${project.groupId}:tika-parser-text-module ${project.version} test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-image-module/pom.xml maven
  • ${project.groupId}:tika-parser-xmp-commons ${project.version}
  • com.drewnoakes:metadata-extractor
  • com.github.jai-imageio:jai-imageio-core
  • org.apache.pdfbox:jbig2-imageio ${jbig2.version}
  • ${project.groupId}:tika-parser-xmp-commons ${project.version} test
  • com.github.jai-imageio:jai-imageio-jpeg2000 ${imageio.version} test
  • org.mockito:mockito-core test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-jdbc-commons/pom.xml maven
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-mail-commons/pom.xml maven
  • org.apache.james:apache-mime4j-core ${mime4j.version}
  • org.apache.james:apache-mime4j-dom ${mime4j.version}
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-mail-module/pom.xml maven
  • ${project.groupId}:tika-parser-html-module ${project.version}
  • ${project.groupId}:tika-parser-mail-commons ${project.version}
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • ${project.groupId}:tika-parser-ocr-module ${project.version} test
  • org.mockito:mockito-core test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/pom.xml maven
  • ${project.groupId}:tika-parser-html-module ${project.version}
  • ${project.groupId}:tika-parser-mail-commons ${project.version}
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • ${project.groupId}:tika-parser-xml-module ${project.version}
  • ${project.groupId}:tika-parser-zip-commons ${project.version}
  • com.healthmarketscience.jackcess:jackcess
  • com.healthmarketscience.jackcess:jackcess-encrypt
  • com.pff:java-libpst ${libpst.version}
  • commons-codec:commons-codec
  • commons-logging:commons-logging
  • org.apache.commons:commons-lang3
  • org.apache.poi:poi
  • org.apache.poi:poi-ooxml
  • org.apache.poi:poi-scratchpad ${poi.version}
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
  • org.slf4j:slf4j-api
  • ${project.groupId}:tika-parser-mail-module ${project.version} test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/pom.xml maven
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • ${project.groupId}:tika-parser-xml-module ${project.version}
  • ${project.groupId}:tika-parser-xmp-commons ${project.version}
  • ${project.groupId}:tika-parser-zip-commons ${project.version}
  • commons-codec:commons-codec
  • org.apache.commons:commons-collections4
  • org.apache.commons:commons-lang3
  • org.apache.poi:poi
  • org.glassfish.jaxb:jaxb-runtime
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-news-module/pom.xml maven
  • com.rometools:rome ${rome.version}
  • org.slf4j:slf4j-api
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-ocr-module/pom.xml maven
  • org.apache.commons:commons-exec
  • org.apache.commons:commons-lang3
  • ${project.groupId}:tika-parser-image-module ${project.version} test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/pom.xml maven
  • ${project.groupId}:tika-parser-xmp-commons ${project.version}
  • org.apache.pdfbox:jempbox ${jempbox.version}
  • org.apache.pdfbox:pdfbox ${pdfbox.version}
  • org.apache.pdfbox:pdfbox-tools ${pdfbox.version}
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
  • org.glassfish.jaxb:jaxb-runtime
  • com.github.jai-imageio:jai-imageio-core test
  • org.slf4j:jcl-over-slf4j test
.github/workflows/main-jdk25-build.yml actions
  • actions/checkout v4 composite
  • actions/setup-java v4 composite
tika-integration-tests/tika-woodstox-tests/pom.xml maven
  • ${project.groupId}:tika-core ${project.version}
  • com.fasterxml.woodstox:woodstox-core