An open API service for producing an overview of a list of open source projects.

https://github.com/apache/tika

content extraction java metadata tika

Score: 36.52899645978604

Last synced: about 14 hours ago
JSON representation

Repository metadata:

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).


Owner metadata:


GitHub Events

Total
Last Year

Committers metadata

Last synced: 2 months ago

Total Commits: 8,634
Total Committers: 178
Avg Commits per committer: 48.506
Development Distribution Score (DDS): 0.699

Commits in past year: 801
Committers in past year: 23
Avg Commits per committer in past year: 34.826
Development Distribution Score (DDS) in past year: 0.541

Name Email Commits
tallison t****n@a****g 2599
dependabot[bot] 4****] 1444
Jukka Zitting j****a@a****g 960
Nick Burch n****k@a****g 932
Tilman Hausherr t****n@a****g 889
Chris Mattmann m****n@a****g 437
David Meikle d****e@a****g 137
Tyler Palsulich t****h@a****g 121
Michael McCandless m****d@a****g 98
Maxim Valyanskiy m****m@a****g 67
ThejanW t****4@c****k 63
Konstantin Gribov g****s@g****m 62
Lee 5****e 49
Thamme Gowda t****a@a****g 41
Nicholas DiPiazza n****a@l****m 39
Kenneth William Krugler k****r@a****g 35
Ray Gauss II r****s@a****g 34
Hong-Thai Nguyen t****4@a****g 30
Lewis John McGibbney l****y@g****m 25
Luis Nassif l****f@g****m 23
manali m****1@g****m 22
Kranthi Kiran GV k****v@g****m 20
Rohan Surana r****0@g****m 19
bitsgalore j****f@k****l 18
Bob Paulin b****b@b****m 17
Madhav Sharan g****v@g****m 16
Dmitry Kryukov d****k 16
Zarana Parekh z****7@g****m 15
nprate2 4****2 14
ashankbehara a****2@i****u 14
and 148 more...

Issue and Pull Request metadata

Last synced: about 2 months ago

Total issues: 7
Total pull requests: 1,475
Average time to close issues: about 1 hour
Average time to close pull requests: 16 days
Total issue authors: 2
Total pull request authors: 57
Average comments per issue: 0.0
Average comments per pull request: 0.31
Merged pull request: 1,276
Bot issues: 5
Bot pull requests: 1,076

Past year issues: 2
Past year pull requests: 370
Past year average time to close issues: 24 minutes
Past year average time to close pull requests: 1 day
Past year issue authors: 2
Past year pull request authors: 22
Past year average comments per issue: 0.0
Past year average comments per pull request: 0.21
Past year merged pull request: 304
Past year bot issues: 1
Past year bot pull requests: 244

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/apache/tika

Top Issue Authors

  • dependabot[bot] (5)
  • tballison (2)

Top Pull Request Authors

  • dependabot[bot] (1,076)
  • tballison (249)
  • dk2k (29)
  • nddipiazza (21)
  • alexey-pelykh (8)
  • bartek (5)
  • subbudvk (5)
  • jogerh (4)
  • ldh5574 (4)
  • gastaldi (3)
  • lsliwko (3)
  • ruwi-next (3)
  • rob975 (2)
  • Lonzak (2)
  • sunluman (2)

Top Issue Labels

  • dependencies (5)
  • java (1)

Top Pull Request Labels

  • dependencies (1,076)
  • java (156)

Package metadata

repo1.maven.org: org.apache.tika:tika-core

This is the core Apache Tika™ toolkit library from which all other modules inherit functionality. It also includes the core facades for the Tika API.

repo1.maven.org: org.apache.tika:tika-parsers-standard-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-25T03:38:16.283Z (2 days ago)
  • Versions: 26
  • Dependent Packages: 54
  • Dependent Repositories: 234
  • Docker Downloads: 10,733,392
  • Rankings:
    • Docker downloads count: 0.963%
    • Dependent repos count: 1.036%
    • Dependent packages count: 1.381%
    • Average: 3.869%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-xmp

Converts Tika metadata to XMP

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-xmp/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-27T00:36:02.500Z (about 15 hours ago)
  • Versions: 61
  • Dependent Packages: 24
  • Dependent Repositories: 72
  • Docker Downloads: 23,917,677
  • Rankings:
    • Docker downloads count: 0.795%
    • Dependent repos count: 2.377%
    • Dependent packages count: 2.646%
    • Average: 4.352%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-text-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-text-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-18T16:35:08.951Z (9 days ago)
  • Versions: 26
  • Dependent Packages: 18
  • Dependent Repositories: 105
  • Docker Downloads: 1,025,388,925
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 1.854%
    • Dependent packages count: 3.857%
    • Average: 4.355%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-serialization

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-serialization/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-20T09:22:07.721Z (7 days ago)
  • Versions: 56
  • Dependent Packages: 23
  • Dependent Repositories: 57
  • Docker Downloads: 20,868,736
  • Rankings:
    • Docker downloads count: 0.825%
    • Dependent packages count: 2.752%
    • Dependent repos count: 2.761%
    • Average: 4.456%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-java7

Java-7 reliant components, including FileTypeDetector implementations

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-java7/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-04T11:48:19.589Z (23 days ago)
  • Versions: 57
  • Dependent Packages: 26
  • Dependent Repositories: 59
  • Docker Downloads: 2,325,584
  • Rankings:
    • Docker downloads count: 1.496%
    • Dependent packages count: 2.554%
    • Dependent repos count: 2.7%
    • Average: 4.539%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-pdf-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

repo1.maven.org: org.apache.tika:tika-app

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-app/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-11T13:02:26.607Z (16 days ago)
  • Versions: 68
  • Dependent Packages: 17
  • Dependent Repositories: 160
  • Docker Downloads: 665,943
  • Rankings:
    • Dependent repos count: 1.342%
    • Docker downloads count: 2.024%
    • Dependent packages count: 3.655%
    • Average: 4.593%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-langdetect

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-18T04:47:50.157Z (9 days ago)
  • Versions: 49
  • Dependent Packages: 21
  • Dependent Repositories: 126
  • Docker Downloads: 17,208
  • Rankings:
    • Dependent repos count: 1.604%
    • Docker downloads count: 2.961%
    • Dependent packages count: 2.996%
    • Average: 4.701%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-microsoft-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-microsoft-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-18T14:32:23.564Z (9 days ago)
  • Versions: 26
  • Dependent Packages: 13
  • Dependent Repositories: 94
  • Docker Downloads: 1,011,074,872
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 1.981%
    • Average: 4.704%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-html-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-html-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-28T14:45:15.039Z (27 days ago)
  • Versions: 26
  • Dependent Packages: 12
  • Dependent Repositories: 91
  • Docker Downloads: 1,011,074,933
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.022%
    • Average: 4.712%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-miscoffice-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-miscoffice-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-03T10:41:36.924Z (24 days ago)
  • Versions: 26
  • Dependent Packages: 10
  • Dependent Repositories: 97
  • Docker Downloads: 1,011,074,872
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 1.942%
    • Average: 4.913%
    • Dependent packages count: 6.583%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-zip-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-zip-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-21T14:02:40.376Z (about 1 month ago)
  • Versions: 26
  • Dependent Packages: 10
  • Dependent Repositories: 53
  • Docker Downloads: 1,011,074,872
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.883%
    • Average: 4.986%
    • Dependent packages count: 5.981%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-image-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-image-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-22T17:43:15.384Z (5 days ago)
  • Versions: 26
  • Dependent Packages: 12
  • Dependent Repositories: 48
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.078%
    • Average: 5.017%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-langdetect-optimaize

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-optimaize/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-26T17:02:35.096Z (29 days ago)
  • Versions: 26
  • Dependent Packages: 11
  • Dependent Repositories: 67
  • Docker Downloads: 20,767,540
  • Rankings:
    • Docker downloads count: 0.825%
    • Dependent repos count: 2.485%
    • Average: 5.052%
    • Dependent packages count: 5.981%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-xmp-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-xmp-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-22T17:23:25.373Z (5 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 53
  • Docker Downloads: 1,011,074,892
  • Rankings:
    • Docker downloads count: 0.098%
    • Dependent repos count: 2.883%
    • Average: 5.258%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-pkg-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-pkg-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-21T14:01:35.669Z (about 1 month ago)
  • Versions: 26
  • Dependent Packages: 10
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.272%
    • Dependent packages count: 6.583%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-apple-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-apple-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T08:49:19.329Z (11 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 90
  • Docker Downloads: 1,011,073,986
  • Rankings:
    • Docker downloads count: 0.099%
    • Dependent repos count: 2.038%
    • Average: 5.273%
    • Forks count: 6.822%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-mail-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-mail-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-19T13:47:43.424Z (8 days ago)
  • Versions: 26
  • Dependent Packages: 9
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.424%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-crypto-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-crypto-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-15T06:24:13.368Z (12 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.603%
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-code-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-code-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-24T21:48:43.219Z (3 days ago)
  • Versions: 27
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.603%
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
repo1.maven.org: org.apache.tika:tika-parser-cad-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-cad-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-12T12:45:51.147Z (15 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.608%
    • Forks count: 6.822%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-news-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-news-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-04T16:03:02.984Z (23 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 44
  • Docker Downloads: 64,144,938
  • Rankings:
    • Docker downloads count: 0.568%
    • Dependent repos count: 3.242%
    • Average: 5.608%
    • Forks count: 6.822%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-translate

This is the translate Apache Tika™ toolkit. Translator implementations may depend on web services.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-translate/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T07:51:27.277Z (11 days ago)
  • Versions: 56
  • Dependent Packages: 5
  • Dependent Repositories: 32
  • Docker Downloads: 20,251,923
  • Rankings:
    • Docker downloads count: 0.827%
    • Dependent repos count: 3.996%
    • Average: 6.418%
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-bundle

OSGi bundle that contains the tika-parsers component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 1.28.5 (published over 3 years ago)
  • Last Synced: 2026-03-18T17:48:04.184Z (9 days ago)
  • Versions: 41
  • Dependent Packages: 8
  • Dependent Repositories: 47
  • Docker Downloads: 1,285,275
  • Rankings:
    • Dependent repos count: 3.111%
    • Docker downloads count: 6.259%
    • Average: 6.536%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Stargazers count: 9.144%
repo1.maven.org: org.apache.tika:tika-parser-ocr-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-ocr-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-07T07:02:54.930Z (20 days ago)
  • Versions: 26
  • Dependent Packages: 10
  • Dependent Repositories: 5
  • Docker Downloads: 31,777,907
  • Rankings:
    • Docker downloads count: 0.71%
    • Dependent packages count: 6.583%
    • Average: 6.816%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 10.821%
repo1.maven.org: org.apache.tika:tika-parser-advancedmedia-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-advancedmedia-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-22T22:06:17.290Z (about 1 month ago)
  • Versions: 26
  • Dependent Packages: 3
  • Dependent Repositories: 43
  • Docker Downloads: 32,367,042
  • Rankings:
    • Docker downloads count: 0.669%
    • Dependent repos count: 3.306%
    • Forks count: 6.822%
    • Average: 7.455%
    • Stargazers count: 9.144%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-parser-sqlite3-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/tika-parsers/tika-parsers-extended/tika-parser-sqlite3-module/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-sqlite3-module/
  • Licenses: Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-05T03:02:34.680Z (23 days ago)
  • Versions: 26
  • Dependent Packages: 5
  • Dependent Repositories: 49
  • Rankings:
    • Dependent repos count: 3.038%
    • Forks count: 6.822%
    • Average: 7.582%
    • Stargazers count: 9.144%
    • Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-parser-scientific-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-scientific-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-20T03:24:40.135Z (7 days ago)
  • Versions: 26
  • Dependent Packages: 5
  • Dependent Repositories: 47
  • Rankings:
    • Dependent repos count: 3.111%
    • Forks count: 6.822%
    • Average: 7.6%
    • Stargazers count: 9.144%
    • Dependent packages count: 11.323%
repo1.maven.org: org.apache.tika:tika-parser-html-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-html-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.9.4 (published 11 months ago)
  • Last Synced: 2026-03-02T16:46:56.987Z (25 days ago)
  • Versions: 18
  • Dependent Packages: 14
  • Dependent Repositories: 2
  • Docker Downloads: 30,276,116
  • Rankings:
    • Docker downloads count: 0.711%
    • Dependent packages count: 5.473%
    • Forks count: 6.822%
    • Average: 7.629%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-batch

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-batch/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-22T15:41:07.482Z (about 1 month ago)
  • Versions: 54
  • Dependent Packages: 4
  • Dependent Repositories: 12
  • Docker Downloads: 616,888
  • Rankings:
    • Docker downloads count: 2.039%
    • Forks count: 6.822%
    • Dependent repos count: 6.963%
    • Average: 7.744%
    • Stargazers count: 9.144%
    • Dependent packages count: 13.753%
repo1.maven.org: org.apache.tika:tika-parser-digest-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-digest-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-22T15:50:00.961Z (5 days ago)
  • Versions: 26
  • Dependent Packages: 8
  • Dependent Repositories: 2
  • Docker Downloads: 31,777,847
  • Rankings:
    • Docker downloads count: 0.71%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Average: 8.002%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-emitter-fs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-fs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-17T01:51:53.921Z (11 days ago)
  • Versions: 25
  • Dependent Packages: 8
  • Dependent Repositories: 2
  • Docker Downloads: 20,240,607
  • Rankings:
    • Docker downloads count: 0.828%
    • Forks count: 6.822%
    • Dependent packages count: 7.341%
    • Average: 8.026%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-server-core

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-core/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-20T09:21:49.036Z (7 days ago)
  • Versions: 26
  • Dependent Packages: 4
  • Dependent Repositories: 3
  • Docker Downloads: 20,238,617
  • Rankings:
    • Docker downloads count: 0.828%
    • Forks count: 6.812%
    • Average: 8.838%
    • Stargazers count: 9.132%
    • Dependent repos count: 13.663%
    • Dependent packages count: 13.753%
repo1.maven.org: org.apache.tika:tika-parser-mail-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-mail-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-12T16:36:30.067Z (15 days ago)
  • Versions: 26
  • Dependent Packages: 6
  • Dependent Repositories: 1
  • Docker Downloads: 64,145,824
  • Rankings:
    • Docker downloads count: 0.568%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 9.348%
    • Dependent packages count: 9.56%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-langdetect-tika

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-tika/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-21T12:35:36.826Z (about 1 month ago)
  • Versions: 25
  • Dependent Packages: 1
  • Dependent Repositories: 53
  • Docker Downloads: 946,929,633
  • Rankings:
    • Docker downloads count: 0.11%
    • Dependent repos count: 2.883%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 10.339%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-httpclient-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-httpclient-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-11T07:47:21.623Z (16 days ago)
  • Versions: 25
  • Dependent Packages: 7
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Dependent packages count: 8.262%
    • Stargazers count: 9.132%
    • Average: 11.213%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-eval-core

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval-core/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-19T14:06:19.184Z (8 days ago)
  • Versions: 26
  • Dependent Packages: 4
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 11.428%
    • Dependent packages count: 13.753%
    • Dependent repos count: 15.993%
repo1.maven.org: org.apache.tika:tika-langdetect-test-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-test-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-22T16:44:06.817Z (5 days ago)
  • Versions: 25
  • Dependent Packages: 6
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent packages count: 9.56%
    • Average: 11.543%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-parent

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parent/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-21T15:27:22.162Z (6 days ago)
  • Versions: 69
  • Dependent Packages: 2
  • Dependent Repositories: 7
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 9.173%
    • Average: 12.014%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-fetcher-s3

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-s3/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-19T10:17:48.537Z (8 days ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 12.323%
    • Dependent repos count: 15.993%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-emitter-s3

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-s3/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-24T17:05:58.876Z (about 1 month ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 12.323%
    • Dependent repos count: 15.993%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-s3

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-s3/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-17T21:03:47.735Z (10 days ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 12.323%
    • Dependent repos count: 15.993%
    • Dependent packages count: 17.332%
repo1.maven.org: org.apache.tika:tika-parser-sqlite3-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-sqlite3-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-26T17:46:38.589Z (29 days ago)
  • Versions: 23
  • Dependent Packages: 2
  • Dependent Repositories: 3
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 13.131%
    • Dependent repos count: 13.663%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-emitter-solr

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-solr/
  • Licenses: Apache-2.0
  • Latest release: 3.0.0 (published over 1 year ago)
  • Last Synced: 2026-02-21T14:00:52.253Z (about 1 month ago)
  • Versions: 18
  • Dependent Packages: 3
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 13.486%
    • Dependent packages count: 17.332%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-solr

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-solr/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-17T18:23:55.608Z (10 days ago)
  • Versions: 25
  • Dependent Packages: 3
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 13.486%
    • Dependent packages count: 17.332%
    • Dependent repos count: 20.645%
repo1.maven.org: org.apache.tika:tika-parser-webarchive-module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-webarchive-module/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-19T03:01:51.925Z (9 days ago)
  • Versions: 19
  • Dependent Packages: 3
  • Dependent Repositories: 0
  • Docker Downloads: 31,499,356
  • Rankings:
    • Docker downloads count: 0.72%
    • Forks count: 6.765%
    • Stargazers count: 9.095%
    • Average: 14.313%
    • Dependent packages count: 22.565%
    • Dependent repos count: 32.421%
repo1.maven.org: org.apache.tika:tika-parser-scientific-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-scientific-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-18T11:48:15.113Z (9 days ago)
  • Versions: 23
  • Dependent Packages: 1
  • Dependent Repositories: 8
  • Rankings:
    • Forks count: 6.822%
    • Dependent repos count: 8.595%
    • Stargazers count: 9.144%
    • Average: 14.324%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-eval

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-07T19:45:57.441Z (20 days ago)
  • Versions: 47
  • Dependent Packages: 1
  • Dependent Repositories: 6
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent repos count: 9.917%
    • Average: 14.648%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-langdetect-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-langdetect-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published about 5 years ago)
  • Last Synced: 2026-03-15T06:46:23.279Z (12 days ago)
  • Versions: 1
  • Dependent Packages: 4
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Dependent packages count: 13.421%
    • Average: 14.685%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-parser-jdbc-commons

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-jdbc-commons/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-03T22:35:11.019Z (24 days ago)
  • Versions: 26
  • Dependent Packages: 2
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 14.876%
    • Dependent repos count: 20.645%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-transcribe-aws

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-transcribe-aws/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-14T20:34:37.414Z (13 days ago)
  • Versions: 25
  • Dependent Packages: 2
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 14.876%
    • Dependent repos count: 20.645%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-emitter-opensearch

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-opensearch/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-17T22:02:45.392Z (10 days ago)
  • Versions: 24
  • Dependent Packages: 2
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 14.882%
    • Dependent repos count: 20.645%
    • Dependent packages count: 22.916%
repo1.maven.org: org.apache.tika:tika-dl

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://maven.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-dl/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-22T18:22:16.253Z (5 days ago)
  • Versions: 47
  • Dependent Packages: 1
  • Dependent Repositories: 4
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 12.011%
    • Average: 15.178%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-26T09:10:26.309Z (1 day ago)
  • Versions: 61
  • Dependent Packages: 0
  • Dependent Repositories: 8
  • Docker Downloads: 13,306
  • Rankings:
    • Docker downloads count: 3.067%
    • Forks count: 6.822%
    • Dependent repos count: 8.595%
    • Stargazers count: 9.144%
    • Average: 15.503%
    • Dependent packages count: 49.885%
  • Advisories:
repo1.maven.org: org.apache.tika:tika-fetcher-http

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-http/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-08T21:34:34.362Z (19 days ago)
  • Versions: 25
  • Dependent Packages: 1
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 16.173%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-jdbc

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-jdbc/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-23T14:46:35.288Z (4 days ago)
  • Versions: 26
  • Dependent Packages: 1
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 16.173%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-server-standard

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-standard/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-20T02:34:31.848Z (8 days ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Docker Downloads: 20,238,617
  • Rankings:
    • Docker downloads count: 0.828%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 16.534%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-async-cli

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-async-cli/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-24T07:21:38.465Z (3 days ago)
  • Versions: 17
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Docker Downloads: 9
  • Rankings:
    • Forks count: 5.227%
    • Stargazers count: 7.527%
    • Average: 16.774%
    • Dependent packages count: 22.361%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-kafka

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-kafka/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-26T01:52:44.813Z (1 day ago)
  • Versions: 18
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.628%
    • Stargazers count: 7.48%
    • Average: 16.862%
    • Dependent packages count: 22.361%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-emitter-kafka

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-kafka/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T09:36:24.846Z (11 days ago)
  • Versions: 17
  • Dependent Packages: 2
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.628%
    • Stargazers count: 7.48%
    • Average: 16.862%
    • Dependent packages count: 22.361%
    • Dependent repos count: 31.98%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-gcs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-gcs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-04T19:33:38.516Z (23 days ago)
  • Versions: 23
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 17.33%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-emitter-jdbc

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-jdbc/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-19T17:48:41.942Z (8 days ago)
  • Versions: 17
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Average: 17.33%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-age-recogniser

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://maven.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-age-recogniser/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-14T15:32:18.868Z (13 days ago)
  • Versions: 26
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-bundle-standard

OSGi bundle that contains the tika-parsers-standard component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle-standard/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-21T13:41:02.497Z (about 1 month ago)
  • Versions: 25
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-fetcher-gcs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-gcs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-07T19:47:07.937Z (20 days ago)
  • Versions: 23
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-emitter-gcs

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-gcs/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T09:37:22.046Z (11 days ago)
  • Versions: 23
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.336%
    • Dependent repos count: 20.645%
    • Dependent packages count: 32.733%
repo1.maven.org: org.apache.tika:tika-example

This module contains examples of how to use Apache Tika.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-example/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-03T18:48:34.686Z (24 days ago)
  • Versions: 56
  • Dependent Packages: 0
  • Dependent Repositories: 20
  • Rankings:
    • Dependent repos count: 5.315%
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Average: 17.792%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-detectors

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-detectors/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-21T15:42:36.234Z (6 days ago)
  • Versions: 16
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.227%
    • Stargazers count: 7.527%
    • Average: 19.183%
    • Dependent repos count: 31.98%
    • Dependent packages count: 31.998%
repo1.maven.org: org.apache.tika:tika-bom

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bom/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-07T09:03:33.931Z (20 days ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 15.993%
    • Average: 20.461%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-eval-app

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-eval-app/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T08:47:21.658Z (11 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.812%
    • Stargazers count: 9.132%
    • Dependent repos count: 20.645%
    • Average: 21.618%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-nlp

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://maven.apache.org
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-nlp/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 1.28.5 (published over 3 years ago)
  • Last Synced: 2026-03-04T01:47:49.010Z (24 days ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Rankings:
    • Forks count: 6.822%
    • Stargazers count: 9.144%
    • Dependent repos count: 20.645%
    • Average: 21.624%
    • Dependent packages count: 49.885%
repo1.maven.org: org.apache.tika:tika-emitter-az-blob

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-emitter-az-blob/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-15T06:40:08.748Z (12 days ago)
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-solr-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-solr-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-26T14:18:11.839Z (29 days ago)
  • Versions: 24
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-iterator-az-blob

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterator-az-blob/
  • Licenses: Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-07T10:01:37.209Z (20 days ago)
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parser-advancedmedia-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-advancedmedia-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-25T02:20:14.479Z (3 days ago)
  • Versions: 20
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-bundles

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundles/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-18T01:34:01.897Z (10 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-bundle-classic

OSGi bundle that contains the tika-parsers-classic component and all its upstream dependencies that aren't OSGI bundles by themselves. This bundle exports no packages, only the Parser and Detector services from the tika-parsers component.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-bundle-classic/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published about 5 years ago)
  • Last Synced: 2026-03-02T14:40:47.306Z (25 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-classic

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-classic/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published about 5 years ago)
  • Last Synced: 2026-03-18T05:49:18.529Z (9 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-ml

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-ml/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T09:38:25.145Z (11 days ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-BETA (published almost 5 years ago)
  • Last Synced: 2026-02-28T22:32:04.519Z (27 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-advanced

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-advanced/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published about 5 years ago)
  • Last Synced: 2026-03-07T20:01:46.478Z (20 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-classic-modules

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-classic-modules/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published about 5 years ago)
  • Last Synced: 2026-03-18T07:33:47.474Z (9 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-standard

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-20T09:01:38.172Z (7 days ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-extended-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-extended-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-22T18:19:59.465Z (about 1 month ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-iterators

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-iterators/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-23T15:49:57.392Z (4 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-standard-modules

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-standard-modules/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-25T18:36:31.984Z (2 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T17:35:05.228Z (11 days ago)
  • Versions: 24
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-server-classic

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: http://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-classic/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 2.0.0-ALPHA (published about 5 years ago)
  • Last Synced: 2026-03-14T23:07:21.719Z (13 days ago)
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-s3-integration-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-s3-integration-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T17:44:14.253Z (11 days ago)
  • Versions: 24
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parser-nlp-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parser-nlp-package/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T17:46:13.178Z (11 days ago)
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-02-21T13:42:00.385Z (about 1 month ago)
  • Versions: 25
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-parsers-extended

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-parsers-extended/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-05T19:19:24.636Z (22 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-fetcher-az-blob

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetcher-az-blob/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-03T08:51:40.210Z (24 days ago)
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-server-eval

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-server-eval/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-05T18:18:21.531Z (22 days ago)
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.513%
    • Stargazers count: 7.827%
    • Average: 23.545%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-reporter-fs-status

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-reporter-fs-status/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-05T04:46:59.812Z (22 days ago)
  • Versions: 17
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-resource-loading-tests

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-resource-loading-tests/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-16T18:06:21.561Z (11 days ago)
  • Versions: 18
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-pipes-reporters

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-pipes-reporters/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-24T18:46:09.806Z (3 days ago)
  • Versions: 18
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Forks count: 5.58%
    • Stargazers count: 8.488%
    • Average: 23.727%
    • Dependent repos count: 31.98%
    • Dependent packages count: 48.86%
repo1.maven.org: org.apache.tika:tika-detector-siegfried

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-detector-siegfried/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.2.3 (published 7 months ago)
  • Last Synced: 2026-03-19T14:05:21.887Z (8 days ago)
  • Versions: 16
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Rankings:
    • Dependent repos count: 31.98%
    • Average: 31.989%
    • Dependent packages count: 31.998%
repo1.maven.org: org.apache.tika:tika-fetchers

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

  • Homepage: https://tika.apache.org/
  • Documentation: https://appdoc.app/artifact/org.apache.tika/tika-fetchers/
  • Licenses: Apache-2.0,Apache-2.0
  • Latest release: 3.3.0 (published 9 days ago)
  • Last Synced: 2026-03-25T05:56:06.041Z (2 days ago)
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Rankings:
    • Dependent repos count: 31.98%
    • Average: 40.42%
    • Dependent packages count: 48.86%

Dependencies

tika-app/pom.xml maven
  • org.apache.logging.log4j:log4j-core ${log4j2.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.apache.tika:tika-batch 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-fs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-optimaize 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-serialization 2.4.2-SNAPSHOT
  • org.apache.tika:tika-xmp 2.4.2-SNAPSHOT
  • org.slf4j:jcl-over-slf4j
tika-bom/pom.xml maven
  • org.apache.tika:tika-age-recogniser 2.4.2-SNAPSHOT
  • org.apache.tika:tika-bundle-standard 2.4.2-SNAPSHOT
  • org.apache.tika:tika-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-dl 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-fs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-gcs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-opensearch 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-s3 2.4.2-SNAPSHOT
  • org.apache.tika:tika-emitter-solr 2.4.2-SNAPSHOT
  • org.apache.tika:tika-eval-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fetcher-gcs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fetcher-http 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fetcher-s3 2.4.2-SNAPSHOT
  • org.apache.tika:tika-fuzzing 2.4.2-SNAPSHOT
  • org.apache.tika:tika-httpclient-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-java7 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-lingo24 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-mitll-text 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-opennlp 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-optimaize 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-test-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-langdetect-tika 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-advancedmedia-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-apple-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-audiovideo-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-cad-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-code-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-crypto-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-digest-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-font-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-html-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-html-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-image-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-jdbc-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-mail-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-mail-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-microsoft-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-miscoffice-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-news-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-nlp-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-ocr-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-pdf-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-pkg-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-scientific-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-scientific-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-sqlite3-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-sqlite3-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-text-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-xml-module 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-xmp-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parser-zip-commons 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-csv 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-gcs 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-jdbc 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-s3 2.4.2-SNAPSHOT
  • org.apache.tika:tika-pipes-iterator-solr 2.4.2-SNAPSHOT
  • org.apache.tika:tika-serialization 2.4.2-SNAPSHOT
  • org.apache.tika:tika-server-client 2.4.2-SNAPSHOT
  • org.apache.tika:tika-server-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-transcribe-aws 2.4.2-SNAPSHOT
  • org.apache.tika:tika-translate 2.4.2-SNAPSHOT
  • org.apache.tika:tika-xmp 2.4.2-SNAPSHOT
tika-bundles/pom.xml maven
  • org.osgi:org.osgi.compendium ${osgi.compendium.version}
tika-bundles/tika-bundle-standard/pom.xml maven
  • ${project.groupId}:tika-core 2.4.2-SNAPSHOT
  • ${project.groupId}:tika-parsers-standard-package 2.4.2-SNAPSHOT
  • com.sun.activation:javax.activation 1.2.0
  • org.apache.logging.log4j:log4j-api
  • com.sun.xml.fastinfoset:FastInfoset 2.1.0 test
  • javax.inject:javax.inject 1 test
  • org.apache.felix:org.apache.felix.framework 7.0.5 test
  • org.glassfish.jaxb:jaxb-runtime ${jaxb.version} test
  • org.ops4j.pax.exam:pax-exam-container-native ${pax.exam.version} test
  • org.ops4j.pax.exam:pax-exam-junit4 ${pax.exam.version} test
  • org.ops4j.pax.exam:pax-exam-link-assembly ${pax.exam.version} test
  • org.ops4j.pax.url:pax-url-aether 2.6.1 test
  • org.osgi:org.osgi.core ${osgi.core.version} test
  • org.slf4j:slf4j-simple ${slf4j.version} test
tika-core/pom.xml maven
  • biz.aQute.bnd:biz.aQute.bndlib provided
  • org.osgi:org.osgi.compendium ${osgi.compendium.version} provided
  • org.osgi:org.osgi.core ${osgi.core.version} provided
  • commons-io:commons-io ${commons.io.version}
  • org.slf4j:slf4j-api
  • com.google.guava:guava ${guava.version} test
  • com.martensigwart:fakeload ${fakeload.version} test
  • org.apache.logging.log4j:log4j-core ${log4j2.version} test
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version} test
  • org.junit.jupiter:junit-jupiter-api ${junit5.version} test
  • org.junit.jupiter:junit-jupiter-engine ${junit5.version} test
tika-eval/tika-eval-app/pom.xml maven
  • com.h2database:h2 ${h2.version}
  • org.apache.logging.log4j:log4j-core ${log4j2.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.apache.poi:poi-ooxml ${poi.version}
  • org.apache.tika:tika-batch ${project.version}
  • org.apache.tika:tika-eval-core ${project.version}
  • org.apache.tika:tika-batch ${project.version} test
  • org.apache.tika:tika-core ${project.version} test
tika-eval/tika-eval-core/pom.xml maven
  • ${project.groupId}:tika-core ${project.version}
  • ${project.groupId}:tika-langdetect-opennlp ${project.version}
  • ${project.groupId}:tika-serialization ${project.version}
  • com.fasterxml.jackson.core:jackson-databind
  • commons-codec:commons-codec ${commons.codec.version}
  • org.apache.commons:commons-lang3 ${commons.lang3.version}
  • org.apache.commons:commons-math3 ${commons.math3.version}
  • org.apache.lucene:lucene-analyzers-common ${lucene.version}
  • org.apache.lucene:lucene-analyzers-icu ${lucene.version}
  • org.apache.lucene:lucene-core ${lucene.version}
  • org.ccil.cowan.tagsoup:tagsoup 1.2.1
  • org.apache.lucene:lucene-memory ${lucene.version} test
  • org.apache.tika:tika-core ${project.version} test
tika-example/pom.xml maven
  • ${project.groupId}:tika-langdetect-optimaize ${project.version}
  • javax.jcr:jcr ${javax.jcr.version}
  • org.apache.jackrabbit:jackrabbit-core ${jackrabbit.version}
  • org.apache.jackrabbit:jackrabbit-jcr-server ${jackrabbit.version}
  • org.apache.lucene:lucene-core ${lucene.version}
  • org.apache.tika:tika-app ${project.version}
  • org.apache.tika:tika-eval-core ${project.version}
  • org.apache.tika:tika-serialization ${project.version}
  • org.apache.tika:tika-transcribe-aws ${project.version}
  • org.apache.tika:tika-translate ${project.version}
  • org.osgi:org.osgi.compendium ${osgi.compendium.version}
  • org.springframework:spring-context ${spring.version}
  • org.apache.tika:tika-core ${project.version} test
tika-fuzzing/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • commons-cli:commons-cli ${commons.cli.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.apache.tika:tika-parser-digest-commons ${project.version}
  • org.apache.tika:tika-parser-pdf-module ${project.version}
  • org.apache.tika:tika-parser-pkg-module ${project.version}
  • org.slf4j:jcl-over-slf4j
  • ${project.groupId}:tika-core ${project.version} test
tika-integration-tests/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-serialization ${project.version} test
  • org.junit.vintage:junit-vintage-engine ${junit5.version} test
tika-integration-tests/tika-pipes-opensearch-integration-tests/pom.xml maven
  • ${project.groupId}:tika-app ${project.version} test
  • ${project.groupId}:tika-emitter-opensearch ${project.version} test
  • net.java.dev.jna:jna ${jna.version} test
  • org.testcontainers:testcontainers ${test.containers.version} test
tika-integration-tests/tika-pipes-s3-integration-tests/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-emitter-s3 ${project.version} test
  • ${project.groupId}:tika-fetcher-s3 ${project.version} test
  • ${project.groupId}:tika-pipes-iterator-s3 ${project.version} test
tika-integration-tests/tika-pipes-solr-integration-tests/pom.xml maven
  • ${project.groupId}:tika-app ${project.version} test
  • ${project.groupId}:tika-emitter-solr ${project.version} test
  • ${project.groupId}:tika-pipes-iterator-solr ${project.version} test
  • org.apache.solr:solr-solrj ${solrj.version} test
  • org.testcontainers:testcontainers ${test.containers.version} test
tika-integration-tests/tika-resource-loading-tests/pom.xml maven
  • org.apache.tika:tika-core 2.4.2-SNAPSHOT test
tika-java7/pom.xml maven
  • biz.aQute.bnd:biz.aQute.bndlib provided
  • org.apache.tika:tika-core 2.4.2-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 2.4.2-SNAPSHOT
tika-langdetect/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • org.junit.jupiter:junit-jupiter-api ${junit5.version} test
  • org.junit.jupiter:junit-jupiter-engine ${junit5.version} test
tika-langdetect/tika-langdetect-lingo24/pom.xml maven
  • com.fasterxml.jackson.core:jackson-databind
  • org.apache.cxf:cxf-rt-rs-client ${cxf.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.glassfish.jaxb:jaxb-runtime ${jaxb.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-langdetect/tika-langdetect-mitll-text/pom.xml maven
  • com.fasterxml.jackson.core:jackson-databind
  • org.apache.cxf:cxf-rt-rs-client ${cxf.version}
  • org.apache.logging.log4j:log4j-slf4j-impl ${log4j2.version}
  • org.glassfish.jaxb:jaxb-runtime ${jaxb.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-langdetect/tika-langdetect-opennlp/pom.xml maven
  • org.apache.opennlp:opennlp-tools ${opennlp.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
.github/workflows/main-jdk17-build.yml actions
  • actions/checkout v4 composite
  • actions/setup-java v1 composite
.github/workflows/main-jdk21-build.yml actions
  • actions/checkout v4 composite
  • actions/setup-java v1 composite
tika-integration-tests/tika-pipes-s3-integration-tests/src/test/resources/docker-compose.yml docker
  • quay.io/minio/minio latest
pom.xml maven
tika-detectors/pom.xml maven
tika-detectors/tika-detector-siegfried/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • com.fasterxml.jackson.core:jackson-databind
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-parsers-standard-package ${project.version} test
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-eval/pom.xml maven
tika-handlers/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
tika-handlers/tika-handler-boilerpipe/pom.xml maven
  • de.l3s.boilerpipe:boilerpipe 1.1.0
tika-integration-tests/tika-pipes-kafka-integration-tests/pom.xml maven
  • ${project.groupId}:tika-app ${project.version} test
  • ${project.groupId}:tika-core ${project.version} test
  • ${project.groupId}:tika-emitter-kafka ${project.version} test
  • ${project.groupId}:tika-pipes-iterator-kafka ${project.version} test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
  • org.testcontainers:junit-jupiter test
  • org.testcontainers:kafka test
  • org.testcontainers:testcontainers test
tika-grpc/pom.xml maven
  • org.apache.tomcat:annotations-api 6.0.53 provided
  • com.beust:jcommander
  • com.fasterxml.jackson.module:jackson-module-jsonSchema
  • com.google.guava:guava
  • com.google.j2objc:j2objc-annotations 3.0.0
  • com.google.protobuf:protobuf-java-util ${protobuf.version}
  • io.grpc:grpc-netty-shaded
  • io.grpc:grpc-protobuf
  • io.grpc:grpc-services
  • io.grpc:grpc-stub
  • org.apache.logging.log4j:log4j-core
  • org.apache.logging.log4j:log4j-slf4j2-impl
  • org.apache.tika:tika-async-cli 4.0.0-SNAPSHOT
  • org.apache.tika:tika-core 4.0.0-SNAPSHOT
  • org.apache.tika:tika-fetcher-http 4.0.0-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 4.0.0-SNAPSHOT
  • org.slf4j:jcl-over-slf4j
  • com.asarkar.grpc:grpc-test 1.2.2 test
  • io.grpc:grpc-testing test
  • org.awaitility:awaitility 4.2.2 test
  • org.eclipse.jetty:jetty-server test
  • org.mockito:mockito-core test
tika-detectors/tika-detector-magika/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • com.fasterxml.jackson.core:jackson-databind
  • ${project.groupId}:tika-core ${project.version} test
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-langdetect/tika-langdetect-optimaize/pom.xml maven
  • com.optimaize.languagedetector:language-detector ${optimaize.version}
  • org.jetbrains:annotations 26.0.2
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-langdetect/tika-langdetect-test-commons/pom.xml maven
tika-langdetect/tika-langdetect-tika/pom.xml maven
  • com.optimaize.languagedetector:language-detector ${optimaize.version}
  • ${project.groupId}:tika-langdetect-test-commons ${project.version} test
tika-parent/pom.xml maven
  • org.junit.jupiter:junit-jupiter-api 5.13.0-M3 test
  • org.junit.jupiter:junit-jupiter-engine 5.13.0-M3 test
tika-parsers/pom.xml maven
  • org.apache.tika:tika-core ${project.version} test
  • org.junit.jupiter:junit-jupiter-api test
  • org.junit.jupiter:junit-jupiter-engine test
tika-parsers/tika-parsers-extended/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
tika-parsers/tika-parsers-extended/tika-parser-scientific-module/pom.xml maven
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • edu.ucar:grib ${netcdf-java.version}
  • edu.ucar:netcdf4 ${netcdf-java.version}
  • javax.measure:unit-api
  • net.jcip:jcip-annotations 1.0
  • org.apache.commons:commons-csv
  • org.apache.sis.core:sis-metadata
  • org.apache.sis.core:sis-utility
  • org.apache.sis.storage:sis-netcdf
  • org.glassfish.jaxb:jaxb-runtime
  • org.opengis:geoapi
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/pom.xml maven
  • org.apache.tika:tika-parser-scientific-module 4.0.0-SNAPSHOT
  • org.apache.tika:tika-parsers-standard-package 4.0.0-SNAPSHOT test
tika-parsers/tika-parsers-extended/tika-parser-sqlite3-module/pom.xml maven
  • ${project.groupId}:tika-parser-jdbc-commons ${project.version}
  • org.xerial:sqlite-jdbc ${sqlite.version}
tika-parsers/tika-parsers-extended/tika-parser-sqlite3-package/pom.xml maven
  • ${project.groupId}:tika-parser-sqlite3-module ${project.version}
tika-parsers/tika-parsers-extended/tika-parsers-extended-integration-tests/pom.xml maven
  • ${project.groupId}:tika-parser-scientific-module ${project.version} test
  • ${project.groupId}:tika-parser-sqlite3-module ${project.version} test
  • ${project.groupId}:tika-parser-sqlite3-package ${project.version} test
  • ${project.groupId}:tika-parsers-standard-package ${project.version} test
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-parsers/tika-parsers-ml/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
tika-parsers/tika-parsers-ml/tika-parser-nlp-module/pom.xml maven
  • org.apache.ctakes:ctakes-core ${ctakes.version} provided
  • ${project.groupId}:tika-parser-pdf-module ${project.version}
  • com.github.openjson:openjson ${openjson.version}
  • com.google.code.gson:gson
  • com.googlecode.json-simple:json-simple
  • commons-codec:commons-codec
  • edu.usc.ir:sentiment-analysis-parser 0.1
  • jakarta.annotation:jakarta.annotation-api
  • org.apache.cxf:cxf-rt-rs-client
  • org.apache.httpcomponents:httpclient
  • org.apache.httpcomponents:httpcore
  • org.slf4j:log4j-over-slf4j test
tika-parsers/tika-parsers-ml/tika-parser-nlp-package/pom.xml maven
  • org.apache.tika:tika-parser-nlp-module 4.0.0-SNAPSHOT
tika-parsers/tika-parsers-ml/tika-transcribe-aws/pom.xml maven
  • com.fasterxml.jackson.core:jackson-databind
  • com.googlecode.json-simple:json-simple
  • javax.xml.bind:jaxb-api 2.3.1
  • software.amazon.awssdk:s3
  • software.amazon.awssdk:sts
  • software.amazon.awssdk:transcribe
  • org.slf4j:slf4j-simple test
tika-parsers/tika-parsers-standard/pom.xml maven
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/pom.xml maven
  • ${project.groupId}:tika-core ${project.version} provided
  • org.apache.logging.log4j:log4j-core test
  • org.apache.logging.log4j:log4j-slf4j2-impl test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-apple-module/pom.xml maven
  • ${project.groupId}:tika-parser-zip-commons ${project.version}
  • com.googlecode.plist:dd-plist ${ddplist.version}
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/pom.xml maven
  • com.drewnoakes:metadata-extractor
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-cad-module/pom.xml maven
  • ${project.groupId}:tika-parser-microsoft-module ${project.version}
  • com.fasterxml.jackson.core:jackson-core
  • com.fasterxml.jackson.core:jackson-databind
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-code-module/pom.xml maven
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • com.epam:parso ${parso.version}
  • org.apache.commons:commons-lang3
  • org.codelibs:jhighlight ${jhighlight.version}
  • org.jsoup:jsoup
  • org.ow2.asm:asm ${asm.version}
  • org.tallison:jmatio 1.5
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-crypto-module/pom.xml maven
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-digest-commons/pom.xml maven
  • commons-codec:commons-codec
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-font-module/pom.xml maven
  • org.apache.pdfbox:fontbox ${pdfbox.version}
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-html-module/pom.xml maven
  • commons-codec:commons-codec
  • org.jsoup:jsoup
  • ${project.groupId}:tika-parser-text-module ${project.version} test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-image-module/pom.xml maven
  • ${project.groupId}:tika-parser-xmp-commons ${project.version}
  • com.drewnoakes:metadata-extractor
  • com.github.jai-imageio:jai-imageio-core
  • org.apache.pdfbox:jbig2-imageio ${jbig2.version}
  • ${project.groupId}:tika-parser-xmp-commons ${project.version} test
  • com.github.jai-imageio:jai-imageio-jpeg2000 ${imageio.version} test
  • org.mockito:mockito-core test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-jdbc-commons/pom.xml maven
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-mail-commons/pom.xml maven
  • org.apache.james:apache-mime4j-core ${mime4j.version}
  • org.apache.james:apache-mime4j-dom ${mime4j.version}
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-mail-module/pom.xml maven
  • ${project.groupId}:tika-parser-html-module ${project.version}
  • ${project.groupId}:tika-parser-mail-commons ${project.version}
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • ${project.groupId}:tika-parser-ocr-module ${project.version} test
  • org.mockito:mockito-core test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/pom.xml maven
  • ${project.groupId}:tika-parser-html-module ${project.version}
  • ${project.groupId}:tika-parser-mail-commons ${project.version}
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • ${project.groupId}:tika-parser-xml-module ${project.version}
  • ${project.groupId}:tika-parser-zip-commons ${project.version}
  • com.healthmarketscience.jackcess:jackcess
  • com.healthmarketscience.jackcess:jackcess-encrypt
  • com.pff:java-libpst ${libpst.version}
  • commons-codec:commons-codec
  • commons-logging:commons-logging
  • org.apache.commons:commons-lang3
  • org.apache.poi:poi
  • org.apache.poi:poi-ooxml
  • org.apache.poi:poi-scratchpad ${poi.version}
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
  • org.slf4j:slf4j-api
  • ${project.groupId}:tika-parser-mail-module ${project.version} test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/pom.xml maven
  • ${project.groupId}:tika-parser-text-module ${project.version}
  • ${project.groupId}:tika-parser-xml-module ${project.version}
  • ${project.groupId}:tika-parser-xmp-commons ${project.version}
  • ${project.groupId}:tika-parser-zip-commons ${project.version}
  • commons-codec:commons-codec
  • org.apache.commons:commons-collections4
  • org.apache.commons:commons-lang3
  • org.apache.poi:poi
  • org.glassfish.jaxb:jaxb-runtime
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-news-module/pom.xml maven
  • com.rometools:rome ${rome.version}
  • org.slf4j:slf4j-api
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-ocr-module/pom.xml maven
  • org.apache.commons:commons-exec
  • org.apache.commons:commons-lang3
  • ${project.groupId}:tika-parser-image-module ${project.version} test
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/pom.xml maven
  • ${project.groupId}:tika-parser-xmp-commons ${project.version}
  • org.apache.pdfbox:jempbox ${jempbox.version}
  • org.apache.pdfbox:pdfbox ${pdfbox.version}
  • org.apache.pdfbox:pdfbox-tools ${pdfbox.version}
  • org.bouncycastle:bcjmail-jdk18on
  • org.bouncycastle:bcprov-jdk18on
  • org.glassfish.jaxb:jaxb-runtime
  • com.github.jai-imageio:jai-imageio-core test
  • org.slf4j:jcl-over-slf4j test
.github/workflows/main-jdk25-build.yml actions
  • actions/checkout v4 composite
  • actions/setup-java v4 composite
tika-integration-tests/tika-woodstox-tests/pom.xml maven
  • ${project.groupId}:tika-core ${project.version}
  • com.fasterxml.woodstox:woodstox-core