https://github.com/microsoft/markitdown
autogen autogen-extension langchain markdown microsoft-office openai pdf
Score: 32.18677900947582
Last synced: about 8 hours ago
JSON representation
Repository metadata:
Python tool for converting files and office documents to Markdown.
- Host: GitHub
- URL: https://github.com/microsoft/markitdown
- Owner: microsoft
- License: mit
- Created: 2024-11-13T19:56:40.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2026-05-26T22:41:34.000Z (24 days ago)
- Last Synced: 2026-06-08T21:31:26.634Z (11 days ago)
- Topics: autogen, autogen-extension, langchain, markdown, microsoft-office, openai, pdf
- Language: Python
- Homepage:
- Size: 4.09 MB
- Stars: 148,355
- Watchers: 483
- Forks: 10,180
- Open Issues: 824
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
- Support: SUPPORT.md
Owner metadata:
- Name: Microsoft
- Login: microsoft
- Email: opensource@microsoft.com
- Kind: organization
- Description: Open source projects and samples from Microsoft
- Website: https://opensource.microsoft.com
- Location: Redmond, WA
- Twitter: OpenAtMicrosoft
- Company:
- Icon url: https://avatars.githubusercontent.com/u/6154722?v=4
- Repositories: 7900
- Last Synced at: 2026-06-13T00:24:32.671Z
- Profile URL: https://github.com/microsoft
Committers metadata
Last synced: 5 days ago
Total Commits: 220
Total Committers: 80
Avg Commits per committer: 2.75
Development Distribution Score (DDS): 0.659
Commits in past year: 30
Committers in past year: 18
Avg Commits per committer in past year: 1.667
Development Distribution Score (DDS) in past year: 0.7
| Name | Commits | |
|---|---|---|
| afourney | a****o@m****m | 75 |
| gagb | g****b | 16 |
| Petr@AP Consulting | 1****g | 8 |
| Josh XT | j****h@d****m | 6 |
| Soulter | 9****2@q****m | 6 |
| Microsoft Open Source | m****e | 5 |
| Sugato Ray | s****y | 5 |
| lesyk | l****k | 5 |
| dependabot[bot] | 4****] | 4 |
| lumin | 7****n | 4 |
| KennyZhang1 | 9****1 | 3 |
| Simon Willison | s****n@g****m | 3 |
| SH4DOW4RE | a****0@g****m | 2 |
| Tomasz Kalinowski | k****t@g****m | 2 |
| Ville Puuska | p****e@g****m | 2 |
| Yuzhong Zhang | 1****I | 2 |
| kevinbabou | k****u@g****m | 2 |
| lumin | 7****n | 2 |
| sakasegawa | n****a@g****m | 2 |
| Richard Ye | 3****1 | 2 |
| Om Gupta | a****a@g****m | 2 |
| Michele Adduci | m****i@g****e | 2 |
| CharlesCNorton | 1****n | 2 |
| t3tra | a****n@t****t | 2 |
| Ebrahim Tayabali | 4****n | 1 |
| Dmitry | 9****t | 1 |
| Emanuele Meazzo | 6****p@g****m | 1 |
| Divyansh Singh | 4****d | 1 |
| Divit | 5****9 | 1 |
| Ikko Eltociear Ashimine | e****r@g****m | 1 |
| and 50 more... | ||
Issue and Pull Request metadata
Last synced: about 1 month ago
Total issues: 499
Total pull requests: 617
Average time to close issues: 17 days
Average time to close pull requests: 8 days
Total issue authors: 326
Total pull request authors: 229
Average comments per issue: 0.65
Average comments per pull request: 1.2
Merged pull request: 265
Bot issues: 0
Bot pull requests: 8
Past year issues: 92
Past year pull requests: 173
Past year average time to close issues: 4 days
Past year average time to close pull requests: 11 days
Past year issue authors: 90
Past year pull request authors: 111
Past year average comments per issue: 0.99
Past year average comments per pull request: 0.79
Past year merged pull request: 21
Past year bot issues: 0
Past year bot pull requests: 2
Top Issue Authors
- rayan3030 (61)
- bentomusique (41)
- Mindihap (22)
- nessamyaio (19)
- gagb (7)
- Viddesh1 (4)
- snake-plant-care-indoors (3)
- imDarshanGK (3)
- t-kalinowski (3)
- AdrianVollmer (2)
- luis440 (2)
- tanreinama (2)
- shoang22 (2)
- ImDiPhErEnT (2)
- asrar-mared (2)
Top Pull Request Authors
- afourney (117)
- gagb (23)
- l-lumin (20)
- KennyZhang1 (8)
- FeuRicardo (8)
- Jah-yee (8)
- dependabot[bot] (8)
- Unizzr (7)
- Viddesh1 (7)
- t3tra-dev (7)
- lh0x00 (6)
- yeungadrian (6)
- Abdujabbar (6)
- 0xRaduan (6)
- BetterAndBetterII (6)
Top Issue Labels
- enhancement (12)
- open for contribution (11)
- bug (5)
- question (3)
- good first issue (2)
- documentation (2)
- dependencies (1)
- duplicate (1)
- help wanted (1)
Top Pull Request Labels
- dependencies (8)
- awaiting op response (5)
- open for reviewing (4)
- github_actions (2)
- help wanted (2)
Package metadata
- Total packages: 20
-
Total downloads:
- pypi: 7,974,805 last-month
- conda: 405 total
- Total dependent packages: 0 (may contain duplicates)
- Total dependent repositories: 0 (may contain duplicates)
- Total versions: 83
- Total maintainers: 13
proxy.golang.org: github.com/microsoft/markitdown
- Homepage:
- Documentation: https://pkg.go.dev/github.com/microsoft/markitdown#section-documentation
- Licenses: mit
- Latest release: v0.1.6 (published 24 days ago)
- Last Synced: 2026-06-15T15:11:41.973Z (5 days ago)
- Versions: 9
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.695%
- Average: 5.885%
- Dependent repos count: 6.076%
proxy.golang.org: github.com/microsoft/MarkItDown
- Homepage:
- Documentation: https://pkg.go.dev/github.com/microsoft/MarkItDown#section-documentation
- Licenses: mit
- Latest release: v0.1.6 (published 24 days ago)
- Last Synced: 2026-06-15T15:11:44.210Z (5 days ago)
- Versions: 9
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.696%
- Average: 5.887%
- Dependent repos count: 6.078%
pypi.org: iflow-mcp_markitdown-mcp
An MCP server for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.0.1a4 (published 7 months ago)
- Last Synced: 2026-06-15T15:11:41.817Z (5 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 50 Last month
-
Rankings:
- Stargazers count: 0.05%
- Forks count: 0.286%
- Dependent packages count: 8.244%
- Average: 13.795%
- Dependent repos count: 46.6%
- Maintainers (1)
pypi.org: markitdowng
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 1.0.1 (published 10 months ago)
- Last Synced: 2026-06-15T15:11:42.718Z (5 days ago)
- Versions: 11
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 105 Last month
-
Rankings:
- Stargazers count: 0.064%
- Forks count: 0.367%
- Dependent packages count: 8.681%
- Average: 14.509%
- Dependent repos count: 48.922%
- Maintainers (1)
pypi.org: markitdown-gjx
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: mit
- Latest release: 0.1.2 (published 10 months ago)
- Last Synced: 2026-06-15T15:11:42.969Z (5 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Stargazers count: 0.064%
- Forks count: 0.367%
- Dependent packages count: 8.685%
- Average: 14.515%
- Dependent repos count: 48.945%
pypi.org: markitdown-sample-plugin
A sample plugin for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.0a1 (published over 1 year ago)
- Last Synced: 2026-06-15T15:11:40.509Z (5 days ago)
- Versions: 3
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 204 Last month
-
Rankings:
- Stargazers count: 0.197%
- Forks count: 1.441%
- Dependent packages count: 9.644%
- Average: 16.396%
- Dependent repos count: 54.302%
- Maintainers (2)
pypi.org: markitdown-paddleocr
Intelligent PDF/Image to Markdown converter using PaddleOCR cloud API
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.2.3 (published 18 days ago)
- Last Synced: 2026-06-15T15:11:42.703Z (5 days ago)
- Versions: 4
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 591 Last month
-
Rankings:
- Dependent packages count: 7.179%
- Downloads: 23.256%
- Average: 23.685%
- Dependent repos count: 40.621%
- Maintainers (1)
pypi.org: markitdownapi-sdk
Python gRPC SDK for MarkItDown microservice — MarkItDown, Storage, Pipeline, and message schemas
- Homepage: https://github.com/microsoft/markitdown
- Documentation: https://markitdownapi-sdk.readthedocs.io/
- Licenses: MIT
- Latest release: 0.2.0 (published 25 days ago)
- Last Synced: 2026-06-15T15:11:39.214Z (5 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 185 Last month
-
Rankings:
- Dependent packages count: 7.15%
- Average: 23.797%
- Dependent repos count: 40.445%
- Maintainers (1)
pypi.org: markitdown-glmocr
Intelligent PDF to Markdown converter using glmocr SDK
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.2.3 (published 18 days ago)
- Last Synced: 2026-06-15T15:11:41.793Z (5 days ago)
- Versions: 4
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 494 Last month
-
Rankings:
- Dependent packages count: 7.228%
- Average: 24.059%
- Dependent repos count: 40.889%
- Maintainers (1)
pypi.org: markitdown-wccyzxy
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.2 (published about 1 month ago)
- Last Synced: 2026-06-15T15:11:40.530Z (5 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 60 Last month
-
Rankings:
- Dependent packages count: 7.334%
- Average: 24.423%
- Downloads: 24.454%
- Dependent repos count: 41.48%
- Maintainers (1)
pypi.org: markitdown-docx-image-plugin
将Word文档中的图片提取到目录中,并在Markdown中使用相对路径引用
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.3.0 (published 8 months ago)
- Last Synced: 2026-06-15T15:11:42.707Z (5 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 312 Last month
-
Rankings:
- Dependent packages count: 8.453%
- Downloads: 22.946%
- Average: 26.394%
- Dependent repos count: 47.783%
- Maintainers (1)
pypi.org: markitdown-ocr
OCR plugin for MarkItDown - Extracts text from images in PDF, DOCX, PPTX, and XLSX via LLM Vision
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.0 (published 3 months ago)
- Last Synced: 2026-06-15T15:11:49.993Z (5 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 40,039 Last month
-
Rankings:
- Dependent packages count: 7.67%
- Average: 26.535%
- Downloads: 28.563%
- Dependent repos count: 43.373%
- Maintainers (3)
pypi.org: markitdown-no-magika
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.2 (published 12 months ago)
- Last Synced: 2026-06-15T15:11:40.511Z (5 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 115,848 Last month
-
Rankings:
- Dependent packages count: 8.914%
- Average: 29.573%
- Dependent repos count: 50.232%
- Maintainers (1)
pypi.org: markitdown-rtf-plugin
A RTF plugin via striprtf for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.0 (published about 1 year ago)
- Last Synced: 2026-06-15T15:11:42.728Z (5 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 8,220 Last month
-
Rankings:
- Dependent packages count: 9.394%
- Average: 31.151%
- Dependent repos count: 52.909%
- Maintainers (1)
pypi.org: markitdown-mcp
An MCP server for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.0.1a4 (published about 1 year ago)
- Last Synced: 2026-06-15T15:11:39.818Z (5 days ago)
- Versions: 4
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 127,057 Last month
-
Rankings:
- Dependent packages count: 9.425%
- Average: 31.253%
- Dependent repos count: 53.081%
- Maintainers (2)
pypi.org: markitdown
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.6 (published 24 days ago)
- Last Synced: 2026-06-15T15:11:40.525Z (5 days ago)
- Versions: 22
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 7,681,640 Last month
-
Rankings:
- Dependent packages count: 10.04%
- Average: 33.273%
- Dependent repos count: 56.505%
- Maintainers (2)
anaconda.org: markitdown
Python tool for converting files and office documents to Markdown.
- Homepage: https://github.com/microsoft/markitdown
- Licenses: MIT
- Latest release: 0.1.5 (published about 2 months ago)
- Last Synced: 2026-04-27T16:04:53.873Z (about 2 months ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 405 Total
-
Rankings:
- Dependent packages count: 45.275%
- Average: 47.525%
- Dependent repos count: 49.775%
nixpkgs-unstable: markitdown-mcp
MCP server for the markitdown library
- Homepage: https://github.com/microsoft/markitdown/tree/main/packages/markitdown-mcp
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/by-name/ma/markitdown-mcp/package.nix#L42
- Licenses: MIT
- Latest release: 0.1.5 (published 3 months ago)
- Last Synced: 2026-03-19T01:06:58.504Z (3 months ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-unstable: python313Packages.markitdown
Python tool for converting files and office documents to Markdown
- Homepage: https://github.com/microsoft/markitdown
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/markitdown/default.nix#L106
- Licenses: MIT
- Latest release: 0.1.4 (published 5 months ago)
- Last Synced: 2026-03-06T16:07:41.990Z (4 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Maintainers (1)
nixpkgs-unstable: python314Packages.markitdown
Python tool for converting files and office documents to Markdown
- Homepage: https://github.com/microsoft/markitdown
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/markitdown/default.nix#L106
- Licenses: MIT
- Latest release: 0.1.4 (published 5 months ago)
- Last Synced: 2026-03-08T04:35:10.330Z (3 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
Dependencies
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/cache v3 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- SpeechRecognition *
- beautifulsoup4 *
- mammoth *
- markdownify *
- numpy *
- openpyxl *
- pandas *
- pathvalidate *
- pdfminer.six *
- puremagic *
- pydub *
- python-pptx *
- requests *
- youtube-transcript-api *
- python 3.13-slim-bullseye build