https://github.com/microsoft/markitdown
autogen autogen-extension langchain markdown microsoft-office openai pdf
Score: 30.53043131619862
Last synced: about 15 hours ago
JSON representation
Repository metadata:
Python tool for converting files and office documents to Markdown.
- Host: GitHub
- URL: https://github.com/microsoft/markitdown
- Owner: microsoft
- License: mit
- Created: 2024-11-13T19:56:40.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2026-02-20T19:41:55.000Z (about 1 month ago)
- Last Synced: 2026-03-09T09:14:09.590Z (18 days ago)
- Topics: autogen, autogen-extension, langchain, markdown, microsoft-office, openai, pdf
- Language: Python
- Homepage:
- Size: 3.57 MB
- Stars: 90,410
- Watchers: 320
- Forks: 5,325
- Open Issues: 461
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
- Support: SUPPORT.md
Owner metadata:
- Name: Microsoft
- Login: microsoft
- Email: opensource@microsoft.com
- Kind: organization
- Description: Open source projects and samples from Microsoft
- Website: https://opensource.microsoft.com
- Location: Redmond, WA
- Twitter: OpenAtMicrosoft
- Company:
- Icon url: https://avatars.githubusercontent.com/u/6154722?v=4
- Repositories: 7638
- Last Synced at: 2026-03-13T04:51:14.710Z
- Profile URL: https://github.com/microsoft
Committers metadata
Last synced: 19 days ago
Total Commits: 213
Total Committers: 78
Avg Commits per committer: 2.731
Development Distribution Score (DDS): 0.662
Commits in past year: 70
Committers in past year: 31
Avg Commits per committer in past year: 2.258
Development Distribution Score (DDS) in past year: 0.486
| Name | Commits | |
|---|---|---|
| afourney | a****o@m****m | 72 |
| gagb | g****b | 16 |
| Petr@AP Consulting | 1****g | 8 |
| Josh XT | j****h@d****m | 6 |
| Soulter | 9****2@q****m | 6 |
| Microsoft Open Source | m****e | 5 |
| Sugato Ray | s****y | 5 |
| dependabot[bot] | 4****] | 4 |
| lumin | 7****n | 4 |
| KennyZhang1 | 9****1 | 3 |
| Simon Willison | s****n@g****m | 3 |
| lesyk | l****k | 3 |
| SH4DOW4RE | a****0@g****m | 2 |
| Tomasz Kalinowski | k****t@g****m | 2 |
| Ville Puuska | p****e@g****m | 2 |
| Yuzhong Zhang | 1****I | 2 |
| kevinbabou | k****u@g****m | 2 |
| lumin | 7****n | 2 |
| sakasegawa | n****a@g****m | 2 |
| Richard Ye | 3****1 | 2 |
| Om Gupta | a****a@g****m | 2 |
| Michele Adduci | m****i@g****e | 2 |
| CharlesCNorton | 1****n | 2 |
| t3tra | a****n@t****t | 2 |
| Ebrahim Tayabali | 4****n | 1 |
| Emanuele Meazzo | 6****p@g****m | 1 |
| Dmitry | 9****t | 1 |
| Divyansh Singh | 4****d | 1 |
| Ikko Eltociear Ashimine | e****r@g****m | 1 |
| James Hickey | j****h | 1 |
| and 48 more... | ||
Issue and Pull Request metadata
Last synced: 24 days ago
Total issues: 491
Total pull requests: 574
Average time to close issues: 18 days
Average time to close pull requests: 8 days
Total issue authors: 318
Total pull request authors: 204
Average comments per issue: 0.59
Average comments per pull request: 1.23
Merged pull request: 263
Bot issues: 0
Bot pull requests: 8
Past year issues: 163
Past year pull requests: 281
Past year average time to close issues: 5 days
Past year average time to close pull requests: 7 days
Past year issue authors: 158
Past year pull request authors: 124
Past year average comments per issue: 0.65
Past year average comments per pull request: 0.79
Past year merged pull request: 103
Past year bot issues: 0
Past year bot pull requests: 2
Top Issue Authors
- rayan3030 (61)
- bentomusique (41)
- Mindihap (22)
- nessamyaio (19)
- gagb (7)
- Viddesh1 (4)
- snake-plant-care-indoors (3)
- t-kalinowski (3)
- imDarshanGK (3)
- asrar-mared (2)
- tanreinama (2)
- ImDiPhErEnT (2)
- janutara (2)
- luis440 (2)
- AdrianVollmer (2)
Top Pull Request Authors
- afourney (116)
- gagb (23)
- l-lumin (20)
- dependabot[bot] (8)
- KennyZhang1 (8)
- FeuRicardo (8)
- t3tra-dev (7)
- Viddesh1 (7)
- rong-xyz (6)
- Abdujabbar (6)
- BetterAndBetterII (6)
- Soulter (6)
- 0xRaduan (6)
- lh0x00 (6)
- yeungadrian (6)
Top Issue Labels
- enhancement (12)
- open for contribution (11)
- bug (4)
- question (3)
- documentation (2)
- good first issue (2)
- help wanted (1)
- duplicate (1)
- dependencies (1)
Top Pull Request Labels
- dependencies (8)
- awaiting op response (5)
- open for reviewing (4)
- help wanted (2)
- github_actions (2)
Package metadata
- Total packages: 15
-
Total downloads:
- pypi: 2,562,575 last-month
- Total dependent packages: 0 (may contain duplicates)
- Total dependent repositories: 0 (may contain duplicates)
- Total versions: 66
- Total maintainers: 8
proxy.golang.org: github.com/microsoft/markitdown
- Homepage:
- Documentation: https://pkg.go.dev/github.com/microsoft/markitdown#section-documentation
- Licenses: mit
- Latest release: v0.1.5 (published about 1 month ago)
- Last Synced: 2026-03-03T17:08:47.407Z (24 days ago)
- Versions: 8
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.695%
- Average: 5.885%
- Dependent repos count: 6.076%
proxy.golang.org: github.com/microsoft/MarkItDown
- Homepage:
- Documentation: https://pkg.go.dev/github.com/microsoft/MarkItDown#section-documentation
- Licenses: mit
- Latest release: v0.1.5 (published about 1 month ago)
- Last Synced: 2026-03-03T17:08:47.167Z (24 days ago)
- Versions: 8
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 5.696%
- Average: 5.887%
- Dependent repos count: 6.078%
pypi.org: iflow-mcp_markitdown-mcp
An MCP server for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.0.1a4 (published 4 months ago)
- Last Synced: 2026-03-03T17:08:44.833Z (24 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 37 Last month
-
Rankings:
- Stargazers count: 0.05%
- Forks count: 0.286%
- Dependent packages count: 8.244%
- Average: 13.795%
- Dependent repos count: 46.6%
- Maintainers (1)
pypi.org: markitdowng
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 1.0.1 (published 7 months ago)
- Last Synced: 2026-03-03T17:08:44.904Z (24 days ago)
- Versions: 11
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 31 Last month
-
Rankings:
- Stargazers count: 0.064%
- Forks count: 0.367%
- Dependent packages count: 8.681%
- Average: 14.509%
- Dependent repos count: 48.922%
- Maintainers (1)
pypi.org: markitdown-gjx
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: mit
- Latest release: 0.1.2 (published 7 months ago)
- Last Synced: 2026-03-03T17:08:44.892Z (24 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Stargazers count: 0.064%
- Forks count: 0.367%
- Dependent packages count: 8.685%
- Average: 14.515%
- Dependent repos count: 48.945%
pypi.org: markitdown-sample-plugin
A sample plugin for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.0a1 (published about 1 year ago)
- Last Synced: 2026-03-03T17:08:44.473Z (24 days ago)
- Versions: 3
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 30 Last month
-
Rankings:
- Stargazers count: 0.197%
- Forks count: 1.441%
- Dependent packages count: 9.644%
- Average: 16.396%
- Dependent repos count: 54.302%
- Maintainers (2)
pypi.org: markitdown-docx-image-plugin
将Word文档中的图片提取到目录中,并在Markdown中使用相对路径引用
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.3.0 (published 6 months ago)
- Last Synced: 2026-03-03T17:08:45.382Z (24 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 64 Last month
-
Rankings:
- Dependent packages count: 8.453%
- Downloads: 22.946%
- Average: 26.394%
- Dependent repos count: 47.783%
- Maintainers (1)
pypi.org: markitdown-no-magika
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.2 (published 9 months ago)
- Last Synced: 2026-03-03T17:08:45.126Z (24 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 28,064 Last month
-
Rankings:
- Dependent packages count: 8.914%
- Average: 29.573%
- Dependent repos count: 50.232%
- Maintainers (1)
pypi.org: markitdown-rtf-plugin
A RTF plugin via striprtf for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.0 (published 12 months ago)
- Last Synced: 2026-03-03T14:04:15.339Z (24 days ago)
- Versions: 2
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 7,860 Last month
-
Rankings:
- Dependent packages count: 9.394%
- Average: 31.151%
- Dependent repos count: 52.909%
- Maintainers (1)
pypi.org: markitdown-mcp
An MCP server for the "markitdown" library.
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.0.1a4 (published 10 months ago)
- Last Synced: 2026-03-03T17:08:44.372Z (24 days ago)
- Versions: 4
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 56,129 Last month
-
Rankings:
- Dependent packages count: 9.425%
- Average: 31.253%
- Dependent repos count: 53.081%
- Maintainers (2)
pypi.org: markitdown
Utility tool for converting various files to Markdown
- Homepage:
- Documentation: https://github.com/microsoft/markitdown#readme
- Licenses: MIT
- Latest release: 0.1.5 (published about 1 month ago)
- Last Synced: 2026-03-03T14:04:15.150Z (24 days ago)
- Versions: 21
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 2,470,360 Last month
-
Rankings:
- Dependent packages count: 10.04%
- Average: 33.273%
- Dependent repos count: 56.505%
- Maintainers (2)
anaconda.org: markitdown
Python tool for converting files and office documents to Markdown.
- Homepage: https://github.com/microsoft/markitdown
- Licenses: MIT
- Latest release: 0.1.1 (published 11 months ago)
- Last Synced: 2026-02-19T12:04:14.955Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent packages count: 45.275%
- Average: 47.525%
- Dependent repos count: 49.775%
nixpkgs-unstable: markitdown-mcp
MCP server for the markitdown library
- Homepage: https://github.com/microsoft/markitdown/tree/main/packages/markitdown-mcp
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/by-name/ma/markitdown-mcp/package.nix#L42
- Licenses: MIT
- Latest release: 0.1.5b1 (published about 1 month ago)
- Last Synced: 2026-02-19T00:05:45.610Z (about 1 month ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 100%
- Maintainers (1)
nixpkgs-unstable: python313Packages.markitdown
Python tool for converting files and office documents to Markdown
- Homepage: https://github.com/microsoft/markitdown
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/markitdown/default.nix#L106
- Licenses: MIT
- Latest release: 0.1.4 (published about 2 months ago)
- Last Synced: 2026-03-06T16:07:41.990Z (21 days ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Maintainers (1)
nixpkgs-unstable: python314Packages.markitdown
Python tool for converting files and office documents to Markdown
- Homepage: https://github.com/microsoft/markitdown
- Documentation: https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/markitdown/default.nix#L106
- Licenses: MIT
- Latest release: 0.1.4 (published about 2 months ago)
- Last Synced: 2026-02-03T22:03:32.712Z (about 2 months ago)
- Versions: 1
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
Dependencies
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/cache v3 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- SpeechRecognition *
- beautifulsoup4 *
- mammoth *
- markdownify *
- numpy *
- openpyxl *
- pandas *
- pathvalidate *
- pdfminer.six *
- puremagic *
- pydub *
- python-pptx *
- requests *
- youtube-transcript-api *
- python 3.13-slim-bullseye build