https://github.com/docarray/docarray
cross-modal data-structures dataclass deep-learning docarray elasticsearch fastapi machine-learning multi-modal multimodal nearest-neighbor-search nested-data neural-search protobuf pydantic pytorch qdrant semantic-search weaviate
Score: 25.343470401197997
Last synced: about 8 hours ago
JSON representation
Repository metadata:
Represent, send, store and search multimodal data
- Host: GitHub
- URL: https://github.com/docarray/docarray
- Owner: docarray
- License: apache-2.0
- Created: 2021-12-14T15:26:24.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2025-06-17T04:10:35.000Z (8 months ago)
- Last Synced: 2025-12-26T11:51:03.638Z (about 1 month ago)
- Topics: cross-modal, data-structures, dataclass, deep-learning, docarray, elasticsearch, fastapi, machine-learning, multi-modal, multimodal, nearest-neighbor-search, nested-data, neural-search, protobuf, pydantic, pytorch, qdrant, semantic-search, weaviate
- Language: Python
- Homepage: https://docs.docarray.org/
- Size: 242 MB
- Stars: 3,113
- Watchers: 46
- Forks: 234
- Open Issues: 105
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.md
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
- Governance: GOVERNANCE.md
Owner metadata:
- Name: DocArray
- Login: docarray
- Email: info@lfaidata.foundation
- Kind: organization
- Description: DocArray is an open source project offering the data structure for multimodal data. It is hosted in incubation in the LF AI & Data Foundation.
- Website: https://docs.docarray.org
- Location: United States of America
- Twitter: docarray
- Company:
- Icon url: https://avatars.githubusercontent.com/u/117445116?v=4
- Repositories: 4
- Last Synced at: 2024-05-11T22:43:44.364Z
- Profile URL: https://github.com/docarray
GitHub Events
Total
- Create event: 10
- Delete event: 7
- Fork event: 7
- Issue comment event: 19
- Issues event: 3
- Pull request event: 28
- Pull request review comment event: 3
- Pull request review event: 2
- Push event: 25
- Release event: 1
- Watch event: 147
- Total: 252
Last Year
- Create event: 9
- Delete event: 7
- Fork event: 5
- Issue comment event: 13
- Issues event: 2
- Pull request event: 24
- Pull request review comment event: 3
- Pull request review event: 2
- Push event: 25
- Release event: 1
- Watch event: 99
- Total: 190
Committers metadata
Last synced: 27 days ago
Total Commits: 1,456
Total Committers: 76
Avg Commits per committer: 19.158
Development Distribution Score (DDS): 0.746
Commits in past year: 5
Committers in past year: 2
Avg Commits per committer in past year: 2.5
Development Distribution Score (DDS) in past year: 0.2
| Name | Commits | |
|---|---|---|
| Han Xiao | h****o@j****i | 370 |
| samsja | 5****a | 189 |
| Jina Dev Bot | d****t@j****i | 159 |
| Joan Fontanals | j****z@j****i | 137 |
| Johannes Messner | 4****r | 91 |
| Charlotte Gerhaher | c****r@j****i | 80 |
| AlaeddineAbdessalem | a****3@l****r | 78 |
| Anne Yang | e****n@f****m | 37 |
| Saba Sturua | 4****z | 31 |
| David Buchaca Prats | d****a@g****m | 28 |
| felix-wang | 3****3 | 21 |
| Alex Cureton-Griffiths | a****1 | 20 |
| maxwelljin2 | g****n@b****u | 18 |
| Jackmin801 | 5****1 | 17 |
| dependabot[bot] | 4****] | 13 |
| Nan Wang | n****g@j****i | 11 |
| Michael Günther | g****0@g****m | 10 |
| Winston Wong | w****2@g****m | 9 |
| dong xiang | i****g@g****m | 9 |
| Aziz Belaweid | 4****z | 8 |
| Aman Agarwal | a****0@g****m | 8 |
| Puneeth K | 3****8 | 8 |
| Alvin Prayuda | a****a@g****m | 8 |
| Wang Bo | b****g@j****i | 7 |
| Delgermurun | d****n | 6 |
| Mohammad Kalim Akram | k****m@g****m | 5 |
| Andrey Vasnetsov | a****y@v****m | 4 |
| cristian | c****r | 4 |
| Casey Clements | c****s | 3 |
| Kacper Łukawski | k****i | 3 |
| and 46 more... | ||
Issue and Pull Request metadata
Last synced: about 1 month ago
Total issues: 159
Total pull requests: 284
Average time to close issues: about 1 month
Average time to close pull requests: 15 days
Total issue authors: 56
Total pull request authors: 41
Average comments per issue: 2.79
Average comments per pull request: 2.6
Merged pull request: 185
Bot issues: 1
Bot pull requests: 69
Past year issues: 2
Past year pull requests: 21
Past year average time to close issues: about 1 month
Past year average time to close pull requests: 10 days
Past year issue authors: 2
Past year pull request authors: 4
Past year average comments per issue: 2.0
Past year average comments per pull request: 0.52
Past year merged pull request: 6
Past year bot issues: 0
Past year bot pull requests: 12
Top Issue Authors
- samsja (34)
- jupyterjazz (19)
- JoanFM (19)
- JohannesMessner (15)
- movchan74 (6)
- hsm207 (5)
- alaeddine-13 (4)
- oytuntez (2)
- Jackmin801 (2)
- girishc13 (2)
- wizrds (2)
- vincetrep (2)
- AnneYang720 (2)
- beidongjiedeguang (2)
- hugocool (2)
Top Pull Request Authors
- JoanFM (81)
- dependabot[bot] (69)
- samsja (30)
- jupyterjazz (16)
- JohannesMessner (12)
- punndcoder28 (8)
- caseyclements (6)
- AnneYang720 (5)
- alaeddine-13 (5)
- ai-naymul (3)
- yxtay (3)
- anna-charlotte (3)
- James4Ever0 (2)
- jay-bhambhani (2)
- srini047 (2)
Top Issue Labels
- good-first-issue (15)
- area/document-index (8)
- index/weaviate (7)
- pydantic-v2 (7)
- type/bug (4)
- area/docs (2)
- DocArray v2 (2)
- difficulty/medium (1)
- old docarray (1)
- area/typing (1)
- dependencies (1)
- python (1)
Top Pull Request Labels
- area/core (97)
- area/testing (77)
- dependencies (68)
- python (65)
- size/s (63)
- size/xs (46)
- component/array (28)
- size/m (25)
- area/housekeeping (25)
- area/cicd (24)
- area/typing (22)
- area/docs (21)
- area/setup (21)
- size/xl (13)
- area/entrypoint (12)
- size/l (7)
- component/proto (5)
- github_actions (2)
Package metadata
- Total packages: 3
-
Total downloads:
- pypi: 109,091 last-month
- Total docker downloads: 303,570
- Total dependent packages: 61 (may contain duplicates)
- Total dependent repositories: 1,391 (may contain duplicates)
- Total versions: 1,004
- Total maintainers: 1
- Total advisories: 1
pypi.org: docarray
The data structure for multimodal data
- Homepage: https://docs.docarray.org/
- Documentation: https://docs.docarray.org
- Licenses: Apache 2.0
- Latest release: 0.41.0 (published 11 months ago)
- Last Synced: 2026-01-08T08:35:31.617Z (27 days ago)
- Versions: 741
- Dependent Packages: 58
- Dependent Repositories: 1,390
- Downloads: 109,091 Last month
- Docker Downloads: 303,570
-
Rankings:
- Dependent repos count: 0.306%
- Downloads: 0.928%
- Docker downloads count: 0.945%
- Stargazers count: 1.449%
- Average: 1.578%
- Dependent packages count: 2.156%
- Forks count: 3.687%
- Maintainers (1)
- Advisories:
proxy.golang.org: github.com/docarray/docarray
- Homepage:
- Documentation: https://pkg.go.dev/github.com/docarray/docarray#section-documentation
- Licenses: apache-2.0
- Latest release: v0.40.1 (published 11 months ago)
- Last Synced: 2026-01-04T18:05:33.803Z (about 1 month ago)
- Versions: 164
- Dependent Packages: 0
- Dependent Repositories: 0
-
Rankings:
- Stargazers count: 1.331%
- Forks count: 1.949%
- Average: 5.686%
- Dependent packages count: 8.899%
- Dependent repos count: 10.567%
conda-forge.org: docarray
DocArray is a library for nested, unstructured data such as text, image, audio, video, 3D mesh. It allows deep learning engineers to efficiently process, embed, search, recommend, store, transfer the data with Pythonic API. 🌌 **All data types**: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. 🐍 **Pythonic experience**: designed to be as easy as Python list. If you know how to Python, you know how to DocArray. Intuitive idioms and type annotation simplify the code you write. 🧑🔬 **Data science powerhouse**: greatly accelerate data scientists work on embedding, matching, visualizing, evaluating via Torch/Tensorflow/ONNX/PaddlePaddle on CPU/GPU. 🚡 **Portable**: ready-to-wire at anytime with efficient and compact serialization from/to Protobuf, bytes, JSON, CSV, dataframe. PyPI: [https://pypi.org/project/docarray](https://pypi.org/project/docarray)
- Homepage: https://github.com/docarray/docarray
- Licenses: Apache-2.0
- Latest release: 0.16.5 (published over 3 years ago)
- Last Synced: 2025-12-31T16:02:45.644Z (about 1 month ago)
- Versions: 99
- Dependent Packages: 3
- Dependent Repositories: 1
-
Rankings:
- Stargazers count: 9.082%
- Forks count: 14.512%
- Dependent packages count: 15.649%
- Average: 15.875%
- Dependent repos count: 24.258%
Dependencies
- epsilla/vectordb latest
- actions/checkout v2.5.0 composite
- actions/checkout v3 composite
- tj-actions/changed-files v34 composite
- actions/checkout v2.5.0 composite
- actions/setup-python v4 composite
- andstor/file-existence-action v1 composite
- codecov/codecov-action v3.1.1 composite
- technote-space/workflow-conclusion-action v2 composite
- actions/checkout v2.5.0 composite
- actions/github-script v3 composite
- actions/labeler v3 composite
- actions/setup-node v2 composite
- actions/setup-python v4 composite
- codelytv/pr-size-labeler v1 composite
- peter-evans/create-or-update-comment v1 composite
- peter-evans/find-comment v1 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v2.5.0 composite
- actions/setup-python v4 composite
- ad-m/github-push-action v0.6.0 composite
- actions/checkout v3 composite
- actions/checkout v2.5.0 composite
- actions/create-release v1 composite
- actions/setup-python v4 composite
- benc-uk/workflow-dispatch v1 composite
- actions/checkout v2.5.0 composite
- actions/setup-python v4 composite
- docker.elastic.co/elasticsearch/elasticsearch 7.10.2
- docker.elastic.co/elasticsearch/elasticsearch 8.6.2
- milvusdb/milvus v2.2.11
- minio/minio RELEASE.2023-03-20T20-16-18Z
- quay.io/coreos/etcd v3.5.5
- qdrant/qdrant v1.1.2
- semitechnologies/weaviate 1.18.3
- minio/minio RELEASE.2023-03-13T19-46-17Z
- 212 dependencies
- black >=22.10.0 develop
- blacken-docs >=1.13.0 develop
- coverage ==6.2 develop
- isort >=5.10.1 develop
- jupyterlab >=3.5.0 develop
- mypy >=1 develop
- pre-commit >=2.20.0 develop
- pytest >=7.0 develop
- pytest-cov 3.0.0 develop
- ruff >=0.0.243 develop
- types-protobuf >=3.20.4 develop
- types-redis >=4.6.0.0 develop
- av >=10.0.0
- elastic-transport ^8.4.0
- elasticsearch >=7.10.1
- fastapi >=0.87.0
- hnswlib >=0.7.0
- jax >=0.4.10
- jina-hubble-sdk >=0.34.0
- lz4 >=1.0.0
- numpy >=1.17.3
- orjson >=3.8.2
- pandas >=1.1.0
- pillow >=9.3.0
- protobuf >=3.20.0
- pydantic >=1.10.2,<2.0.0
- pydub ^0.25.1
- pymilvus ^2.2.12
- python >=3.8,<4.0
- qdrant-client >=1.1.4
- redis ^4.6.0
- rich >=13.1.0
- smart-open >=6.3.0
- torch >=1.0.0
- trimesh >=3.17.1
- types-pillow >=9.3.0.1
- types-requests >=2.28.11.6
- typing-inspect >=0.8.0
- weaviate-client >=3.17, <3.18