awesome-llama: https://github.com/vectorch-ai/ScaleLLM
cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer
Score: 12.049909942941962
Last synced: about 2 hours ago
JSON representation
Repository metadata:
A high-performance inference system for large language models, designed for production environments.
- Host: GitHub
- URL: https://github.com/vectorch-ai/ScaleLLM
- Owner: vectorch-ai
- License: apache-2.0
- Created: 2023-07-24T20:14:28.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-11-03T04:00:01.000Z (3 months ago)
- Last Synced: 2025-11-03T06:05:10.735Z (3 months ago)
- Topics: cuda, efficiency, gpu, inference, llama, llama3, llm, llm-inference, model, performance, production, serving, speculative, transformer
- Language: C++
- Homepage: https://docs.vectorch.com/
- Size: 19.3 MB
- Stars: 481
- Watchers: 16
- Forks: 37
- Open Issues: 57
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Owner metadata:
- Name: vectorch-ai
- Login: vectorch-ai
- Email:
- Kind: organization
- Description:
- Website:
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/136120791?v=4
- Repositories: 1
- Last Synced at: 2023-07-24T21:41:44.702Z
- Profile URL: https://github.com/vectorch-ai
GitHub Events
Total
- Create event: 123
- Delete event: 100
- Fork event: 9
- Issue comment event: 4
- Issues event: 18
- Pull request event: 233
- Pull request review event: 4
- Push event: 419
- Release event: 4
- Watch event: 79
- Total: 993
Last Year
- Create event: 120
- Delete event: 98
- Fork event: 9
- Issue comment event: 4
- Issues event: 16
- Pull request event: 230
- Pull request review event: 4
- Push event: 405
- Release event: 3
- Watch event: 76
- Total: 965
Committers metadata
Last synced: 3 months ago
Total Commits: 770
Total Committers: 6
Avg Commits per committer: 128.333
Development Distribution Score (DDS): 0.038
Commits in past year: 144
Committers in past year: 1
Avg Commits per committer in past year: 144.0
Development Distribution Score (DDS) in past year: 0.0
| Name | Commits | |
|---|---|---|
| Michael Mi | g****i@g****m | 741 |
| Tongxuan Liu | t****x@m****n | 22 |
| Xiao Liang | s****n@g****m | 4 |
| xianzhedong | 8****e | 1 |
| panhengyu | 4****5 | 1 |
| Kevin Li | k****5@g****m | 1 |
Issue and Pull Request metadata
Last synced: 4 months ago
Total issues: 87
Total pull requests: 607
Average time to close issues: 26 days
Average time to close pull requests: 1 day
Total issue authors: 26
Total pull request authors: 5
Average comments per issue: 1.29
Average comments per pull request: 0.04
Merged pull request: 504
Bot issues: 0
Bot pull requests: 0
Past year issues: 18
Past year pull requests: 254
Past year average time to close issues: about 18 hours
Past year average time to close pull requests: about 18 hours
Past year issue authors: 3
Past year pull request authors: 1
Past year average comments per issue: 0.06
Past year average comments per pull request: 0.0
Past year merged pull request: 218
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
- guocuimi (53)
- liutongxuan (6)
- Arcmoon-Hu (2)
- BUJIDAOVS (2)
- sitabulaixizawaluduo (2)
- tp-nan (2)
- dengyingxu (1)
- WangErXiao (1)
- zyxcambridge (1)
- Missliuff (1)
- dalamudx (1)
- paulcx (1)
- omarmhaimdat (1)
- gunpal5 (1)
- candowu (1)
Top Pull Request Authors
- guocuimi (568)
- liutongxuan (23)
- dongxianzhe (11)
- 936187425 (3)
- ShijiaTang (2)
Top Issue Labels
- enhancement (8)
- good first issue (6)
- bug (5)
- investigation needed (3)
- backlog (1)
- duplicate (1)
Top Pull Request Labels
Package metadata
- Total packages: 1
-
Total downloads:
- pypi: 52 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 19
- Total maintainers: 1
pypi.org: scalellm
A high-performance inference system for large language models.
- Homepage: https://github.com/vectorch-ai/ScaleLLM
- Documentation: https://scalellm.readthedocs.io/
- Licenses: Apache 2.0
- Latest release: 0.2.6 (published 5 months ago)
- Last Synced: 2025-11-03T06:07:41.335Z (3 months ago)
- Versions: 19
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 52 Last month
-
Rankings:
- Dependent packages count: 10.965%
- Average: 36.354%
- Dependent repos count: 61.743%
- Maintainers (1)
Dependencies
- actions/cache v3 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- docker/build-push-action v5 composite
- docker/login-action v3 composite
- docker/setup-buildx-action v3 composite
- docker/setup-qemu-action v3 composite
- olegtarasov/get-tag v2.1 composite
- $BASE_IMAGE latest build
- ubuntu 22.04 build
- golang 1.21-alpine build
- huggingface_hub *
- abseil >=20230125.3
- benchmark >=1.8.0
- glog >=0.6.0#2
- gtest >=1.13.0
- sentencepiece >=0.1.97
- github.com/golang/glog v1.1.0
- github.com/golang/protobuf v1.5.3
- github.com/grpc-ecosystem/grpc-gateway/v2 v2.17.1
- golang.org/x/net v0.14.0
- golang.org/x/sys v0.11.0
- golang.org/x/text v0.12.0
- google.golang.org/genproto v0.0.0-20230803162519-f966b187b2e5
- google.golang.org/genproto/googleapis/api v0.0.0-20230822172742-b8732ec3820d
- google.golang.org/genproto/googleapis/rpc v0.0.0-20230822172742-b8732ec3820d
- google.golang.org/grpc v1.57.0
- google.golang.org/protobuf v1.31.0
- github.com/golang/glog v1.1.0
- github.com/golang/protobuf v1.5.0
- github.com/golang/protobuf v1.5.3
- github.com/google/go-cmp v0.5.5
- github.com/google/go-cmp v0.5.9
- github.com/grpc-ecosystem/grpc-gateway/v2 v2.17.1
- golang.org/x/net v0.14.0
- golang.org/x/sys v0.11.0
- golang.org/x/text v0.12.0
- golang.org/x/xerrors v0.0.0-20191204190536-9bdfabe68543
- google.golang.org/genproto v0.0.0-20230803162519-f966b187b2e5
- google.golang.org/genproto/googleapis/api v0.0.0-20230822172742-b8732ec3820d
- google.golang.org/genproto/googleapis/rpc v0.0.0-20230822172742-b8732ec3820d
- google.golang.org/grpc v1.57.0
- google.golang.org/protobuf v1.26.0-rc.1
- google.golang.org/protobuf v1.26.0
- google.golang.org/protobuf v1.31.0
- torch *