An open API service for producing an overview of a list of open source projects.

awesome-llama: https://github.com/vectorch-ai/ScaleLLM

cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer

Score: 12.049909942941962

Last synced: about 2 hours ago
JSON representation

Repository metadata:

A high-performance inference system for large language models, designed for production environments.


Owner metadata:


GitHub Events

Total
Last Year

Committers metadata

Last synced: 3 months ago

Total Commits: 770
Total Committers: 6
Avg Commits per committer: 128.333
Development Distribution Score (DDS): 0.038

Commits in past year: 144
Committers in past year: 1
Avg Commits per committer in past year: 144.0
Development Distribution Score (DDS) in past year: 0.0

Name Email Commits
Michael Mi g****i@g****m 741
Tongxuan Liu t****x@m****n 22
Xiao Liang s****n@g****m 4
xianzhedong 8****e 1
panhengyu 4****5 1
Kevin Li k****5@g****m 1

Issue and Pull Request metadata

Last synced: 4 months ago

Total issues: 87
Total pull requests: 607
Average time to close issues: 26 days
Average time to close pull requests: 1 day
Total issue authors: 26
Total pull request authors: 5
Average comments per issue: 1.29
Average comments per pull request: 0.04
Merged pull request: 504
Bot issues: 0
Bot pull requests: 0

Past year issues: 18
Past year pull requests: 254
Past year average time to close issues: about 18 hours
Past year average time to close pull requests: about 18 hours
Past year issue authors: 3
Past year pull request authors: 1
Past year average comments per issue: 0.06
Past year average comments per pull request: 0.0
Past year merged pull request: 218
Past year bot issues: 0
Past year bot pull requests: 0

More stats: https://issues.ecosyste.ms/repositories/lookup?url=https://github.com/vectorch-ai/ScaleLLM

Top Issue Authors

  • guocuimi (53)
  • liutongxuan (6)
  • Arcmoon-Hu (2)
  • BUJIDAOVS (2)
  • sitabulaixizawaluduo (2)
  • tp-nan (2)
  • dengyingxu (1)
  • WangErXiao (1)
  • zyxcambridge (1)
  • Missliuff (1)
  • dalamudx (1)
  • paulcx (1)
  • omarmhaimdat (1)
  • gunpal5 (1)
  • candowu (1)

Top Pull Request Authors

  • guocuimi (568)
  • liutongxuan (23)
  • dongxianzhe (11)
  • 936187425 (3)
  • ShijiaTang (2)

Top Issue Labels

  • enhancement (8)
  • good first issue (6)
  • bug (5)
  • investigation needed (3)
  • backlog (1)
  • duplicate (1)

Top Pull Request Labels


Package metadata

pypi.org: scalellm

A high-performance inference system for large language models.

  • Homepage: https://github.com/vectorch-ai/ScaleLLM
  • Documentation: https://scalellm.readthedocs.io/
  • Licenses: Apache 2.0
  • Latest release: 0.2.6 (published 5 months ago)
  • Last Synced: 2025-11-03T06:07:41.335Z (3 months ago)
  • Versions: 19
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 52 Last month
  • Rankings:
    • Dependent packages count: 10.965%
    • Average: 36.354%
    • Dependent repos count: 61.743%
  • Maintainers (1)

Dependencies

.github/workflows/build.yml actions
  • actions/cache v3 composite
  • actions/checkout v3 composite
.github/workflows/docker.yml actions
  • actions/checkout v3 composite
  • docker/build-push-action v5 composite
  • docker/login-action v3 composite
  • docker/setup-buildx-action v3 composite
  • docker/setup-qemu-action v3 composite
  • olegtarasov/get-tag v2.1 composite
Dockerfile docker
  • $BASE_IMAGE latest build
  • ubuntu 22.04 build
gateway/Dockerfile docker
  • golang 1.21-alpine build
requirements.txt pypi
  • huggingface_hub *
vcpkg.json vcpkg
  • abseil >=20230125.3
  • benchmark >=1.8.0
  • glog >=0.6.0#2
  • gtest >=1.13.0
  • sentencepiece >=0.1.97
gateway/go.mod go
  • github.com/golang/glog v1.1.0
  • github.com/golang/protobuf v1.5.3
  • github.com/grpc-ecosystem/grpc-gateway/v2 v2.17.1
  • golang.org/x/net v0.14.0
  • golang.org/x/sys v0.11.0
  • golang.org/x/text v0.12.0
  • google.golang.org/genproto v0.0.0-20230803162519-f966b187b2e5
  • google.golang.org/genproto/googleapis/api v0.0.0-20230822172742-b8732ec3820d
  • google.golang.org/genproto/googleapis/rpc v0.0.0-20230822172742-b8732ec3820d
  • google.golang.org/grpc v1.57.0
  • google.golang.org/protobuf v1.31.0
gateway/go.sum go
  • github.com/golang/glog v1.1.0
  • github.com/golang/protobuf v1.5.0
  • github.com/golang/protobuf v1.5.3
  • github.com/google/go-cmp v0.5.5
  • github.com/google/go-cmp v0.5.9
  • github.com/grpc-ecosystem/grpc-gateway/v2 v2.17.1
  • golang.org/x/net v0.14.0
  • golang.org/x/sys v0.11.0
  • golang.org/x/text v0.12.0
  • golang.org/x/xerrors v0.0.0-20191204190536-9bdfabe68543
  • google.golang.org/genproto v0.0.0-20230803162519-f966b187b2e5
  • google.golang.org/genproto/googleapis/api v0.0.0-20230822172742-b8732ec3820d
  • google.golang.org/genproto/googleapis/rpc v0.0.0-20230822172742-b8732ec3820d
  • google.golang.org/grpc v1.57.0
  • google.golang.org/protobuf v1.26.0-rc.1
  • google.golang.org/protobuf v1.26.0
  • google.golang.org/protobuf v1.31.0
src/huggingface/Cargo.toml cargo
python/setup.py pypi
  • torch *