awesome-llama: https://github.com/nptt9/illama
exllama exllamav2 flash-attention-2 inference llama llama2 llama3 llm-inference paged-attention server
Score: -Infinity
Last synced: about 7 hours ago
JSON representation
Repository metadata:
A fast, lightweight, parallel inference server for Llama LLMs.
- Host: GitHub
- URL: https://github.com/nptt9/illama
- Owner: nickpotafiy
- License: mit
- Created: 2024-05-23T05:37:08.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-07-30T14:33:37.000Z (over 1 year ago)
- Last Synced: 2024-09-28T04:01:45.401Z (over 1 year ago)
- Topics: exllama, exllamav2, flash-attention-2, inference, llama, llama2, llama3, llm-inference, paged-attention, server
- Language: Python
- Homepage:
- Size: 45.9 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Owner metadata:
- Name: Nick Potafiy
- Login: nickpotafiy
- Email:
- Kind: user
- Description:
- Website:
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/12480886?u=84b3f46a873b6de37537f20a6252beb325aada5c&v=4
- Repositories: 1
- Last Synced at: 2023-07-26T21:42:38.437Z
- Profile URL: https://github.com/nickpotafiy
GitHub Events
Total
- Total: 0
Last Year
- Total: 0
Committers metadata
Last synced: 24 days ago
Issue and Pull Request metadata
Last synced: over 1 year ago
Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull request: 0
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 0
Past year pull request authors: 0
Past year average comments per issue: 0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
Top Pull Request Authors
Top Issue Labels
Top Pull Request Labels
Dependencies
- fastapi *
- flash-attn *
- tokenizers *