Collections: awesome-llama
https://github.com/xorbitsai/inference
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm
Last synced: 01 Jun 2026
https://github.com/JuliusHaring/chatbot-template
A comprehensive chatbot system with integrated LLM querying and Messenger Bot interfacing capabilities. To be used as a template for implementation.
chatbot chatgpt chatgpt-api llama llamaindex telegram telegram-bot vectorstore
Last synced: 02 Jun 2026
https://github.com/ayaka14732/llama-2-jax
JAX implementation of the Llama 2 model
jax llama llama2 natural-language-processing nlp
Last synced: 02 Jun 2026
https://github.com/kyegomez/SwitchTransformers
Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"
ai gpt4 llama mixture-model mixture-of-experts mixture-of-models ml moe multi-modal
Last synced: 02 Jun 2026
https://github.com/vemonet/libre-chat
🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline capable and easy to setup.
chatbot chatgpt langchain large-language-models llm llm-inference openapi self-hosted
Last synced: 02 Jun 2026
https://github.com/c0sogi/llama-api
An OpenAI-like LLaMA inference API
api exllama fastapi llama llamacpp
Last synced: 02 Jun 2026
https://github.com/Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 01 Jun 2026
https://github.com/ErikBjare/are-copilots-local-yet
Are Copilots Local Yet? The frontier of local LLM Copilots for code completion, project generation, shell assistance, and more. Find tools shaping tomorrow's developer experience, today!
copilot github-copilot llama llm openai starcoder wizardcoder
Last synced: 02 Jun 2026
https://github.com/run-llama/llama_cloud_services
Knowledge Agents and Management in the Cloud
document document-parser document-parsing docx-to-markdown parsing pdf pdf-document-processor pdf-to-excel pdf-to-json pdf-to-markdown pdf-to-text ppt-to-json ppt-to-markdown pptx structured-data tables
Last synced: 01 Jun 2026
https://github.com/ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
ai-crawler ai-scraping ai-search crawler data-extraction firecrawl-alternative large-language-model llm markdown rag scraping scraping-python web-crawler web-crawlers web-data web-data-extraction web-scraper web-scraping web-search webscraping
Last synced: 01 Jun 2026
https://github.com/kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
artificial-intelligence finetuning gpt4 gpt4-api gpt4vision llama machine-learning
Last synced: 02 Jun 2026
https://github.com/shoutsid/townhall
A Python-based chatbot project built on the autogen and tinygrad foundation, utilizing advanced agents for dynamic conversations and function orchestration, enhancing and expanding traditional chatbot capabilities.
agent-based agent-based-framework ai autogen chat-application chatbot gpt gpt-2 gpt-3 gpt2 gpt3-turbo gpt4 llama llm tinygrad
Last synced: 02 Jun 2026
https://github.com/laelhalawani/gguf_llama
Wrapper for simplified use of Llama2 GGUF quantized models.
cpu-inference gguf llama llama2 llamacpp quantization
Last synced: 02 Jun 2026
https://github.com/cycneuramus/signal-aichat
An AI chatbot for Signal powered by Google Bard, Bing Chat, ChatGPT, HuggingChat, and llama.cpp
ai-bot bard bing-chat chatgpt chatgpt-bot google-bard huggingchat llama llamacpp signal-bot signal-messenger
Last synced: 02 Jun 2026
https://github.com/aiplanethub/beyondllm
Build, evaluate and observe LLM apps
ai artificial-intelligence embeddings evaluate-llm genai generative-ai hacktoberfest hacktoberfest-accepted hacktoberfest2024 large-language-models llm llms rag
Last synced: 02 Jun 2026
https://github.com/vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference kimi llama llm llm-serving model-serving moe openai pytorch qwen qwen3 tpu transformer
Last synced: 01 Jun 2026
https://github.com/rbourgeat/llm-rp
✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and Linux (for now) 🧙♂️
ai characterai chat game ggml llama llama-cpp llm roleplay stable-diffusion
Last synced: 01 Jun 2026
https://github.com/aiplanethub/genai-stack
An End to End GenAI Framework
ai chatgpt data-engineering datascientist genai hacktoberfest hacktoberfest-accepted hacktoberfest2023 langchain llama llama-index llm llmops mlops
Last synced: 02 Jun 2026
https://github.com/shroominic/codeinterpreter-api
👾 Open source implementation of the ChatGPT Code Interpreter
chatgpt chatgpt-code-generation code-interpreter codeinterpreter langchain llm-agent
Last synced: 02 Jun 2026
https://github.com/kyegomez/AttnWithConvolutions
Interleaved Attention's with convolutions for text modeling
artificial-intelligence attention attention-mechanism convolution convolutional-neural-networks gpt4 llama machine-learning machine-learning-algorithms
Last synced: 02 Jun 2026
https://github.com/HubertKasperek/ai-companion-py
Python bindings for ai-companion (only backend, without WebUI)
chatbot library llama llm python
Last synced: 02 Jun 2026
https://github.com/livingbio/fuzzy-json
Fuzzy-JSON is a compact Python package with no dependencies, designed to address the pesky JSONDecodeError that sometimes occurs when utilizing OpenAI's powerful call function.
json llama llm openai openai-chatgpt python
Last synced: 01 Jun 2026
https://github.com/SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec
Last synced: 02 Jun 2026
https://github.com/RAHB-REALTORS-Association/transcriber-describer
Transcribes videos and describes them with OpenAI APIs or local models.
ai automation docker llama llama-cpp local-ai openai openai-api python whisper whisper-cpp
Last synced: 02 Jun 2026
https://github.com/hitz-zentroa/GoLLIE
Guideline following Large Language Model for Information Extraction
code-llama event-extraction gollie guidelines hugginface-hub huggingface inference information-extraction llama llama2 llm llms named-entity-recognition relation-extraction state-of-the-art text-generation training transformer
Last synced: 02 Jun 2026
https://github.com/higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
cluster-management deep-learning distributed llama llama2 llm machine-learning mlops pytorch
Last synced: 01 Jun 2026
https://github.com/maclandrol/molfeat-hype
Can ChatGPT generate molecular features ?
Last synced: 01 Jun 2026
https://github.com/unifyai/unify
Notion for AI Observability 📊
ai claude gpt gpt-4 llama2 llm llm-inference llms mixtral openai python
Last synced: 02 Jun 2026
https://github.com/flojud/DocsChat
The chatbot utilizes a conversational retrieval chain to answer user queries based on the content of embedded documents. It leverages various NLP techniques, including language models and embeddings, to provide relevant responses.
Last synced: 01 Jun 2026
https://github.com/karpathy/llama2.c
Inference Llama 2 in one file of pure C
Last synced: 02 Jun 2026
https://github.com/LLukas22/llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
llama llm python rust
Last synced: 02 Jun 2026
https://github.com/laelhalawani/gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b
database hugginface inference llama llama-cpp-python llama2 llm model-database python3
Last synced: 02 Jun 2026
https://github.com/djokester/groqeval
Use groq for evaluations
generative-ai groq llama3 llm llm-as-a-judge llm-as-evaluator mixtral
Last synced: 02 Jun 2026
https://github.com/chelsey0527/ai-resume-rater
An ai powered resume feedback site
llama2 openai python
Last synced: 02 Jun 2026
https://github.com/internlm/xtuner
A Next-Generation Training Engine Built for Ultra-Large MoE Models
agent deepseek-v3 gpt-oss intern-s1 internvl kimi-k2 llm multimodal qwen3-moe qwen3-vl reinforcement-learning
Last synced: 01 Jun 2026
https://github.com/zhudotexe/kani
kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)
chatgpt framework function-calling gpt-4 large-language-models llama llms microframework openai tool-use
Last synced: 01 Jun 2026
https://github.com/kyegomez/GATS
Implementation of GATS from the paper: "GATS: Gather-Attend-Scatter" in pytorch and zeta
ai attention attention-is-all-you-need attention-mechanism gpt4 llama ml multi-modal multi-modality multimodal open-source
Last synced: 02 Jun 2026
https://github.com/NVIDIA/nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
genai langchain lcel llama llama3 llm nim nvidia nvwb-project rag
Last synced: 02 Jun 2026
https://github.com/PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
ai-safety beaver datasets gpt human-feedback human-feedback-data language-model large-language-model llama llm llms rlhf safe-rlhf safety
Last synced: 02 Jun 2026
https://github.com/h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
ai chatbot chatgpt fedramp fine-tuning finetuning generative generative-ai gpt llama llama2 llm llm-training
Last synced: 02 Jun 2026
https://github.com/microsoft/sarathi-serve
A low-latency & high-throughput serving engine for LLMs
llama llm-inference pytorch transformer
Last synced: 02 Jun 2026
https://github.com/sonnhfit/SonAgent
Self-Repairing Autonomous Agent for Digital Consciousness Backup Using Large Language Models (LLM) and powerful code generation capability, self-editing source code and self-debugging its own source code
agent ai autonomus-robots chatgpt code-generation language-model large-language-models llama2 llm ml self self-coding self-debugging self-editing-its-own-source-code self-editing-source-code self-repairing
Last synced: 02 Jun 2026
https://github.com/AGiXT/python-sdk
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent agi agixt ai artificial automation chromadb intelligence llama llm llmops openai python
Last synced: 02 Jun 2026
https://github.com/llukas22/llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
llama llm python rust
Last synced: 02 Jun 2026
https://github.com/momegas/megabots
🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
chatbot faiss fastapi gpt-35-turbo gpt-4 information-retrieval langchain llama natural-language-processing nlp pinecone prompt-engineering python question-answering s3
Last synced: 02 Jun 2026
https://github.com/internlm/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind
Last synced: 01 Jun 2026
https://github.com/PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
bert compression distributed-training document-intelligence embedding ernie information-extraction llama llm neural-search nlp paddlenlp pretrained-models question-answering search-engine semantic-analysis sentiment-analysis transformers uie
Last synced: 01 Jun 2026
https://github.com/RAHB-REALTORS-Association/email-autodrafts
Email Auto-ReplAI is a Python tool that uses AI to automate drafting responses to unread Gmail messages, streamlining email management tasks.
ai automation docker email email-draft gmail gmail-api llama llama-cpp local-ai openai openai-api python
Last synced: 02 Jun 2026
https://github.com/SteelPh0enix/unreasonable-llama
Python API for llama.cpp webserver
Last synced: 01 Jun 2026
https://github.com/minggnim/nlp-models
A repository for training transformer based models
chatbot chatbots ctransformers deeplearning falcon fine-tuning gpt-2 langchain llama2 llms multi-label-classification multi-task-learning nlp pytorch qdrant-vector-database transformers
Last synced: 02 Jun 2026
https://github.com/leftmove/cria
Run LLMs locally with as little friction as possible.
collaborate github github-codespaces github-copilot llama llm ollama python
Last synced: 02 Jun 2026
https://github.com/kyegomez/Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minimal learning curve.
inference-engine llama2 llama2-7b llamacpp llamas llm-inference llms opensource
Last synced: 02 Jun 2026
https://github.com/UbiquitousLearning/mllm
Fast Multimodal LLM on Mobile Devices
ai llama llm mobile multimodal
Last synced: 01 Jun 2026
https://github.com/PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna
Last synced: 02 Jun 2026
https://github.com/kyegomez/eaot
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
artificial-intelligence gpt4 llama llama2 machine-learning prompt-engineering prompting
Last synced: 02 Jun 2026
https://github.com/sonnhfit/sonagent
Self-Repairing Autonomous Agent for Digital Consciousness Backup Using Large Language Models (LLM) and powerful code generation capability, self-editing source code and self-debugging its own source code
agent ai autonomus-robots chatgpt code-generation language-model large-language-models llama2 llm ml self self-coding self-debugging self-editing-its-own-source-code self-editing-source-code self-repairing
Last synced: 02 Jun 2026
https://github.com/adalkiran/llama-nuts-and-bolts
A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.
deep-learning educational-project go golang large-language-models llama llama3-1 machine-learning ml transformers unicode utf-8
Last synced: 02 Jun 2026
https://github.com/sae-llm-coconut/coconut-ai
Python library that ease the installation process of Stable Diffusion, and allows to genrate images with a nice to use API.
llama2 python-library stable-diffusion
Last synced: 02 Jun 2026
https://github.com/yihong1120/Traffic-Violation-Report-System
A platform for users to upload and share the responses from law enforcement agencies to their traffic violation reports in Taiwan. This system aims to increase transparency and public oversight of traffic law enforcement.
big-query cloud-computing computer-vision database-design django gcp gemini google-maps-api llama2 machine-learning nginx python raspberry-pi taiwan ubuntu web-development yolov8
Last synced: 02 Jun 2026
https://github.com/josh-xt/agixt
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 01 Jun 2026
https://github.com/jpmanson/llm_templates
Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging Face models.
chatbot cohere gemma huggingface jinja2 library llama2 llama3 llm mistral nlp nlp-library phi3 template
Last synced: 02 Jun 2026
https://github.com/artitw/text2text
Text2Text Language Modeling Toolkit
chatbot chatgpt cross-lingual embeddings information-retrieval levenshtein-distance llama llm multi-lingual nlp question-generation rag search tf-idf tokenizer transformers translator
Last synced: 01 Jun 2026
https://github.com/huggingface/text-generation-inference
Large Language Model Text Generation Inference
bloom deep-learning falcon gpt inference nlp pytorch starcoder transformer
Last synced: 01 Jun 2026
https://github.com/friendliai/friendli-client
[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI
ai generative-ai gpt gpt3 inference inference-engine inference-server llama2 llm llm-inference llm-ops llm-serving llmops llms mistral ml mlops serving stable-diffusion
Last synced: 02 Jun 2026
https://github.com/dylanhogg/llmgraph
Create knowledge graphs with LLMs
chatgpt gephi gexf graph graphml knowledge-graph large-language-model llama2 llm
Last synced: 02 Jun 2026
https://github.com/withcatai/node-llama-cpp
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
ai bindings catai cmake cmake-js cuda embedding function-calling gguf gpu grammar json-schema llama llama-cpp llm metal nodejs prebuilt-binaries self-hosted vulkan
Last synced: 01 Jun 2026
https://github.com/LennardZuendorf/thesis-webapp
Webapp/Application implemention of my thesis about XAI and Interpretability of Transformer Models.
bertviz gradio huggingface interpretable-ai llama2 mistral shap xai
Last synced: 02 Jun 2026
https://github.com/bigsk1/voice-chat-ai
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs, Kokoro, Typecast or xAI
ai-speech ai-voice ai-voice-agent anthropic-claude conversational-ai elevenlabs-api fastapi ollama selfhosted tts typecast voice-ai webrtc whisper-ai xai xai-tts
Last synced: 02 Jun 2026
https://github.com/Loguru-AI/Loguru-CLI
An interactive commandline interface that brings intelligence to your logs.
ai artificial-intelligence gen-ai generative-ai llama llama3 llm log log-ai log-analysis log-analytics log-intelligence logs-ai logs-intelligence ollama
Last synced: 02 Jun 2026
https://github.com/bolna-ai/bolna
Conversational voice AI agents
agentic-ai agents ai-agents cartesia conversational-ai deepgram deepseek deepseek-chat elevenlabs function-calling gpt-4 llama openai plivo twilio voice-agents voice-ai-agents voice-assistant whisper
Last synced: 01 Jun 2026
https://github.com/kyegomez/M2PT
Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities"
ai attention attention-is-all-you-need gpt4 gpt5 llama ml models mulit-modality multi-modal
Last synced: 02 Jun 2026
https://github.com/kyegomez/EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
artificial-intelligence gpt4 llama llama2 machine-learning prompt-engineering prompting
Last synced: 02 Jun 2026
https://github.com/HectorPulido/discord-bot-LLama
It's a chatbot made with Python that simulates natural conversation with users. The chatbot is designed to be used in the Discord platform, providing an interactive experience for the users. LLAMA can run in user hardware or in colab.
ai chatbot llama
Last synced: 02 Jun 2026
https://github.com/ParthSareen/ducky
Natural language to bash commands. Run, understand, copy bash commands generated by an LLM
agent ai-agent bash llm ollama
Last synced: 02 Jun 2026
https://github.com/Dino-Kupinic/blackrose
fastapi llama3 meta-ai ollama python3
Last synced: 02 Jun 2026
https://github.com/explosion/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta
Last synced: 01 Jun 2026
https://github.com/AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer
Last synced: 01 Jun 2026
https://github.com/hyperonym/basaran
Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.
generative gpt huggingface language-model llama llm model natural-language-processing nlp openai-api python text-generation transformers
Last synced: 01 Jun 2026
https://github.com/scisharp/llamasharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel
Last synced: 01 Jun 2026
https://github.com/srikanth235/benchllama
Benchmark your local LLMs.
ai benchmark code-completion codellama deepseek-coder gen-ai llm ollama
Last synced: 01 Jun 2026
https://github.com/theodo-group/GenossGPT
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) 🌈🐂 Replace OpenAI GPT with any LLMs in your app with one line.
api gpt gpt4all huggingface inference llama llm openai private public
Last synced: 01 Jun 2026
https://github.com/Strvm/meta-ai-api
Llama 3 API 70B & 405B (MetaAI Reverse Engineered)
405b 70b ai api llama llama2 llama3 meta
Last synced: 01 Jun 2026
https://github.com/Riccorl/llama-trainer
Llama Trainer Utility
huggingface llama llm llm-inference llm-training llms transformer
Last synced: 01 Jun 2026
https://github.com/AkashKobal/Blog-Generation-Platform
This repository contains code for generating blog content using the LLama 2 language model. It integrates with Streamlit for easy user interaction. Simply input your blog topic, desired word count, and writing style to generate engaging blog content.
akash akashkobal artificialintelligence blog-generation-platform bloggeneration github huggingface huggingface-models llama llama2 machinelearning naturallanguageprocessing nlp nlp-machine-learning python python3 streamlit streamlit-webapp textgeneration
Last synced: 01 Jun 2026
https://github.com/strvm/meta-ai-api
Llama 3 API 70B & 405B (MetaAI Reverse Engineered)
405b 70b ai api llama llama2 llama3 meta
Last synced: 01 Jun 2026
https://github.com/Simatwa/python-tgpt
AI Chat in Terminal + Package + REST-API
ai blackboxai chatgp chatgpt fastapi gemini gpt koboldai llama llama2 novita openai perplexity poe python-tgpt terminal-gpt tgpt
Last synced: 01 Jun 2026
https://github.com/alexeichhorn/typegpt
Make GPT safe for production
gpt llama llm openai prompt-engineering
Last synced: 01 Jun 2026
https://github.com/ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch
Last synced: 01 Jun 2026
https://github.com/Tiiny-AI/PowerInfer
High-speed Large Language Model Serving for Local Deployment
large-language-models llama llm llm-inference local-inference
Last synced: 01 Jun 2026
https://github.com/TUDB-Labs/mLoRA
An Efficient "Factory" to Build Multiple LoRA Adapters
baichuan chatglm dpo finetune gpu llama llama2 llm lora mlora peft rlhf
Last synced: 01 Jun 2026
https://github.com/shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
chatgpt dpo gpt llama llm medical medicalgpt
Last synced: 01 Jun 2026
https://github.com/Agora-Lab-AI/Atom
a suite of finetuned LLMs for atomically precise function calling 🧪
ai artificial-intelligence convolutional-neural-networks function-calling gpt-4 llama llama2 llamacpp ml multi-modal open-source rpa rpc task-automation tool-usage transformer workflow-automation
Last synced: 01 Jun 2026
https://github.com/tenstorrent/tt-metal
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.
accelerator ai cuda deepseek gpu img-gen kernels llama llm metal scale-out stable-diffusion tenstorrent video-gen
Last synced: 01 Jun 2026
https://github.com/vinhnx/VT.ai
VT.ai - multimodal AI chat app with dynamic conversation routing
agent ai assistant assistant-chat-bots chatbot dalle function-calling llama llamacpp llm llms multimodal ollama openai python tool-use
Last synced: 01 Jun 2026
https://github.com/h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
ai chatgpt embeddings fedramp generative gpt gpt4all llama2 llm mixtral pdf private privategpt vectorstore
Last synced: 01 Jun 2026
https://github.com/shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
bart bert chatglm chatgpt gpt2 llama seq2seq t5 text-generation textgen xlnet
Last synced: 01 Jun 2026
https://github.com/Picovoice/picollm
On-device LLM Inference Powered by X-Bit Quantization
compression efficient-inference gemma generative-ai language-model language-models large-language-model llama llama2 llama3 llm llm-inference llms mistral mixtral model-compression natural-language-processing quantization self-hosted
Last synced: 01 Jun 2026
Statistics
- Projects: 2,239
- Last updated: almost 2 years ago