Collections: awesome-llama

https://github.com/xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 01 Jun 2026

https://github.com/JuliusHaring/chatbot-template

A comprehensive chatbot system with integrated LLM querying and Messenger Bot interfacing capabilities. To be used as a template for implementation.

chatbot chatgpt chatgpt-api llama llamaindex telegram telegram-bot vectorstore

Last synced: 02 Jun 2026

https://github.com/ayaka14732/llama-2-jax

JAX implementation of the Llama 2 model

jax llama llama2 natural-language-processing nlp

Last synced: 02 Jun 2026

https://github.com/kyegomez/SwitchTransformers

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

ai gpt4 llama mixture-model mixture-of-experts mixture-of-models ml moe multi-modal

Last synced: 02 Jun 2026

https://github.com/vemonet/libre-chat

🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline capable and easy to setup.

chatbot chatgpt langchain large-language-models llm llm-inference openapi self-hosted

Last synced: 02 Jun 2026

https://github.com/c0sogi/llama-api

An OpenAI-like LLaMA inference API

api exllama fastapi llama llamacpp

Last synced: 02 Jun 2026

https://github.com/Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 01 Jun 2026

https://github.com/ErikBjare/are-copilots-local-yet

Are Copilots Local Yet? The frontier of local LLM Copilots for code completion, project generation, shell assistance, and more. Find tools shaping tomorrow's developer experience, today!

copilot github-copilot llama llm openai starcoder wizardcoder

Last synced: 02 Jun 2026

https://github.com/run-llama/llama_cloud_services

Knowledge Agents and Management in the Cloud

document document-parser document-parsing docx-to-markdown parsing pdf pdf-document-processor pdf-to-excel pdf-to-json pdf-to-markdown pdf-to-text ppt-to-json ppt-to-markdown pptx structured-data tables

Last synced: 01 Jun 2026

https://github.com/ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

ai-crawler ai-scraping ai-search crawler data-extraction firecrawl-alternative large-language-model llm markdown rag scraping scraping-python web-crawler web-crawlers web-data web-data-extraction web-scraper web-scraping web-search webscraping

Last synced: 01 Jun 2026

https://github.com/kyegomez/Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

artificial-intelligence finetuning gpt4 gpt4-api gpt4vision llama machine-learning

Last synced: 02 Jun 2026

https://github.com/shoutsid/townhall

A Python-based chatbot project built on the autogen and tinygrad foundation, utilizing advanced agents for dynamic conversations and function orchestration, enhancing and expanding traditional chatbot capabilities.

agent-based agent-based-framework ai autogen chat-application chatbot gpt gpt-2 gpt-3 gpt2 gpt3-turbo gpt4 llama llm tinygrad

Last synced: 02 Jun 2026

https://github.com/ggml-org/llama.cpp

LLM inference in C/C++

ggml

Last synced: 01 Jun 2026

https://github.com/laelhalawani/gguf_llama

Wrapper for simplified use of Llama2 GGUF quantized models.

cpu-inference gguf llama llama2 llamacpp quantization

Last synced: 02 Jun 2026

https://github.com/cycneuramus/signal-aichat

An AI chatbot for Signal powered by Google Bard, Bing Chat, ChatGPT, HuggingChat, and llama.cpp

ai-bot bard bing-chat chatgpt chatgpt-bot google-bard huggingchat llama llamacpp signal-bot signal-messenger

Last synced: 02 Jun 2026

https://github.com/aiplanethub/beyondllm

Build, evaluate and observe LLM apps

ai artificial-intelligence embeddings evaluate-llm genai generative-ai hacktoberfest hacktoberfest-accepted hacktoberfest2024 large-language-models llm llms rag

Last synced: 02 Jun 2026

https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference kimi llama llm llm-serving model-serving moe openai pytorch qwen qwen3 tpu transformer

Last synced: 01 Jun 2026

https://github.com/rbourgeat/llm-rp

✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and Linux (for now) 🧙‍♂️

ai characterai chat game ggml llama llama-cpp llm roleplay stable-diffusion

Last synced: 01 Jun 2026

https://github.com/aiplanethub/genai-stack

An End to End GenAI Framework

ai chatgpt data-engineering datascientist genai hacktoberfest hacktoberfest-accepted hacktoberfest2023 langchain llama llama-index llm llmops mlops

Last synced: 02 Jun 2026

https://github.com/shroominic/codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

chatgpt chatgpt-code-generation code-interpreter codeinterpreter langchain llm-agent

Last synced: 02 Jun 2026

https://github.com/kyegomez/AttnWithConvolutions

Interleaved Attention's with convolutions for text modeling

artificial-intelligence attention attention-mechanism convolution convolutional-neural-networks gpt4 llama machine-learning machine-learning-algorithms

Last synced: 02 Jun 2026

https://github.com/HubertKasperek/ai-companion-py

Python bindings for ai-companion (only backend, without WebUI)

chatbot library llama llm python

Last synced: 02 Jun 2026

https://github.com/livingbio/fuzzy-json

Fuzzy-JSON is a compact Python package with no dependencies, designed to address the pesky JSONDecodeError that sometimes occurs when utilizing OpenAI's powerful call function.

json llama llm openai openai-chatgpt python

Last synced: 01 Jun 2026

https://github.com/SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec

Last synced: 02 Jun 2026

https://github.com/RAHB-REALTORS-Association/transcriber-describer

Transcribes videos and describes them with OpenAI APIs or local models.

ai automation docker llama llama-cpp local-ai openai openai-api python whisper whisper-cpp

Last synced: 02 Jun 2026

https://github.com/hitz-zentroa/GoLLIE

Guideline following Large Language Model for Information Extraction

code-llama event-extraction gollie guidelines hugginface-hub huggingface inference information-extraction llama llama2 llm llms named-entity-recognition relation-extraction state-of-the-art text-generation training transformer

Last synced: 02 Jun 2026

https://github.com/higgsfield-ai/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

cluster-management deep-learning distributed llama llama2 llm machine-learning mlops pytorch

Last synced: 01 Jun 2026

https://github.com/maclandrol/molfeat-hype

Can ChatGPT generate molecular features ?

Last synced: 01 Jun 2026

https://github.com/unifyai/unify

Notion for AI Observability 📊

ai claude gpt gpt-4 llama2 llm llm-inference llms mixtral openai python

Last synced: 02 Jun 2026

https://github.com/flojud/DocsChat

The chatbot utilizes a conversational retrieval chain to answer user queries based on the content of embedded documents. It leverages various NLP techniques, including language models and embeddings, to provide relevant responses.

Last synced: 01 Jun 2026

https://github.com/karpathy/llama2.c

Inference Llama 2 in one file of pure C

Last synced: 02 Jun 2026

https://github.com/di-osc/osc-llm

轻量级大模型推理引擎

llama llama-3 llm

Last synced: 01 Jun 2026

https://github.com/LLukas22/llm-rs-python

Unofficial python bindings for the rust llm library. 🐍❤️🦀

llama llm python rust

Last synced: 02 Jun 2026

https://github.com/laelhalawani/gguf_modeldb

A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b

database hugginface inference llama llama-cpp-python llama2 llm model-database python3

Last synced: 02 Jun 2026

https://github.com/djokester/groqeval

Use groq for evaluations

generative-ai groq llama3 llm llm-as-a-judge llm-as-evaluator mixtral

Last synced: 02 Jun 2026

https://github.com/chelsey0527/ai-resume-rater

An ai powered resume feedback site

llama2 openai python

Last synced: 02 Jun 2026

https://github.com/internlm/xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

agent deepseek-v3 gpt-oss intern-s1 internvl kimi-k2 llm multimodal qwen3-moe qwen3-vl reinforcement-learning

Last synced: 01 Jun 2026

https://github.com/zhudotexe/kani

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

chatgpt framework function-calling gpt-4 large-language-models llama llms microframework openai tool-use

Last synced: 01 Jun 2026

https://github.com/kyegomez/GATS

Implementation of GATS from the paper: "GATS: Gather-Attend-Scatter" in pytorch and zeta

ai attention attention-is-all-you-need attention-mechanism gpt4 llama ml multi-modal multi-modality multimodal open-source

Last synced: 02 Jun 2026

https://github.com/NVIDIA/nim-anywhere

Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench

genai langchain lcel llama llama3 llm nim nvidia nvwb-project rag

Last synced: 02 Jun 2026

https://github.com/PKU-Alignment/beavertails

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

ai-safety beaver datasets gpt human-feedback human-feedback-data language-model large-language-model llama llm llms rlhf safe-rlhf safety

Last synced: 02 Jun 2026

https://github.com/h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

ai chatbot chatgpt fedramp fine-tuning finetuning generative generative-ai gpt llama llama2 llm llm-training

Last synced: 02 Jun 2026

https://github.com/microsoft/sarathi-serve

A low-latency & high-throughput serving engine for LLMs

llama llm-inference pytorch transformer

Last synced: 02 Jun 2026

https://github.com/sonnhfit/SonAgent

Self-Repairing Autonomous Agent for Digital Consciousness Backup Using Large Language Models (LLM) and powerful code generation capability, self-editing source code and self-debugging its own source code

agent ai autonomus-robots chatgpt code-generation language-model large-language-models llama2 llm ml self self-coding self-debugging self-editing-its-own-source-code self-editing-source-code self-repairing

Last synced: 02 Jun 2026

https://github.com/AGiXT/python-sdk

AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent agi agixt ai artificial automation chromadb intelligence llama llm llmops openai python

Last synced: 02 Jun 2026

https://github.com/llukas22/llm-rs-python

Unofficial python bindings for the rust llm library. 🐍❤️🦀

llama llm python rust

Last synced: 02 Jun 2026

https://github.com/momegas/megabots

🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵

chatbot faiss fastapi gpt-35-turbo gpt-4 information-retrieval langchain llama natural-language-processing nlp pinecone prompt-engineering python question-answering s3

Last synced: 02 Jun 2026

https://github.com/internlm/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 01 Jun 2026

https://github.com/PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

bert compression distributed-training document-intelligence embedding ernie information-extraction llama llm neural-search nlp paddlenlp pretrained-models question-answering search-engine semantic-analysis sentiment-analysis transformers uie

Last synced: 01 Jun 2026

https://github.com/RAHB-REALTORS-Association/email-autodrafts

Email Auto-ReplAI is a Python tool that uses AI to automate drafting responses to unread Gmail messages, streamlining email management tasks.

ai automation docker email email-draft gmail gmail-api llama llama-cpp local-ai openai openai-api python

Last synced: 02 Jun 2026

https://github.com/SteelPh0enix/unreasonable-llama

Python API for llama.cpp webserver

Last synced: 01 Jun 2026

https://github.com/minggnim/nlp-models

A repository for training transformer based models

chatbot chatbots ctransformers deeplearning falcon fine-tuning gpt-2 langchain llama2 llms multi-label-classification multi-task-learning nlp pytorch qdrant-vector-database transformers

Last synced: 02 Jun 2026

https://github.com/leftmove/cria

Run LLMs locally with as little friction as possible.

collaborate github github-codespaces github-copilot llama llm ollama python

Last synced: 02 Jun 2026

https://github.com/kyegomez/Exa

Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minimal learning curve.

inference-engine llama2 llama2-7b llamacpp llamas llm-inference llms opensource

Last synced: 02 Jun 2026

https://github.com/UbiquitousLearning/mllm

Fast Multimodal LLM on Mobile Devices

ai llama llm mobile multimodal

Last synced: 01 Jun 2026

https://github.com/PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna

Last synced: 02 Jun 2026

https://github.com/kyegomez/eaot

The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"

artificial-intelligence gpt4 llama llama2 machine-learning prompt-engineering prompting

Last synced: 02 Jun 2026

https://github.com/sonnhfit/sonagent

Last synced: 02 Jun 2026

https://github.com/adalkiran/llama-nuts-and-bolts

A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.

deep-learning educational-project go golang large-language-models llama llama3-1 machine-learning ml transformers unicode utf-8

Last synced: 02 Jun 2026

https://github.com/sae-llm-coconut/coconut-ai

Python library that ease the installation process of Stable Diffusion, and allows to genrate images with a nice to use API.

llama2 python-library stable-diffusion

Last synced: 02 Jun 2026

https://github.com/yihong1120/Traffic-Violation-Report-System

A platform for users to upload and share the responses from law enforcement agencies to their traffic violation reports in Taiwan. This system aims to increase transparency and public oversight of traffic law enforcement.

big-query cloud-computing computer-vision database-design django gcp gemini google-maps-api llama2 machine-learning nginx python raspberry-pi taiwan ubuntu web-development yolov8

Last synced: 02 Jun 2026

https://github.com/josh-xt/agixt

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 01 Jun 2026

https://github.com/jpmanson/llm_templates

Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging Face models.

chatbot cohere gemma huggingface jinja2 library llama2 llama3 llm mistral nlp nlp-library phi3 template

Last synced: 02 Jun 2026

https://github.com/artitw/text2text

Text2Text Language Modeling Toolkit

chatbot chatgpt cross-lingual embeddings information-retrieval levenshtein-distance llama llm multi-lingual nlp question-generation rag search tf-idf tokenizer transformers translator

Last synced: 01 Jun 2026

https://github.com/huggingface/text-generation-inference

Large Language Model Text Generation Inference

bloom deep-learning falcon gpt inference nlp pytorch starcoder transformer

Last synced: 01 Jun 2026

https://github.com/friendliai/friendli-client

[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI

ai generative-ai gpt gpt3 inference inference-engine inference-server llama2 llm llm-inference llm-ops llm-serving llmops llms mistral ml mlops serving stable-diffusion

Last synced: 02 Jun 2026

https://github.com/dylanhogg/llmgraph

Create knowledge graphs with LLMs

chatgpt gephi gexf graph graphml knowledge-graph large-language-model llama2 llm

Last synced: 02 Jun 2026

https://github.com/withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

ai bindings catai cmake cmake-js cuda embedding function-calling gguf gpu grammar json-schema llama llama-cpp llm metal nodejs prebuilt-binaries self-hosted vulkan

Last synced: 01 Jun 2026

https://github.com/LennardZuendorf/thesis-webapp

Webapp/Application implemention of my thesis about XAI and Interpretability of Transformer Models.

bertviz gradio huggingface interpretable-ai llama2 mistral shap xai

Last synced: 02 Jun 2026

https://github.com/bigsk1/voice-chat-ai

🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs, Kokoro, Typecast or xAI

ai-speech ai-voice ai-voice-agent anthropic-claude conversational-ai elevenlabs-api fastapi ollama selfhosted tts typecast voice-ai webrtc whisper-ai xai xai-tts

Last synced: 02 Jun 2026

https://github.com/Loguru-AI/Loguru-CLI

An interactive commandline interface that brings intelligence to your logs.

ai artificial-intelligence gen-ai generative-ai llama llama3 llm log log-ai log-analysis log-analytics log-intelligence logs-ai logs-intelligence ollama

Last synced: 02 Jun 2026

https://github.com/bolna-ai/bolna

Conversational voice AI agents

agentic-ai agents ai-agents cartesia conversational-ai deepgram deepseek deepseek-chat elevenlabs function-calling gpt-4 llama openai plivo twilio voice-agents voice-ai-agents voice-assistant whisper

Last synced: 01 Jun 2026

https://github.com/kyegomez/M2PT

Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities"

ai attention attention-is-all-you-need gpt4 gpt5 llama ml models mulit-modality multi-modal

Last synced: 02 Jun 2026

https://github.com/kyegomez/EAOT

The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"

artificial-intelligence gpt4 llama llama2 machine-learning prompt-engineering prompting

Last synced: 02 Jun 2026

https://github.com/HectorPulido/discord-bot-LLama

It's a chatbot made with Python that simulates natural conversation with users. The chatbot is designed to be used in the Discord platform, providing an interactive experience for the users. LLAMA can run in user hardware or in colab.

ai chatbot llama

Last synced: 02 Jun 2026

https://github.com/ParthSareen/ducky

Natural language to bash commands. Run, understand, copy bash commands generated by an LLM

agent ai-agent bash llm ollama

Last synced: 02 Jun 2026

https://github.com/Dino-Kupinic/blackrose

fastapi llama3 meta-ai ollama python3

Last synced: 02 Jun 2026

https://github.com/explosion/curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta

Last synced: 01 Jun 2026

https://github.com/AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 01 Jun 2026

https://github.com/hyperonym/basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

generative gpt huggingface language-model llama llm model natural-language-processing nlp openai-api python text-generation transformers

Last synced: 01 Jun 2026

https://github.com/scisharp/llamasharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot gpt llama llama-cpp llama2 llama3 llamacpp llava llm multi-modal semantic-kernel

Last synced: 01 Jun 2026

https://github.com/srikanth235/benchllama

Benchmark your local LLMs.

ai benchmark code-completion codellama deepseek-coder gen-ai llm ollama

Last synced: 01 Jun 2026

https://github.com/theodo-group/GenossGPT

One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) 🌈🐂 Replace OpenAI GPT with any LLMs in your app with one line.

api gpt gpt4all huggingface inference llama llm openai private public

Last synced: 01 Jun 2026

https://github.com/Strvm/meta-ai-api

Llama 3 API 70B & 405B (MetaAI Reverse Engineered)

405b 70b ai api llama llama2 llama3 meta

Last synced: 01 Jun 2026

https://github.com/Riccorl/llama-trainer

Llama Trainer Utility

huggingface llama llm llm-inference llm-training llms transformer

Last synced: 01 Jun 2026

https://github.com/AkashKobal/Blog-Generation-Platform

This repository contains code for generating blog content using the LLama 2 language model. It integrates with Streamlit for easy user interaction. Simply input your blog topic, desired word count, and writing style to generate engaging blog content.

akash akashkobal artificialintelligence blog-generation-platform bloggeneration github huggingface huggingface-models llama llama2 machinelearning naturallanguageprocessing nlp nlp-machine-learning python python3 streamlit streamlit-webapp textgeneration

Last synced: 01 Jun 2026

https://github.com/strvm/meta-ai-api

Llama 3 API 70B & 405B (MetaAI Reverse Engineered)

405b 70b ai api llama llama2 llama3 meta

Last synced: 01 Jun 2026

https://github.com/Simatwa/python-tgpt

AI Chat in Terminal + Package + REST-API

ai blackboxai chatgp chatgpt fastapi gemini gpt koboldai llama llama2 novita openai perplexity poe python-tgpt terminal-gpt tgpt

Last synced: 01 Jun 2026

https://github.com/alexeichhorn/typegpt

Make GPT safe for production

gpt llama llm openai prompt-engineering

Last synced: 01 Jun 2026

https://github.com/ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch

Last synced: 01 Jun 2026

https://github.com/Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

large-language-models llama llm llm-inference local-inference

Last synced: 01 Jun 2026

https://github.com/TUDB-Labs/mLoRA

An Efficient "Factory" to Build Multiple LoRA Adapters

baichuan chatglm dpo finetune gpu llama llama2 llm lora mlora peft rlhf

Last synced: 01 Jun 2026

https://github.com/shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

chatgpt dpo gpt llama llm medical medicalgpt

Last synced: 01 Jun 2026

https://github.com/Agora-Lab-AI/Atom

a suite of finetuned LLMs for atomically precise function calling 🧪

ai artificial-intelligence convolutional-neural-networks function-calling gpt-4 llama llama2 llamacpp ml multi-modal open-source rpa rpc task-automation tool-usage transformer workflow-automation

Last synced: 01 Jun 2026

https://github.com/tenstorrent/tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

accelerator ai cuda deepseek gpu img-gen kernels llama llm metal scale-out stable-diffusion tenstorrent video-gen

Last synced: 01 Jun 2026

https://github.com/vinhnx/VT.ai

VT.ai - multimodal AI chat app with dynamic conversation routing

agent ai assistant assistant-chat-bots chatbot dalle function-calling llama llamacpp llm llms multimodal ollama openai python tool-use

Last synced: 01 Jun 2026

https://github.com/h2oai/h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

ai chatgpt embeddings fedramp generative gpt gpt4all llama2 llm mixtral pdf private privategpt vectorstore

Last synced: 01 Jun 2026

https://github.com/shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型，实现了包括LLaMA，ChatGLM，BLOOM，GPT2，Seq2Seq，BART，T5，UDA等模型的训练和预测，开箱即用。

bart bert chatglm chatgpt gpt2 llama seq2seq t5 text-generation textgen xlnet

Last synced: 01 Jun 2026

https://github.com/Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

compression efficient-inference gemma generative-ai language-model language-models large-language-model llama llama2 llama3 llm llm-inference llms mistral mixtral model-compression natural-language-processing quantization self-hosted