Ecosyste.ms: Summary
An open API service providing a high level summary for open source projects.
Collections: awesome-llama
https://github.com/vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
amd cuda gpt inference inferentia llama llm llm-serving llmops mlops model-serving pytorch rocm tpu trainium transformer xpu
Last synced: 14 Sep 2024
https://github.com/nomic-ai/gpt4all
gpt4all: run open-source LLMs anywhere
llm-inference
Last synced: 14 Sep 2024
https://github.com/huggingface/text-generation-inference
Large Language Model Text Generation Inference
bloom deep-learning falcon gpt inference nlp pytorch starcoder transformer
Last synced: 14 Sep 2024
https://github.com/PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
bert compression distributed-training document-intelligence embedding ernie information-extraction llama llm neural-search nlp paddlenlp pretrained-models question-answering search-engine semantic-analysis sentiment-analysis transformers uie
Last synced: 14 Sep 2024
https://github.com/lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
ai azure-openai chat chatglm chatgpt claude dalle-3 function-calling gemini gpt gpt-4 gpt-4-vision llama2 nextjs ollama openai tts
Last synced: 14 Sep 2024
https://github.com/zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search
Last synced: 14 Sep 2024
https://github.com/run-llama/llama_parse
Parse files for optimal RAG
document parsing pdf pdf-document-processor ppt pptx structured-data
Last synced: 14 Sep 2024
https://github.com/langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
agent ai anthropic backend-as-a-service chatbot gemini genai gpt gpt-4 llama3 llm llmops nextjs openai orchestration python rag workflow workflows
Last synced: 14 Sep 2024
https://github.com/chatchat-space/langchain-chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
chatbot chatchat chatglm chatgpt embedding faiss fastchat gpt knowledge-base langchain langchain-chatglm llama llm milvus ollama qwen rag retrieval-augmented-generation streamlit xinference
Last synced: 14 Sep 2024
https://github.com/chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
chatbot chatchat chatglm chatgpt embedding faiss fastchat gpt knowledge-base langchain langchain-chatglm llama llm milvus ollama qwen rag retrieval-augmented-generation streamlit xinference
Last synced: 14 Sep 2024
https://github.com/internlm/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind
Last synced: 14 Sep 2024
https://github.com/run-llama/LlamaIndexTS
LlamaIndex is a data framework for your LLM applications
agent anthr chatbot claude claude-ai create-llama embedding firewo groq-ai javascript llama llama-index llama2 llama3 llm mistr nodejs openai react typescript
Last synced: 14 Sep 2024
https://github.com/InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind
Last synced: 14 Sep 2024
https://github.com/ollama/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
gemma gemma2 go golang llama llama2 llama3 llava llm llms mistral ollama phi3
Last synced: 14 Sep 2024
https://github.com/bentoml/OpenLLM
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.
ai bentoml falcon fine-tuning llama llama2 llm llm-inference llm-ops llm-serving llmops mistral ml mlops model-inference mpt open-source-llm openllm stablelm vicuna
Last synced: 14 Sep 2024
https://github.com/bentoml/openllm
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.
ai bentoml falcon fine-tuning llama llama2 llm llm-inference llm-ops llm-serving llmops mistral ml mlops model-inference mpt open-source-llm openllm stablelm vicuna
Last synced: 14 Sep 2024
https://github.com/ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch
Last synced: 14 Sep 2024
https://github.com/mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
chatgpt deep-learning language-model llm tvm webgpu webml
Last synced: 14 Sep 2024
https://github.com/hiyouga/llama-factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers
Last synced: 14 Sep 2024
https://github.com/hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers
Last synced: 14 Sep 2024
https://github.com/xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm
Last synced: 14 Sep 2024
https://github.com/sobelio/llm-chain
`llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tasks
chatgpt langchain llama llm openai rust text-summary
Last synced: 14 Sep 2024
https://github.com/run-llama/llama-hub
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
Last synced: 14 Sep 2024
https://github.com/mangiucugna/json_repair
A python module to repair invalid JSON, commonly used to parse the output of LLMs
deep-learning gpt-4 json llama3 llm machine-learning mistral openai-api parser repair
Last synced: 14 Sep 2024
https://github.com/TheR1D/shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
chatgpt cheat-sheet cli commands gpt-3 gpt-4 linux llama llm ollama openai productivity python shell terminal
Last synced: 14 Sep 2024
https://github.com/josh-xt/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 14 Sep 2024
https://github.com/josh-xt/agixt
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 14 Sep 2024
https://github.com/Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python
Last synced: 14 Sep 2024
https://github.com/sigoden/aichat
All-in-one AI CLI tool featuring Chat-REPL, Shell Assistant, RAG, AI tools & agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
ai ai-agents all-in-one azure-openai bedrock chatbot claude cli function-calling gemini llm ollama openai rag tool-use vertexai
Last synced: 14 Sep 2024
https://github.com/unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
ai fine-tuning finetuning gemma llama llama3 llms lora mistral phi3 qlora unsloth
Last synced: 14 Sep 2024
https://github.com/haotian-liu/llava
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning
Last synced: 14 Sep 2024
https://github.com/haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning
Last synced: 14 Sep 2024
https://github.com/explosion/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta
Last synced: 14 Sep 2024
https://github.com/oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
Last synced: 14 Sep 2024
https://github.com/explosion/spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
anthropic claude cohere dolly falcon gpt-3 gpt-4 large-language-models llama llm machine-learning named-entity-recognition natural-language-processing nlp openai prompt-engineering spacy text-classification
Last synced: 14 Sep 2024
https://github.com/open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai
Last synced: 14 Sep 2024
https://github.com/predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
fine-tuning gpt llama llm llm-inference llm-serving llmops lora model-serving pytorch transformers
Last synced: 14 Sep 2024
https://github.com/floneum/floneum
A toolkit for controllable, private AI on consumer hardware in rust
ai candle floneum-v3 kalosm llama llamacpp llm mistral rust
Last synced: 14 Sep 2024
https://github.com/langroid/langroid
Harness LLMs with Multi-Agent Programming
agents ai chatgpt function-calling gpt gpt-4 gpt4 information-retrieval language-model llama llm llm-agent llm-framework local-llm multi-agent-systems openai-api rag retrieval-augmented-generation
Last synced: 14 Sep 2024
https://github.com/mobiusml/hqq
Official implementation of Half-Quadratic Quantization (HQQ)
llm machine-learning quantization
Last synced: 14 Sep 2024
https://github.com/internlm/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning
Last synced: 14 Sep 2024
https://github.com/InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning
Last synced: 14 Sep 2024
https://github.com/google/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer
Last synced: 14 Sep 2024
https://github.com/snowby666/poe-api-wrapper
👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀
api chatbot chatgpt claude code-llama dall-e gemini gpt-4 groq llama mistral openai palm2 poe poe-api python quora qwen reverse-engineering stable-diffusion
Last synced: 14 Sep 2024
https://github.com/h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
ai chatgpt embeddings generative gpt gpt4all llama2 llm mixtral pdf private privategpt vectorstore
Last synced: 14 Sep 2024
https://github.com/bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
bloom chatbot deep-learning distributed-systems falcon gpt guanaco language-models large-language-models llama machine-learning mixtral neural-networks nlp pipeline-parallelism pretrained-models pytorch tensor-parallelism transformer volunteer-computing
Last synced: 14 Sep 2024
https://github.com/withcatai/node-llama-cpp
Run AI models locally on your machine with node.js bindings for llama.cpp. Force a JSON schema on the model output on the generation level
ai bindings catai cmake cmake-js cuda gguf grammar json-schema llama llama-cpp llm metal nodejs prebuilt-binaries self-hosted
Last synced: 14 Sep 2024
https://github.com/shroominic/codeinterpreter-api
👾 Open source implementation of the ChatGPT Code Interpreter
chatgpt chatgpt-code-generation code-interpreter codeinterpreter langchain llm-agent
Last synced: 14 Sep 2024
https://github.com/kyegomez/zeta
Build high-performance AI models with modular building blocks
artificial-intelligence deep-learning gpt4 llama2 longnet multi-agent-systems multi-modal multi-modal-learning multi-platform pytorch speech-recognition transformer transformers
Last synced: 14 Sep 2024
https://github.com/entropy-research/Devon
Devon: An open-source pair programmer
agent agent-based-framework agent-based-model ai ai-developer ai-software ai-software-engineer code-assistant code-generation developer-tool developer-tools gpt-4 gpt-4o groq llama3 ollama vscode
Last synced: 14 Sep 2024
https://github.com/juncongmoo/pyllama
LLaMA: Open and Efficient Foundation Language Models
Last synced: 14 Sep 2024
https://github.com/SilasMarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
ai auto-completion developer-tools ide language-client llama llamacpp llm lsp mistral openai self-hosted
Last synced: 14 Sep 2024
https://github.com/k8sgpt-ai/k8sgpt
Giving Kubernetes Superpowers to everyone
ai devops kubernetes llama openai sre tooling
Last synced: 14 Sep 2024
https://github.com/Atome-FE/llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv
Last synced: 14 Sep 2024
https://github.com/atome-fe/llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv
Last synced: 14 Sep 2024
https://github.com/flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Last synced: 14 Sep 2024
https://github.com/sendbird/sendbird-chat-sdk-javascript
Sendbird Chat SDK for JavaScript.
api-for-chat bard chat-api chat-api-platform chat-platform chat-sdk chatbot-api chatbot-sdk chatgpt communications-platform genai-chatbot genai-chatbot-api genai-chatbot-sdk gpt-powered-chatbot instant-messaging-api llama2 messaging-api messaging-platform messaging-sdk palm2
Last synced: 14 Sep 2024
https://github.com/eidolon-ai/eidolon
The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications
agents generative-ai langchain llama llm openai python services
Last synced: 14 Sep 2024
https://github.com/yoshoku/llama_cpp.rb
llama_cpp provides Ruby bindings for llama.cpp
ai gem llama llm ruby
Last synced: 14 Sep 2024
https://github.com/sendbird/sendbird-uikit-react-native
Build chat in minutes with Sendbird UIKit open source code.
api-for-chat bard chat-api chat-api-platform chat-platform chat-sdk chat-ui chatbot-api chatbot-ui chatgpt communications-platform genai-chatbot genai-chatbot-api gpt-powered-chatbot gpt-ui llama2 messaging-api messaging-platform messaging-sdk palm2
Last synced: 16 Sep 2024
https://github.com/friendliai/friendli-client
Friendli: the fastest serving engine for generative AI
ai generative-ai gpt gpt3 inference inference-engine inference-server llama2 llm llm-inference llm-ops llm-serving llmops llms mistral ml mlops serving stable-diffusion
Last synced: 14 Sep 2024
https://github.com/Simatwa/python-tgpt
AI Chat in Terminal + Package + REST-API
ai chatgpt fastapi gemini gpt python-tgpt terminal-gpt tgpt
Last synced: 14 Sep 2024
https://github.com/mdrokz/rust-llama.cpp
LLama.cpp rust bindings
api-bindings cpp crates-io ffi llama llama-cpp machine-learning model rust
Last synced: 14 Sep 2024
https://github.com/zjunlp/EasyEdit
[知识编辑] [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
artificial-intelligence baichuan chatgpt easyedit efficient gpt knowledge-editing knowlm large-language-models llama llama2 mistral mmedit model-editing natural-language-processing open-source-project safeedit tool trustworthy-ai unlearning
Last synced: 14 Sep 2024
https://github.com/unifyai/unify
LLMs Run Riot in Production. Get Back in The Driving Seat. Build Your Own Evals, Iterate Quickly, and Go from Prototype to Production in No Time ⚡
ai claude gpt gpt-4 llama2 llm llm-inference llms mixtral openai python
Last synced: 14 Sep 2024
https://github.com/liltom-eth/llama2-webui
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
llama-2 llama2 llm llm-inference
Last synced: 14 Sep 2024
https://github.com/melih-unsal/demogpt
Create 🦜️🔗 LangChain apps by just using prompts🌟 Star to support our work! | 只需使用句子即可创建 LangChain 应用程序。 给个star支持我们的工作吧!
agent agents ai artificial-intelligence autogpt autonomous-agents chatgpt chatgpt-api demo gpt-4 gpt3-turbo langchain langchain-app langchain-python llama2 llms openai python streamlit streamlit-application
Last synced: 14 Sep 2024
https://github.com/Tongjilibo/bert4torch
An elegent pytorch implement of transformers
belle bert bert4keras bert4torch chatglm large-language-models llama llm named-entity-recognition nlp pytorch relation-extraction seq2seq text-classification transformers
Last synced: 14 Sep 2024
https://github.com/tongjilibo/bert4torch
An elegent pytorch implement of transformers
belle bert bert4keras bert4torch chatglm large-language-models llama llm named-entity-recognition nlp pytorch relation-extraction seq2seq text-classification transformers
Last synced: 14 Sep 2024
https://github.com/melih-unsal/DemoGPT
Create 🦜️🔗 LangChain apps by just using prompts🌟 Star to support our work! | 只需使用句子即可创建 LangChain 应用程序。 给个star支持我们的工作吧!
agent agents ai artificial-intelligence autogpt autonomous-agents chatgpt chatgpt-api demo gpt-4 gpt3-turbo langchain langchain-app langchain-python llama2 llms openai python streamlit streamlit-application
Last synced: 14 Sep 2024
https://github.com/smallcloudai/refact
WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding
ai autocompletion chat developer-tools devtools fine-tuning llama2 llms refactoring self-hosted starchat starcoder wizardlm
Last synced: 14 Sep 2024
https://github.com/himself65/LlamaIndexTS
LlamaIndex is a data framework for your LLM applications
Last synced: 14 Sep 2024
https://github.com/abdeladim-s/pyllamacpp
Python bindings for llama.cpp
langchain llama llamacpp llms
Last synced: 14 Sep 2024
https://github.com/dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
large-language-model llm multi-modal segmentation
Last synced: 14 Sep 2024
https://github.com/aiplanethub/beyondllm
Build, evaluate and observe LLM apps
ai artificial-intelligence embeddings evaluate-llm genai generative-ai large-language-models llm llms rag
Last synced: 14 Sep 2024
https://github.com/aws-samples/foundation-model-benchmarking-tool
Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.
bedrock benchmark benchmarking foundation-models generative-ai inferentia llama2 p4d sagemaker
Last synced: 14 Sep 2024
https://github.com/gbaptista/ollama-ai
A Ruby gem for interacting with Ollama's API that allows you to run open source AI LLMs (Large Language Models) locally.
ai alpaca bakllava dolphin llama llama2 llava llm mistral mistral-ai mixtral nano-bots ollama ollama-api openorca vicuna
Last synced: 14 Sep 2024
https://github.com/darrenburns/elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
ai chatgpt claude gemma gpt large-language-models llama llama3 llm mistral mistral-ai mixtral ollama ollama-client ollama-interface phi-3 python terminal tui
Last synced: 14 Sep 2024
https://github.com/georgian-io/LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
ablation-study classification falcon fine-tuning finetuning flan-t5 large-language-models llama2 llm-test lora mistral-7b nlp nlp-machine-learning qlora redpajama summarization unit-testing zephyr
Last synced: 14 Sep 2024
https://github.com/SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
bamboo-7b falcon large-language-models llama llm llm-inference local-inference
Last synced: 14 Sep 2024
https://github.com/artitw/text2text
Text2Text: Crosslingual NLP/G toolkit
backtranslation chatgpt cross-lingual embeddings information-retrieval levenshtein-distance llama llm multi-lingual natural-language-generation natural-language-processing nlp question-answering question-generation search summarization tf-idf tokenizer transformers translator
Last synced: 14 Sep 2024
https://github.com/belladoreai/llama3-tokenizer-js
JS tokenizer for LLaMA 3 and LLaMA 3.1
llama llama3 llm tokenizer
Last synced: 14 Sep 2024
https://github.com/Noeda/rllama
Rust+OpenCL+AVX2 implementation of LLaMA inference code
Last synced: 14 Sep 2024
https://github.com/dzhng/zod-gpt
Get structured, fully typed, and validated JSON outputs from OpenAI and Anthropic models.
Last synced: 14 Sep 2024
https://github.com/tairov/llama2.py
Inference Llama 2 in one file of pure Python
inference llama llama2 llm machine-learning ml python small-code
Last synced: 14 Sep 2024
https://github.com/dzhng/llm-api
Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.
Last synced: 14 Sep 2024
https://github.com/mybigday/llama.rn
React Native binding of llama.cpp
android ios llama llama-cpp llm react-native
Last synced: 14 Sep 2024
https://github.com/SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec
Last synced: 14 Sep 2024
https://github.com/bolna-ai/bolna
End-to-end platform for building voice first multimodal agents
anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts
Last synced: 14 Sep 2024
https://github.com/zhudotexe/kani
kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
chatgpt claude-2 framework function-calling gpt-3 gpt-4 large-language-models llama llama-2 llms microframework openai tool-use
Last synced: 14 Sep 2024
https://github.com/axflow/axflow
The TypeScript framework for AI development
ai llm typescript
Last synced: 14 Sep 2024
https://github.com/safevideo/autollm
Ship RAG based LLM web apps in seconds.
anthropic bedrock cohere fastapi gradio langchain large-language-models llama-index llama2 llm openai palm pypi python retrieval-augmented-generation vector-database vertex-ai
Last synced: 14 Sep 2024
https://github.com/karpathy/llama2.c
Inference Llama 2 in one file of pure C
Last synced: 14 Sep 2024
https://github.com/expectedparrot/edsl
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
anthropic data-labeling deepinfra domain-specific-language experiments llama2 llm llm-agent llm-framework llm-inference market-research mixtral open-source openai python social-science surveys synthetic-data
Last synced: 14 Sep 2024
https://github.com/sendbird/chat-ai-widget
Build AI Chatbot in minutes with Sendbird Chatbot Widget.
bard chatbot chatgpt genai-chatbot gpt-powered-chatbot llama2 widget
Last synced: 14 Sep 2024
https://github.com/aj-archipelago/cortex
Simplify and accelerate AI-powered application development with structured interfaces to models and powerful prompt execution environments.
ai chatgpt gpt-3 gpt-35-turbo gpt-4 graphql langchain llama llama-cpp llamacpp llm openai palm palm2 rest-api vertex-ai
Last synced: 14 Sep 2024
https://github.com/eliranwong/letmedoit
An advanced AI assistant that leverages the capabilities of ChatGPT API, Gemini Pro, AutoGen, and open-source LLMs, enabling it both to engage in conversations and to execute computing tasks on local devices.
ai api autogen chatgpt gemini google interpreter microsoft multimodal openai rag
Last synced: 14 Sep 2024
https://github.com/mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
ai artificial-intelligence browser browser-automation gpt gpt-4 langchain llama llm openai playwright puppeteer scraper
Last synced: 14 Sep 2024
Statistics
- Projects: 2,243
- Last updated: about 2 months ago