Ecosyste.ms: Summary

An open API service providing a high level summary for open source projects.

Collections: awesome-llama

https://github.com/ggerganov/llama.cpp

LLM inference in C/C++

ggml llama

Last synced: 14 Sep 2024

https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda gpt inference inferentia llama llm llm-serving llmops mlops model-serving pytorch rocm tpu trainium transformer xpu

Last synced: 14 Sep 2024

https://github.com/nomic-ai/gpt4all

gpt4all: run open-source LLMs anywhere

llm-inference

Last synced: 14 Sep 2024

https://github.com/huggingface/text-generation-inference

Large Language Model Text Generation Inference

bloom deep-learning falcon gpt inference nlp pytorch starcoder transformer

Last synced: 14 Sep 2024

https://github.com/PaddlePaddle/PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

bert compression distributed-training document-intelligence embedding ernie information-extraction llama llm neural-search nlp paddlenlp pretrained-models question-answering search-engine semantic-analysis sentiment-analysis transformers uie

Last synced: 14 Sep 2024

https://github.com/lobehub/lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

ai azure-openai chat chatglm chatgpt claude dalle-3 function-calling gemini gpt gpt-4 gpt-4-vision llama2 nextjs ollama openai tts

Last synced: 14 Sep 2024

https://github.com/zilliztech/GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search

Last synced: 14 Sep 2024

https://github.com/run-llama/llama_parse

Parse files for optimal RAG

document parsing pdf pdf-document-processor ppt pptx structured-data

Last synced: 14 Sep 2024

https://github.com/langgenius/dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

agent ai anthropic backend-as-a-service chatbot gemini genai gpt gpt-4 llama3 llm llmops nextjs openai orchestration python rag workflow workflows

Last synced: 14 Sep 2024

https://github.com/chatchat-space/langchain-chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

chatbot chatchat chatglm chatgpt embedding faiss fastchat gpt knowledge-base langchain langchain-chatglm llama llm milvus ollama qwen rag retrieval-augmented-generation streamlit xinference

Last synced: 14 Sep 2024

https://github.com/chatchat-space/Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

chatbot chatchat chatglm chatgpt embedding faiss fastchat gpt knowledge-base langchain langchain-chatglm llama llm milvus ollama qwen rag retrieval-augmented-generation streamlit xinference

Last synced: 14 Sep 2024

https://github.com/internlm/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 14 Sep 2024

https://github.com/run-llama/LlamaIndexTS

LlamaIndex is a data framework for your LLM applications

agent anthr chatbot claude claude-ai create-llama embedding firewo groq-ai javascript llama llama-index llama2 llama3 llm mistr nodejs openai react typescript

Last synced: 14 Sep 2024

https://github.com/InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

codellama cuda-kernels deepspeed fastertransformer internlm llama llama2 llama3 llm llm-inference turbomind

Last synced: 14 Sep 2024

https://github.com/ollama/ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

gemma gemma2 go golang llama llama2 llama3 llava llm llms mistral ollama phi3

Last synced: 14 Sep 2024

https://github.com/bentoml/OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.

ai bentoml falcon fine-tuning llama llama2 llm llm-inference llm-ops llm-serving llmops mistral ml mlops model-inference mpt open-source-llm openllm stablelm vicuna

Last synced: 14 Sep 2024

https://github.com/bentoml/openllm

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.

ai bentoml falcon fine-tuning llama llama2 llm llm-inference llm-ops llm-serving llmops mistral ml mlops model-inference mpt open-source-llm openllm stablelm vicuna

Last synced: 14 Sep 2024

https://github.com/ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch

Last synced: 14 Sep 2024

https://github.com/mlc-ai/web-llm

High-performance In-browser LLM Inference Engine

chatgpt deep-learning language-model llm tvm webgpu webml

Last synced: 14 Sep 2024

https://github.com/hiyouga/llama-factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 14 Sep 2024

https://github.com/hiyouga/LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers

Last synced: 14 Sep 2024

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 14 Sep 2024

https://github.com/sobelio/llm-chain

`llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tasks

chatgpt langchain llama llm openai rust text-summary

Last synced: 14 Sep 2024

https://github.com/run-llama/llama-hub

A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain

Last synced: 14 Sep 2024

https://github.com/mangiucugna/json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

deep-learning gpt-4 json llama3 llm machine-learning mistral openai-api parser repair

Last synced: 14 Sep 2024

https://github.com/TheR1D/shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

chatgpt cheat-sheet cli commands gpt-3 gpt-4 linux llama llm ollama openai productivity python shell terminal

Last synced: 14 Sep 2024

https://github.com/josh-xt/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 14 Sep 2024

https://github.com/josh-xt/agixt

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 14 Sep 2024

https://github.com/Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 14 Sep 2024

https://github.com/sigoden/aichat

All-in-one AI CLI tool featuring Chat-REPL, Shell Assistant, RAG, AI tools & agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.

ai ai-agents all-in-one azure-openai bedrock chatbot claude cli function-calling gemini llm ollama openai rag tool-use vertexai

Last synced: 14 Sep 2024

https://github.com/unslothai/unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

ai fine-tuning finetuning gemma llama llama3 llms lora mistral phi3 qlora unsloth

Last synced: 14 Sep 2024

https://github.com/haotian-liu/llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning

Last synced: 14 Sep 2024

https://github.com/haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning

Last synced: 14 Sep 2024

https://github.com/explosion/curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta

Last synced: 14 Sep 2024

https://github.com/oobabooga/text-generation-webui

A Gradio web UI for Large Language Models.

Last synced: 14 Sep 2024

https://github.com/explosion/spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

anthropic claude cohere dolly falcon gpt-3 gpt-4 large-language-models llama llm machine-learning named-entity-recognition natural-language-processing nlp openai prompt-engineering spacy text-classification

Last synced: 14 Sep 2024

https://github.com/open-compass/opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

benchmark chatgpt evaluation large-language-model llama2 llama3 llm openai

Last synced: 14 Sep 2024

https://github.com/predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

fine-tuning gpt llama llm llm-inference llm-serving llmops lora model-serving pytorch transformers

Last synced: 14 Sep 2024

https://github.com/floneum/floneum

A toolkit for controllable, private AI on consumer hardware in rust

ai candle floneum-v3 kalosm llama llamacpp llm mistral rust

Last synced: 14 Sep 2024

https://github.com/langroid/langroid

Harness LLMs with Multi-Agent Programming

agents ai chatgpt function-calling gpt gpt-4 gpt4 information-retrieval language-model llama llm llm-agent llm-framework local-llm multi-agent-systems openai-api rag retrieval-augmented-generation

Last synced: 14 Sep 2024

https://github.com/mobiusml/hqq

Official implementation of Half-Quadratic Quantization (HQQ)

llm machine-learning quantization

Last synced: 14 Sep 2024

https://github.com/internlm/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 14 Sep 2024

https://github.com/InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning

Last synced: 14 Sep 2024

https://github.com/google/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 14 Sep 2024

https://github.com/snowby666/poe-api-wrapper

👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀

api chatbot chatgpt claude code-llama dall-e gemini gpt-4 groq llama mistral openai palm2 poe poe-api python quora qwen reverse-engineering stable-diffusion

Last synced: 14 Sep 2024

https://github.com/h2oai/h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

ai chatgpt embeddings generative gpt gpt4all llama2 llm mixtral pdf private privategpt vectorstore

Last synced: 14 Sep 2024

https://github.com/bigscience-workshop/petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

bloom chatbot deep-learning distributed-systems falcon gpt guanaco language-models large-language-models llama machine-learning mixtral neural-networks nlp pipeline-parallelism pretrained-models pytorch tensor-parallelism transformer volunteer-computing

Last synced: 14 Sep 2024

https://github.com/withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Force a JSON schema on the model output on the generation level

ai bindings catai cmake cmake-js cuda gguf grammar json-schema llama llama-cpp llm metal nodejs prebuilt-binaries self-hosted

Last synced: 14 Sep 2024

https://github.com/shroominic/codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

chatgpt chatgpt-code-generation code-interpreter codeinterpreter langchain llm-agent

Last synced: 14 Sep 2024

https://github.com/kyegomez/zeta

Build high-performance AI models with modular building blocks

artificial-intelligence deep-learning gpt4 llama2 longnet multi-agent-systems multi-modal multi-modal-learning multi-platform pytorch speech-recognition transformer transformers

Last synced: 14 Sep 2024

https://github.com/entropy-research/Devon

Devon: An open-source pair programmer

agent agent-based-framework agent-based-model ai ai-developer ai-software ai-software-engineer code-assistant code-generation developer-tool developer-tools gpt-4 gpt-4o groq llama3 ollama vscode

Last synced: 14 Sep 2024

https://github.com/juncongmoo/pyllama

LLaMA: Open and Efficient Foundation Language Models

Last synced: 14 Sep 2024

https://github.com/SilasMarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

ai auto-completion developer-tools ide language-client llama llamacpp llm lsp mistral openai self-hosted

Last synced: 14 Sep 2024

https://github.com/ggerganov/llama.cpp/

LLM inference in C/C++

ggml llama

Last synced: 14 Sep 2024

https://github.com/k8sgpt-ai/k8sgpt

Giving Kubernetes Superpowers to everyone

ai devops kubernetes llama openai sre tooling

Last synced: 14 Sep 2024

https://github.com/Atome-FE/llama-node

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv

Last synced: 14 Sep 2024

https://github.com/atome-fe/llama-node

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv

Last synced: 14 Sep 2024

https://github.com/flexflow/FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Last synced: 14 Sep 2024

https://github.com/sendbird/sendbird-chat-sdk-javascript

Sendbird Chat SDK for JavaScript.

api-for-chat bard chat-api chat-api-platform chat-platform chat-sdk chatbot-api chatbot-sdk chatgpt communications-platform genai-chatbot genai-chatbot-api genai-chatbot-sdk gpt-powered-chatbot instant-messaging-api llama2 messaging-api messaging-platform messaging-sdk palm2

Last synced: 14 Sep 2024

https://github.com/eidolon-ai/eidolon

The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications

agents generative-ai langchain llama llm openai python services

Last synced: 14 Sep 2024

https://github.com/yoshoku/llama_cpp.rb

llama_cpp provides Ruby bindings for llama.cpp

ai gem llama llm ruby

Last synced: 14 Sep 2024

https://github.com/sendbird/sendbird-uikit-react-native

Build chat in minutes with Sendbird UIKit open source code.

api-for-chat bard chat-api chat-api-platform chat-platform chat-sdk chat-ui chatbot-api chatbot-ui chatgpt communications-platform genai-chatbot genai-chatbot-api gpt-powered-chatbot gpt-ui llama2 messaging-api messaging-platform messaging-sdk palm2

Last synced: 16 Sep 2024

https://github.com/friendliai/friendli-client

Friendli: the fastest serving engine for generative AI

ai generative-ai gpt gpt3 inference inference-engine inference-server llama2 llm llm-inference llm-ops llm-serving llmops llms mistral ml mlops serving stable-diffusion

Last synced: 14 Sep 2024

https://github.com/Simatwa/python-tgpt

AI Chat in Terminal + Package + REST-API

ai chatgpt fastapi gemini gpt python-tgpt terminal-gpt tgpt

Last synced: 14 Sep 2024

https://github.com/mdrokz/rust-llama.cpp

LLama.cpp rust bindings

api-bindings cpp crates-io ffi llama llama-cpp machine-learning model rust

Last synced: 14 Sep 2024

https://github.com/zjunlp/EasyEdit

[知识编辑] [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

artificial-intelligence baichuan chatgpt easyedit efficient gpt knowledge-editing knowlm large-language-models llama llama2 mistral mmedit model-editing natural-language-processing open-source-project safeedit tool trustworthy-ai unlearning

Last synced: 14 Sep 2024

https://github.com/unifyai/unify

LLMs Run Riot in Production. Get Back in The Driving Seat. Build Your Own Evals, Iterate Quickly, and Go from Prototype to Production in No Time ⚡

ai claude gpt gpt-4 llama2 llm llm-inference llms mixtral openai python

Last synced: 14 Sep 2024

https://github.com/liltom-eth/llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.

llama-2 llama2 llm llm-inference

Last synced: 14 Sep 2024

https://github.com/melih-unsal/demogpt

Create 🦜️🔗 LangChain apps by just using prompts🌟 Star to support our work! | 只需使用句子即可创建 LangChain 应用程序。 给个star支持我们的工作吧!

agent agents ai artificial-intelligence autogpt autonomous-agents chatgpt chatgpt-api demo gpt-4 gpt3-turbo langchain langchain-app langchain-python llama2 llms openai python streamlit streamlit-application

Last synced: 14 Sep 2024

https://github.com/Tongjilibo/bert4torch

An elegent pytorch implement of transformers

belle bert bert4keras bert4torch chatglm large-language-models llama llm named-entity-recognition nlp pytorch relation-extraction seq2seq text-classification transformers

Last synced: 14 Sep 2024

https://github.com/tongjilibo/bert4torch

An elegent pytorch implement of transformers

belle bert bert4keras bert4torch chatglm large-language-models llama llm named-entity-recognition nlp pytorch relation-extraction seq2seq text-classification transformers

Last synced: 14 Sep 2024

https://github.com/melih-unsal/DemoGPT

Create 🦜️🔗 LangChain apps by just using prompts🌟 Star to support our work! | 只需使用句子即可创建 LangChain 应用程序。 给个star支持我们的工作吧!

agent agents ai artificial-intelligence autogpt autonomous-agents chatgpt chatgpt-api demo gpt-4 gpt3-turbo langchain langchain-app langchain-python llama2 llms openai python streamlit streamlit-application

Last synced: 14 Sep 2024

https://github.com/smallcloudai/refact

WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding

ai autocompletion chat developer-tools devtools fine-tuning llama2 llms refactoring self-hosted starchat starcoder wizardlm

Last synced: 14 Sep 2024

https://github.com/himself65/LlamaIndexTS

LlamaIndex is a data framework for your LLM applications

Last synced: 14 Sep 2024

https://github.com/abdeladim-s/pyllamacpp

Python bindings for llama.cpp

langchain llama llamacpp llms

Last synced: 14 Sep 2024

https://github.com/dvlab-research/LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

large-language-model llm multi-modal segmentation

Last synced: 14 Sep 2024

https://github.com/aiplanethub/beyondllm

Build, evaluate and observe LLM apps

ai artificial-intelligence embeddings evaluate-llm genai generative-ai large-language-models llm llms rag

Last synced: 14 Sep 2024

https://github.com/aws-samples/foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.

bedrock benchmark benchmarking foundation-models generative-ai inferentia llama2 p4d sagemaker

Last synced: 14 Sep 2024

https://github.com/gbaptista/ollama-ai

A Ruby gem for interacting with Ollama's API that allows you to run open source AI LLMs (Large Language Models) locally.

ai alpaca bakllava dolphin llama llama2 llava llm mistral mistral-ai mixtral nano-bots ollama ollama-api openorca vicuna

Last synced: 14 Sep 2024

https://github.com/darrenburns/elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

ai chatgpt claude gemma gpt large-language-models llama llama3 llm mistral mistral-ai mixtral ollama ollama-client ollama-interface phi-3 python terminal tui

Last synced: 14 Sep 2024

https://github.com/georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

ablation-study classification falcon fine-tuning finetuning flan-t5 large-language-models llama2 llm-test lora mistral-7b nlp nlp-machine-learning qlora redpajama summarization unit-testing zephyr

Last synced: 14 Sep 2024

https://github.com/SJTU-IPADS/PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

bamboo-7b falcon large-language-models llama llm llm-inference local-inference

Last synced: 14 Sep 2024

https://github.com/artitw/text2text

Text2Text: Crosslingual NLP/G toolkit

backtranslation chatgpt cross-lingual embeddings information-retrieval levenshtein-distance llama llm multi-lingual natural-language-generation natural-language-processing nlp question-answering question-generation search summarization tf-idf tokenizer transformers translator

Last synced: 14 Sep 2024

https://github.com/belladoreai/llama3-tokenizer-js

JS tokenizer for LLaMA 3 and LLaMA 3.1

llama llama3 llm tokenizer

Last synced: 14 Sep 2024

https://github.com/Noeda/rllama

Rust+OpenCL+AVX2 implementation of LLaMA inference code

Last synced: 14 Sep 2024

https://github.com/dzhng/zod-gpt

Get structured, fully typed, and validated JSON outputs from OpenAI and Anthropic models.

Last synced: 14 Sep 2024

https://github.com/tairov/llama2.py

Inference Llama 2 in one file of pure Python

inference llama llama2 llm machine-learning ml python small-code

Last synced: 14 Sep 2024

https://github.com/dzhng/llm-api

Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.

Last synced: 14 Sep 2024

https://github.com/mybigday/llama.rn

React Native binding of llama.cpp

android ios llama llama-cpp llm react-native

Last synced: 14 Sep 2024

https://github.com/SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec

Last synced: 14 Sep 2024

https://github.com/bolna-ai/bolna

End-to-end platform for building voice first multimodal agents

anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts

Last synced: 14 Sep 2024

https://github.com/zhudotexe/kani

kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)

chatgpt claude-2 framework function-calling gpt-3 gpt-4 large-language-models llama llama-2 llms microframework openai tool-use

Last synced: 14 Sep 2024

https://github.com/axflow/axflow

The TypeScript framework for AI development

ai llm typescript

Last synced: 14 Sep 2024

https://github.com/safevideo/autollm

Ship RAG based LLM web apps in seconds.

anthropic bedrock cohere fastapi gradio langchain large-language-models llama-index llama2 llm openai palm pypi python retrieval-augmented-generation vector-database vertex-ai

Last synced: 14 Sep 2024

https://github.com/karpathy/llama2.c

Inference Llama 2 in one file of pure C

Last synced: 14 Sep 2024

https://github.com/expectedparrot/edsl

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

anthropic data-labeling deepinfra domain-specific-language experiments llama2 llm llm-agent llm-framework llm-inference market-research mixtral open-source openai python social-science surveys synthetic-data

Last synced: 14 Sep 2024

https://github.com/sendbird/chat-ai-widget

Build AI Chatbot in minutes with Sendbird Chatbot Widget.

bard chatbot chatgpt genai-chatbot gpt-powered-chatbot llama2 widget

Last synced: 14 Sep 2024

https://github.com/aj-archipelago/cortex

Simplify and accelerate AI-powered application development with structured interfaces to models and powerful prompt execution environments.

ai chatgpt gpt-3 gpt-35-turbo gpt-4 graphql langchain llama llama-cpp llamacpp llm openai palm palm2 rest-api vertex-ai

Last synced: 14 Sep 2024

https://github.com/eliranwong/letmedoit

An advanced AI assistant that leverages the capabilities of ChatGPT API, Gemini Pro, AutoGen, and open-source LLMs, enabling it both to engage in conversations and to execute computing tasks on local devices.

ai api autogen chatgpt gemini google interpreter microsoft multimodal openai rag

Last synced: 14 Sep 2024

https://github.com/mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

ai artificial-intelligence browser browser-automation gpt gpt-4 langchain llama llm openai playwright puppeteer scraper

Last synced: 14 Sep 2024

Statistics

  • Projects: 2,243
  • Last updated: about 2 months ago