open-webui/open-webui
open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Open source AI stack map for local, private, and own-infrastructure deployments.
Self-hosted AI directory
Use this page when you want the complete self-hosted AI map: chat interfaces, runtimes, RAG systems, retrieval databases, agent/workflow builders, and coding tools.
Stack map, not mega-list
The recommendations are exact registry-backed projects across the main self-hosted AI layers instead of a fuzzy keyword dump.
Choose the right layer
Use chat apps for users, runtimes for local models, RAG tools for private knowledge, vector databases for retrieval, and agents/dev tools for automation.
Why it works
Chat and local inference
Open WebUI, LobeChat, LibreChat, AnythingLLM, Onyx, Jan, Ollama, LocalAI, llama.cpp, and vLLM cover the user interface and model-serving layers.
Private knowledge and retrieval
DocsGPT, Kotaemon, LlamaIndex, Haystack, R2R, RAGFlow, Qdrant, Chroma, Weaviate, Milvus, LanceDB, and pgvector cover RAG applications and retrieval infrastructure.
Agents, workflows, and code
Dify, CrewAI, AutoGen, the n8n starter kit, Tabby, Continue, and Aider cover self-hosted app building, automation, and developer workflows.
Curated repositories
open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
danny-avila
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.
Mintplex-Labs
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
onyx-dot-app
Open Source AI Platform - AI Chat with advanced features that works with every LLM
janhq
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
ollama
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
mudler
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
ggml-org
LLM inference in C/C++
vllm-project
A high-throughput and memory-efficient inference and serving engine for LLMs
arc53
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Cinnamon
An open-source RAG-based tool for chatting with your documents.
run-llama
LlamaIndex is the leading document agent and OCR platform
deepset-ai
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
SciPhi-AI
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
infiniflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
weaviate
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
rag-web-ui
RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.
microsoft
A modular graph-based Retrieval-Augmented Generation (RAG) system
HKUDS
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
microsoft
Research project. A Memory solution for users, teams, and applications.
qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
chroma-core
Data infrastructure for AI
weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
milvus-io
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
lancedb
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
pgvector
Open-source vector similarity search for Postgres
langgenius
Production-ready platform for agentic workflow development.
crewAIInc
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
microsoft
A programming framework for agentic AI
n8n-io
The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.
TabbyML
Self-hosted AI coding assistant
continuedev
⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI
Aider-AI
aider is AI pair programming in your terminal
Related pages
Self-hosted ChatGPT alternatives
Private assistant apps and team chat portals for people who want a familiar front end around local or private models.
Local model runtimes and inference servers
Private inference stacks for running models locally or exposing an OpenAI-compatible endpoint inside your own infrastructure.
Self-hosted RAG tools
Document search, connectors, and knowledge assistants for private corpora and retrieval-heavy AI products.
Vector databases and retrieval storage
Storage and search layers for embeddings, filtering, persistence, and semantic retrieval at scale.
Agents, workflows, and app builders
Workflow engines, agent systems, and app builders for repeatable internal automation instead of one-off chat.
AI developer tools
Self-hostable coding assistants and repo-aware tools for local or private developer workflows.
FAQ
A self-hosted AI tool is an open source project that can reasonably run on local hardware, private servers, containers, or your own cloud account as part of an AI workflow.
No. Some repositories are models, examples, API clients, or low-level libraries. This page favors projects that form a useful self-hosted stack layer: interface, runtime, RAG system, retrieval store, agent platform, or coding tool.