onyx-dot-app/onyx
onyx-dot-app
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Self-hosted RAG
Use this page when your AI system needs to search private documents, connect to internal knowledge, or ground answers in your own data.
Apps and building blocks
The list includes ready-to-use document assistants as well as frameworks and engines for custom RAG pipelines.
Private data focus
These projects are useful when retrieval quality, connectors, indexing, and deployment control matter.
Why it works
Document chat apps
AnythingLLM, Onyx, DocsGPT, Kotaemon, Khoj, Verba, and RAG Web UI are closer to ready-to-use private knowledge assistants.
RAG frameworks and engines
LlamaIndex, Haystack, R2R, RAGFlow, txtai, llmware, RAG-Anything, GraphRAG, and LightRAG fit teams building or customizing the retrieval pipeline.
Visual and advanced RAG builders
Dify, Flowise, Langflow, GraphRAG, and LightRAG are useful when teams need workflows, agents, or graph-based retrieval instead of a simple document-chat app.
Curated repositories
onyx-dot-app
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Mintplex-Labs
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
arc53
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Cinnamon
An open-source RAG-based tool for chatting with your documents.
run-llama
LlamaIndex is the leading document agent and OCR platform
deepset-ai
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
SciPhi-AI
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
infiniflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
weaviate
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
rag-web-ui
RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.
microsoft
A modular graph-based Retrieval-Augmented Generation (RAG) system
HKUDS
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
khoj-ai
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
langgenius
Production-ready platform for agentic workflow development.
FlowiseAI
Build AI Agents, Visually
langflow-ai
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
neuml
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
llmware-ai
Unified framework for building enterprise RAG pipelines with small, specialized models
HKUDS
"RAG-Anything: All-in-One RAG Framework"
Selection guide
Separate ready-to-use knowledge-base apps from developer frameworks and retrieval infrastructure. The right choice depends on whether users need a product UI or engineers need a pipeline.
For private knowledge assistants
Onyx, AnythingLLM, DocsGPT, Kotaemon, Khoj, Verba, and RAG Web UI are closer to end-user document chat products.
For custom RAG pipelines
LlamaIndex, Haystack, R2R, RAGFlow, txtai, llmware, RAG-Anything, GraphRAG, and LightRAG are better framed as frameworks, engines, or research-oriented building blocks.
For visual RAG builders
Dify, Flowise, and Langflow fit teams that want to assemble knowledge-base apps, agents, and retrieval flows without starting from raw framework code.
Production fit
A rankable RAG page should make the distinction clear. Some projects are production-facing apps; others are libraries, methods, or experimental memory systems.
Strongest app fit
Onyx, AnythingLLM, DocsGPT, Kotaemon, and Khoj are the best matches for private knowledge-base and document-chat intent.
Adjacent but useful
Dify, Flowise, Langflow, GraphRAG, and LightRAG are valuable, but should not be sold as simple turnkey document-chat apps.
Retrieval stack
Many RAG systems use vector search, but the vector database is only one layer. Ingestion, chunking, metadata, reranking, citations, access control, and evaluation can matter more than the store alone.
Use vector stores as infrastructure
Qdrant, Chroma, Weaviate, Milvus, LanceDB, and pgvector support RAG, but they are not complete RAG products by themselves.
Suggested additions
PrivateGPT
zylon-ai/private-gpt
A private document Q&A project focused on local/offline RAG workflows. Strong fit for this page once it is present in the upstream registry.
View repositoryFastGPT
labring/FastGPT
A knowledge-base platform with RAG retrieval, data processing, and visual AI workflows. Strong product fit, with licensing details to review before commercial use.
View repositoryMaxKB
1Panel-dev/MaxKB
A self-hostable knowledge-base and agent platform built around RAG pipelines, document ingestion, and enterprise-style assistant workflows.
View repositoryQuivr
QuivrHQ/quivr
A self-hostable second-brain style RAG project. Relevant to private knowledge-base intent, though its current positioning is broader than simple document chat.
View repositoryDB-GPT
eosphoros-ai/DB-GPT
A self-hosted AI data assistant with private knowledge and RAG capabilities. Better for data-heavy assistant workflows than simple file-based document chat.
View repositorylocalGPT
PromtEngineer/localGPT
A local document-chat project focused on private, on-device Q&A. Useful candidate, but narrower and less platform-like than the main curated options.
View repositorySurfSense
MODSetter/SurfSense
A privacy-focused NotebookLM-style project for teams. Promising fit for private knowledge workflows, but newer and less established than the main list.
View repositoryRelated pages
Self-hosted ChatGPT alternatives
Chat interfaces and assistant apps you can run with local models, private endpoints, or your own hosted providers.
Local model runtimes and inference servers
Inference servers and local runtimes for serving models on your machine, server, or private cloud.
Vector databases and retrieval storage
Databases and search layers for embeddings, metadata filtering, persistence, and semantic retrieval.
Agents, workflows, and app builders
Agent frameworks, workflow engines, and app builders for repeatable AI-powered processes.
AI developer tools
Coding assistants and repo-aware tools that can run locally or inside private development environments.
FAQ
It is an open source project that helps language model apps retrieve private or custom knowledge while running in your own environment as an app, framework, service, or retrieval database.
Often, but not always. Many RAG systems use vector search, but smaller or specialized deployments may use hybrid search, full-text search, reranking, or framework-managed indexes.
Only when they are part of a broader RAG product. Tools that only parse PDFs, crawl websites, or convert files are useful inputs, but they are not enough for this page on their own.
A RAG app gives users a product interface for document search or chat. A RAG framework gives developers building blocks for ingestion, retrieval, reranking, evaluation, or custom pipelines.
Onyx and AnythingLLM are the strongest fits for team-oriented private knowledge bases. DocsGPT, Khoj, RAGFlow, R2R, Dify, Flowise, Langflow, and Kotaemon can also fit depending on whether the team needs a product UI, API-first retrieval, visual workflows, or deeper document processing.