onyx-dot-app/onyx
onyx-dot-app
Open Source AI Platform - AI Chat with advanced features that works with every LLM

Open source retrieval and knowledge tools for private document AI.
Self-hosted RAG
Use this page to choose open source RAG building blocks that can run in your own environment: document-chat apps, retrieval frameworks, graph RAG engines, and vector databases.
Split by job
Includes end-user document chat apps, production RAG frameworks, and retrieval storage instead of mixing every adjacent ingestion utility into one list.
Registry-backed
Every recommendation comes from the synced upstream registry, using exact repositories rather than fuzzy keyword matches.
Why it works
Document chat apps
AnythingLLM, Onyx, DocsGPT, Kotaemon, Verba, and RAG Web UI are closer to ready-to-use private knowledge assistants.
RAG frameworks and engines
LlamaIndex, Haystack, R2R, RAGFlow, GraphRAG, LightRAG, and Kernel Memory are better when you are building or customizing the pipeline.
Vector and retrieval storage
Qdrant, Chroma, Weaviate, Milvus, LanceDB, and pgvector cover the storage/search layer behind many private RAG deployments.
Curated repositories
onyx-dot-app
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Mintplex-Labs
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
arc53
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Cinnamon
An open-source RAG-based tool for chatting with your documents.
run-llama
LlamaIndex is the leading document agent and OCR platform
deepset-ai
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
SciPhi-AI
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
infiniflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
weaviate
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
rag-web-ui
RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.
microsoft
A modular graph-based Retrieval-Augmented Generation (RAG) system
HKUDS
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
microsoft
Research project. A Memory solution for users, teams, and applications.
qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
chroma-core
Data infrastructure for AI
weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
milvus-io
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
lancedb
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
pgvector
Open-source vector similarity search for Postgres
Related pages
Self-hosted ChatGPT alternatives
Private assistant apps and team chat portals for people who want a familiar front end around local or private models.
Local model runtimes and inference servers
Private inference stacks for running models locally or exposing an OpenAI-compatible endpoint inside your own infrastructure.
Vector databases and retrieval storage
Storage and search layers for embeddings, filtering, persistence, and semantic retrieval at scale.
Agents, workflows, and app builders
Workflow engines, agent systems, and app builders for repeatable internal automation instead of one-off chat.
AI developer tools
Self-hostable coding assistants and repo-aware tools for local or private developer workflows.
Self-hosted AI tools
Browse open source AI tools you can run on your own infrastructure, from local LLM apps to RAG, agents, inference, and production tooling.
FAQ
It is an open source project that helps retrieve private or custom knowledge for language model apps and can run in your own environment as an app, framework, service, or retrieval database.
Often, but not always. Many RAG systems use vector search, but smaller or specialized deployments may use hybrid search, full-text search, reranking, or framework-managed indexes.
Only when they are part of a broader RAG product. Tools that only parse PDFs, crawl websites, or convert files are useful inputs, but they are not enough for this page's main recommendation set.