
opendatalab/MinerU
opendatalab
61.2k
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
extract-datalayout-analysisocr

deepset-ai/haystack
deepset-ai
25k
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
nlpquestion-answeringpytorch
A system for agentic LLM-powered data processing and ETL

langgenius/dify
langgenius
139.2k
Production-ready platform for agentic workflow development.
21.8k|TypeScript
NOASSERTIONaigptllm

infiniflow/ragflow
infiniflow
79k
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

bytedance/deer-flow
bytedance
63.8k
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.
agentagenticagentic-framework