Open Source AI github trending open-source

🔥 HKUDS/LightRAG

GitHub TrendingApril 4, 20262 min read0 views

Source Quiz

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation" — Trending on GitHub today with 272 new stars.

🎉 News

[2026.03]🎯[New Feature]: Integrated OpenSearch as a unified storage backend, providing comprehensive support for all four LightRAG storage.
[2026.03]🎯[New Feature]: Introduced a setup wizard. Support for local deployment of embedding, reranking, and storage backends via Docker.
[2025.11]🎯[New Feature]: Integrated RAGAS for Evaluation and Langfuse for Tracing. Updated the API to return retrieved contexts alongside query results to support context precision metrics.
[2025.10]🎯[Scalability Enhancement]: Eliminated processing bottlenecks to support Large-Scale Datasets Efficiently.
[2025.09]🎯[New Feature] Enhances knowledge graph extraction accuracy for Open-Sourced LLMs such as Qwen3-30B-A3B.
[2025.08]🎯[New Feature] Reranker is now supported, significantly boosting performance for mixed queries (set as default query mode).
[2025.08]🎯[New Feature] Added Document Deletion with automatic KG regeneration to ensure optimal query performance.
[2025.06]🎯[New Release] Our team has released RAG-Anything — an All-in-One Multimodal RAG system for seamless processing of text, images, tables, and equations.
[2025.06]🎯[New Feature] LightRAG now supports comprehensive multimodal data handling through RAG-Anything integration, enabling seamless document parsing and RAG capabilities across diverse formats including PDFs, images, Office documents, tables, and formulas. Please refer to the new multimodal section for details.
[2025.03]🎯[New Feature] LightRAG now supports citation functionality, enabling proper source attribution and enhanced document traceability.
[2025.02]🎯[New Feature] You can now use MongoDB as an all-in-one storage solution for unified data management.
[2025.02]🎯[New Release] Our team has released VideoRAG-a RAG system for understanding extremely long-context videos
[2025.01]🎯[New Release] Our team has released MiniRAG making RAG simpler with small models.
[2025.01]🎯You can now use PostgreSQL as an all-in-one storage solution for data management.
[2024.11]🎯[New Resource] A comprehensive guide to LightRAG is now available on LearnOpenCV. — explore in-depth tutorials and best practices. Many thanks to the blog author for this excellent contribution!
[2024.11]🎯[New Feature] Introducing the LightRAG WebUI — an interface that allows you to insert, query, and visualize LightRAG knowledge through an intuitive web-based dashboard.
[2024.11]🎯[New Feature] You can now use Neo4J for Storage-enabling graph database support.
[2024.10]🎯[New Feature] We've added a link to a LightRAG Introduction Video. — a walkthrough of LightRAG's capabilities. Thanks to the author for this excellent contribution!
[2024.10]🎯[New Channel] We have created a Discord channel!💬 Welcome to join our community for sharing, discussions, and collaboration! 🎉🎉

Algorithm Flowchart

Figure 1: LightRAG Indexing Flowchart - Img Caption : Source

Figure 2: LightRAG Retrieval and Querying Flowchart - Img Caption : Source

Installation

💡 Using uv for Package Management: This project uses uv for fast and reliable Python package management. Install uv first: curl -LsSf https://astral.sh/uv/install.sh | sh (Unix/macOS) or powershell -c "irm https://astral.sh/uv/install.ps1 | iex" (Windows)

Note: You can also use pip if you prefer, but uv is recommended for better performance and more reliable dependency management.

📦 Offline Deployment: For offline or air-gapped environments, see the Offline Deployment Guide for instructions on pre-installing all dependencies and cache files.

Install LightRAG Server

The LightRAG Server is designed to provide Web UI and API support. The Web UI facilitates document indexing, knowledge graph exploration, and a simple RAG query interface. LightRAG Server also provide an Ollama compatible interfaces, aiming to emulate LightRAG as an Ollama chat model. This allows AI chat bot, such as Open WebUI, to access LightRAG easily.

Install from PyPI

### Install LightRAG Server as tool using uv (recommended) uv tool install "lightrag-hku[api]"

### Install LightRAG Server as tool using uv (recommended) uv tool install "lightrag-hku[api]"

Or using pip

python -m venv .venv

source .venv/bin/activate # Windows: .venv\Scripts\activate

pip install "lightrag-hku[api]"

Build front-end artifacts

cd lightrag_webui bun install --frozen-lockfile bun run build cd ..

Setup env file

Obtain the env.example file by downloading it from the GitHub repository root

or by copying it from a local source checkout.

cp env.example .env # Update the .env with your LLM and embedding configurations

Launch the server

lightrag-server`

Installation from Source

git clone https://github.com/HKUDS/LightRAG.git cd LightRAG

git clone https://github.com/HKUDS/LightRAG.git cd LightRAG

Bootstrap the development environment (recommended)

make dev source .venv/bin/activate # Activate the virtual environment (Linux/macOS)

Or on Windows: .venv\Scripts\activate

make dev installs the test toolchain plus the full offline stack

(API, storage backends, and provider integrations), then builds the frontend.

Run make env-base or copy env.example to .env before starting the server.

Equivalent manual steps with uv

Note: uv sync automatically creates a virtual environment in .venv/

uv sync --extra test --extra offline source .venv/bin/activate # Activate the virtual environment (Linux/macOS)

Or on Windows: .venv\Scripts\activate

Or using pip with virtual environment

python -m venv .venv

source .venv/bin/activate # Windows: .venv\Scripts\activate

pip install -e ".[test,offline]"

Build front-end artifacts

cd lightrag_webui bun install --frozen-lockfile bun run build cd ..

setup env file

make env-base # Or: cp env.example .env and update it manually

Launch API-WebUI server

lightrag-server`

Launching the LightRAG Server with Docker Compose

git clone https://github.com/HKUDS/LightRAG.git cd LightRAG cp env.example .env # Update the .env with your LLM and embedding configurations

git clone https://github.com/HKUDS/LightRAG.git cd LightRAG cp env.example .env # Update the .env with your LLM and embedding configurations

modify LLM and Embedding settings in .env

docker compose up`

Historical versions of LightRAG docker images can be found here: LightRAG Docker Images

Create .env File With Setup Tool

Instead of editing env.example by hand, use the interactive setup wizard to generate a configured .env and, when needed, docker-compose.final.yml:

make env-base # Required first step: LLM, embedding, reranker make env-storage # Optional: storage backends and database services make env-server # Optional: server port, auth, and SSL make env-base-rewrite # Optional: force-regenerate wizard-managed compose services make env-storage-rewrite # Optional: force-regenerate wizard-managed compose services make env-security-check # Optional: audit the current .env for security risks

make env-base # Required first step: LLM, embedding, reranker make env-storage # Optional: storage backends and database services make env-server # Optional: server port, auth, and SSL make env-base-rewrite # Optional: force-regenerate wizard-managed compose services make env-storage-rewrite # Optional: force-regenerate wizard-managed compose services make env-security-check # Optional: audit the current .env for security risks

For full description of every target see docs/InteractiveSetup.md. The setup wizards update configuration only; run make env-security-check separately to audit the current .env for security risks before deployment. By default, rerunning the setup preserves unchanged wizard-managed compose service blocks; use a -rewrite target only when you need to rebuild those managed blocks from the bundled templates.

Install LightRAG Core

Install from source (Recommended)

cd LightRAG

Note: uv sync automatically creates a virtual environment in .venv/

uv sync source .venv/bin/activate # Activate the virtual environment (Linux/macOS)

Or on Windows: .venv\Scripts\activate

Or: pip install -e .`

Install from PyPI

uv pip install lightrag-hku

Or: pip install lightrag-hku`

Quick Start

LLM and Technology Stack Requirements for LightRAG

LightRAG's demands on the capabilities of Large Language Models (LLMs) are significantly higher than those of traditional RAG, as it requires the LLM to perform entity-relationship extraction tasks from documents. Configuring appropriate Embedding and Reranker models is also crucial for improving query performance.

LLM Selection:

It is recommended to use an LLM with at least 32 billion parameters. The context length should be at least 32KB, with 64KB being recommended. It is not recommended to choose reasoning models during the document indexing stage. During the query stage, it is recommended to choose models with stronger capabilities than those used in the indexing stage to achieve better query results.

Embedding Model:

A high-performance Embedding model is essential for RAG. We recommend using mainstream multilingual Embedding models, such as: BAAI/bge-m3 and text-embedding-3-large. Important Note: The Embedding model must be determined before document indexing, and the same model must be used during the document query phase. For certain storage solutions (e.g., PostgreSQL), the vector dimension must be defined upon initial table creation. Therefore, when changing embedding models, it is necessary to delete the existing vector-related tables and allow LightRAG to recreate them with the new dimensions.

Reranker Model Configuration:

Configuring a Reranker model can significantly enhance LightRAG's retrieval performance. When a Reranker model is enabled, it is recommended to set the "mix mode" as the default query mode. We recommend using mainstream Reranker models, such as: BAAI/bge-reranker-v2-m3 or models provided by services like Jina.

Quick Start for LightRAG Server

The LightRAG Server is designed to provide Web UI and API support. The LightRAG Server offers a comprehensive knowledge graph visualization feature. It supports various gravity layouts, node queries, subgraph filtering, and more. For more information about LightRAG Server, please refer to LightRAG Server.

Quick Start for LightRAG core

To get started with LightRAG core, refer to the sample codes available in the examples folder. Additionally, a video demo demonstration is provided to guide you through the local setup process. If you already possess an OpenAI API key, you can run the demo right away:

### you should run the demo code with project folder cd LightRAG

### you should run the demo code with project folder cd LightRAG

provide your API-KEY for OpenAI

export OPENAI_API_KEY="sk-...your_opeai_key..."

download the demo document of "A Christmas Carol" by Charles Dickens

curl https://raw.githubusercontent.com/gusye1234/nano-graphrag/main/tests/mock_data.txt > ./book.txt

run the demo code

python examples/lightrag_openai_demo.py`

For a streaming response implementation example, please see examples/lightrag_openai_compatible_demo.py. Prior to execution, ensure you modify the sample code's LLM and embedding configurations accordingly.

Note 1: When running the demo program, please be aware that different test scripts may use different embedding models. If you switch to a different embedding model, you must clear the data directory (./dickens); otherwise, the program may encounter errors. If you wish to retain the LLM cache, you can preserve the kv_store_llm_response_cache.json file while clearing the data directory.

Note 2: Only lightrag_openai_demo.py and lightrag_openai_compatible_demo.py are officially supported sample codes. Other sample files are community contributions that haven't undergone full testing and optimization.

Programming with LightRAG Core

For the complete Core API reference — including init parameters, QueryParam, LLM/embedding provider examples (OpenAI, Ollama, Azure, Gemini, HuggingFace, LlamaIndex), reranker injection, insert operations, entity/relation management, and delete/merge — see docs/ProgramingWithCore.md.

⚠️ If you would like to integrate LightRAG into your project, we recommend utilizing the REST API provided by the LightRAG Server. LightRAG Core is typically intended for embedded applications or for researchers who wish to conduct studies and evaluations.

Advanced Features

LightRAG provides additional capabilities including token usage tracking, knowledge graph data export, LLM cache management, Langfuse observability integration, and RAGAS-based evaluation. See docs/AdvancedFeatures.md.

Multimodal Document Processing (RAG-Anything Integration)

LightRAG integrates with RAG-Anything for end-to-end multimodal RAG across PDFs, Office documents, images, tables, and formulas. For setup and usage examples, see docs/AdvancedFeatures.md.

LightRAG Server will soon integrate RAG-Anything’s multimodal processing capabilities into its file processing pipeline. Stay tuned.

Replicating Findings in the Papper

LightRAG consistently outperforms NaiveRAG, RQ-RAG, HyDE, and GraphRAG across agriculture, computer science, legal, and mixed domains. For the full evaluation methodology, prompts, and reproduce steps, see docs/Reproduce.md.

Overall Performance Table

Agriculture

Legal

Mix

NaiveRAG LightRAG NaiveRAG LightRAG NaiveRAG LightRAG NaiveRAG LightRAG

Comprehensiveness 32.4% 67.6% 38.4% 61.6% 16.4% 83.6% 38.8% 61.2%

Diversity 23.6% 76.4% 38.0% 62.0% 13.6% 86.4% 32.4% 67.6%

Empowerment 32.4% 67.6% 38.8% 61.2% 16.4% 83.6% 42.8% 57.2%

Overall 32.4% 67.6% 38.8% 61.2% 15.2% 84.8% 40.0% 60.0%

RQ-RAG LightRAG RQ-RAG LightRAG RQ-RAG LightRAG RQ-RAG LightRAG

Comprehensiveness 31.6% 68.4% 38.8% 61.2% 15.2% 84.8% 39.2% 60.8%

Diversity 29.2% 70.8% 39.2% 60.8% 11.6% 88.4% 30.8% 69.2%

Empowerment 31.6% 68.4% 36.4% 63.6% 15.2% 84.8% 42.4% 57.6%

Overall 32.4% 67.6% 38.0% 62.0% 14.4% 85.6% 40.0% 60.0%

HyDE LightRAG HyDE LightRAG HyDE LightRAG HyDE LightRAG

Comprehensiveness 26.0% 74.0% 41.6% 58.4% 26.8% 73.2% 40.4% 59.6%

Diversity 24.0% 76.0% 38.8% 61.2% 20.0% 80.0% 32.4% 67.6%

Empowerment 25.2% 74.8% 40.8% 59.2% 26.0% 74.0% 46.0% 54.0%

Overall 24.8% 75.2% 41.6% 58.4% 26.4% 73.6% 42.4% 57.6%

GraphRAG LightRAG GraphRAG LightRAG GraphRAG LightRAG GraphRAG LightRAG

Comprehensiveness 45.6% 54.4% 48.4% 51.6% 48.4% 51.6% 50.4% 49.6%

Diversity 22.8% 77.2% 40.8% 59.2% 26.4% 73.6% 36.0% 64.0%

Empowerment 41.2% 58.8% 45.2% 54.8% 43.6% 56.4% 50.8% 49.2%

Overall 45.2% 54.8% 48.0% 52.0% 47.2% 52.8% 50.4% 49.6%

🔗 Related Projects

Ecosystem & Extensions

⭐ Star History

🤝 Contribution

We welcome contributions of all kinds — bug fixes, new features, documentation improvements, and more. Please read our Contributing Guide before submitting a pull request.

We thank all our contributors for their valuable contributions.

📖 Citation

@article{guo2024lightrag, title={LightRAG: Simple and Fast Retrieval-Augmented Generation}, author={Zirui Guo and Lianghao Xia and Yanhua Yu and Tu Ao and Chao Huang}, year={2024}, eprint={2410.05779}, archivePrefix={arXiv}, primaryClass={cs.IR} }

@article{guo2024lightrag, title={LightRAG: Simple and Fast Retrieval-Augmented Generation}, author={Zirui Guo and Lianghao Xia and Yanhua Yu and Tu Ao and Chao Huang}, year={2024}, eprint={2410.05779}, archivePrefix={arXiv}, primaryClass={cs.IR} }

⭐ Thank you for visiting LightRAG! ⭐

Original source

GitHub Trending

https://github.com/HKUDS/LightRAG

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

githubtrendingopen-source

ProductsLive

I Switched From GitKraken to This Indie Git Client and I’m Not Going Back

I've been using GitKraken for the past three years. It's a solid tool, no doubt. But when they bumped the price to $99/year and started locking basic features behind the paywall, I started looking around. I didn't expect to find anything worth switching to. Then I stumbled on GitSquid. I honestly don't remember how I found it - probably a random thread on Reddit or Hacker News. The website looked clean, the screenshots looked promising, and it had a free tier, so I figured I'd give it a shot. Worst case, I'd uninstall it after 10 minutes like every other "GitKraken alternative" I'd tried before. That was two weeks ago. I've since uninstalled GitKraken. First Impressions The install was fast. No account creation, no sign-in, no "let us send you onboarding emails", just download the DMG, dra

DEV Community

4mabout 2 hours ago

ReleasesFresh

The Documentation Attack Surface: How npm Libraries Teach Insecure Patterns

Most security audits focus on code. But across five reviews of high-profile npm libraries — totaling 195 million weekly downloads — I found the same pattern: the code is secure, but the README teaches developers to be insecure. One finding resulted in a GitHub Security Advisory (GHSA-8wrj-g34g-4865) filed at the axios maintainer's request. This isn't a bug in any single library. It's a systemic issue in how the npm ecosystem documents security-sensitive operations. The Pattern A library implements a secure default. Then its README shows a simplified example that strips away the security. Developers copy the example. The library's download count becomes a multiplier for the insecure pattern. Case 1: axios — Credential Re-injection After Security Stripping (65M weekly downloads) The code: fo

DEV Community

6mabout 2 hours ago

ProductsFresh

What is an MCP proxy and why does it need an approval layer?

MCP (Model Context Protocol) lets AI agents call external tools. A database query, a file write, an API call -- the agent decides what to do and the MCP server executes it. But there's nothing in the spec that evaluates whether that action should happen. An MCP proxy sits between the agent and the MCP server. It intercepts every tools/call request, does something with it, and forwards it (or doesn't). The proxy pattern isn't new -- it's how HTTP proxies, API gateways, and service meshes work. Apply it to MCP and you get an enforcement point for agent actions. Why a plain proxy isn't enough Most MCP proxies today do routing, load balancing, or observability. They watch traffic. Some log it. A few do rate limiting. None of that stops an agent from running DROP TABLE customers if the tool cal

DEV Community

4mabout 2 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 208 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Open Source AI

Open Source AIFresh

I scored 14 popular AI frameworks on behavioral commitment — here's the data

When you're choosing an AI framework, what do you actually look at? Usually: stars, documentation quality, whether the README looks maintained. All of that is stated signal. Easy to manufacture, doesn't tell you if the project will exist in 18 months. I built a tool that scores repos on behavioral commitment — signals that cost real time and money to fake. Here's what I found when I ran 14 of the most popular AI frameworks through it. The methodology Five behavioral signals, weighted by how hard they are to fake: Signal Weight Logic Longevity 30% Years of consistent operation Recent activity 25% Commits in the last 30 days Community 20% Number of contributors Release cadence 15% Stable versioned releases Social proof 10% Stars (real people starring costs attention) Archived repos or projec

DEV Community

3mabout 3 hours ago

Open Source AIFresh

Running OpenClaw with Gemma 4 TurboQuant on MacAir 16GB

Hi guys, We’ve implemented a one-click app for OpenClaw with Local Models built in. It includes TurboQuant caching, a large context window, and proper tool calling. It runs on mid-range devices. Free and Open source. The biggest challenge was enabling a local agentic model to run on average hardware like a Mac Mini or MacBook Air. Small models work well on these devices, but agents require more sophisticated models like QWEN or GLM. OpenClaw adds a large context to each request, which caused the MacBook Air to struggle with processing. This became possible with TurboQuant cache compression, even on 16gb memory. We found llama.cpp TurboQuant implementation by Tom Turney. However, it didn’t work properly with agentic tool calling in many cases with QWEN, so we had to patch it. Even then, the

Reddit r/LocalLLaMA

2mabout 3 hours ago

Open Source AIFresh

Help running Qwen3-Coder-Next TurboQuant (TQ3) model

I found a TQ3-quantized version of Qwen3-Coder-Next here: https://huggingface.co/edwardyoon79/Qwen3-Coder-Next-TQ3_0 According to the page, this model requires a compatible inference engine that supports TurboQuant. It also provides a command, but it doesn’t clearly specify which version or fork of llama.cpp should be used (or maybe I missed it). llama-server I’ve tried the following llama.cpp forks that claim to support TQ3, but none of them worked for me: https://github.com/TheTom/llama-cpp-turboquant https://github.com/turbo-tan/llama.cpp-tq3 https://github.com/drdotdot/llama.cpp-turbo3-tq3 If anyone has successfully run this model, I’d really appreciate it if you could share how you did it. submitted by /u/UnluckyTeam3478 [link] [comments]

Reddit r/LocalLLaMA

1mabout 12 hours ago

Open Source AIFresh

Show HN: TurboQuant-WASM – Google's vector quantization in the browser

Comments

Hacker News

2mabout 8 hours ago

🔥 HKUDS/LightRAG

🎉 News

Installation

Install LightRAG Server

Or using pip

python -m venv .venv

source .venv/bin/activate # Windows: .venv\Scripts\activate

pip install "lightrag-hku[api]"

Build front-end artifacts

Setup env file

Obtain the env.example file by downloading it from the GitHub repository root

or by copying it from a local source checkout.

Launch the server

Bootstrap the development environment (recommended)

Or on Windows: .venv\Scripts\activate

make dev installs the test toolchain plus the full offline stack

(API, storage backends, and provider integrations), then builds the frontend.

Run make env-base or copy env.example to .env before starting the server.

Equivalent manual steps with uv

Note: uv sync automatically creates a virtual environment in .venv/

Or on Windows: .venv\Scripts\activate

Or using pip with virtual environment

python -m venv .venv

source .venv/bin/activate # Windows: .venv\Scripts\activate

pip install -e ".[test,offline]"

Build front-end artifacts

setup env file

Launch API-WebUI server

modify LLM and Embedding settings in .env

Create .env File With Setup Tool

Install LightRAG Core

Note: uv sync automatically creates a virtual environment in .venv/

Or on Windows: .venv\Scripts\activate

Or: pip install -e .`

Or: pip install lightrag-hku`

Quick Start

LLM and Technology Stack Requirements for LightRAG

Quick Start for LightRAG Server

Quick Start for LightRAG core

provide your API-KEY for OpenAI

download the demo document of "A Christmas Carol" by Charles Dickens

run the demo code

Programming with LightRAG Core

Advanced Features

Multimodal Document Processing (RAG-Anything Integration)

Replicating Findings in the Papper

🔗 Related Projects

⭐ Star History

🤝 Contribution

📖 Citation

Daily AI Digest

More about

I Switched From GitKraken to This Indie Git Client and I’m Not Going Back

The Documentation Attack Surface: How npm Libraries Teach Insecure Patterns

What is an MCP proxy and why does it need an approval layer?

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Open Source AI

I scored 14 popular AI frameworks on behavioral commitment — here's the data

Running OpenClaw with Gemma 4 TurboQuant on MacAir 16GB

Help running Qwen3-Coder-Next TurboQuant (TQ3) model

Show HN: TurboQuant-WASM – Google&#x27;s vector quantization in the browser

Show HN: TurboQuant-WASM – Google's vector quantization in the browser