yichuan-w/LEANN

↗ GitHub

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

10,914

Stars

956

Forks

Watchers

Open Issues

Python·MIT License·Last commit Apr 24, 2026·by @yichuan-w·Published April 5, 2026·Analyzed 3mo ago

Safety Rating A

No security concerns identified. The repository is a legitimate open-source academic project from Berkeley Sky Computing Lab with a published arXiv paper. No hardcoded secrets, malicious code patterns, suspicious dependencies, or prompt injection attempts were detected. The README explicitly emphasizes privacy and local-only data processing, and the project encourages use of local LLM backends for maximum privacy. The installation instructions reference standard, well-known dependencies (faiss, langchain, llama-index, ollama, etc.).

ℹAI-assisted review, not a professional security audit.

AI Analysis

LEANN is a storage-efficient vector index and RAG framework that uses graph-based selective recomputation with high-degree preserving pruning to achieve up to 97% storage savings compared to traditional vector databases like FAISS. Instead of storing all embeddings, it stores a pruned graph structure and recomputes embeddings on demand during search. It supports indexing personal data from diverse sources (documents, emails, browser history, WeChat, iMessage, ChatGPT/Claude exports, Slack, Twitter), runs fully locally for privacy, integrates with LLM backends (Ollama, OpenAI, HuggingFace, Anthropic), and includes native MCP server support for Claude Code integration. The project is backed by a peer-reviewed paper from Berkeley Sky Computing Lab.

Use Cases

Building local, privacy-preserving RAG systems on personal devices with minimal storage overhead
Semantic search across personal data sources such as emails, browser history, chat history, and documents
Indexing and querying large document corpora (up to 60M chunks) on consumer hardware
Integrating as a semantic search MCP service with Claude Code for codebase retrieval
Replacing heavyweight vector databases (FAISS, etc.) in memory-constrained or on-device deployments
Multimodal PDF retrieval using ColQwen/ColPali vision-language models