
from graphmemory149
Embedded GraphRAG: DuckDB-backed hybrid graph + vector store for extracting, storing, deduping, and querying entity/relationship graphs with BM25, HNSW vectors,
GraphMemory provides an embedded GraphRAG engine built on DuckDB that combines vector search (HNSW), full-text (BM25), and hybrid retrieval with a fluent query builder and multi-hop traversal. It adds extraction tools (DSPy) to convert text into nodes and edges, fuzzy deduplication and merge strategies, NetworkX-based graph algorithms (PageRank, centrality), and a lightweight D3.js visualizer. The library is single-package and file-backed, intended for local or prototype deployments where a serverless, single-writer graph is desirable.
Use GraphMemory when your retrieval needs go beyond pure vectors or pure graph databases — e.g., RAG workflows that require traversals across entities, or when you want to extract structured knowledge (entities & relations) from documents and keep it deduplicated and queryable. It is suited for prototyping, small-to-medium datasets, and applications where embedding model vector lengths are fixed. Avoid when you need distributed multi-writer graph databases at massive scale (hundreds of millions of nodes).
examples/ covering extraction and ingest patterns.ask() patterns, DSPy extraction usage, two-pass dedup best practices, and gotchas for production use.Likely compatible with agents that can run Python tooling and call LLMs for extraction (Copilot/Codex/Gemini CLI style workflows).
This skill has not been reviewed by our automated audit pipeline yet.