nano-graphrag Manual - Doramagic.ai

Doramagic Project Pack · Human Manual

nano-graphrag

A simple, easy-to-hack GraphRAG implementation

Project Overview & System Architecture

Related topics: Configuration, Query Modes & Prompt Customization, Storage Backends, LLM & Embedding Integrations

Section Related Pages

Continue reading this section for the full explanation and source context.

Project Overview & System Architecture

Purpose and Scope

nano-graphrag is a compact, hackable GraphRAG implementation that re-implements the core ideas from Microsoft's GraphRAG paper in roughly 1100 lines of code (excluding tests and prompts), as stated in readme.md. The project explicitly targets developers who found the official implementation difficult to read or extend, and it positions itself as a "smaller, faster, cleaner" alternative that preserves the central graph-augmented retrieval behaviour.

The library exposes two primary entry points — GraphRAG and QueryParam — both defined as Python dataclasses and re-exported from nano_graphrag/__init__.py. GraphRAG orchestrates the indexing pipeline and query answering, while QueryParam controls per-query behaviour such as retrieval mode and community budget.

High-Level Architecture

The system follows a pluggable, storage-agnostic pipeline. The GraphRAG class in nano_graphrag/graphrag.py wires together three abstract storage interfaces defined in nano_graphrag/base.py:

BaseKVStorage — key/json pairs (default: disk files)
BaseVectorStorage — embedding indices (default: nano-vectordb)
BaseGraphStorage — knowledge graph (default: networkx)

Each interface has built-in implementations and documented extension points, so users can swap Neo4j, Milvus, HNSWlib, or file-backed storage without modifying the pipeline.

flowchart LR
    A[Text Chunks] --> B[Entity & Relation Extraction<br/>via best_model_func]
    B --> C[Graph Storage<br/>networkx / neo4j]
    B --> D[Vector Storage<br/>nano-vectordb / hnswlib / milvus]
    C --> E[Community Detection<br/>& Report Summarization]
    E --> F[KV Storage<br/>community reports cache]
    Q[User Query] --> G{Query Mode}
    G -->|local| H[Entity lookup → neighborhood]
    G -->|global| I[Top-K community reports]
    G -->|naive| J[Vector similarity over chunks]
    H --> K[cheap_model_func response]
    I --> K
    J --> K
    K --> R[Answer]

The diagram mirrors the indexing/query flow described in readme.md. During indexing, text chunks produced by nano_graphrag/_op.py are sent through entity extraction, graph and vector writes, and a community-report pass. During querying, the same orchestrator dispatches based on QueryParam.mode to local, global, or naive retrieval.

Two-Model LLM Strategy

A distinctive design choice documented in readme.md and implemented in nano_graphrag/_llm.py is the use of two LLM roles:

best_model_func (default gpt-4o) — used for planning tasks that demand quality: entity/relation extraction, JSON-constrained parsing, and final answer generation.
cheap_model_func (default gpt-4o-mini) — used for bulk summarisation, such as community-report generation.

Both functions share the same signature (async def my_llm_complete(prompt, system_prompt=None, history_messages=[], **kwargs) -> str) and may be overridden per GraphRAG instance. A convert_response_to_json_func hook lets users repair malformed JSON returned by open-source models, addressing a common stability problem noted in readme.md under the "Json Output" section.

Pluggable Components

The following table summarises the supported components and their sources, as listed in readme.md under the "Components" section. Community issue #2 explicitly tracks requests for additional storages (e.g. MongoDB) and LLMs (e.g. Ollama), most of which already ship as examples under examples/.

Type	Options	Status
LLM	OpenAI, Azure OpenAI, Amazon Bedrock, DeepSeek, Ollama	Built-in / examples
Embedding	OpenAI, Amazon Bedrock, sentence-transformers	Built-in / examples
Vector DB	nano-vectordb, hnswlib, milvus-lite, faiss	Built-in / examples
Graph Storage	networkx, neo4j	Built-in
Chunking	by token size, by text splitter	Built-in
Visualisation	graphml export	examples

Custom chunking functions are supported by passing chunk_func=chunking_by_seperators to GraphRAG, as shown in the using_custom_chunking_method.py example referenced in readme.md.

Query Modes and Prompts

QueryParam.mode selects among three retrieval strategies implemented in graphrag.py:

local — seed entities are resolved from the query, and the surrounding graph neighborhood is fed to the model. Governed by local_max_token and local_community_single_one.
global — uses the top-K most important community reports (controlled by global_max_consider_community, default 512). As noted in the "Issues" section of readme.md, this differs from Microsoft's map-reduce style that fills the context with all communities.
naive — a flat vector similarity search over chunks, enabled via GraphRAG(enable_naive_rag=True).

All prompt templates live in nano_graphrag/prompt.py inside the PROMPTS dictionary. The most important keys, per readme.md, are entity_extraction, community_report, local_rag_response, global_reduce_rag_response, and fail_response. Community issue #56 raised a question about repeated sections inside local_rag_response; the prompts are intentionally user-editable, so this can be modified by overriding the dictionary entry.

Async, Persistence, and Known Limitations

Every synchronous method on GraphRAG has an a-prefixed async counterpart (e.g. ainsert, aquery), as described in readme.md. Indexing state is persisted under working_dir; re-instantiating GraphRAG from the same directory reloads the graph, vectors, and community reports without recomputation.

Two functional gaps are explicitly acknowledged in the "Issues" section of readme.md:

nano-graphrag does not implement the covariates feature of the original GraphRAG.
The global search uses top-K communities rather than Microsoft's map-reduce across all communities.

Community issue #99 also tracks a feature request to add Microsoft's DRIFT search variant, which is not yet present. Installation pain on Windows (encoding errors from cp1252) is tracked in issue #125; users on Windows may need to set PYTHONIOENCODING=utf-8 before running pip install.

Configuration, Query Modes & Prompt Customization

Related topics: Project Overview & System Architecture, Storage Backends, LLM & Embedding Integrations

Section Related Pages

Continue reading this section for the full explanation and source context.

Section LLM and embedding slots

Continue reading this section for the full explanation and source context.

Section Storage and chunking slots

Continue reading this section for the full explanation and source context.

Section JSON output and repair hooks

Continue reading this section for the full explanation and source context.

Configuration, Query Modes & Prompt Customization

nano-graphrag exposes nearly all of its behavior through two Python @dataclasses — GraphRAG and QueryParam — plus a mutable prompt dictionary nano_graphrag.prompt.PROMPTS. This page documents how to wire those configuration surfaces together: the constructor parameters that select LLMs, embeddings, and storage backends; the QueryParam fields that choose between local, global, and naive retrieval; and the prompt templates that control entity extraction, community summarization, and answer generation.

Configuration Surface: `GraphRAG` and `QueryParam`

Both GraphRAG and QueryParam are dataclasses, so every option is type-hinted and discoverable through help(...) (see the README "Available Parameters" section). GraphRAG is the long-lived object that owns the working directory, storage handles, and the insert/query pipeline; QueryParam is a lightweight per-call configuration passed to query(...) or aquery(...) (readme.md).

LLM and embedding slots

nano-graphrag requires two LLM tiers by design: a "great" model for planning and response generation, and a "cheap" model for summarization. The defaults are gpt-4o and gpt-4o-mini respectively, implemented in nano_graphrag/_llm.py. You replace them with GraphRAG(best_model_func=..., cheap_model_func=...), together with best_model_max_token_size, best_model_max_async, and the analogous cheap_model_* knobs. The reference signature that any custom LLM must follow is:

async def my_llm_complete(prompt, system_prompt=None, history_messages=[], **kwargs) -> str:
    hashing_kv: BaseKVStorage = kwargs.pop("hashing_kv", None)  # optional LLM cache
    response = await call_your_LLM(messages, **kwargs)
    return response

Cloud and local providers plug into this same slot: Azure OpenAI is enabled with GraphRAG(..., using_azure_openai=True, ...) (see .env.example.azure); Amazon Bedrock with using_amazon_bedrock=True plus best_model_id / cheap_model_id (examples/using_amazon_bedrock.py); ollama and DeepSeek via the examples in examples/using_ollama_as_llm.py and examples/using_deepseek_as_llm.py. Embeddings follow the same pattern through the EmbeddingFunc protocol decorated with wrap_embedding_func_with_attrs(embedding_dim=..., max_token_size=...) in nano_graphrag/_utils.py.

Storage and chunking slots

Storage is injected through three abstract base classes defined in nano_graphrag/base.py: BaseKVStorage (key→JSON), BaseVectorStorage (embeddings), and BaseGraphStorage (the knowledge graph). Defaults are disk JSON files, nano-vectordb, and networkx. Substitutes are wired in with key_string_value_json_storage_cls=, vector_db_storage_cls=, and graph_storage_cls=. The README lists hnswlib, milvus-lite, and faiss as vector alternatives, and Neo4j as a graph alternative with a dedicated tutorial at docs/use_neo4j_for_graphrag.md.

Chunking is a callable parameter, not a class. The default is token-size chunking; switching to the text-splitter implementation is one constructor argument:

from nano_graphrag._op import chunking_by_seperators
GraphRAG(..., chunk_func=chunking_by_seperators, ...)

User-defined chunkers follow the same pattern, demonstrated in examples/using_custom_chunking_method.py.

Query Modes

QueryParam.mode selects between three retrieval strategies. The first two — local and global — are the two GraphRAG query modes described in the original Microsoft paper; naive is a vector-only baseline that is only active when GraphRAG(..., enable_naive_rag=True, ...) is set at construction time.

flowchart TD
    Q[User query] --> P{QueryParam.mode}
    P -->|local| L[Entity-centric neighborhood search<br/>+ community reports]
    P -->|global| G[Top-K important communities<br/>via global_max_consider_community]
    P -->|naive| N[Vector similarity over chunks<br/>requires enable_naive_rag=True]
    L --> R[best_model_func generates answer]
    G --> R
    N --> R
    R --> Out[Final response<br/>or only_need_context output]

Key QueryParam fields worth knowing:

mode — "local", "global", or "naive".
only_need_context=True — returns just the assembled context (community reports for local; analyst reports for global) without calling the final answer model. This is the documented integration point for users who want to feed the context into their own prompt (readme.md).
global_max_consider_community — caps the number of top-importance communities that the global search fills into the prompt; the default is 512. The README explicitly notes that nano-graphrag's global search differs from Microsoft's reference implementation, which uses a map-reduce to fill *all* communities.

A frequently-requested fourth mode, DRIFT search (added by Microsoft after local and global), is tracked in community issue #99 and is not yet implemented. For now, QueryParam.mode is limited to the three values above.

Prompt Customization

All prompt templates live in a single mutable dictionary: nano_graphrag.prompt.PROMPTS (nano_graphrag/prompt.py). Mutating it in place is the supported way to override behavior — the README explicitly invites users to "play with it and replace any prompt inside." The most important keys are:

Key	Role in the pipeline
`entity_extraction`	Used to extract entities and relations from each text chunk during indexing.
`community_report`	Used to summarize each detected graph cluster.
`local_rag_response`	System prompt template for local-search answer generation.
`global_reduce_rag_response`	System prompt template for global-search answer generation.
`fail_response`	Fallback returned when no community/context is relevant to the query.

JSON output and repair hooks

nano-graphrag instructs best_model_func to emit JSON via OpenAI's "response_format": {"type": "json_object"} argument. Open-source models (e.g., ollama) often produce unstable JSON, so the library exposes convert_response_to_json_func to post-process the raw string into a dict. A common third-party choice is json_repair (readme.md).

A note on prompt repetition

Community issue #56 observed that the local_rag_response template repeats its Goal and Target response length and format sections. This repetition is present in the source and is intentional within the current template, but it is exactly the kind of friction the in-place PROMPTS dictionary is designed to fix — users can rewrite the template to a non-repeated form without forking the project.

Common Customization Patterns

Putting the three configuration surfaces together, the typical customization patterns are:

Swap the LLM and embedding only — leave storage and chunking at defaults. This is the path used by the DeepSeek, ollama, and Bedrock examples.
Swap storage to a managed service — keep the OpenAI defaults, but pass vector_db_storage_cls= or graph_storage_cls=Neo4jStorage. The Neo4j tutorial is the canonical walkthrough.
Turn the library into a context provider — call graph_func.query(..., param=QueryParam(mode="local", only_need_context=True)) and pipe the returned CSV-style report into your own downstream prompt.
Run a fully local stack with no API keys — combine ollama for the LLM, sentence-transformers for embeddings (examples/using_local_embedding_model.py), and an in-memory vector store, as shown in examples/no_openai_key_at_all.py.

Together, the GraphRAG constructor, the QueryParam per-call object, and the PROMPTS dictionary cover every documented customization surface in nano-graphrag.

Storage Backends, LLM & Embedding Integrations

Related topics: Project Overview & System Architecture, Configuration, Query Modes & Prompt Customization, Benchmarks, Roadmap & Known Limitations

Section Related Pages

Continue reading this section for the full explanation and source context.

Section KV Storage (BaseKVStorage)

Continue reading this section for the full explanation and source context.

Section Vector Database Storage (BaseVectorStorage)

Continue reading this section for the full explanation and source context.

Section Graph Database Storage (BaseGraphStorage)

Continue reading this section for the full explanation and source context.

Storage Backends, LLM & Embedding Integrations

Purpose and Scope

nano-graphrag is designed as a small, hackable GraphRAG implementation whose core value proposition is portability: every external dependency (LLM provider, embedding model, vector index, graph store, KV cache) is hidden behind a base class so it can be swapped at construction time without touching pipeline code. This page documents the extension points for those three categories of integrations and the defaults shipped with the library.

The high-level abstraction lives in nano_graphrag/base.py, which defines abstract BaseKVStorage, BaseVectorStorage, and BaseGraphStorage types; the _storage/__init__.py module wires the default concrete implementations into the GraphRAG dataclass declared in nano_graphrag/__init__.py. Source: nano_graphrag/base.py.

flowchart LR
    A[GraphRAG dataclass] -->|best_model_func| L[(LLM Provider)]
    A -->|cheap_model_func| L
    A -->|embedding_func| E[(Embedding Model)]
    A -->|key_string_value_json_storage_cls| KV[(KV Storage)]
    A -->|vector_db_storage_cls| VDB[(Vector DB)]
    A -->|graph_storage_cls| GDB[(Graph DB)]
    L -.async .-> A
    E -.async .-> A

Storage Backends

KV Storage (`BaseKVStorage`)

The default implementation persists JSON values to disk under working_dir and is implemented in nano_graphrag/_storage/kv_json.py (JsonKVStorage). It is used internally for LLM response caching, chunk deduplication, full-document storage, and community reports. The base interface declares async get_by_id, get_by_ids, filter_keys, upsert, drop, and is_empty methods that any backend must implement. Source: nano_graphrag/_storage/kv_json.py, nano_graphrag/base.py.

To swap in an alternative backend (for example MongoDB or Redis), pass it via GraphRAG(..., key_string_value_json_storage_cls=YOURS, ...). If every storage is replaced with a non-file backend, set always_create_working_dir=False to skip the directory-creation step.

Vector Database Storage (`BaseVectorStorage`)

Two built-in implementations ship with the library and both implement the same interface from nano_graphrag/base.py:

Backend	File	Notes
`nano-vectordb` (default)	`nano_graphrag/_storage/vdb_nanovectordb.py`	Pure-Python, in-memory + on-disk persistence under `working_dir`
`hnswlib`	`nano_graphrag/_storage/vdb_hnswlib.py`	Built-in C++ ANN index, recommended for larger corpora

Additional examples in the repository demonstrate milvus-lite and faiss. Any subclass of BaseVectorStorage implementing upsert, query, delete_entity, and is_empty can be injected through GraphRAG(..., vector_db_storage_cls=YOURS, ...). Sources: nano_graphrag/_storage/vdb_nanovectordb.py, nano_graphrag/_storage/vdb_hnswlib.py.

Graph Database Storage (`BaseGraphStorage`)

The default graph backend is NetworkXStorage in nano_graphrag/_storage/gdb_networkx.py, which exposes the abstract API: has_node, has_edge, get_node, upsert_node, upsert_edge, get_all_edges, get_neighbors, get_edge, delete_node, remove_nodes, remove_edges, get_nodes_by_chunk_ids, and community-detection helpers (clustering, community_schema). A Neo4jStorage implementation is also shipped and documented in docs/use_neo4j_for_graphrag.md. Source: nano_graphrag/_storage/gdb_networkx.py.

LLM Integrations

nano-graphrag distinguishes two LLM roles passed to GraphRAG:

best_model_func — used for entity extraction, community summarization planning, and final response generation.
cheap_model_func — used for cheap summarization and intermediate rewriting.

Built-in providers live in nano_graphrag/_llm.py and include gpt_4o_complete, gpt_4o_mini_complete, Azure OpenAI (gated behind using_azure_openai=True), and Amazon Bedrock (gated behind using_amazon_bedrock=True plus best_model_id / cheap_model_id). By default the great model is gpt-4o and the cheap model is gpt-4o-mini.

The async signature that all LLM functions must satisfy is:

async def my_llm_complete(
    prompt: str,
    system_prompt: str | None = None,
    history_messages: list = [],
    **kwargs,
) -> str:
    hashing_kv: BaseKVStorage = kwargs.pop("hashing_kv", None)  # for caching
    ...

Inject the function via GraphRAG(best_model_func=..., best_model_max_token_size=..., best_model_max_async=...) and similarly for the cheap model. Because the LLM call passes through the hashing_kv storage, swapping the KV backend also swaps the response cache. Sources: nano_graphrag/_llm.py, README.md.

For LLM providers that produce unstable JSON when using response_format={"type": "json_object"} (a common pain point raised in community discussion around open-source models), GraphRAG accepts convert_response_to_json_func=YOUR_STRING_TO_JSON_FUNC so callers can repair the output (e.g., via json_repair).

Embedding Function Integration

Embeddings are an EmbeddingFunc instance decorated with @wrap_embedding_func_with_attrs(embedding_dim=..., max_token_size=...) from nano_graphrag/_utils.py. The decorated function must be async and accept a list[str], returning a numpy.ndarray of shape (len(texts), embedding_dim). The default uses OpenAI's text-embedding-3-small (1536 dimensions, 8192 token ceiling). Source: nano_graphrag/_utils.py.

Replacement is performed via GraphRAG(embedding_func=your_embed_func, embedding_batch_num=..., embedding_func_max_async=...). The examples/using_local_embedding_model.py script demonstrates sentence-transformers as a fully offline backend. Because embedding_dim is captured by the decorator, changing the model automatically re-tunes the vector storage.

Community Notes and Customization

The repository's most-discussed feature request (issue #2, "Add more Storages and LLMs") explicitly tracks community contributions for MongoDB, ollama, and other backends; the storage/embedding/LLM sections above map directly to the three extension points that contributors need to subclass. The README reiterates that any backend implementation must conform to the base interfaces in nano_graphrag/base.py, and the project maintains a benchmark suite under examples/benchmarks for comparing components.

A known operational caveat from the Windows installation issue (#125) is the platform encoding: nano-graphrag reads and writes JSON files using UTF-8 by default, but Windows terminals default to cp1252. Setting PYTHONUTF8=1 (or running from a UTF-8 locale) before launching resolves the UnicodeDecodeError reported during installation and first run.

For visualization of the persisted graph, nano-graphrag can export the NetworkXStorage contents to GraphML; the graphml_visualize.py example was patched in v0.0.8 to handle long entity names that previously truncated the output.

Benchmarks, Roadmap & Known Limitations

Related topics: Project Overview & System Architecture, Configuration, Query Modes & Prompt Customization

Section Related Pages

Continue reading this section for the full explanation and source context.

Section 2.1 English & Chinese Benchmarks

Continue reading this section for the full explanation and source context.

Section 2.2 Multi-Hop RAG Evaluation

Continue reading this section for the full explanation and source context.

Section 2.3 Component-Level Comparisons

Continue reading this section for the full explanation and source context.

Benchmarks, Roadmap & Known Limitations

1. Overview

nano-graphrag positions itself as a "simple, easy-to-hack GraphRAG implementation" — a smaller, faster, and cleaner alternative to Microsoft's official GraphRAG. The project is intentionally minimal: excluding tests and prompts, the entire library is about 1,100 lines of code (readme.md). The repository publishes benchmarks, a roadmap, and a transparent list of known limitations so users can evaluate fit-for-purpose before adopting it.

This page consolidates the publicly available benchmark results, planned enhancements, known constraints, and notable community-reported issues.

2. Benchmarks

The project publishes comparative benchmarks against the original GraphRAG implementation, plus a community-driven multi-hop evaluation.

2.1 English & Chinese Benchmarks

Two separate benchmark documents are maintained:

docs/benchmark-en.md — English-language benchmark
docs/benchmark-zh.md — Chinese-language benchmark

These document comparative performance and quality metrics between nano-graphrag and the reference GraphRAG implementation, allowing users to verify that the simplified codebase preserves core functionality. Source: readme.md section "## Benchmark".

2.2 Multi-Hop RAG Evaluation

A Jupyter notebook examples/benchmarks/eval_naive_graphrag_on_multi_hop.ipynb evaluates the naive RAG mode against the MultiHop-RAG dataset. The naive RAG mode can be enabled with:

graph_func = GraphRAG(working_dir="./dickens", enable_naive_rag=True)
print(rag.query("...", param=QueryParam(mode="naive")))

Source: readme.md section "## Naive RAG".

2.3 Component-Level Comparisons

The README points users to examples/benchmarks for component-level comparisons (LLMs, vector DBs, graph stores, chunking strategies). As stated in the README: *"Check examples/benchmarks to see few comparisons between components. Always welcome to contribute more components."* Source: readme.md section "## Components".

3. Roadmap

The canonical roadmap lives at docs/ROADMAP.md. Source: readme.md section "## Roadmap". The roadmap informs the project's forward direction and helps contributors prioritize work.

3.1 Recent Milestones (v0.0.8)

The v0.0.8 release incorporated community contributions (readme.md section "### Latest Release: v0.0.8"):

Change	Contributor	PR
Add Azure OpenAI as an option in `_llm.py`	@SliverBulle	#31
Add DSPy for entity extraction	@NumberChiffre	#27
Fix `graphml_visualize.py` for long data	@akai-shuuichi	related PR

3.2 Contribution Guidelines

Anyone can contribute; the project documents the workflow in docs/CONTRIBUTING.md. Source: readme.md section "## Contribute".

4. Known Limitations

The README transparently enumerates two functional limitations under its "Issues" section. Source: readme.md section "## Issues".

4.1 `covariates` Feature Not Implemented

nano-graphrag does not implement the covariates feature of GraphRAG. This is a deliberate omission noted in the README.

4.2 Global Search Differs from Original

nano-graphrag implements global search differently from the original. The reference implementation uses a map-reduce-like style to fill all communities into context. In contrast, nano-graphrag selects only the top-K most important and central communities. This is controlled via QueryParam.global_max_consider_community, which defaults to 512 communities. Source: readme.md section "## Issues".

4.3 Windows Encoding Issue

Community issue #125 reports an installation error on Windows machines, surfacing as cp1252.py decode errors. This indicates non-UTF-8 default encodings on some Windows environments can interfere with package operations.

4.4 Repetitive Prompt Content

Community issue #56 notes that PROMPTS["local_rag_response"] contains duplicated "Goal" and "Target response length and format" sections. Users should verify whether this is intentional before editing prompts. Source: readme.md section "Prompt" describes the relevant prompt variables.

4.5 Missing DRIFT Search

Community feature request #99 asks for Microsoft's DRIFT search (released after local and global search) to be ported to nano-graphrag.

4.6 Component Coverage

Community issue #2 ("Add more Storages and LLMs?") is the most-discussed enhancement request. The README's component table (Source: readme.md section "## Components") shows which backends are built-in versus example-only:

Type	Backend	Status
LLM	OpenAI / Amazon Bedrock	Built-in
LLM	DeepSeek / Ollama	examples/
Vector DB	nano-vectordb / hnswlib	Built-in
Vector DB	milvus-lite / faiss	examples/
Graph	networkx	Built-in
Graph	neo4j	Built-in (doc)

5. See Also

Project README — architecture, components, quick start
FAQ — frequently asked questions
Contributing Guide — how to contribute
Roadmap — forward-looking plans
English Benchmark, Chinese Benchmark, DSPy Entity Extraction Benchmark

Source: https://github.com/gusye1234/nano-graphrag / Human Manual

Doramagic Pitfall Log

Source-linked risks stay visible on the manual page so the preview does not read like a recommendation.

high Configuration risk requires verification

May increase setup, validation, or first-run risk for the user.

medium Installation risk requires verification

May increase setup, validation, or first-run risk for the user.

medium Installation risk requires verification

May increase setup, validation, or first-run risk for the user.

medium Configuration risk requires verification

May increase setup, validation, or first-run risk for the user.

Doramagic Pitfall Log

Found 13 structured pitfall item(s), including 1 high/blocking item(s). Top priority: Configuration risk - Configuration risk requires verification.

1. Configuration risk: Configuration risk requires verification

Severity: high
Finding: Project evidence flags a configuration risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: community_evidence:github | https://github.com/gusye1234/nano-graphrag/issues/75

2. Installation risk: Installation risk requires verification

Severity: medium
Finding: Project evidence flags a installation risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: community_evidence:github | https://github.com/gusye1234/nano-graphrag/issues/163

3. Installation risk: Installation risk requires verification

Severity: medium
Finding: Project evidence flags a installation risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: community_evidence:github | https://github.com/gusye1234/nano-graphrag/issues/166

4. Configuration risk: Configuration risk requires verification

Severity: medium
Finding: Project evidence flags a configuration risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: capability.host_targets | https://github.com/gusye1234/nano-graphrag

5. Configuration risk: Configuration risk requires verification

Severity: medium
Finding: Project evidence flags a configuration risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: community_evidence:github | https://github.com/gusye1234/nano-graphrag/issues/167

6. Capability evidence risk: Capability evidence risk requires verification

Severity: medium
Finding: README/documentation is current enough for a first validation pass.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: capability.assumptions | https://github.com/gusye1234/nano-graphrag

7. Maintenance risk: Maintenance risk requires verification

Severity: medium
Finding: Project evidence flags a maintenance risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: evidence.maintainer_signals | https://github.com/gusye1234/nano-graphrag

8. Security or permission risk: Security or permission risk requires verification

Severity: medium
Finding: no_demo
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: downstream_validation.risk_items | https://github.com/gusye1234/nano-graphrag

9. Security or permission risk: Security or permission risk requires verification

Severity: medium
Finding: no_demo
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: risks.scoring_risks | https://github.com/gusye1234/nano-graphrag

10. Security or permission risk: Security or permission risk requires verification

Severity: medium
Finding: Project evidence flags a security or permission risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: community_evidence:github | https://github.com/gusye1234/nano-graphrag/issues/125

11. Security or permission risk: Security or permission risk requires verification

Severity: medium
Finding: Project evidence flags a security or permission risk. Review the linked source before relying on this workflow.
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: community_evidence:github | https://github.com/gusye1234/nano-graphrag/issues/173

12. Maintenance risk: Maintenance risk requires verification

Severity: low
Finding: issue_or_pr_quality=unknown。
User impact: May increase setup, validation, or first-run risk for the user.
Recommended check: Reproduce the official install and quickstart path in an isolated environment.
Evidence: evidence.maintainer_signals | https://github.com/gusye1234/nano-graphrag

Source: Doramagic discovery, validation, and Project Pack records

Community Discussion Evidence

These external discussion links are review inputs, not standalone proof that the project is production-ready.

Sources 12

Count of project-level external discussion links exposed on this manual page.

Use Review before install

Open the linked issues or discussions before treating the pack as ready for your environment.

Community Discussion Evidence

Doramagic exposes project-level community discussion separately from official documentation. Review these links before using nano-graphrag with real data or production workflows.

do you have option in code not to use node2vec , but use just similarity - github / github_issue
[[Feature] Add ArcadeDB as graph storage backend](https://github.com/gusye1234/nano-graphrag/issues/173) - github / github_issue
Installation issue on Windows machine related to encoding - github / github_issue
"'charmap' codec can't decode" error encountered when installing on wind - github / github_issue
JSONDecodeError using no_openai_key_at_all.py - github / github_issue
Writing graph with 0 edges triggers leiden error - github / github_issue
Community source 7 - github / github_issue
INFO:openai._base_client:Retrying request to /chat/completions in X seco - github / github_issue
v0.0.8 - github / github_release
v0.0.7 - github / github_release
v0.0.6 - github / github_release
v0.0.5 - github / github_release

Source: Project Pack community evidence and pitfall evidence