# slopo - Doramagic AI Context Pack

> 定位：安装前体验与判断资产。它帮助宿主 AI 有一个好的开始，但不代表已经安装、执行或验证目标项目。

## 充分原则

- **充分原则，不是压缩原则**：AI Context Pack 应该充分到让宿主 AI 在开工前理解项目价值、能力边界、使用入口、风险和证据来源；它可以分层组织，但不以最短摘要为目标。
- **压缩策略**：只压缩噪声和重复内容，不压缩会影响判断和开工质量的上下文。

## 给宿主 AI 的使用方式

你正在读取 Doramagic 为 slopo 编译的 AI Context Pack。请把它当作开工前上下文：帮助用户理解适合谁、能做什么、如何开始、哪些必须安装后验证、风险在哪里。不要声称你已经安装、运行或执行了目标项目。

## Claim 消费规则

- **事实来源**：Repo Evidence + Claim/Evidence Graph；Human Wiki 只提供显著性、术语和叙事结构。
- **事实最低状态**：`supported`
- `supported`：可以作为项目事实使用，但回答中必须引用 claim_id 和证据路径。
- `weak`：只能作为低置信度线索，必须要求用户继续核实。
- `inferred`：只能用于风险提示或待确认问题，不能包装成项目事实。
- `unverified`：不得作为事实使用，应明确说证据不足。
- `contradicted`：必须展示冲突来源，不得替用户强行选择一个版本。

## 它最适合谁

- **想在安装前理解开源项目价值和边界的用户**：当前证据主要来自项目文档。 证据：`README.md` Claim：`clm_0002` supported 0.86

## 它能做什么

- **命令行启动或安装流程**（需要安装后验证）：项目文档中存在可执行命令，真实使用需要在本地或宿主环境中运行这些命令。 证据：`README.md` Claim：`clm_0001` supported 0.86

## 怎么开始

- `uv tool install slopo` 证据：`README.md` Claim：`clm_0003` supported 0.86

## 继续前判断卡

- **当前建议**：需要管理员/安全审批
- **为什么**：继续前可能涉及密钥、账号、外部服务或敏感上下文，建议先经过管理员或安全审批。

### 30 秒判断

- **现在怎么做**：需要管理员/安全审批
- **最小安全下一步**：先跑 Prompt Preview；若涉及凭证或企业环境，先审批再试装
- **先别相信**：真实输出质量不能在安装前相信。
- **继续会触碰**：命令执行、本地环境或项目文件、环境变量 / API Key

### 现在可以相信

- **适合人群线索：想在安装前理解开源项目价值和边界的用户**（supported）：有 supported claim 或项目证据支撑，但仍不等于真实安装效果。 证据：`README.md` Claim：`clm_0002` supported 0.86
- **能力存在：命令行启动或安装流程**（supported）：可以相信项目包含这类能力线索；是否适合你的具体任务仍要试用或安装后验证。 证据：`README.md` Claim：`clm_0001` supported 0.86
- **存在 Quick Start / 安装命令线索**（supported）：可以相信项目文档出现过启动或安装入口；不要因此直接在主力环境运行。 证据：`README.md` Claim：`clm_0003` supported 0.86

### 现在还不能相信

- **真实输出质量不能在安装前相信。**（unverified）：Prompt Preview 只能展示引导方式，不能证明真实项目中的结果质量。
- **宿主 AI 版本兼容性不能在安装前相信。**（unverified）：Claude、Cursor、Codex、Gemini 等宿主加载规则和版本差异必须在真实环境验证。
- **不会污染现有宿主 AI 行为，不能直接相信。**（inferred）：Skill、plugin、AGENTS/CLAUDE/GEMINI 指令可能改变宿主 AI 的默认行为。
- **可安全回滚不能默认相信。**（unverified）：除非项目明确提供卸载和恢复说明，否则必须先在隔离环境验证。
- **真实安装后是否与用户当前宿主 AI 版本兼容？**（unverified）：兼容性只能通过实际宿主环境验证。
- **项目输出质量是否满足用户具体任务？**（unverified）：安装前预览只能展示流程和边界，不能替代真实评测。
- **安装命令是否需要网络、权限或全局写入？**（unverified）：这影响企业环境和个人环境的安装风险。 证据：`README.md`

### 继续会触碰什么

- **命令执行**：包管理器、网络下载、本地插件目录、项目配置或用户主目录。 原因：运行第一条命令就可能产生环境改动；必须先判断是否值得跑。 证据：`README.md`
- **本地环境或项目文件**：安装结果、插件缓存、项目配置或本地依赖目录。 原因：安装前无法证明写入范围和回滚方式，需要隔离验证。 证据：`README.md`
- **环境变量 / API Key**：项目入口文档明确出现 API key、token、secret 或账号凭证配置。 原因：如果真实安装需要凭证，应先使用测试凭证并经过权限/合规判断。 证据：`README.md`, `src/slopo/config.py`
- **宿主 AI 上下文**：AI Context Pack、Prompt Preview、Skill 路由、风险规则和项目事实。 原因：导入上下文会影响宿主 AI 后续判断，必须避免把未验证项包装成事实。

### 最小安全下一步

- **先跑 Prompt Preview**：用安装前交互式试用判断工作方式是否匹配，不需要授权或改环境。（适用：任何项目都适用，尤其是输出质量未知时。）
- **只在隔离目录或测试账号试装**：避免安装命令污染主力宿主 AI、真实项目或用户主目录。（适用：存在命令执行、插件配置或本地写入线索时。）
- **不要使用真实生产凭证**：环境变量/API key 一旦进入宿主或工具链，可能产生账号和合规风险。（适用：出现 API、TOKEN、KEY、SECRET 等环境线索时。）
- **安装后只验证一个最小任务**：先验证加载、兼容、输出质量和回滚，再决定是否深用。（适用：准备从试用进入真实工作流时。）

### 退出方式

- **保留安装前状态**：记录原始宿主配置和项目状态，后续才能判断是否可恢复。
- **记录安装命令和写入路径**：没有明确卸载说明时，至少要知道哪些目录或配置需要手动清理。
- **准备撤销测试 API key 或 token**：测试凭证泄露或误用时，可以快速止损。
- **如果没有回滚路径，不进入主力环境**：不可回滚是继续前阻断项，不应靠信任或运气继续。

## 哪些只能预览

- 解释项目适合谁和能做什么
- 基于项目文档演示典型对话流程
- 帮助用户判断是否值得安装或继续研究

## 哪些必须安装后验证

- 真实安装 Skill、插件或 CLI
- 执行脚本、修改本地文件或访问外部服务
- 验证真实输出质量、性能和兼容性

## 边界与风险判断卡

- **把安装前预览误认为真实运行**：用户可能高估项目已经完成的配置、权限和兼容性验证。 处理方式：明确区分 prompt_preview_can_do 与 runtime_required。 Claim：`clm_0004` inferred 0.45
- **命令执行会修改本地环境**：安装命令可能写入用户主目录、宿主插件目录或项目配置。 处理方式：先在隔离环境或测试账号中运行。 证据：`README.md` Claim：`clm_0005` supported 0.86
- **待确认**：真实安装后是否与用户当前宿主 AI 版本兼容？。原因：兼容性只能通过实际宿主环境验证。
- **待确认**：项目输出质量是否满足用户具体任务？。原因：安装前预览只能展示流程和边界，不能替代真实评测。
- **待确认**：安装命令是否需要网络、权限或全局写入？。原因：这影响企业环境和个人环境的安装风险。

## 开工前工作上下文

### 加载顺序

- 先读取 how_to_use.host_ai_instruction，建立安装前判断资产的边界。
- 读取 claim_graph_summary，确认事实来自 Claim/Evidence Graph，而不是 Human Wiki 叙事。
- 再读取 intended_users、capabilities 和 quick_start_candidates，判断用户是否匹配。
- 需要执行具体任务时，优先查 role_skill_index，再查 evidence_index。
- 遇到真实安装、文件修改、网络访问、性能或兼容性问题时，转入 risk_card 和 boundaries.runtime_required。

### 任务路由

- **命令行启动或安装流程**：先说明这是安装后验证能力，再给出安装前检查清单。 边界：必须真实安装或运行后验证。 证据：`README.md` Claim：`clm_0001` supported 0.86

### 上下文规模

- 文件总数：59
- 重要文件覆盖：32/59
- 证据索引条目：32
- 角色 / Skill 条目：12

### 证据不足时的处理

- **missing_evidence**：说明证据不足，要求用户提供目标文件、README 段落或安装后验证记录；不要补全事实。
- **out_of_scope_request**：说明该任务超出当前 AI Context Pack 证据范围，并建议用户先查看 Human Manual 或真实安装后验证。
- **runtime_request**：给出安装前检查清单和命令来源，但不要替用户执行命令或声称已执行。
- **source_conflict**：同时展示冲突来源，标记为待核实，不要强行选择一个版本。

## Prompt Recipes

### 适配判断

- 目标：判断这个项目是否适合用户当前任务。
- 预期输出：适配结论、关键理由、证据引用、安装前可预览内容、必须安装后验证内容、下一步建议。

```text
请基于 slopo 的 AI Context Pack，先问我 3 个必要问题，然后判断它是否适合我的任务。回答必须包含：适合谁、能做什么、不能做什么、是否值得安装、证据来自哪里。所有项目事实必须引用 evidence_refs、source_paths 或 claim_id。
```

### 安装前体验

- 目标：让用户在安装前感受核心工作流，同时避免把预览包装成真实能力或营销承诺。
- 预期输出：一段带边界标签的体验剧本、安装后验证清单和谨慎建议；不含真实运行承诺或强营销表述。

```text
请把 slopo 当作安装前体验资产，而不是已安装工具或真实运行环境。

请严格输出四段：
1. 先问我 3 个必要问题。
2. 给出一段“体验剧本”：用 [安装前可预览]、[必须安装后验证]、[证据不足] 三种标签展示它可能如何引导工作流。
3. 给出安装后验证清单：列出哪些能力只有真实安装、真实宿主加载、真实项目运行后才能确认。
4. 给出谨慎建议：只能说“值得继续研究/试装”“先补充信息后再判断”或“不建议继续”，不得替项目背书。

硬性边界：
- 不要声称已经安装、运行、执行测试、修改文件或产生真实结果。
- 不要写“自动适配”“确保通过”“完美适配”“强烈建议安装”等承诺性表达。
- 如果描述安装后的工作方式，必须使用“如果安装成功且宿主正确加载 Skill，它可能会……”这种条件句。
- 体验剧本只能写成“示例台词/假设流程”：使用“可能会询问/可能会建议/可能会展示”，不要写“已写入、已生成、已通过、正在运行、正在生成”。
- Prompt Preview 不负责给安装命令；如用户准备试装，只能提示先阅读 Quick Start 和 Risk Card，并在隔离环境验证。
- 所有项目事实必须来自 supported claim、evidence_refs 或 source_paths；inferred/unverified 只能作风险或待确认项。

```

### 角色 / Skill 选择

- 目标：从项目里的角色或 Skill 中挑选最匹配的资产。
- 预期输出：候选角色或 Skill 列表，每项包含适用场景、证据路径、风险边界和是否需要安装后验证。

```text
请读取 role_skill_index，根据我的目标任务推荐 3-5 个最相关的角色或 Skill。每个推荐都要说明适用场景、可能输出、风险边界和 evidence_refs。
```

### 风险预检

- 目标：安装或引入前识别环境、权限、规则冲突和质量风险。
- 预期输出：环境、权限、依赖、许可、宿主冲突、质量风险和未知项的检查清单。

```text
请基于 risk_card、boundaries 和 quick_start_candidates，给我一份安装前风险预检清单。不要替我执行命令，只说明我应该检查什么、为什么检查、失败会有什么影响。
```

### 宿主 AI 开工指令

- 目标：把项目上下文转成一次对话开始前的宿主 AI 指令。
- 预期输出：一段边界明确、证据引用明确、适合复制给宿主 AI 的开工前指令。

```text
请基于 slopo 的 AI Context Pack，生成一段我可以粘贴给宿主 AI 的开工前指令。这段指令必须遵守 not_runtime=true，不能声称项目已经安装、运行或产生真实结果。
```

## 角色 / Skill 索引

- 共索引 12 个角色 / Skill / 项目文档条目。

- **Slopo**（project_doc）：! https://raw.githubusercontent.com/rafal-qa/slopo/refs/heads/main/doc/logo.png 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`README.md`
- **1 score 0.96-1.00**（project_doc）：- slopo/indexing/parsing/lang/csharp.py lines 108-112 - slopo/indexing/parsing/lang/go.py lines 104-108 - slopo/indexing/parsing/lang/java.py lines 90-94 - slopo/indexing/parsing/lang/javascript.py lines 97-101 - slopo/indexing/parsing/lang/kotlin.py lines 100-104 - slopo/indexing/parsing/lang/rust.py lines 90-94 - slopo/indexing/parsing/lang/typescript.py lines 97-101 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-01.md`
- **2 score 0.95-1.00**（project_doc）：- slopo/indexing/parsing/lang/kotlin.py lines 70-80 - slopo/indexing/parsing/lang/python.py lines 40-50 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-02.md`
- **3 score 1.00-1.00**（project_doc）：- slopo/indexing/parsing/lang/csharp.py lines 22-26 - slopo/indexing/parsing/lang/go.py lines 14-18 - slopo/indexing/parsing/lang/java.py lines 14-18 - slopo/indexing/parsing/lang/javascript.py lines 20-24 - slopo/indexing/parsing/lang/kotlin.py lines 14-18 - slopo/indexing/parsing/lang/python.py lines 12-16 - slopo/indexing/parsing/lang/rust.py lines 14-18 - slopo/indexing/parsing/lang/typescript.py lines 20-24 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-03.md`
- **4 score 0.94-1.00**（project_doc）：- slopo/indexing/parsing/lang/javascript.py lines 53-66 - slopo/indexing/parsing/lang/typescript.py lines 53-66 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-04.md`
- **5 score 0.96-1.00**（project_doc）：- slopo/indexing/parsing/lang/csharp.py lines 29-43 - slopo/indexing/parsing/lang/go.py lines 21-35 - slopo/indexing/parsing/lang/java.py lines 21-35 - slopo/indexing/parsing/lang/javascript.py lines 27-41 - slopo/indexing/parsing/lang/kotlin.py lines 21-35 - slopo/indexing/parsing/lang/rust.py lines 21-35 - slopo/indexing/parsing/lang/typescript.py lines 27-41 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-05.md`
- **6 score 0.93-1.00**（project_doc）：- slopo/indexing/parsing/lang/javascript.py lines 44-50 - slopo/indexing/parsing/lang/typescript.py lines 44-50 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-06.md`
- **7 score 0.98-1.00**（project_doc）：- slopo/indexing/parsing/lang/csharp.py lines 83-88 - slopo/indexing/parsing/lang/go.py lines 89-94 - slopo/indexing/parsing/lang/java.py lines 75-80 - slopo/indexing/parsing/lang/javascript.py lines 82-87 - slopo/indexing/parsing/lang/kotlin.py lines 83-88 - slopo/indexing/parsing/lang/rust.py lines 75-80 - slopo/indexing/parsing/lang/typescript.py lines 82-87 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-07.md`
- **8 score 0.99-1.00**（project_doc）：- slopo/indexing/parsing/lang/go.py lines 97-101 - slopo/indexing/parsing/lang/javascript.py lines 90-94 - slopo/indexing/parsing/lang/rust.py lines 83-87 - slopo/indexing/parsing/lang/typescript.py lines 90-94 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-08.md`
- **9 score 0.93-0.97**（project_doc）：- slopo/indexing/parsing/lang/python.py lines 61-65 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-09.md`
- **10 score 0.93-0.95**（project_doc）：- slopo/analysis/rerank.py lines 30-46 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/cluster-10.md`
- **Index**（project_doc）：Cluster Hash Score Code units Unique files ----------------------------- -------------- ----------- ------------ -------------- Cluster 1 cluster-01.md 344750d2372b 0.96-1.00 8 8 Cluster 2 cluster-02.md dcd405ffdfbc 0.95-1.00 8 8 Cluster 3 cluster-03.md e6c369338dc2 1.00-1.00 8 8 Cluster 4 cluster-04.md a3e5a5a93c89 0.94-1.00 7 7 Cluster 5 cluster-05.md 05d70662b75d 0.96-1.00 8 8 Cluster 6 cluster-06.md fc46752d2ce6… 激活提示：当用户需要理解项目结构、安装方式或边界时参考。 证据：`doc/example-report/index.md`

## 证据索引

- 共索引 32 条证据。

- **Slopo**（documentation）：! https://raw.githubusercontent.com/rafal-qa/slopo/refs/heads/main/doc/logo.png 证据：`README.md`
- **License**（source_file）：GNU AFFERO GENERAL PUBLIC LICENSE Version 3, 19 November 2007 证据：`LICENSE`
- **1 score 0.96-1.00**（documentation）：- slopo/indexing/parsing/lang/csharp.py lines 108-112 - slopo/indexing/parsing/lang/go.py lines 104-108 - slopo/indexing/parsing/lang/java.py lines 90-94 - slopo/indexing/parsing/lang/javascript.py lines 97-101 - slopo/indexing/parsing/lang/kotlin.py lines 100-104 - slopo/indexing/parsing/lang/rust.py lines 90-94 - slopo/indexing/parsing/lang/typescript.py lines 97-101 证据：`doc/example-report/cluster-01.md`
- **2 score 0.95-1.00**（documentation）：- slopo/indexing/parsing/lang/kotlin.py lines 70-80 - slopo/indexing/parsing/lang/python.py lines 40-50 证据：`doc/example-report/cluster-02.md`
- **3 score 1.00-1.00**（documentation）：- slopo/indexing/parsing/lang/csharp.py lines 22-26 - slopo/indexing/parsing/lang/go.py lines 14-18 - slopo/indexing/parsing/lang/java.py lines 14-18 - slopo/indexing/parsing/lang/javascript.py lines 20-24 - slopo/indexing/parsing/lang/kotlin.py lines 14-18 - slopo/indexing/parsing/lang/python.py lines 12-16 - slopo/indexing/parsing/lang/rust.py lines 14-18 - slopo/indexing/parsing/lang/typescript.py lines 20-24 证据：`doc/example-report/cluster-03.md`
- **4 score 0.94-1.00**（documentation）：- slopo/indexing/parsing/lang/javascript.py lines 53-66 - slopo/indexing/parsing/lang/typescript.py lines 53-66 证据：`doc/example-report/cluster-04.md`
- **5 score 0.96-1.00**（documentation）：- slopo/indexing/parsing/lang/csharp.py lines 29-43 - slopo/indexing/parsing/lang/go.py lines 21-35 - slopo/indexing/parsing/lang/java.py lines 21-35 - slopo/indexing/parsing/lang/javascript.py lines 27-41 - slopo/indexing/parsing/lang/kotlin.py lines 21-35 - slopo/indexing/parsing/lang/rust.py lines 21-35 - slopo/indexing/parsing/lang/typescript.py lines 27-41 证据：`doc/example-report/cluster-05.md`
- **6 score 0.93-1.00**（documentation）：- slopo/indexing/parsing/lang/javascript.py lines 44-50 - slopo/indexing/parsing/lang/typescript.py lines 44-50 证据：`doc/example-report/cluster-06.md`
- **7 score 0.98-1.00**（documentation）：- slopo/indexing/parsing/lang/csharp.py lines 83-88 - slopo/indexing/parsing/lang/go.py lines 89-94 - slopo/indexing/parsing/lang/java.py lines 75-80 - slopo/indexing/parsing/lang/javascript.py lines 82-87 - slopo/indexing/parsing/lang/kotlin.py lines 83-88 - slopo/indexing/parsing/lang/rust.py lines 75-80 - slopo/indexing/parsing/lang/typescript.py lines 82-87 证据：`doc/example-report/cluster-07.md`
- **8 score 0.99-1.00**（documentation）：- slopo/indexing/parsing/lang/go.py lines 97-101 - slopo/indexing/parsing/lang/javascript.py lines 90-94 - slopo/indexing/parsing/lang/rust.py lines 83-87 - slopo/indexing/parsing/lang/typescript.py lines 90-94 证据：`doc/example-report/cluster-08.md`
- **9 score 0.93-0.97**（documentation）：- slopo/indexing/parsing/lang/python.py lines 61-65 证据：`doc/example-report/cluster-09.md`
- **10 score 0.93-0.95**（documentation）：- slopo/analysis/rerank.py lines 30-46 证据：`doc/example-report/cluster-10.md`
- **Index**（documentation）：Cluster Hash Score Code units Unique files ----------------------------- -------------- ----------- ------------ -------------- Cluster 1 cluster-01.md 344750d2372b 0.96-1.00 8 8 Cluster 2 cluster-02.md dcd405ffdfbc 0.95-1.00 8 8 Cluster 3 cluster-03.md e6c369338dc2 1.00-1.00 8 8 Cluster 4 cluster-04.md a3e5a5a93c89 0.94-1.00 7 7 Cluster 5 cluster-05.md 05d70662b75d 0.96-1.00 8 8 Cluster 6 cluster-06.md fc46752d2ce6 0.93-1.00 7 7 Cluster 7 cluster-07.md 60849c6d52f1 0.98-1.00 8 8 Cluster 8 cluster-08.md e1c991a348b6 0.99-1.00 5 5 Cluster 9 cluster-09.md 38abfbd70dde 0.93-0.97 3 3 Cluster 10 cluster-10.md 6cbaa611475c 0.93-0.95 3 2 证据：`doc/example-report/index.md`
- **Boost**（source_file）：CROSS DIR MAX BOOST = 0.15 CROSS DIR MAX HOPS = 8 SAME FILE MAX BOOST = 0.1 SAME FILE STEP LINES = 250 SAME FILE MAX STEPS = 8 def cross dir hops: int - float def same file line distance: int - float ⋮---- steps = line distance // SAME FILE STEP LINES ⋮---- def distance boost distance: int, max distance: int, max boost: float - float ⋮---- capped = min distance, max distance 证据：`src/slopo/analysis/boost.py`
- **Clustering**（source_file）：def cluster pairs pairs: list SimilarPair - list set int ⋮---- groups: list set int = ⋮---- matching = g for g in groups if a in g or b in g ⋮---- merged: set int = set ⋮---- def sort cluster unit ids: set int , pairs: list SimilarPair - list int ⋮---- lookup: dict tuple int, int , float = {} ⋮---- def sim a: int, b: int - float sorted ids = sorted unit ids best pair = max path = best pair 0 , best pair 1 remaining = unit ids - {best pair 0 , best pair 1 } ⋮---- last = path -1 next unit = max remaining, key=lambda u: sim last, u ⋮---- result: list Cluster = ⋮---- members = set cluster.unit ids in cluster = ordered = sort cluster members, in cluster sims = p.similarity for p in in cluster mi… 证据：`src/slopo/analysis/clustering.py`
- **Command**（source_file）：BLOCK SIZE = 1000 ⋮---- embeddings = load embeddings conn ⋮---- pairs = find similar pairs embeddings, cfg.similarity threshold, BLOCK SIZE ⋮---- referenced ids = {uid for p in pairs for uid in p.unit id a, p.unit id b } units = load units conn, referenced ids pairs = exclude overlapping pairs pairs, units ⋮---- clusters = build clusters pairs reranked pairs = rerank all clusters clusters, pairs, units clusters = reorder clusters clusters, reranked pairs clusters = filter clusters clusters, cfg.rerank threshold ⋮---- ignored = load ignored cfg.ignore file ⋮---- kept = c for c in clusters if cluster hash c, units not in ignored ignored count = len clusters - len kept clusters = kept ⋮---- du… 证据：`src/slopo/analysis/command.py`
- **Dedup**（source_file）：folded: list Cluster = duplicates: dict int, list UnitRecord = {} ⋮---- kept ids: list int = first of hash: dict str, int = {} ⋮---- body hash = units unit id .body hash primary = first of hash.get body hash 证据：`src/slopo/analysis/dedup.py`
- **Ignore**（source_file）：HASH LENGTH = 12 HEADER = """\ def cluster hash cluster: Cluster, units: dict int, UnitRecord - str ⋮---- pairs = sorted canonical = "\n".join f"{path}\0{body hash}" for path, body hash in pairs digest = hashlib.sha256 canonical.encode "utf-8" .hexdigest ⋮---- def load ignored path: Path - set str ⋮---- hashes: set str = set ⋮---- stripped = line.split " ", 1 0 .strip ⋮---- def ensure ignore file path: Path - None 证据：`src/slopo/analysis/ignore.py`
- **Filesystem**（source_file）：total = len clusters ⋮---- filename = cluster filename i, total ⋮---- def clean report dir output dir: Path - None ⋮---- index = output dir / "index.md" 证据：`src/slopo/analysis/report/filesystem.py`
- **Markdown**（source_file）：LANG MAP = { ⋮---- total = len clusters headers = "Cluster", "Hash", "Score", "Code units", "Unique files" rows: list list str = ⋮---- link = f" Cluster {i} {cluster filename i, total } " records = units uid for uid in cluster.unit ids ⋮---- unit count = len records unique files = len {record.file path for record in records} ⋮---- timestamp = generated at.strftime "%Y-%m-%d %H:%M:%S" ⋮---- lines: list str = ⋮---- unit = units unit id lang = lang tag unit.file path records = unit, duplicates.get unit id, ⋮---- def format table headers: list str , rows: list list str - str ⋮---- widths = def render cells: list str - str ⋮---- padded = cells col .ljust widths col for col in range len cells ⋮--… 证据：`src/slopo/analysis/report/markdown.py`
- **Naming**（source_file）：CLUSTER FILE GLOB = "cluster- .md" CLUSTER FILE RE = re.compile r"cluster-\d+\.md" def cluster filename number: int, total: int - str ⋮---- width = len str total 证据：`src/slopo/analysis/report/naming.py`
- **Rerank**（source_file）：def path hops path a: str, path b: str - int ⋮---- dir a = PurePosixPath path a .parent.parts dir b = PurePosixPath path b .parent.parts common = 0 ⋮---- b = boost.same file line distance unit a, unit b ⋮---- b = boost.cross dir path hops unit a.file path, unit b.file path ⋮---- reranked: list SimilarPair = ⋮---- unit a = units pair.unit id a unit b = units pair.unit id b new score = rerank pair score pair, unit a, unit b ⋮---- members = set cluster.unit ids in cluster = ⋮---- def line distance a: UnitRecord, b: UnitRecord - int 证据：`src/slopo/analysis/rerank.py`
- **Similarity**（source_file）：unit ids = list embeddings.keys n = len unit ids ⋮---- matrix = np.stack embeddings uid for uid in unit ids ⋮---- pairs: list SimilarPair = ⋮---- end = min start + block size, n block = matrix start:end @ matrix.T ⋮---- global r = start + int r global c = int c 证据：`src/slopo/analysis/similarity.py`
- **Cli**（source_file）：app = typer.Typer DEFAULT CONFIG = Path "slopo.conf.yaml" def version callback value: bool - None ⋮---- @app.command def init ctx: typer.Context - None ⋮---- path = config path ctx ⋮---- @app.command name="show-config" def show config ctx: typer.Context - None ⋮---- cfg = load config or exit ctx ⋮---- value = getattr cfg, f.name ⋮---- value = mask api key value ⋮---- @app.command def index ctx: typer.Context - None ⋮---- """Scan a directory and store parsed code units.""" ⋮---- conn = open existing db or exit cfg ⋮---- conn = create db cfg ⋮---- @app.command def embed ctx: typer.Context - None ⋮---- @app.command def analyze ctx: typer.Context - None def main - None def config path ctx: type… 证据：`src/slopo/cli.py`
- **Config**（source_file）：CONFIG TEMPLATE = """\ class ConfigError Exception ⋮---- @dataclass class Config ⋮---- source dir: Path source dir exclude: list str db file: Path report dir: Path ignore file: Path embedding model: str embedding dimensions: int embedding api key: str embedding batch size: int embedding batch chars: int similarity threshold: float rerank threshold: float body node count threshold: int def load config path: Path - Config ⋮---- text = path.read text encoding="utf-8" source = str path ⋮---- raw = yaml.safe load text ⋮---- def parse config raw: Any, source: str - Config ⋮---- raw = {} ⋮---- def write config template path: Path - None def mask api key key: str - str def check missing space after… 证据：`src/slopo/config.py`
- **Command**（source_file）：total = count unembedded units conn ⋮---- embedded = 0 ⋮---- batch = load next batch ⋮---- batch embeddings = embed units batch, cfg 证据：`src/slopo/embedding/command.py`
- **Command**（source_file）：stats = sync index 证据：`src/slopo/indexing/command.py`
- **Python**（source_file）：LANGUAGE = Language tree sitter python.language PARSER = Parser LANGUAGE COMMENT TYPES = {"comment"} def parse source: bytes - list CodeUnit ⋮---- tree = PARSER.parse source units: list CodeUnit = ⋮---- def collect units node: Node, source: bytes, units: list CodeUnit - None ⋮---- name node = node.child by field name "name" name = body = body without comments node, source ⋮---- def body without comments function: Node, source: bytes - str ⋮---- comment spans: list tuple int, int = ⋮---- pieces: list bytes = cursor = function.start byte ⋮---- cursor = end ⋮---- def collect comment spans node: Node, spans: list tuple int, int - None def count body nodes function definition: Node - int ⋮---- b… 证据：`src/slopo/indexing/parsing/lang/python.py`
- **Rust**（source_file）：LANGUAGE = Language tree sitter rust.language PARSER = Parser LANGUAGE COMMENT TYPES = {"line comment", "block comment"} UNIT TYPES = {"function item", "function signature item", "closure expression"} def parse source: bytes - list CodeUnit ⋮---- tree = PARSER.parse source units: list CodeUnit = ⋮---- def collect units node: Node, source: bytes, units: list CodeUnit - None ⋮---- body = body without comments node, source ⋮---- def unit name node: Node - str ⋮---- name node = node.child by field name "name" ⋮---- name node = binding name node node ⋮---- def binding name node node: Node - Node None ⋮---- parent = node.parent ⋮---- left = parent.child by field name "left" ⋮---- def body without… 证据：`src/slopo/indexing/parsing/lang/rust.py`
- **Typescript**（source_file）：LANGUAGE = Language tree sitter typescript.language typescript PARSER = Parser LANGUAGE COMMENT TYPES = {"comment"} UNIT TYPES = { def parse source: bytes - list CodeUnit ⋮---- tree = PARSER.parse source units: list CodeUnit = ⋮---- def collect units node: Node, source: bytes, units: list CodeUnit - None ⋮---- body = body without comments node, source ⋮---- def unit name node: Node - str ⋮---- name node = node.child by field name "name" ⋮---- name node = binding name node node ⋮---- def binding name node node: Node - Node None ⋮---- parent = node.parent ⋮---- left = parent.child by field name "left" ⋮---- def body without comments unit: Node, source: bytes - str ⋮---- comment spans: list tu… 证据：`src/slopo/indexing/parsing/lang/typescript.py`
- **.gitignore**（source_file）：.py cod .idea/ .venv/ .env slopo.db slopo-report/ slopo.conf.yaml slopo.ignore.txt 证据：`.gitignore`
- **Pyproject**（source_file）：project name = "slopo" version = "0.2.0" description = "Embedding-based code duplication detector" license = "AGPL-3.0-or-later" license-files = "LICENSE" authors = { name = "Rafal Kochanowski" } readme = "README.md" requires-python = " =3.12" dependencies = "litellm~=1.89.2", "numpy~=2.4.6", "pathspec~=1.1.1", "python-dotenv~=1.2.2", "pyyaml~=6.0.3", "tree-sitter~=0.25.2", "tree-sitter-c-sharp~=0.23.5", "tree-sitter-go~=0.25.0", "tree-sitter-java~=0.23.5", "tree-sitter-javascript~=0.25.0", "tree-sitter-kotlin~=1.1.0", "tree-sitter-python~=0.25.0", "tree-sitter-rust~=0.24.2", "tree-sitter-typescript~=0.23.2", "typer~=0.26.7", 证据：`pyproject.toml`

## 宿主 AI 必须遵守的规则

- **把本资产当作开工前上下文，而不是运行环境。**：AI Context Pack 只包含证据化项目理解，不包含目标项目的可执行状态。 证据：`README.md`, `LICENSE`, `doc/example-report/cluster-01.md`
- **回答用户时区分可预览内容与必须安装后才能验证的内容。**：安装前体验的消费者价值来自降低误装和误判，而不是伪装成真实运行。 证据：`README.md`, `LICENSE`, `doc/example-report/cluster-01.md`

## 用户开工前应该回答的问题

- 你准备在哪个宿主 AI 或本地环境中使用它？
- 你只是想先体验工作流，还是准备真实安装？
- 你最在意的是安装成本、输出质量、还是和现有规则的冲突？

## 验收标准

- 所有能力声明都能回指到 evidence_refs 中的文件路径。
- AI_CONTEXT_PACK.md 没有把预览包装成真实运行。
- 用户能在 3 分钟内看懂适合谁、能做什么、如何开始和风险边界。

---

## Doramagic Context Augmentation

下面内容用于强化 Repomix/AI Context Pack 主体。Human Manual 只提供阅读骨架；踩坑日志会被转成宿主 AI 必须遵守的工作约束。

## Human Manual 骨架

使用规则：这里只是项目阅读路线和显著性信号，不是事实权威。具体事实仍必须回到 repo evidence / Claim Graph。

宿主 AI 硬性规则：
- 不得把页标题、章节顺序、摘要或 importance 当作项目事实证据。
- 解释 Human Manual 骨架时，必须明确说它只是阅读路线/显著性信号。
- 能力、安装、兼容性、运行状态和风险判断必须引用 repo evidence、source path 或 Claim Graph。

- **项目概览、安装与快速开始**：importance `high`
  - source_paths: README.md, pyproject.toml, src/slopo/cli.py
- **系统架构与数据流水线**：importance `high`
  - source_paths: src/slopo/cli.py, src/slopo/schema.py, src/slopo/db.py, src/slopo/indexing/scanner.py, src/slopo/indexing/parsing/base.py
- **相似度、聚类、重排序与 v0.2.0 精确副本处理**：importance `high`
  - source_paths: src/slopo/analysis/command.py, src/slopo/analysis/similarity.py, src/slopo/analysis/clustering.py, src/slopo/analysis/rerank.py, src/slopo/analysis/boost.py
- **配置参数、Markdown 报告与团队协作工作流**：importance `high`
  - source_paths: src/slopo/config.py, src/slopo/analysis/ignore.py, src/slopo/analysis/report/markdown.py, src/slopo/analysis/report/filesystem.py, src/slopo/analysis/report/naming.py

## Repo Inspection Evidence / 源码检查证据

- repo_clone_verified: true
- repo_inspection_verified: true
- repo_commit: `90f6f4fa5739452918e93f7c4b3b1f652591fec5`
- inspected_files: `README.md`, `pyproject.toml`, `uv.lock`, `src/slopo/__init__.py`, `src/slopo/analysis/__init__.py`, `src/slopo/analysis/boost.py`, `src/slopo/analysis/clustering.py`, `src/slopo/analysis/command.py`, `src/slopo/analysis/db.py`, `src/slopo/analysis/dedup.py`, `src/slopo/analysis/ignore.py`, `src/slopo/analysis/models.py`, `src/slopo/analysis/overlap.py`, `src/slopo/analysis/report/__init__.py`, `src/slopo/analysis/report/filesystem.py`, `src/slopo/analysis/report/markdown.py`, `src/slopo/analysis/report/naming.py`, `src/slopo/analysis/rerank.py`, `src/slopo/analysis/similarity.py`, `src/slopo/cli.py`

宿主 AI 硬性规则：
- 没有 repo_clone_verified=true 时，不得声称已经读过源码。
- 没有 repo_inspection_verified=true 时，不得把 README/docs/package 文件判断写成事实。
- 没有 quick_start_verified=true 时，不得声称 Quick Start 已跑通。

## Doramagic Pitfall Constraints / 踩坑约束

这些规则来自 Doramagic 发现、验证或编译过程中的项目专属坑点。宿主 AI 必须把它们当作工作约束，而不是普通说明文字。

### Constraint 1: 能力判断依赖假设

- Trigger: README/documentation is current enough for a first validation pass.
- Host AI rule: 将假设转成下游验证清单。
- Why it matters: 假设不成立时，用户拿不到承诺的能力。
- Evidence: capability.assumptions | https://news.ycombinator.com/item?id=48762038 | README/documentation is current enough for a first validation pass.
- Hard boundary: 不要把这个坑点包装成已解决、已验证或可忽略，除非后续验证证据明确证明它已经关闭。

### Constraint 2: 维护活跃度未知

- Trigger: 未记录 last_activity_observed。
- Host AI rule: 补 GitHub 最近 commit、release、issue/PR 响应信号。
- Why it matters: 新项目、停更项目和活跃项目会被混在一起，推荐信任度下降。
- Evidence: evidence.maintainer_signals | https://news.ycombinator.com/item?id=48762038 | last_activity_observed missing
- Hard boundary: 不要把这个坑点包装成已解决、已验证或可忽略，除非后续验证证据明确证明它已经关闭。

- Trigger: no_demo
- Evidence: downstream_validation.risk_items | https://news.ycombinator.com/item?id=48762038 | no_demo; severity=medium
- Hard boundary: 不要把这个坑点包装成已解决、已验证或可忽略，除非后续验证证据明确证明它已经关闭。

### Constraint 4: 存在评分风险

- Trigger: no_demo
- Why it matters: 风险会影响是否适合普通用户安装。
- Evidence: risks.scoring_risks | https://news.ycombinator.com/item?id=48762038 | no_demo; severity=medium
- Hard boundary: 不要把这个坑点包装成已解决、已验证或可忽略，除非后续验证证据明确证明它已经关闭。

### Constraint 5: issue/PR 响应质量未知

- Trigger: issue_or_pr_quality=unknown。
- Host AI rule: 抽样最近 issue/PR，判断是否长期无人处理。
- Why it matters: 用户无法判断遇到问题后是否有人维护。
- Evidence: evidence.maintainer_signals | https://news.ycombinator.com/item?id=48762038 | issue_or_pr_quality=unknown
- Hard boundary: 不要把这个坑点包装成已解决、已验证或可忽略，除非后续验证证据明确证明它已经关闭。

### Constraint 6: 发布节奏不明确

- Trigger: release_recency=unknown。
- Host AI rule: 确认最近 release/tag 和 README 安装命令是否一致。
- Why it matters: 安装命令和文档可能落后于代码，用户踩坑概率升高。
- Evidence: evidence.maintainer_signals | https://news.ycombinator.com/item?id=48762038 | release_recency=unknown
- Hard boundary: 不要把这个坑点包装成已解决、已验证或可忽略，除非后续验证证据明确证明它已经关闭。