# Pitfall Log / 踩坑日志

项目：promptfoo/promptfoo

摘要：发现 19 个潜在踩坑项，其中 0 个为 high/blocking；最高优先级：安装坑 - 失败模式：installation: 0.121.8。

## 1. 安装坑 · 失败模式：installation: 0.121.8

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this installation risk before relying on the project: 0.121.8
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.8
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.8 | 0.121.8

## 2. 安装坑 · 失败模式：installation: code-scan-action: 0.1.6

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this installation risk before relying on the project: code-scan-action: 0.1.6
- 对用户的影响：Upgrade or migration may change expected behavior: code-scan-action: 0.1.6
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/code-scan-action-0.1.6 | code-scan-action: 0.1.6

## 3. 配置坑 · 可能修改宿主 AI 配置

- 严重度：medium
- 证据强度：source_linked
- 发现：项目面向 Claude/Cursor/Codex/Gemini/OpenCode 等宿主，或安装命令涉及用户配置目录。
- 对用户的影响：安装可能改变本机 AI 工具行为，用户需要知道写入位置和回滚方法。
- 证据：capability.host_targets | https://github.com/promptfoo/promptfoo | host_targets=claude, chatgpt

## 4. 配置坑 · 失败模式：configuration: 0.121.15

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this configuration risk before relying on the project: 0.121.15
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.15
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.15 | 0.121.15

## 5. 配置坑 · 失败模式：configuration: Per-test-case `repeat` option to control how many times individual tests run

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this configuration risk before relying on the project: Per-test-case `repeat` option to control how many times individual tests run
- 对用户的影响：Developers may misconfigure credentials, environment, or host setup: Per-test-case `repeat` option to control how many times individual tests run
- 证据：failure_mode_cluster:github_issue | https://github.com/promptfoo/promptfoo/issues/9700 | Per-test-case `repeat` option to control how many times individual tests run

## 6. 配置坑 · 来源证据：Per-test-case `repeat` option to control how many times individual tests run

- 严重度：medium
- 证据强度：source_linked
- 发现：GitHub 社区证据显示该项目存在一个配置相关的待验证问题：Per-test-case `repeat` option to control how many times individual tests run
- 对用户的影响：可能增加新用户试用和生产接入成本。
- 证据：community_evidence:github | https://github.com/promptfoo/promptfoo/issues/9700 | 来源类型 github_issue 暴露的待验证使用条件。

## 7. 能力坑 · 能力判断依赖假设

- 严重度：medium
- 证据强度：source_linked
- 发现：README/documentation is current enough for a first validation pass.
- 对用户的影响：假设不成立时，用户拿不到承诺的能力。
- 证据：capability.assumptions | https://github.com/promptfoo/promptfoo | README/documentation is current enough for a first validation pass.

## 8. 运行坑 · 失败模式：runtime: 0.121.12

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this runtime risk before relying on the project: 0.121.12
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.12
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.12 | 0.121.12

## 9. 运行坑 · 失败模式：runtime: 0.121.14

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this runtime risk before relying on the project: 0.121.14
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.14
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.14 | 0.121.14

## 10. 维护坑 · 失败模式：migration: 0.121.13

- 严重度：medium
- 证据强度：source_linked
- 发现：Developers should check this migration risk before relying on the project: 0.121.13
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.13
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.13 | 0.121.13

## 11. 维护坑 · 维护活跃度未知

- 严重度：medium
- 证据强度：source_linked
- 发现：未记录 last_activity_observed。
- 对用户的影响：新项目、停更项目和活跃项目会被混在一起，推荐信任度下降。
- 证据：evidence.maintainer_signals | https://github.com/promptfoo/promptfoo | last_activity_observed missing

- 严重度：medium
- 证据强度：source_linked
- 发现：no_demo
- 证据：downstream_validation.risk_items | https://github.com/promptfoo/promptfoo | no_demo; severity=medium

## 13. 安全/权限坑 · 存在评分风险

- 严重度：medium
- 证据强度：source_linked
- 发现：no_demo
- 对用户的影响：风险会影响是否适合普通用户安装。
- 证据：risks.scoring_risks | https://github.com/promptfoo/promptfoo | no_demo; severity=medium

## 14. 运行坑 · 失败模式：performance: 0.121.10

- 严重度：low
- 证据强度：source_linked
- 发现：Developers should check this performance risk before relying on the project: 0.121.10
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.10
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.10 | 0.121.10

## 15. 维护坑 · issue/PR 响应质量未知

- 严重度：low
- 证据强度：source_linked
- 发现：issue_or_pr_quality=unknown。
- 对用户的影响：用户无法判断遇到问题后是否有人维护。
- 证据：evidence.maintainer_signals | https://github.com/promptfoo/promptfoo | issue_or_pr_quality=unknown

## 16. 维护坑 · 发布节奏不明确

- 严重度：low
- 证据强度：source_linked
- 发现：release_recency=unknown。
- 对用户的影响：安装命令和文档可能落后于代码，用户踩坑概率升高。
- 证据：evidence.maintainer_signals | https://github.com/promptfoo/promptfoo | release_recency=unknown

## 17. 维护坑 · 失败模式：maintenance: 0.121.11

- 严重度：low
- 证据强度：source_linked
- 发现：Developers should check this maintenance risk before relying on the project: 0.121.11
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.11
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.11 | 0.121.11

## 18. 维护坑 · 失败模式：maintenance: 0.121.9

- 严重度：low
- 证据强度：source_linked
- 发现：Developers should check this maintenance risk before relying on the project: 0.121.9
- 对用户的影响：Upgrade or migration may change expected behavior: 0.121.9
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/0.121.9 | 0.121.9

## 19. 维护坑 · 失败模式：maintenance: code-scan-action: 0.1.7

- 严重度：low
- 证据强度：source_linked
- 发现：Developers should check this maintenance risk before relying on the project: code-scan-action: 0.1.7
- 对用户的影响：Upgrade or migration may change expected behavior: code-scan-action: 0.1.7
- 证据：failure_mode_cluster:github_release | https://github.com/promptfoo/promptfoo/releases/tag/code-scan-action-0.1.7 | code-scan-action: 0.1.7
