# Pitfall Log

Project: adbar/trafilatura

Summary: Found 37 structured pitfall item(s), including 2 high/blocking item(s). Top priority: Runtime risk - Runtime risk requires verification.

## 1. Runtime risk - Runtime risk requires verification

- Severity: high
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/661

## 2. Security or permission risk - Security or permission risk requires verification

- Severity: high
- Evidence strength: source_linked
- Finding: Project evidence flags a security or permission risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/634

## 3. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this configuration risk before relying on the project: Duplicate paragraph extraction when a long sibling paragraph is present
- User impact: Developers may misconfigure credentials, environment, or host setup: Duplicate paragraph extraction when a long sibling paragraph is present
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/817

## 4. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this configuration risk before relying on the project: Duplicated lines when nested in <article> and <main>, with <br> in front
- User impact: Developers may misconfigure credentials, environment, or host setup: Duplicated lines when nested in <article> and <main>, with <br> in front
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/768

## 5. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this configuration risk before relying on the project: `include_images` changes text extraction
- User impact: Developers may misconfigure credentials, environment, or host setup: `include_images` changes text extraction
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/194

## 6. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this configuration risk before relying on the project: some extraction duplicated in xml
- User impact: Developers may misconfigure credentials, environment, or host setup: some extraction duplicated in xml
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/634

## 7. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a configuration risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/768

## 8. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a configuration risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/236

## 9. Configuration risk - Configuration risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a configuration risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/829

## 10. Capability evidence risk - Capability evidence risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: README/documentation is current enough for a first validation pass.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: capability.assumptions | https://github.com/adbar/trafilatura

## 11. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/755

## 12. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/78

## 13. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/842

## 14. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/471

## 15. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/825

## 16. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/396

## 17. Runtime risk - Runtime risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a runtime risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/411

## 18. Maintenance risk - Maintenance risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this migration risk before relying on the project: Investigate spacing in element tails
- User impact: Developers may hit a documented source-backed failure mode: Investigate spacing in element tails
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/661

## 19. Maintenance risk - Maintenance risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this migration risk before relying on the project: trafilatura-1.12.0
- User impact: Upgrade or migration may change expected behavior: trafilatura-1.12.0
- Evidence: failure_mode_cluster:github_release | https://github.com/adbar/trafilatura/releases/tag/v1.12.0

## 20. Maintenance risk - Maintenance risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this migration risk before relying on the project: trafilatura-1.12.1
- User impact: Upgrade or migration may change expected behavior: trafilatura-1.12.1
- Evidence: failure_mode_cluster:github_release | https://github.com/adbar/trafilatura/releases/tag/v1.12.1

## 21. Maintenance risk - Maintenance risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Developers should check this migration risk before relying on the project: trafilatura-2.0.0
- User impact: Upgrade or migration may change expected behavior: trafilatura-2.0.0
- Evidence: failure_mode_cluster:github_release | https://github.com/adbar/trafilatura/releases/tag/v2.0.0

## 22. Maintenance risk - Maintenance risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a maintenance risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/788

## 23. Maintenance risk - Maintenance risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a maintenance risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: evidence.maintainer_signals | https://github.com/adbar/trafilatura

## 24. Security or permission risk - Security or permission risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: no_demo
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: downstream_validation.risk_items | https://github.com/adbar/trafilatura

## 25. Security or permission risk - Security or permission risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: no_demo
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: risks.scoring_risks | https://github.com/adbar/trafilatura

## 26. Security or permission risk - Security or permission risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a security or permission risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/817

## 27. Security or permission risk - Security or permission risk requires verification

- Severity: medium
- Evidence strength: source_linked
- Finding: Project evidence flags a security or permission risk. Review the linked source before relying on this workflow.
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: community_evidence:github | https://github.com/adbar/trafilatura/issues/194

## 28. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this capability risk before relying on the project: Keeping all valid table information and formatting
- User impact: Developers may hit a documented source-backed failure mode: Keeping all valid table information and formatting
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/78

## 29. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this capability risk before relying on the project: Keeping images breaks parsing
- User impact: Developers may hit a documented source-backed failure mode: Keeping images breaks parsing
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/842

## 30. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this capability risk before relying on the project: `included_images` failed when trying to extract images in a table
- User impact: Developers may hit a documented source-backed failure mode: `included_images` failed when trying to extract images in a table
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/396

## 31. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this capability risk before relying on the project: include_links breaks the extraction for https://news.ycombinator.com
- User impact: Developers may hit a documented source-backed failure mode: include_links breaks the extraction for https://news.ycombinator.com
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/411

## 32. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this conceptual risk before relying on the project: Backticks produce extra line breaks
- User impact: Developers may hit a documented source-backed failure mode: Backticks produce extra line breaks
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/755

## 33. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this conceptual risk before relying on the project: HTML conversion: 'NoneType' object is not subscriptable
- User impact: Developers may hit a documented source-backed failure mode: HTML conversion: 'NoneType' object is not subscriptable
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/236

## 34. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this conceptual risk before relying on the project: Missing Yoast FAQ block headers
- User impact: Developers may hit a documented source-backed failure mode: Missing Yoast FAQ block headers
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/471

## 35. Capability evidence risk - Capability evidence risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: Developers should check this conceptual risk before relying on the project: Text dropped in table after setting `include_formatting=True`
- User impact: Developers may hit a documented source-backed failure mode: Text dropped in table after setting `include_formatting=True`
- Evidence: failure_mode_cluster:github_issue | https://github.com/adbar/trafilatura/issues/829

## 36. Maintenance risk - Maintenance risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: issue_or_pr_quality=unknown。
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: evidence.maintainer_signals | https://github.com/adbar/trafilatura

## 37. Maintenance risk - Maintenance risk requires verification

- Severity: low
- Evidence strength: source_linked
- Finding: release_recency=unknown。
- User impact: May increase setup, validation, or first-run risk for the user.
- Evidence: evidence.maintainer_signals | https://github.com/adbar/trafilatura
