# PaddleOCR

Canonical URL: https://doramagic.ai/en/projects/paddleocr/

Source repository: https://github.com/PaddlePaddle/PaddleOCR

## What it is

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

## Capability boundary

skill, recipe, host_instruction, eval, preflight

## First safe verification

Verify the smallest path in an isolated environment and keep a rollback path.

## Main risk

Developers may fail before the first successful local run: Link Checker Report

## Evidence base

https://github.com/PaddlePaddle/PaddleOCR, https://github.com/PaddlePaddle/PaddleOCR#readme, Human Manual, Pitfall Log
