# unstructured

Canonical URL: https://doramagic.ai/en/projects/unstructured/

Source repository: https://github.com/Unstructured-IO/unstructured

## What it is

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models.  Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

## Capability boundary

skill, recipe, host_instruction, eval, preflight

## First safe verification

Verify the smallest path in an isolated environment and keep a rollback path.

## Main risk

May increase setup, validation, or first-run risk for the user.

## Evidence base

https://github.com/Unstructured-IO/unstructured, https://github.com/Unstructured-IO/unstructured#readme, Human Manual, Pitfall Log
