Doramagic.ai Chinese

Tag preview

Open Source Tool

6 preview projects.

Data Analysis & Investment Research Public

PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

PaddlePaddle/PaddleOCR
Customer Communication & Team Operations Public

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

Unstructured-IO/unstructured
Data Analysis & Investment Research Public

presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

microsoft/presidio
Data Analysis & Investment Research Public

seekdb

The AI-Native Search Database. Best for agent storage, it unifies vector, text, structured, and semi-structured data into a single engine. This all-in-one database makes agents smarter, easier to run, and more stable.

oceanbase/seekdb
MCP Tool Integration Public

webclaw

MCP tool integration project for safely connecting external tools, services, or data sources to an AI host.

0xMassi/webclaw
MCPTool callingHost configuration
Vector Retrieval and RAG Public

dsRAG

Vector retrieval project for checking embedding storage, query semantics, RAG integration, data boundaries, and rollback.

D-Star-AI/dsRAG
Vector databaseRAGEmbeddings