# https://github.com/firecrawl/firecrawl 项目说明书生成时间：2026-05-19 08:34:08 UTC ## 目录 - [Introduction to Firecrawl](#introduction) - [Project File Structure](#file-structure) - [System Architecture](#system-architecture) - [Search Functionality](#search-functionality) - [Web Scraper Engine](#scraper-engine) - [Agent and Deep Research](#agent-capabilities) - [Python SDK](#python-sdk) - [JavaScript/TypeScript SDK](#javascript-sdk) - [Other Language SDKs](#other-sdks) - [API v2 Endpoints](#api-v2-endpoints) ## Introduction to Firecrawl ### 相关页面相关主题：[System Architecture](#system-architecture), [Search Functionality](#search-functionality), [Web Scraper Engine](#scraper-engine)

相关源码文件

以下源码文件用于生成本页说明： - [README.md](https://github.com/firecrawl/firecrawl/blob/main/README.md) - [apps/python-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/python-sdk/README.md) - [apps/js-sdk/firecrawl/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/js-sdk/firecrawl/README.md) - [apps/go-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/go-sdk/README.md) - [apps/java-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/java-sdk/README.md) - [apps/dot-net-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/dot-net-sdk/README.md) - [apps/ruby-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/ruby-sdk/README.md)

# Introduction to Firecrawl Firecrawl is an intelligent web scraping and data extraction platform designed specifically for AI systems. It enables developers to search, scrape, and interact with the web through a unified API, supporting multiple programming languages through official SDKs. 资料来源：[README.md](https://github.com/firecrawl/firecrawl/blob/main/README.md) ## Core Features Overview Firecrawl provides four primary capabilities that form the foundation of its web interaction platform: ### Search Find information across the web through Firecrawl's search functionality, allowing AI applications to locate relevant sources and data. 资料来源：[README.md](https://github.com/firecrawl/firecrawl/blob/main/README.md) ### Scrape Extract clean, structured data from any webpage. The scrape feature supports multiple output formats including markdown, HTML, and links, with options for full-page or main-content-only extraction. 资料来源：[README.md](https://github.com/firecrawl/firecrawl/blob/main/README.md) ### Interact Click, navigate, and operate on web pages programmatically. This feature enables complex workflows like filling forms, navigating through multi-step processes, and performing authenticated operations. 资料来源：[README.md](https://github.com/firecrawl/firecrawl/blob/main/README.md) ### Agent Autonomous data gathering through AI-powered agents that can intelligently navigate websites, extract relevant information, and handle complex research tasks. 资料来源：[README.md](https://github.com/firecrawl/firecrawl/blob/main/README.md) ## Architecture Overview ```mermaid graph TD A[Client Applications] --> B[Firecrawl API] B --> C[Search Service] B --> D[Scrape Service] B --> E[Crawl Service] B --> F[Agent Service] C --> G[Search Providers] D --> H[HTML Processing] E --> H H --> I[Markdown Conversion] I --> J[Structured Output] F --> K[LLM Integration] K --> D K --> E ``` ## SDK Ecosystem Firecrawl provides official SDKs for multiple programming languages, enabling seamless integration across different technology stacks. 资料来源：[apps/python-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/python-sdk/README.md) ### SDK Comparison | Language | Package Name | Version | Min SDK/API Version | Installation | |----------|-------------|---------|---------------------|--------------| | Python | `firecrawl-sdk` | Latest | Python 3.8+ | `pip install firecrawl-sdk` | | JavaScript/TypeScript | `@mendable/firecrawl-js` | Latest | Node.js 18+ | `npm install @mendable/firecrawl-js` | | Go | `firecrawl` | v2 | Go 1.21+ | `go get github.com/firecrawl/firecrawl-go-sdk` | | Java | `firecrawl-java` | 1.6.0 | Java 11+ | Maven dependency | | .NET | `firecrawl-sdk` | Latest | .NET 6+ | `dotnet add package firecrawl-sdk` | | Ruby | `firecrawl` | Latest | Ruby 3.0+ | `gem install firecrawl` | 资料来源：[apps/python-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/python-sdk/README.md), [apps/js-sdk/firecrawl/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/js-sdk/firecrawl/README.md), [apps/go-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/go-sdk/README.md), [apps/java-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/java-sdk/README.md), [apps/dot-net-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/dot-net-sdk/README.md), [apps/ruby-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/ruby-sdk/README.md) ### Python SDK ```python from firecrawl import Firecrawl app = Firecrawl(api_key="fc-YOUR_API_KEY") result = app.scrape('https://firecrawl.dev', formats=['markdown', 'html']) ``` The Python SDK supports both synchronous and asynchronous operations, with v2 being the current major version and v1 available for legacy compatibility under `firecrawl.v1`. 资料来源：[apps/python-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/python-sdk/README.md) ### JavaScript/TypeScript SDK ```javascript import Firecrawl from '@mendable/firecrawl-js'; const app = new Firecrawl({ apiKey: "fc-YOUR_API_KEY" }); const result = await app.scrape('https://firecrawl.dev'); ``` 资料来源：[apps/js-sdk/firecrawl/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/js-sdk/firecrawl/README.md) ### Go SDK ```rust use firecrawl::{Client, ScrapeOptions, Format, CrawlOptions}; let client = Client::new("fc-YOUR_API_KEY")?; let document = client.scrape("https://firecrawl.dev", None).await?; ``` 资料来源：[apps/go-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/go-sdk/README.md) ### Java SDK ```java FirecrawlClient client = FirecrawlClient.builder() .apiKey("fc-your-api-key") .build(); Document doc = client.scrape("https://example.com", ScrapeOptions.builder() .formats(List.of("markdown")) .build()); ``` 资料来源：[apps/java-sdk/README.md](https://github.com/firecrawl/firecrawl/blob/main/apps/java-sdk/README.md) ### .NET SDK ```csharp var client = new FirecrawlClient("fc-your-api-key"); var doc = await client.ScrapeAsync("https://example.com", new ScrapeOptions { Formats = new List

Python Parse