Crawl a URL using Firecrawl

Overview

Firecrawl is a tool for crawling websites and extracting clean markdown that’s structured in an LLM-ready format.

Here are two examples of how to use Firecrawl with Trigger.dev:

Prerequisites

A project with Trigger.dev initialized
A Firecrawl account

Example 1: crawl an entire website with Firecrawl

This task crawls a website and returns the crawlResult object. You can set the limit parameter to control the number of URLs that are crawled.

trigger/firecrawl-url-crawl.ts

import FirecrawlApp from "@mendable/firecrawl-js";
import { task } from "@trigger.dev/sdk/v3";

// Initialize the Firecrawl client with your API key
const firecrawlClient = new FirecrawlApp({
  apiKey: process.env.FIRECRAWL_API_KEY, // Get this from your Firecrawl dashboard
});

export const firecrawlCrawl = task({
  id: "firecrawl-crawl",
  run: async (payload: { url: string }) => {
    const { url } = payload;

    // Crawl: scrapes all the URLs of a web page and return content in LLM-ready format
    const crawlResult = await firecrawlClient.crawlUrl(url, {
      limit: 100, // Limit the number of URLs to crawl
      scrapeOptions: {
        formats: ["markdown", "html"],
      },
    });

    if (!crawlResult.success) {
      throw new Error(`Failed to crawl: ${crawlResult.error}`);
    }

    return {
      data: crawlResult,
    };
  },
});

Testing your task

You can test your task by triggering it from the Trigger.dev dashboard.

"url": "<url-to-crawl>" // Replace with the URL you want to crawl

Example 2: scrape a single URL with Firecrawl

This task scrapes a single URL and returns the scrapeResult object.

trigger/firecrawl-url-scrape.ts

import FirecrawlApp, { ScrapeResponse } from "@mendable/firecrawl-js";
import { task } from "@trigger.dev/sdk/v3";

// Initialize the Firecrawl client with your API key
const firecrawlClient = new FirecrawlApp({
  apiKey: process.env.FIRECRAWL_API_KEY, // Get this from your Firecrawl dashboard
});

export const firecrawlScrape = task({
  id: "firecrawl-scrape",
  run: async (payload: { url: string }) => {
    const { url } = payload;

    // Scrape: scrapes a URL and get its content in LLM-ready format (markdown, structured data via LLM Extract, screenshot, html)
    const scrapeResult = (await firecrawlClient.scrapeUrl(url, {
      formats: ["markdown", "html"],
    })) as ScrapeResponse;

    if (!scrapeResult.success) {
      throw new Error(`Failed to scrape: ${scrapeResult.error}`);
    }

    return {
      data: scrapeResult,
    };
  },
});

Testing your task

You can test your task by triggering it from the Trigger.dev dashboard.

"url": "<url-to-scrape>" // Replace with the URL you want to scrape

LibreOffice PDF conversionThis example demonstrates how to convert documents to PDF using LibreOffice with Trigger.dev.

On this page

Overview
Prerequisites
Example 1: crawl an entire website with Firecrawl
Testing your task
Example 2: scrape a single URL with Firecrawl
Testing your task

Introduction

Frameworks

Guides

Example projects

Python guides

Example tasks

Migration guides

Community packages

Crawl a URL using Firecrawl

Overview

Prerequisites

Example 1: crawl an entire website with Firecrawl

Testing your task

Example 2: scrape a single URL with Firecrawl

Testing your task

Introduction

Frameworks

Guides

Example projects

Python guides

Example tasks

Migration guides

Community packages

​Overview

​Prerequisites

​Example 1: crawl an entire website with Firecrawl

​Testing your task

​Example 2: scrape a single URL with Firecrawl

​Testing your task

Overview

Prerequisites

Example 1: crawl an entire website with Firecrawl

Testing your task

Example 2: scrape a single URL with Firecrawl

Testing your task