Back to Crawlee

HTTP crawler

website/versioned_docs/version-3.12/examples/http_crawler.mdx

3.16.0538 B
Original Source

import RunnableCodeBlock from '@site/src/components/RunnableCodeBlock'; import ApiLink from '@site/src/components/ApiLink'; import HttpCrawlerSource from '!!raw-loader!roa-loader!./http_crawler.ts';

This example demonstrates how to use <ApiLink to="http-crawler/class/HttpCrawler">HttpCrawler</ApiLink> to build a HTML crawler that crawls a list of URLs from an external file, load each URL using a plain HTTP request, and save HTML.

<RunnableCodeBlock className="language-js" type="cheerio"> {HttpCrawlerSource} </RunnableCodeBlock>