docs/guides/dataset/add_data_source/add_rss.md
Integrate an RSS feed as a data source.
This guide explains how to add an RSS feed as a data source to your dataset in RAGFlow.
RSS (Really Simple Syndication) is a standardized web feed format used to publish frequently updated content—such as blog entries, news headlines, and podcasts. By connecting an RSS feed to RAGFlow, you can automatically ingest new content from a website as soon as it is published.
Integrating an RSS data source provides the following advantages:
Before adding the data source, you need the direct URL of the RSS feed you want to monitor. You can typically find this in a few different ways:
https://example.com/rss, https://example.com/feed, or https://example.com/atom.xml).Ctrl+F (or Cmd+F) to search for rss or application/rss+xml. The URL will be listed in the href attribute of that tag.To add an RSS feed to your dataset, follow these steps:
https://news.ycombinator.com/rss).Once configured, RAGFlow's background task executors will automatically poll the RSS feed. The system continuously downloads new entries for parsing and chunking, while concurrently running the deleted-file sync to remove files that are no longer present in the source feed, requiring no manual scheduling on your part.