document-readers/jsoup-reader/ README.md
This module provides an HTML document reader for the Spring AI project. It leverages the JSoup library to parse HTML content and extract text and metadata, making it suitable for use in AI applications.
<body> of an HTML document.<meta> tags (e.g., description, keywords). You can specify which meta tags to extract.<a href="...">) within the document.Resource abstraction, allowing you to read HTML from files, classpath resources, URLs, and even in-memory byte arrays../mvnw -pl document-readers/jsoup-reader clean install