HTML
HTML processing enables you to parse HTML content effectively. Below you can find guidelines on using the HTML-related APIs available in GenAIScript.
Overview
HTML processing functions allow you to convert HTML content to text or markdown, aiding in content extraction and manipulation for various automation tasks.
convertToText
Converts HTML content into plain text. This is useful for extracting readable text from web pages.
convertToMarkdown
Converts HTML into Markdown format. This function is handy for content migration projects or when integrating web content into markdown-based systems.
By default, the converter produces GitHub-flavored markdown. You can disable this behavior by setting the disableGfm
parameter to true
.
convertTablesToJSON
This function specializes in extracting tables from HTML content and converting them into JSON format. It is useful for data extraction tasks on web pages.