HTML

pip install pineflow-readers-file
class HTMLReader

Load a HTML file and extract text from a specific tag.

Parameters:

tag (str) – HTML tag to extract. Defaults to section.

load_data(input_file)

Loads data from the specified file.

Parameters:

input_file (str) – File path to load.

Returns:

A list of Document objects loaded from the file.

Return type:

List[Document]