Data Resource - HTML

Undraw Environment

Data Resource - HTML

About

The html files are supported. Tabulify will extract by default the first html table (ie tabular file data resource).

If you want the whole content, you can set the resource type to text.

If there is no table element in the html page, the created resource has one cell with the whole content.

Example

How to read an HTML table?

Attributes

The data attributes for an HTML file are:

Name Default Value Description
TABLE_SELECTOR table The Css selector to obtain the table element
HEADER_SELECTOR th The Css selector to obtain the headers elements in the table
ROW_SELECTOR tr The Css selector to obtain the rows in the table
DATA_SELECTOR th, td The Css selector to obtain the data cell in the row

Because a html file is also a text file, you can also set text properties such as the character set.

FAQ

What happens if there is no table selected ?

If there is no table selected or present, the html file is considered a text file.




Related HowTo
Undraw Environment
How to load an HTLM table in a database?

This howto shows you how to load an HTML table in a SQL Table. Because the HTML resource is a file resource, it can be used as any source in any type of transfer. Example with the upsert operation...
Undraw Environment
How to read an HTML table?

HTML This Howto shows you how to read a HTML table from a html page. By default, an HTML resource will return the first HTML table found. Example: the first table of the Eurovision...
Undraw Environment
How to use the define operation?

This howto shows you how to use the define data operation to create data resources in the pipeline file (ie inline data resources) This operation is only available as step in a pipeline (ie not in a tabul...

Task Runner