Data Resource - Attribute (Property)

Undraw Environment

Data Resource - Attribute (Property)

About

A resource attribute is a attribute for a data resources.

Type

They may be:

  • derived: value computed such as:
  • literal: value set such as:
    • the header-row-id attribute of a csv that determines if there is or not a header

Management

See

You can see this attributes with the tabul data info command

tabul data info characters.csv@howto
Information about the data resource (characters.csv@howto)
attribute             value                                                                                              description
-------------------   ------------------------------------------------------------------------------------------------   ----------------------------------------------------------------------
CHARACTER_SET         UTF-8                                                                                              The character set of the file
COLUMN_NAME           lines                                                                                              The name of the column when the text content is returned on one column
COMMENT_CHARACTER     #                                                                                                  The comment character
DELIMITER_CHARACTER   ,                                                                                                  The delimiter character
END_OF_RECORD         \n, \r, \r\n                                                                                       The End Of Record string sequence
ESCAPE_CHARACTER                                                                                                         The escape character
HEADER_ROW_ID         1                                                                                                  The id of the header row
IGNORE_EMPTY_LINE     true                                                                                               Ignore empty line
QUOTE_CHARACTER       "                                                                                                  The quote character
URI                   file:///opt/tabulify/resources/howto/characters.csv                                                The URI of the file
ABSOLUTE_PATH         /opt/tabulify/resources/howto/characters.csv                                                       The absolute path on the data system
ACCESS_TIME           2025-11-10 15:34:45.519261734                                                                      The access time (access time)
COMMENT                                                                                                                  A comment
CONNECTION            howto                                                                                              The connection name
COUNT                 5                                                                                                  The number of records
CREATION_TIME         2025-11-10 15:34:41.449090141                                                                      The creation time (birth time)
DATA_URI              characters.csv@howto                                                                               The data uri
KIND                  csv                                                                                                The kind of media
LOGICAL_NAME          characters                                                                                         The logical name
MD5                   d0d7f020d834cde13d0a14733520d4d6                                                                   The Md5 hash
MEDIA_SUBTYPE         csv                                                                                                The media subType
MEDIA_TYPE            text/csv                                                                                           The media type
NAME                  characters.csv                                                                                     The name of the data resource
PARENT                                                                                                                   The parent
PATH                  characters.csv                                                                                     The relative path to the default connection path
SHA384                6e96667d1306abb445b5608ec36d193a6dc775a6adc74844531906bf4691eee5f32ce7c423e38c9c072692f459c57d5f   The Sha384 hash
SHA384_INTEGRITY      sha384-bpZmfRMGq7RFtWCOw20ZOm3Hdaatx0hEUxkGv0aR7uXzLOfEI+OMnAcmkvRZxX1f                            The sha384 value used in the html integrity attribute
SIZE                  432                                                                                                The number of byte
TABULAR_TYPE          data                                                                                               The tabular type
UPDATE_TIME           2025-11-10 15:34:41.449090141                                                                      The last update time (modify time)

List

You can get them in a tabular format with the tabul data list command

tabul data list -a count -a size *@howto
path                        media_type        count   size
-------------------------   ---------------   -----   ----
archive                     inode/directory       2       
books--data-def.yml         text/yaml             1    285
books-semicolon.csv         text/csv              6    385
books.csv                   text/csv              6    386
books_template.json         text/json             1    112
characters.csv              text/csv              5    432
command                     inode/directory       6       
create_foo.sql              text/sql              2     70
create_view_foo.sql         text/sql              1     32
dataset                     inode/directory       1       
diff                        inode/directory       2       
email                       inode/directory       2       
generator                   inode/directory       7       
json                        inode/directory       1       
mysql                       inode/directory       2       
oracle                      inode/directory       3       
pebble                      inode/directory       2       
pipeline                    inode/directory       8       
postgres                    inode/directory      15       
README.md                   text/plain            3    132
recursive                   inode/directory       1       
select.yml                  text/yaml             1    332
sql_query_ddl_country.sql   text/sql              8    559
sqlite                      inode/directory       4       
sqlserver                   inode/directory       3       
warehouse_delete.csv        text/csv              1     17
warehouse_delete.sql        text/sql              1     65
warehouse_original.csv      text/csv              1    301
warehouse_update.sql        text/sql              1    230
warehouse_upsert.csv        text/csv              2    420
xml                         inode/directory       1       
yaml                        inode/directory       1

Set

You can set the attributes via:

Add as virtual column

You can add the data resource attribute as extra-column via the virtual columns.

Type

Identifier

  • dataUri - the unique identifier of this data resource
  • type of resource:
    • media-type - the resource media type (ie text/html)
    • media-subtype - the resource media subtype (ie html in text/html), generally the file extension
    • kind - kind is a single name representation of a media type used in a manifest

Common

Common attributes are attributes that you will find on every data resources.

The following attributes are common:

Time

  • creation-time - the creation time (when the resource was created)
  • access-time - the last access time (when the resource was last read)
  • update-time - the last update time (when the resource content was last changed, known also as modification time)

Note:

  • They may be null if the system does not support them
  • The in-memory pipeline data resource are set with a creation time at UTC.

Digest

  • md5 - the MD5 digest
  • sha384 - the SHA384 digest
  • sha384_integrity - the SHA384 integrity digest used in a integrity HTML attribute.

Specific

Specific attributes are attributes that are available only for specific type of data resource.

For instance, for a text file, you would have its character-set.




Related HowTo
Undraw Environment
Database HowTo - How to load JSON documents in a database

This howto will show you how to load json document (files) into your relational database.
Undraw Environment
How to load and analyze YAML documents in a database?

This howto will show you how to load and analyze yaml document (files) into a relational database. sqlite howto databaserelational database At its core a YAML file is just a JSON file styled in...
Undraw Environment
How to read and write an Excel file?

This howto will demonstrate you how to read and write to an Excel resource. To following this howto, you should have followed the getting started guide to have knowledge of: Resource creation...

Task Runner