unzip is a stream intermediate operation that will unzip/unpack:
The unzip operation is also available via the tabul data unzip command
| Argument | Default | Description |
|---|---|---|
| entry-selector | A list of glob pattern. If set, only the archive entries that match will be extracted | |
| strip-components | 0 | Number of parts striped from the entry path to calculate the destination relative path from the destination directory (equivalent to strip-components in tar) |
| target-data-uri | ${entry_path}@tmp | a template data uri that defines the destination directory where input and entry attributes may be used For instance ${input_logical_name}@tmp |
| Flow Property | ||
| output-type | results | The output - targets, the extracted entry - inputs, the archive inputs are passed - results, the results of the extraction is passed |
| stream-type | map | The type of stream operation * map will produce one output for one archive * split will produce one output by archive entry |
The target-data-uri defines the location of the extracted entry.
The path therefore is mandatory and needs to be unique (By default, the ${entry_path})
The following variables may be used in the template data uri
strip-components removes one or more names from the entry path used in the target_data_uri.
It is used generally to delete the root directory in the path.
For instance,
Note that if you knew that the root directory was called archive-name, you could also delete it with matched group backreference. ie:
If you set as output the value results, you will get a data resource with the following columns:
| Columns | Description |
|---|---|
| target_data_uri | the data uri of the extracted archive entry |
| entry_path | the archive entry path |
| entry_media_type | the archive entry media type |
| entry_media_size | the archive entry size |
| entry_update_time | the archive update time |
Example:
target_data_uri entry_path entry_media_type entry_size entry_update_time
-------------------------------- ------------- ---------------- ---------- ---------------------
world/empty.txt@tmp world/empty.txt text/plain 0 2025-08-19 08:57:53.0
world/foo.txt@tmp world/foo.txt text/plain 5 2025-08-19 08:57:42.0
The target path is a path where an entry is going to be extracted.
If this target path exists, the file is overwritten by the extracted entry.