Tabulify - TPC-DS (Benchmark)

Tpc

Tabulify - TPC-DS (Benchmark)

Tpc

About

Tabulify supports the Tpc-Ds database benchmark on the following points:

Operations

Schema Management

This section shows you how to manage the sub-schema of TPC-DS

All tables

tpcds - all TPC-DS tables

tabul data list *@tpcds
tabul data create *@tpcds @targetConnection
tabul data fill *@tpcds @targetConnection

Dwh

the data-warehouse tables - all tables without the tables that starts with a s (ie without the staging tables)

tabul data list [!s]*@tpcds
tabul data create [!s]*@tpcds @targetConnection
tabul data fill [!s]*@tpcds @targetConnection

Store Sales

The store-sales schema has the store_sales and store_return star schema (a data-warehouse schema).

tabul data list --with-dependencies store*@tpcds
path                     media_type
----------------------   ------------
customer                 sql/relation
customer_address         sql/relation
customer_demographics    sql/relation
date_dim                 sql/relation
household_demographics   sql/relation
income_band              sql/relation
item                     sql/relation
promotion                sql/relation
reason                   sql/relation
store                    sql/relation
store_returns            sql/relation
store_sales              sql/relation
time_dim                 sql/relation

tabul data create --with-dependencies store*@tpcds @targetConnection
tabul data copy --with-dependencies store*@tpcds @targetConnection

This article explains this technic: how to select a star schema

Note on the schema

The TPC-DS benchmark does not define the B column (business key) as unique key. Our implementation makes them all unique (except on the item table where the column is unique only with the start and end date)

Why ? Because when using TPC-DS as a sample schema, the data generator will then create data that is consistent with the queries.

For TPC-DS, a business key is neither a primary key nor a foreign key in the context of the data warehouse schema. It is only used to differentiate new data from update data of the source tables during the data maintenance operations.




Related HowTo
Undraw Circuit Board
Database HowTo - How to load your database with the TPCDS benchmark

This howto will show you how to load a relational database schema in order to create a benchmark with the data query command
Undraw Circuit Board
Learning Tabulify - Step 4 - How to select Data Resources

To select a data resources such as a file or a database table, Tabulify uses the concept of: and dependency (Do we select also the dependent data resources) This page goes through this concepts...
Undraw Circuit Board
Learning Tabulify - Step 6 - How to transfer Data Resources

In Tabulify, when you want to manipulate data, it's called a transfer. You want to move data, you transfer You want to copy data, you transfer You want to insert data, you transfer You want to...
Undraw Circuit Board
Learning Tabulify - Step 7 - How to execute a SQL Query

For Tabulify, a SQL Query is a runtime resource that when executed returns a data set in tabular format (known as the result set). The selector of a runtime is known as a runtime selector. This selector...
Undraw Circuit Board
Learning Tabulify - Step 8 - How to modify the content of a table

Tabulify learning guidetransfer step The transfer page should have made you familiar with the transfer concept where we have introduced the copy operation. The copy and move operation does not modify...
Undraw Circuit Board
Postgres - How to list SQL Tables

This howto will show you how to list SQL Tables
Undraw Circuit Board
SQL - How to execute and shows the result of a SQL SELECT Query

This howto will show you to execute and shows the result of a SQL Select Query against the tpcds data
Undraw Circuit Board
SQLite - How to fill a table with auto generated data

This how-to will show you how to fill a table with auto-generated data and the Tabul fill action.
Undraw Circuit Board
Tabul - How to copy the data definition of all tables from a star/snowflake schema

This howto will show you how to copy the data definition of all SQL tables from a star/snowflake schema.
Undraw Circuit Board
Tabul - How to copy the data definition of all tables from a star/snowflake schema

This howto will show you how to copy the data definition of all SQL tables from a star/snowflake schema.

Task Runner