TPC-D, version 2: Overview
Goal: define a workload to “take over” for TPC-D 1.x in time with its lifecycle (~2 year from now)
Address the known deficiencies of the 1.x specification
- Introduce data skew
- Require multi-user executions
- What number of streams is interesting?
- Should updates scale with users? with data volume?
- Broaden the scope of the query set and data set
- “Snowstorm” schema
- Larger query set
- Batch and Trickle update models