Data Integration (DI), also known as ‘ETL’, is the analysis, combination, and transformation of data from a variety of sources and formats into a unified data model representation. Data Integration is a key element of data warehousing, application integration, and business analytics solutions. The variety and volume of data is always increasing and performance of data integration systems is critical. However, there has been no industry standard for measuring and comparing the performance of DI systems.
The TPC has re-established the TPC-ETL subcommittee, renamed to TPC-DI to develop a DI benchmark. The TPC-DI benchmark subcommittee has been formed and is continuing the development of the specification and data generator. Two sample implementations have been created and initial runs performed.
|