Joy Mundy, co-author of Ralph Kimball’s best-selling books and former partner of the Kimball Group, will show you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation-ready format.
Description
Introduction
The extract, transform, and load (ETL) phase of the data warehouse development lifecycle is the most difficult, time-consuming, and labor-intensive phase of building a data warehouse. A solid ETL system is reliable, accurate and high performant.
Joy Mundy, co-author with Ralph Kimball of The Data Warehouse Lifecycle Toolkit and The Kimball Group Reader, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation-ready format.
Joy will lead you through a rapid-fire one-day course on how to:
- Choose the appropriate architecture for your ETL system
- Plan and design your ETL system
- Build the suite of ETL processes
- Build a comprehensive data cleaning subsystem
- Tune the overall ETL process for optimum performance
- Determine the role of Big Data in your DW architecture
- And much more
Why attend
This vendor-neutral course helps you understand all the factors necessary for effectively designing the back room ETL system of your Kimball DW/BI environment. The course focuses on the overall architecture and design of the ETL system. We drill into the critical processes within the ETL system that are often overlooked. By the end of this course, you will understand how your data warehouse ETL system can be built to respond to ever-changing business requirements.
This is not a code-oriented implementation course; it is a vendor-neutral architecture course for the designer who must keep a broad perspective.
Who should attend
This course is designed for those responsible for building the back room ETL system of a data warehouse environment, including data warehouse team leads, ETL architects, ETL designers and developers, and data warehouse operational staff.
Prerequisites
This course assumes familiarity with the Kimball Approach to dimensional data warehousing. Students must have:
- Attended the Lifecycle and Dimensional Modeling courses, or
- Read The Data Warehouse Toolkit book, or
- A lot of work experience!