The Data Warehouse  ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data

The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data

By: Joe Caserta (author), Ralph Kimball (author)Paperback

Up to 2 WeeksUsually despatched within 2 weeks

Description

Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse Offers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality

About Author

RALPH KIMBALL, PhD, founder of the Kimball Group, has been a leading visionary in the data warehousing industry since 1982 and is one of today's best-known speakers and educators. He is the author of several bestselling titles published on data warehousing, including The Data Warehouse Toolkit (Wiley). JOE CASERTA is the founder of Caserta Concepts, LLC, a data warehousing consulting firm. He writes frequently for print and online magazines, and is an active contributor to DWList, the major online community for data warehousing professionals.

Contents

Acknowledgments. About the Authors. Introduction. Part I: Requirements, Realities, and Architecture. Chapter 1: Surrounding the Requirements. Chapter 2: ETL Data Structures. Part II: Data Flow. Chapter 3: Extracting. Chapter 4: Cleaning and Conforming. Chapter 5: Delivering Dimension Tables. Chapter 6: Delivering Fact Tables. Part III: Implementation and operations. Chapter 7: Development. Chapter 8: Operations. Chapter 9: Metadata. Chapter 10: Responsibilities. Part IV: Real Time Streaming ETL Systems. Chapter 11: Real-Time ETL Systems. Chapter 12: Conclusions. Index.

Product Details

  • ISBN13: 9780764567575
  • Format: Paperback
  • Number Of Pages: 528
  • ID: 9780764567575
  • weight: 712
  • ISBN10: 0764567578

Delivery Information

  • Saver Delivery: Yes
  • 1st Class Delivery: Yes
  • Courier Delivery: Yes
  • Store Delivery: Yes

Prices are for internet purchases only. Prices and availability in WHSmith Stores may vary significantly

Close