Engineering Specialist Advisor, NTT Data, Plano, Texas, USA
Title of the Talk :
Automating Data Warehouse Testing in the Cloud: Scalable Approaches for Accuracy, Performance, and Reliability
Abstract of Talk:
As enterprises migrate to cloud-based data warehouses, the scale, complexity, and velocity of ETL pipelines demand a shift from manual validation to fully automated testing frameworks. In cloud environments where elasticity, distributed processing, and frequent schema changes are the norm where automation is key to ensuring data accuracy, performance, and reliability without slowing delivery cycles. This session will present practical strategies for automating data warehouse testing across public, private, and hybrid cloud platforms. Attendees will learn how to design automated workflows for source-to-target reconciliation, transformation logic validation, regression testing, and performance benchmarking, all within CI/CD pipelines. The talk will also cover integration with cloud-native services such as AWS Glue, Azure Data Factory, and Google BigQuery, and the use of AI/ML techniques for anomaly detection. Real-world examples will illustrate how leading organizations achieve faster release cycles, reduced operational costs, and higher trust in analytics through intelligent, automated testing solutions.
