Advanced Data Lake Management and Governance with AWS Lake Formation
.MP4, AVC, 1152x720, 30 fps | English, AAC, 2 Ch | 2h 45m | 398 MB
Instructor: Dipali Kulshrestha
.MP4, AVC, 1152x720, 30 fps | English, AAC, 2 Ch | 2h 45m | 398 MB
Instructor: Dipali Kulshrestha
In this advanced course on AWS Lake Formation, AWS-certified software programmer Dipali Kulshrestha equips you with the knowledge and skills to build, manage, and govern secure data lakes at scale. Learn how to set up a data lake from scratch, configure fine-grained permissions, and implement robust data governance controls using both IAM and Lake Formation permissions. Delve into cross-account data sharing so that you can ensure secure data collaboration across different AWS environments. Plus, explore advanced data processing techniques using AWS services like Athena, Glue, and EMR. When you complete this course, you’ll know how to optimize and secure data lakes for real-world enterprise use cases and how to address complex governance and access challenges.
Learning objectives
- Set up and manage data lakes using AWS Lake Formation, implementing best practices for data ingestion, cataloging, and registration of data sources to ensure efficient and secure data storage.
- Master the ability to enforce fine-grained permissions and access control for different data roles (data analysts, scientists, administrators), leveraging both IAM and Lake Formation permissions to securely govern data access.
- Understand how to securely share data lake assets across AWS accounts using both IAM-based and Lake Formation-based permissions, ensuring compliance with enterprise data governance policies while facilitating collaboration.
- Integrate AWS Lake Formation with services like AWS Athena, Glue, and EMR to query, transform, and process data, enabling efficient and scalable analysis workflows within the governed data lake.
- Manage complex data access scenarios using Lake Formation's hybrid access models, ensuring that both IAM and Lake Formation permissions work harmoniously to address data access challenges.