Data Engineering
Modern Data Engineering In The Cloud
data engineering modern cloud 26.08.2020 - 28.08.2020
MAMPU, Cyberjaya
Data
engineering is the crucial part to enable and operationalize big data analytics and cloud
applications in the big data ecosystem Modern data engineering ensures fast, secure, and high
quality implementation of new systems that streamline operations and reduce costs with
minimal workforce interruption It provides an extensible, highly scalable set of tools to access,
transform, and integrate data from any business system This course is designed for anyone who
wants to perform data integration and management tasks Participants work on projects to
monitor the process and database changes In addition, participants would also learn about how
cloud technology is able to help IT to reduce hardware dependencies, software management etc
Course Objective
- Memahami dan melaksanakan konsep Extract, Transform and Load (ETL).
- Melakukan proses integrasi data dan menyiapkan tugasan yang diberi.
- Memantau proses dan perubahan pangkalan data.
- Memahami kepentingan teknologi Cloud.
Course Outcomes
- Understand and execute the concept of Extract, Transform and Load (ETL)
- Perform data integration process and manage the tasks given
- Monitor the process and database changes
- Understand the importance of cloud technology
Course Outline
- What is Data Engineering?
- Overview
- Data Engineering Skillset
- Data Engineering Roles
- What is Cloud Data Engineering?
- Different Types of Cloud Offering
- Traditional ETL vs Modern Data Engineering in Cloud
- Why on Cloud?
- Data Engineering Execution
- Working with files
- iBasictransformation (Join, Filter, Expression Editor)
- Using context variables
- Error Handling
- Working with Databases
- Working with web services
- Master Job
- Documenting a Job
- Parallel Execution
- Joblets
- Change Data Capture (CDC)
- Introduction to Cloud
- Overview architecture
- Connecting to Talend Cloud
- Publishing job
- Cloud Engine and Remote Engine
- Job Execution
- Job Scheduling