Data Engineer Syllabus
Data Engineer Syllabus
o
rg
DATA ENGINEERING SYLLABUS KEY
POINTS
Module 1: Introduction to Data and Opportunities
What is data? (Structured, Semi-structured, Unstructured)
The Data Lifecycle (Capture, Store, Process, Analyze,
Visualize) Big Data and its characteristics (Volume, Variety,
Velocity)
Career paths in Data Engineering
Real-world use cases of Data Engineering
Module 5: MongoDB
Introduction to MongoDB (a popular NoSQL document
database) JSON data format and working with documents
CRUD operations (Create, Read, Update, Delete) in
MongoDB Querying data using MongoDB Query Language
Hands-on Labs with MongoDB Compass
Apache Airflow: A workflow orchestration tool for scheduling and managing data
pipelines. (High-Level overview)
Hive: A data warehouse software framework for reading, writing, and managing
large datasets stored in distributed storage systems like Hadoop.
www.learnomate.o
rg
Module 10: Data Visualization with Power BI
Introduction to Power BI for data visualization
Connecting Power BI to data sources (Azure Synapse,
etc.) Creating reports and dashboards with interactive
visuals Sharing insights with stakeholders