Data Engineering Tech_ Interview.
DB vs DWH :
Databases used in tradi onal way of storing and manage data for transac onal
applica ons.
Data warehouse is a type of database it was the place where the huge amount of
data is stored .
Provide meaningful business insight .
( )هو تيكنيك بستخدمه ع اقدر اجمع الداتا من كذ ا سورس )مكان( واستخدمها ع حسب احتياج البيزنس
Star Schema : a rela onal database schema that organizes data into a central
fact table surrounded by mul ple dimension tables, resembling a star in shape
Most used -- denormalized – need less join.
Galaxy Schema : a data warehouse structure where mul ple fact tables share
some or all of the same dimension tables.
Snowflake Schema : consists of one fact table that is connected to many
dimension tables, which can be connected to other dimension tables through a
many-to-one rela onship
Normalized – need more joins .
What is the different between Fact-T & Dimension-T :
Fact tables store quan ta ve data, like sales figures or transac on counts, while
dimension tables store descrip ve informa on about the facts, such as customer
demographics or product
OLAP vs OLTP :
OLTP systems handle real- me, transac onal data processing like order entry or
online banking, while OLAP systems focus on analyzing aggregated, historical data
for repor ng and business intelligence.
ETL vs ELT :
ETL :
1. Extract: Data is gathered from different source systems.
2. Transform: Data is then transformed into a standardized format. The
transforma on can include cleansing, aggrega on, enrichment, and other
processes to make the data fit for its purpose.
3. Load: The transformed data is loaded into a target data warehouse or
another repository.
Data sources are of a smaller scale, and transforma ons are complex.
ETL :
1. Extract: Just as with ETL, data is collected from different sources.
2. Load: Instead of transforming it immediately, raw data is directly loaded into
the target system.
3. Transform: Transformations take place within the data warehouse
Suitability for large datasets
** Business Type: A startup might priori ze flexibility (ELT), while a healthcare
provider might priori ze data security (ETL).
Stored procedure :
A stored procedure is a prepared SQL code that you can save, so the code can be
reused over and over again.
CTEs
Are used to simplify complex joins and run subqueries. They help make SQL
queries more readable and maintainable.
displays all students with Science majors and grade A:
Having vs Group By vs Where :
Where : Used to filter a record .
Having : It is extension from where with using aggrega on func ons.
Group By : groups rows that have the same values into summary rows & used
with aggregate func ons .
Delete vs Truncate :
Both DELETE and TRUNCATE are used to remove data from a table, but they differ
in their func onality and impact.
DELETE allows you to remove specific rows based on a WHERE clause, and it's part
of Data Manipula on Language (DML).
TRUNCATE removes all rows from a table but doesn't use a WHERE clause, and it's
a Data Defini on Language (DDL) command.
Email : [email protected]
Liked in : www.linkedin.com/in/diaa-magdi-41b350221