0% found this document useful (0 votes)

37 views3 pages

Advanced Interview QA ADF Databricks PowerBI

The document contains advanced interview questions and answers related to Azure Data Factory, Databricks, and Power BI, focusing on scenario-based and technical questions. It covers troubleshooting strategies, delta load designs, integration runtime usage, performance optimization, and data modeling best practices. Each section provides concise answers to help candidates prepare for technical interviews in data engineering and analytics roles.

Uploaded by

Praveen Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views3 pages

Advanced Interview QA ADF Databricks PowerBI

Uploaded by

Praveen Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Advanced Interview Questions and Answers

Azure Data Factory (ADF)

Scenario-Based Questions:

1. Q: You have a pipeline that loads millions of records daily from an on-prem SQL Server to Azure SQL

Database. One day, the copy activity fails without any code changes. How would you troubleshoot and

ensure minimal downtime?

A: - Check pipeline run history in ADF Monitor.

- Review integration runtime status.

- Inspect source/target connectivity.

- Retry manually or use retry policy.

- Maintain checkpoints and use data partitioning.

- Enable alerts using Azure Monitor.

2. Q: Your client wants to implement a CDC-based delta load from SAP to Azure SQL via ADF, but SAP only

provides full extracts. How would you design a delta load strategy with minimal load time?

A: - Use watermark columns or hash diff logic.

- Store last load value in variables or metadata table.

- Filter SAP extract using this watermark.

- Use snapshot-diff logic if no timestamp available.

Technical-Based Questions:

1. Q: Explain how integration runtime works in ADF. When would you use self-hosted IR over Azure IR?

A: - Azure IR for cloud data movement.

- Self-hosted IR for on-prem/private network resources.

- Use self-hosted IR when accessing on-prem SQL Server.

2. Q: How can you parameterize Linked Services and Datasets for reusability in ADF across multiple

environments (dev/test/prod)?

A: - Use global parameters or dynamic content.

- Define parameters for ServerName, DatabaseName, etc.

- Helps in CI/CD deployment and reuse.

3. Q: How does ADF handle retry policies, and what are the best practices for configuring them in

mission-critical pipelines?

A: - Retry options: count and interval.

- Enable for transient faults.

- Avoid retrying logic-based failures.

- Use timeouts and fail-fast logic.

Databricks
Scenario-Based Questions:

1. Q: You are implementing a CDC pipeline using Delta Lake. The source system provides both insert and

delete records. How would you design the pipeline in Databricks to handle this efficiently using DLT or Auto

Loader?

A: - Use apply_changes() in DLT with apply_as_deletes.

- For Auto Loader, use MERGE with DELETE logic.

- Maintain high watermark using timestamp.

2. Q: A data team complains that a notebook job is running slower after new columns were added to the

Delta table. How would you investigate and optimize it?

A: - Check for file skew and small files.

- Run OPTIMIZE and ZORDER.

- Consider schema evolution impact.

- Use Photon runtime and cache hot tables.

Technical-Based Questions:

1. Q: Explain the difference between OPTIMIZE, VACUUM, and ZORDER BY in Delta Lake. When and how

should each be used?

A: - OPTIMIZE: compacts small files.

- ZORDER: sorts by columns for filtering.

- VACUUM: cleans obsolete files.

- Use ZORDER after OPTIMIZE.

2. Q: How would you implement a slowly changing dimension Type 2 (SCD2) logic using PySpark in

Databricks?

A: - Join source with target on business keys.

- Detect changes -> expire old records.

- Insert new version with updated start_date.

- Use MERGE INTO.

3. Q: What are the pros and cons of using Delta Live Tables (DLT) over traditional notebooks for data

pipeline orchestration?

A: - Pros: declarative, auto-lineage, CDC support.

- Cons: less flexible, more resource intensive.

- Best for production-grade pipelines.

Power BI
Scenario-Based Questions:

1. Q: Your report is slow when filtering on slicers and visuals take 10+ seconds to render. How would you go

about identifying and resolving performance bottlenecks?

A: - Use Performance Analyzer.

- Optimize DAX and reduce model size.

- Use summary/aggregation tables.

- Disable auto-date/time.

2. Q: A user requests row-level security (RLS) based on department and region. Departments may span

multiple regions. How do you implement this dynamic RLS in Power BI?

A: - Create user-department-region mapping.

- Apply USERPRINCIPALNAME() in DAX.

- Set roles and test with 'View as Role'.

Technical-Based Questions:

1. Q: Explain the differences between Import, DirectQuery, and Composite models. When should each be

used and why?

A: - Import: fastest, full DAX.

- DirectQuery: real-time, limited features.

- Composite: mix of both.

- Use Composite for large + fast KPIs.

2. Q: How do you handle circular dependency errors in complex DAX measures or calculated columns?

A: - Use variables.

- Break logic into steps.

- Avoid calculated columns depending on measures.

3. Q: What are the best practices for designing a Power BI data model for large-scale datasets (e.g., over 1

billion rows)?

A: - Use star schema.

- Apply aggregations.

- Use surrogate keys.

- Apply incremental refresh.

Snowflake Notes
100% (9)
Snowflake Notes
67 pages
Azure Databricks Interview Question
No ratings yet
Azure Databricks Interview Question
12 pages
Snowflake Snowpro Exam Cheatsheet
83% (12)
Snowflake Snowpro Exam Cheatsheet
7 pages
Advanced Data Engineering With Databricks
No ratings yet
Advanced Data Engineering With Databricks
154 pages
DatabricksDataEngineer Associate2024
80% (5)
DatabricksDataEngineer Associate2024
157 pages
Azure Comapny Wise Question
No ratings yet
Azure Comapny Wise Question
68 pages
1Z0-1041-24 Exam Questions
100% (1)
1Z0-1041-24 Exam Questions
25 pages
Databricks Question 1668314325
No ratings yet
Databricks Question 1668314325
104 pages
Databricks Certified Data Engineer Associate 4
100% (1)
Databricks Certified Data Engineer Associate 4
13 pages
PYSPARK Interview Questions
100% (3)
PYSPARK Interview Questions
126 pages
ETL Processes Using PySpark
67% (3)
ETL Processes Using PySpark
7 pages
Azure Databricks Interview
100% (2)
Azure Databricks Interview
35 pages
Azure Data Factory
77% (13)
Azure Data Factory
52 pages
Service Manual - Mispa CCXL Agappe - Final
No ratings yet
Service Manual - Mispa CCXL Agappe - Final
108 pages
Azure Data Factory Interview Questions
0% (1)
Azure Data Factory Interview Questions
14 pages
Databricks Certified Developer For Apache Spark 3.0 Practice Tests 540 Questions
0% (1)
Databricks Certified Developer For Apache Spark 3.0 Practice Tests 540 Questions
290 pages
PySpark Data Frame Questions PDF
100% (2)
PySpark Data Frame Questions PDF
57 pages
Data Engineering With Databricks
100% (2)
Data Engineering With Databricks
63 pages
Azure Databricks Course Slide Deck
75% (4)
Azure Databricks Course Slide Deck
169 pages
Data Engineering With Databricks Da
100% (3)
Data Engineering With Databricks Da
232 pages
Snowflake Architecture - Concepts
No ratings yet
Snowflake Architecture - Concepts
38 pages
Azure Data Factory Interview Questions and Answer
No ratings yet
Azure Data Factory Interview Questions and Answer
12 pages
My Pyspark Practice Notes
100% (1)
My Pyspark Practice Notes
63 pages
100 Dataengineering Interview Questions TRRaveendra 1694654407
No ratings yet
100 Dataengineering Interview Questions TRRaveendra 1694654407
58 pages
PracticeExam DataEngineerAssociate
No ratings yet
PracticeExam DataEngineerAssociate
23 pages
Azure Data Engineer - Samatha Gudala
100% (1)
Azure Data Engineer - Samatha Gudala
8 pages
Data Analysis With Databricks
75% (4)
Data Analysis With Databricks
80 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Etl With Azure Cookbook Practical Recipes For Building Modern Etl Solutions To Load and Transform Data From Any Source 1800203314 9781800203310
100% (7)
Etl With Azure Cookbook Practical Recipes For Building Modern Etl Solutions To Load and Transform Data From Any Source 1800203314 9781800203310
446 pages
Big Data Engineering - PySpark
100% (2)
Big Data Engineering - PySpark
120 pages
Tcs DE INTERVIEW Q&A2025
No ratings yet
Tcs DE INTERVIEW Q&A2025
12 pages
Interviews Are Tough, Especially When ADF Basics Trip You
No ratings yet
Interviews Are Tough, Especially When ADF Basics Trip You
10 pages
EY Mock
No ratings yet
EY Mock
1 page
BASF Interview QA
No ratings yet
BASF Interview QA
4 pages
Azure Etl 1741608374
No ratings yet
Azure Etl 1741608374
14 pages
Microsoft Azure Database Administrator DP 300
From Everand
Microsoft Azure Database Administrator DP 300
Manish Soni
No ratings yet
Data Engineer Interview at A Top Product-Based Company
No ratings yet
Data Engineer Interview at A Top Product-Based Company
7 pages
Azure de Interview Question Set Part 1 1710925748
No ratings yet
Azure de Interview Question Set Part 1 1710925748
9 pages
ADF Interviews
No ratings yet
ADF Interviews
6 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
DP-500 Designing and Implementing Enterprise-Scale Analytics Solutions Using Microsoft Azure and Microsoft Power BI Exam Guide
From Everand
DP-500 Designing and Implementing Enterprise-Scale Analytics Solutions Using Microsoft Azure and Microsoft Power BI Exam Guide
Anand Vemula
No ratings yet
ADE Azure Data Engineer Interview
No ratings yet
ADE Azure Data Engineer Interview
12 pages
Azure Data Factory Interview Questions Answers 1740678784
No ratings yet
Azure Data Factory Interview Questions Answers 1740678784
9 pages
Databricks Data Engineer Professional Practice
No ratings yet
Databricks Data Engineer Professional Practice
10 pages
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
Azure de QSN and Ans
No ratings yet
Azure de QSN and Ans
16 pages
Must Know Before Your Next Databricks Interview
No ratings yet
Must Know Before Your Next Databricks Interview
7 pages
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
ADF Questions Set
No ratings yet
ADF Questions Set
5 pages
Spark Interview Questions
No ratings yet
Spark Interview Questions
4 pages
Tech Mahindra
No ratings yet
Tech Mahindra
2 pages
1) How To Rerun A Pipe Line From Data Factory Monitor.: Azure Data Factory Advanced Interview Questions and Answers
No ratings yet
1) How To Rerun A Pipe Line From Data Factory Monitor.: Azure Data Factory Advanced Interview Questions and Answers
18 pages
Tiger Analytics 1735834470
No ratings yet
Tiger Analytics 1735834470
27 pages
Azure Interview
No ratings yet
Azure Interview
13 pages
Databricks Practice Questions 1
No ratings yet
Databricks Practice Questions 1
10 pages
AWS Cloud Practitioner Study Guide & Practice Tests
From Everand
AWS Cloud Practitioner Study Guide & Practice Tests
SUJAN
No ratings yet
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
3 pages
Microsoft Azure Data Engineer DP 203
From Everand
Microsoft Azure Data Engineer DP 203
Manish Soni
No ratings yet
Confluent Certified Developer for Apache Kafka® Exam kit
From Everand
Confluent Certified Developer for Apache Kafka® Exam kit
PRIYANKA
No ratings yet
5 Years of Experience in Azure Data Factory
No ratings yet
5 Years of Experience in Azure Data Factory
4 pages
Interview Questions
No ratings yet
Interview Questions
7 pages
Google Associate Cloud Engineer Exam Companion: Q&A with Explanations
From Everand
Google Associate Cloud Engineer Exam Companion: Q&A with Explanations
SUJAN
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Ade Companywise Interview
No ratings yet
Ade Companywise Interview
133 pages
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
?stuck in A Loop of Rejections - Let's Break The Cycle!?
No ratings yet
?stuck in A Loop of Rejections - Let's Break The Cycle!?
7 pages
Kafka Developer Certified: The Essential Guide
From Everand
Kafka Developer Certified: The Essential Guide
SUJAN
No ratings yet
Building Modern Data Applications Using Databricks Lakehouse: Develop, optimize, and monitor data pipelines on Databricks
From Everand
Building Modern Data Applications Using Databricks Lakehouse: Develop, optimize, and monitor data pipelines on Databricks
Will Girten
No ratings yet
Interview Q & A (SQL Spark HIVE Airflow AWS Kafka) - 1
No ratings yet
Interview Q & A (SQL Spark HIVE Airflow AWS Kafka) - 1
25 pages
Most Frequently Asked Azure Data Factory Interview Questions
0% (1)
Most Frequently Asked Azure Data Factory Interview Questions
5 pages
What's New in .NET 8? A Complete Guide to the Latest Features
From Everand
What's New in .NET 8? A Complete Guide to the Latest Features
Nitika
No ratings yet
Interview
No ratings yet
Interview
2 pages
Skill Wise Azure DE - Interview Questions (BR)
No ratings yet
Skill Wise Azure DE - Interview Questions (BR)
6 pages
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
From Everand
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
Steve Brown
No ratings yet
Difference Between Exceptall
No ratings yet
Difference Between Exceptall
8 pages
Capgemini Questionnaire
No ratings yet
Capgemini Questionnaire
11 pages
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
From Everand
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
Steve Brown
No ratings yet
Taking Interviw
No ratings yet
Taking Interviw
15 pages
Senior Data Engineer Qna
No ratings yet
Senior Data Engineer Qna
4 pages
4
No ratings yet
4
2 pages
Report Zazmic Inc. Senior Middle Data Engineer Hiring Test AWS Snowflake Databricks Python SQL Kalgaonkarsiddhesh
No ratings yet
Report Zazmic Inc. Senior Middle Data Engineer Hiring Test AWS Snowflake Databricks Python SQL Kalgaonkarsiddhesh
36 pages
My Walmart Interviewexperience Answers
No ratings yet
My Walmart Interviewexperience Answers
13 pages
Data Engineering Cookbook
89% (9)
Data Engineering Cookbook
88 pages
Architecting A Data Lake
100% (8)
Architecting A Data Lake
60 pages
Top 200 Data Engineer Interview Question PDF
100% (4)
Top 200 Data Engineer Interview Question PDF
482 pages
Performance Tuning in Azure Databricks
100% (1)
Performance Tuning in Azure Databricks
124 pages
Crack Your Databricks
100% (1)
Crack Your Databricks
103 pages
Apache Spark - DataFrames and Spark SQL
100% (2)
Apache Spark - DataFrames and Spark SQL
146 pages
Azure DATA Fatcory
No ratings yet
Azure DATA Fatcory
2,982 pages
CSS Electronics Products
No ratings yet
CSS Electronics Products
11 pages
Batangas State University Jplpc-Malvar
No ratings yet
Batangas State University Jplpc-Malvar
5 pages
Esquema Sensor de Temperatura Com LM 358
No ratings yet
Esquema Sensor de Temperatura Com LM 358
1 page
Multimedia - Learning Livro Inglês
No ratings yet
Multimedia - Learning Livro Inglês
99 pages
Zscaler Private Access: Fast, Secure Access To Private Applications With Cloud-Delivered Zero Trust Network Access (ZTNA)
No ratings yet
Zscaler Private Access: Fast, Secure Access To Private Applications With Cloud-Delivered Zero Trust Network Access (ZTNA)
4 pages
Pendekar Laut Generasi 1
100% (1)
Pendekar Laut Generasi 1
6 pages
Em70 140
No ratings yet
Em70 140
2 pages
10C Form T1 PHD Thesis Submission For Repository NITT
No ratings yet
10C Form T1 PHD Thesis Submission For Repository NITT
2 pages
Using Office Backstage: Lesson Skill Matrix
No ratings yet
Using Office Backstage: Lesson Skill Matrix
13 pages
Power Off Reset Reason Backup
No ratings yet
Power Off Reset Reason Backup
4 pages
ADS & A Unit-1 Study Material
No ratings yet
ADS & A Unit-1 Study Material
13 pages
Graded Quiz Unit 3
No ratings yet
Graded Quiz Unit 3
36 pages
Transport Requests in SAP
No ratings yet
Transport Requests in SAP
9 pages
Detecting EBPF Rootkits Using Virtualization and Memory Forensics
No ratings yet
Detecting EBPF Rootkits Using Virtualization and Memory Forensics
8 pages
Chapter 1 Introduction OS
No ratings yet
Chapter 1 Introduction OS
18 pages
Selenium
No ratings yet
Selenium
33 pages
Course Tittle:-Project Title:-: Object Oriented Software Analysis and Design
100% (1)
Course Tittle:-Project Title:-: Object Oriented Software Analysis and Design
24 pages
Entec E-1500 Manual EN
No ratings yet
Entec E-1500 Manual EN
71 pages
59 Tweed 15w Amp Kit Instructions
100% (1)
59 Tweed 15w Amp Kit Instructions
44 pages
Chapter 4 Cube & Cuberoots
No ratings yet
Chapter 4 Cube & Cuberoots
7 pages
How To Install Java On Windows 64 Bit Machine - HowToDoInJava
No ratings yet
How To Install Java On Windows 64 Bit Machine - HowToDoInJava
15 pages
60% PDF
No ratings yet
60% PDF
1 page
EPGP in Data Science (Curriculum)
No ratings yet
EPGP in Data Science (Curriculum)
30 pages
Palmer, S. - Rethinking Perceptual Organization - The Role of Uniform Connectedness
No ratings yet
Palmer, S. - Rethinking Perceptual Organization - The Role of Uniform Connectedness
28 pages
MRG 2
No ratings yet
MRG 2
27 pages
W2-EX RA0 6 Solutions
No ratings yet
W2-EX RA0 6 Solutions
24 pages
Ebooks File Agile Project Management With Kanban All Chapters
100% (4)
Ebooks File Agile Project Management With Kanban All Chapters
34 pages
School Education and Sports Department
No ratings yet
School Education and Sports Department
1 page

Advanced Interview QA ADF Databricks PowerBI

Uploaded by

Advanced Interview QA ADF Databricks PowerBI

Uploaded by

Advanced Interview Questions and Answers

Azure Data Factory (ADF)

ensure minimal downtime?

A: - Check pipeline run history in ADF Monitor.

- Review integration runtime status.

- Inspect source/target connectivity.

- Retry manually or use retry policy.

- Maintain checkpoints and use data partitioning.

- Enable alerts using Azure Monitor.

A: - Use watermark columns or hash diff logic.

- Store last load value in variables or metadata table.

- Filter SAP extract using this watermark.

- Use snapshot-diff logic if no timestamp available.

A: - Azure IR for cloud data movement.

- Self-hosted IR for on-prem/private network resources.

- Use self-hosted IR when accessing on-prem SQL Server.

A: - Use global parameters or dynamic content.

- Define parameters for ServerName, DatabaseName, etc.

- Helps in CI/CD deployment and reuse.

A: - Retry options: count and interval.

- Enable for transient faults.

- Use timeouts and fail-fast logic.

A: - Use apply_changes() in DLT with apply_as_deletes.

- For Auto Loader, use MERGE with DELETE logic.

- Maintain high watermark using timestamp.

Delta table. How would you investigate and optimize it?

A: - Check for file skew and small files.

- Run OPTIMIZE and ZORDER.

- Consider schema evolution impact.

- Use Photon runtime and cache hot tables.

should each be used?

A: - OPTIMIZE: compacts small files.

- ZORDER: sorts by columns for filtering.

- VACUUM: cleans obsolete files.

- Use ZORDER after OPTIMIZE.

A: - Join source with target on business keys.

- Detect changes -> expire old records.

- Insert new version with updated start_date.

- Use MERGE INTO.

A: - Pros: declarative, auto-lineage, CDC support.

- Cons: less flexible, more resource intensive.

about identifying and resolving performance bottlenecks?

A: - Use Performance Analyzer.

- Optimize DAX and reduce model size.

- Use summary/aggregation tables.

A: - Create user-department-region mapping.

- Apply USERPRINCIPALNAME() in DAX.

- Set roles and test with 'View as Role'.

used and why?

A: - Import: fastest, full DAX.

- DirectQuery: real-time, limited features.

- Composite: mix of both.

- Use Composite for large + fast KPIs.

- Break logic into steps.

- Avoid calculated columns depending on measures.

A: - Use star schema.

- Use surrogate keys.

- Apply incremental refresh.

You might also like