0% found this document useful (0 votes)
83 views91 pages

Ilovepdf Merged

Uploaded by

sawantcha.raja
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
83 views91 pages

Ilovepdf Merged

Uploaded by

sawantcha.raja
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 91

THE COMPLETE

DATA ANALYST ROADMAP

Go From Zero to a Data Analyst in 12 Months

Mosh Hamedani
2

Hi! I am Mosh Hamedani, a software engineer with over 20


years of experience.

Over the past 10 years, I ve had the privilege of teaching


millions of people how to code and become professional
software engineers through my YouTube channel and online
courses.

It s my mission to make software engineering accessible to


everyone. Join me on this journey and unlock your potential in
the world of coding!

https://codewithmosh.com

Copyright 2024 Code with Mosh codewithmosh.com




Data Analyst Roadmap 3

Table of Content

Introduction 4
Target Audience 4
Resources 4
Roadmap Overview 5
Mathematics and Statistics 6
Excel 8
SQL 10
Python 12
Version Control (Git) 14
Data Collection and Preparation 15
Data Visualization 17
Machine Learning Optional) 19
Big Data Optional) 20

Copyright 2024 Code with Mosh codewithmosh.com


(
(
Data Analyst Roadmap 4

Introduction
This guide is designed to help you navigate the essential skills needed to become
a successful data analyst. Whether you're just starting out or looking to enhance
your existing skills, this roadmap will provide a clear and structured path.

Target Audience
This guide is for:

• Beginners who want to know what they need to learn to land a data analyst job.

• Experienced individuals looking to level up their skills and fill in the gaps in
their knowledge.

Resources
For detailed tutorials and full courses, check out the following resources:

• YouTube Channel: https://www.youtube.com/c/programmingwithmosh

• Full Courses: https://codewithmosh.com

Copyright 2024 Code with Mosh codewithmosh.com


Data Analyst Roadmap 5

Roadmap Overview
Below is a comprehensive table listing all the essential skills needed to become a
proficient data analyst, along with the estimated time required to learn each skill.

Keep in mind that the time needed to learn each skill can vary for everyone. These
estimates are based on dedicating 3 to 5 hours of study every day.

Use this roadmap to guide your learning journey and track your progress as you
build a strong foundation in data analysis.

Skill Est. Time Learning Phase


Mathematics and Statistics 1 2 months Beginner
Excel 2 3 weeks Beginner
SQL 1 2 months Beginner
Programming Python) 1 2 months Beginner
Version Control (Git) 1 2 weeks Beginner
Data Collection and Preparation 1 2 months Intermediate
Data Visualization 1 2 months Intermediate
Machine Learning Optional) 1 2 months Advanced
Big Data Optional) 1 2 months Advanced
Total 8 16 months

Copyright 2024 Code with Mosh codewithmosh.com


-
-
-
-
-
-
-
-
-
-
(
(
(
Data Analyst Roadmap 6

Mathematics and Statistics


A strong foundation in mathematics and statistics is essential for data analysis.
Concepts such as probability, statistical analysis, and algebra provide the
theoretical underpinnings for understanding and interpreting data.

Estimated Time: 1 2 months

Essential Concepts
• Basic Algebra

• Basic algebraic operations

• Solving equations

• Calculus

• Basic concepts of differentiation and integration

• Understanding of limits and functions

• Linear Algebra

• Vectors and matrices

• Matrix operations

• Eigenvalues and eigenvectors

• Probability

• Basic probability concepts

• Probability distributions (normal, binomial, poisson)

• Random variables

• Bayes' theorem

Copyright 2024 Code with Mosh codewithmosh.com


-
Data Analyst Roadmap 7

• Descriptive Statistics

• Mean, median, mode

• Variance and standard deviation

• Skewness and kurtosis

• Quartiles and percentiles

• Inferential Statistics

• Hypothesis testing

• Confidence intervals

• p-values

• T-tests and chi-square tests

• Analysis of variance ANOVA

• Regression Analysis

• Simple linear regression

• Multiple linear regression

• Logistic regression

Copyright 2024 Code with Mosh codewithmosh.com


(
)
Data Analyst Roadmap 8

Excel
Excel is a powerful tool for data analysis and manipulation. It s widely used for its
simplicity, accessibility, and a broad range of built-in functions that make data
analysis straightforward.

Estimated Time: 2 3 weeks

Essential Concepts
• Basics

• Creating and managing workbooks

• Navigating and selecting cells

• Cell formatting (number, text, date, currency)

• Formulas and Functions

• Basic arithmetic operations (+, -, *, /)

• Common functions: SUM, AVERAGE, COUNT, MAX, MIN

• Logical functions: IF, AND, OR, NOT

• Text functions: CONCATENATE, LEFT, RIGHT, MID, TRIM

• Date functions: TODAY, NOW, DATE, DATEDIF, YEAR, MONTH, DAY

• Lookup functions: VLOOKUP, HLOOKUP, INDEX MATCH

• Basic Data Analysis

• Sorting data

• Filtering data

• Conditional formatting

Copyright 2024 Code with Mosh codewithmosh.com


-
-

Data Analyst Roadmap 9

• Charts

• Creating charts (bar charts, line charts, pie charts, scatter plots)

• Customizing chart elements (titles, labels, legends)

• Pivot Tables

• Creating pivot tables

• Customizing pivot tables

• Advanced Formulas

• IF

• SUMIF

• COUNTIF

• IFERROR

Copyright 2024 Code with Mosh codewithmosh.com


Data Analyst Roadmap 10

SQL
SQL Structured Query Language) is essential for querying and managing data in
relational databases. It s a fundamental skill for any data analyst working with
structured data.

Estimated Time: 1 2 months

Learning resources: YouTube Tutorial | Full Course

Essential Concepts
• Basic Operations

• Querying data SELECT

• Modifying data INSERT, UPDATE, DELETE

• Filtering data WHERE, IN, BETWEEN, LIKE, IS NULL, REGEXP

• Logical operators AND, OR, NOT

• Sorting and limiting data ORDER BY, LIMIT

• Complex Queries

• Joins INNER, OUTER, SELF, NATURAL, CROSS

• Aggregate functions MAX, MIN, AVG, SUM, COUNT

• Grouping data GROUP BY, HAVING, ROLLUP

• Subqueries

• Views

• Stored Procedures and Functions

• Triggers and Events

Copyright 2024 Code with Mosh codewithmosh.com


(
(
(
(
(
(
-
(
(

)
(
)
)
)
)
)
)
)
Data Analyst Roadmap 11

• Transactions

• Transaction isolation levels

• BEGIN, COMMIT, ROLLBACK

• Database Design

• Normalization

• Database integrity with primary keys, foreign keys, and constraints

• Indexes

• Security and Permissions

• Managing users and privileges

Copyright 2024 Code with Mosh codewithmosh.com


Data Analyst Roadmap 12

Python
Python is a versatile and widely-used programming language in data analysis due
to its simplicity and extensive library support.

Estimated Time: 1 2 months

Learning resources: YouTube Tutorial | Full Course

Essential Concepts
• Python Fundamentals

• Variables and data types

• Loops (for, while) and conditional statements (if, elif, else)

• Functions and scope

• Data Structures

• Arrays, lists, tuples and sets

• Stacks and queues

• Dictionaries

• Comprehensions

• Generator expressions

• Exception Handling

• Handling exceptions with try/except

• Raising exceptions

• Functional Programming

• Lambda functions

• Map, reduce, filter

Copyright 2024 Code with Mosh codewithmosh.com


-
Data Analyst Roadmap 13

• Object-oriented Programming

• Classes and objects

• Inheritance and polymorphism

• Modules and packages

• Creating modules

• Managing packages with pip and pipenv

• Virtual environments

• Python Standard Library

• Working with paths, files, and directories

• Working with CSV and JSON files

• Working with Date/time

• Generating random values

• Data Analysis Libraries

• Pandas

• NumPy

• SciPy

Copyright 2024 Code with Mosh codewithmosh.com


Data Analyst Roadmap 14

Version Control (Git)


Git is a version control system that is crucial for managing code and collaboration.
It allows you to track changes, collaborate with others, and maintain the integrity
of your codebase, making it an essential tool for any data analyst

Estimated Time: 1 2 weeks

Learning resources: YouTube Tutorial | Full Course

Essential Concepts
• Setup and Configuration: init, clone, config

• Staging: status, add, rm, mv, commit, reset

• Inspect and Compare: log, diff, show

• Branching: branch, checkout, merge

• Remote Repositories: remote, fetch, pull, push

• Temporary Commits: stash

• GitHub: fork, pull request, code review

Copyright 2024 Code with Mosh codewithmosh.com


-
Data Analyst Roadmap 15

Data Collection and Preparation


Data collection, cleaning, and preparation are critical steps in data analysis. This
involves gathering raw data, transforming it into a usable format, and ensuring its
quality and accuracy for reliable analysis.

Estimated Time: 1 2 months

Essential Concepts
• Python Libraries

• Numpy (numerical computing)

• Pandas (data manipulation)

• BeautifulSoup (web scraping)

• Scrapy (web scraping)

• Data Collection

• Importing data from CSV, Excel, and JSON files

• Connecting to databases

• Using APIs to collect data

• Web scraping

• Data Cleaning

• Handling missing values

• Removing duplicates

• Finding outliers

• Data transformation

• Converting data types (text to numbers, dates, etc)

Copyright 2024 Code with Mosh codewithmosh.com


-
Data Analyst Roadmap 16

• Parsing and splitting data (text to columns)

• Standardizing data formats (uppercase, lowercase, date formats)

• Data Integration

• Combining data from multiple sources

Copyright 2024 Code with Mosh codewithmosh.com


Data Analyst Roadmap 17

Data Visualization
Data visualization involves creating graphical representations of data to identify
trends, patterns, and insights. It is a crucial skill for communicating results
effectively.

Estimated Time: 1 2 months

Essential Concepts
• Python Libraries

• Matplotlib

• Seaborn

• Visualization Tools

• Basics of Tableau or Power BI (growing in popularity)

• Charts and Graphs

• Bar charts

• Line charts

• Scatter plots

• Funnel charts

• Histograms

• Stacked charts

• Heatmaps

• Pie charts

• Dashboards

• Creating interactive dashboards

Copyright 2024 Code with Mosh codewithmosh.com


-
Data Analyst Roadmap 18

• Dynamic dashboards

• Storytelling with Data

• Creating narratives

• Insights through visualization

Copyright 2024 Code with Mosh codewithmosh.com


Data Analyst Roadmap 19

Machine Learning Optional)


Machine learning provides data analysts with powerful tools to create predictive
models and uncover deeper insights from data. Understanding the basics of
machine learning can significantly enhance your ability to analyze complex
datasets and provide actionable insights.

Estimated Time: 1 2 months

Essential Concepts
• Python Libraries

• Scikit-learn

• Supervised Learning

• Regression algorithms (e.g., linear regression, logistic regression)


• Classification algorithms (e.g., decision trees, k-nearest neighbors,
support vector machines)

• Unsupervised Learning

• Clustering algorithms (e.g., K-means, hierarchical clustering)


• Dimensionality reduction techniques (e.g., PCA, LDA
• Model Evaluation

• Confusion matrix
• Precision
• Recall
• F1 score
• ROC curves

Copyright 2024 Code with Mosh codewithmosh.com


-
(
)
Data Analyst Roadmap 20

Big Data Optional)


Big data technologies enable data analysts to handle and process vast amounts of
data efficiently. Understanding big data concepts and tools is essential for
working with large datasets and gaining insights from them.

Estimated Time: 1 2 months

Essential Concepts
• Big Data Frameworks

• Hadoop

• Apache Spark

• Data Storage

• HDFS

• NoSQL databases Cassandra, MongoDB

• Data Processing

• MapReduce programming model

• Batch processing with Spark

• Real-time processing with Spark Streaming

• Data Ingestion

• Data collection tools Kafka, Flume)

Copyright 2024 Code with Mosh codewithmosh.com


(
-
(
(
)
Data Analyst Roadmap 21

Learning to code is a journey. Be patient with yourself and


stay persistent, even when things get tough.

- Mosh

Copyright 2024 Code with Mosh codewithmosh.com


Master Data Analysis
A Step-by-Step Roadmap to Land a Job in 2025
with AI

This roadmap is designed for freshers, career changers, and non-IT


professionals. It includes FREE learning resources for technical skills.

This guide will equip you with the must-have technical skills, actionable
strategies (ATS-friendly resumes, LinkedIn optimization, interview
prep), and hands-on project ideas to build your portfolio.

As we approach 2025, Machine Learning (ML), AI tools, and Prompt


Engineering are rapidly becoming integral to the data analyst’s skill set.

Machine learning, AI tools, and prompt engineering are no longer just


advanced skills for data scientists. In 2025, these technologies will not
only enhance productivity but also enable analysts to deliver more
accurate, faster, and actionable insights, positioning them at the
forefront of the data revolution.

Doubts about Will AI replace Data Analyst Jobs?


Watch this: https://www.youtube.com/watch?v=19bGdeau_bM&t=247s

Total Duration: 5 months

● Freshers or non-IT professionals: 5-6 months


● Career changers with some transferable skills: 4-5 months

Daily Time Commitment:

● Part-time learners (e.g., working professionals): 2-3 hours daily


● Full-time learners: 4-6 hours daily

Timing Requirement:
● Morning learners: Early hours are great for focused learning (e.g.,
technical skills).
● Evening learners: Post-work hours can be used for lighter tasks like
soft skills, networking, practicing technical skills and revising.

5 Month Learning Plan:

Learning Technical Skills


1. Statistics

What to learn:

● Descriptive Statistics:
Measures of central tendency (Mean, Median and Mode)
Measures of dispersion (Range, Variance, Standard Deviation, Percentiles
and Quartiles)
Frequency, Relative and Cumulative frequency
Graphical Representations - Boxplots, Histograms, Scatterplots
Outliers, how to identify and remove outliers
Correlation and Covariance

● Probability: Bayes Theorem, Probability Distributions, Standard Normal


Distribution, Empirical Rule

● Inferential Statistics: Confidence Interval (Z / T distribution), Hypothesis


testing, Level of significance and p values, Types of tests (z test, t test,
ANOVA, Chi square etc)

Real Life Applications: A/B Testing, Regression Analysis, Time Series Analysis
and Forecasting

Projects and Tutorials:

● Statistics Crash Course Tutorial:


https://www.youtube.com/watch?v=S7LvZZNq4ys&t=8317s

● A/B Testing and Regression Analysis Project:


https://www.youtube.com/watch?v=iCj4lT5KvJk&t=1057s

● Time Series Analysis Crash Course Tutorial:


https://www.youtube.com/watch?v=A3fowDMo8mM&t=8995s

● Stock Price Prediction Project:


https://www.youtube.com/watch?v=IY8HZ2Z_sn4
2. Python

What to learn:

● Basics: Data Types, Data Structures (list, dictionary, set, tuple),


Conditional Statements, Loops, Control Statements, Functions, Error
handling, Modules and Packages, OOPs (Optional)

● Libraries for Data Analysis: Pandas, Numpy, Matplotlib, Seaborn, Scipy


and Statsmodels

Real Life Applications:

● Exploratory Data Analysis: Summary Statistics, Formulating Research


Questions and answering using Data Analysis & Visualization, Identifying
Outliers, Identifying anomalies in data, feature engineering

● Data Cleaning and Preparation: Handling missing data, removing


inconsistencies and duplicacy, removing outliers, feature selection

● Data Transformation: Pivot, groupby, merge, join, scaling and


normalization

● Working with databases: using sqlite/ psycopg2 like databases libraries


to fetch data from database in python for advance analysis

● Time Series Analysis: Stationarity, Resampling, Rolling window analysis,


Forecasting

● Statistical Analysis: Applying all statistical tests using python

Projects and Tutorials:

● Python Playlist:
https://www.youtube.com/watch?v=bPrmA1SEN2k&list=PLZoTAELRMXV
NUL99R4bDlVYsncUNvwUBB

● Python for Data Analysis:


https://www.youtube.com/watch?v=wUSDVGivd-8

● Data Analysis End-to-End Project using Python:


https://www.youtube.com/watch?v=obJZ1rB7TKc&t=2074s
● Complete python project:
https://www.youtube.com/watch?v=KgCgpCIOkIs&t=1226s&pp=ygUOcHl0
aG9uIHByb2plY3Q%3D

Practice Websites:
● https://www.analyticsvidhya.com/blog/2024/05/python-coding-interview-q
uestion s-for-beginners/
● https://www.hackerrank.com/domains/python
● https://leetcode.com/problemset/

3. SQL

What to learn:

● Data Definition Language (DDL): Creating, Altering, Dropping tables


● Constraints, Keys: Primary Key, Foreign Key, Unique Key, NOT Null
● Normalization: 1NF, 2NF, 3NF, BCNF
● Data Manipulation: Inserting, updating, deleting records
● Data Types: Numeric, String, Date and Time
● Query Structure: Select, filtering, sorting and limiting data
● Joins: Inner, Left, Right, Full Outer, Cross, Self
● Grouping and Aggregation
● Subqueries: Single, Multiple, Correlated subqueries
● Indexing: Unique, Composite, Full text
● Transactions: ACID properties, Begin transaction, commit, rollback
● Performance Optimization: Indexing, Optimizing Joins, Caching
● Window function, CTEs, Stored Procedure and Functions, Triggers

Real Life Applications: Writing queries for creating reports, Leveraging Python
with SQL to solve complex business problems

Projects and Tutorials:

● SQL Crash Course:


https://www.youtube.com/watch?v=On9eSN3F8w0

● SQL and Python End-to-End Project:


https://www.youtube.com/watch?v=2VMAdlzNuTw&t=1350s
● Airlines Analysis using SQL and Python Project:
https://www.youtube.com/watch?v=LcMjsqZiSjY&t=115s

Practice Websites:

● https://leetcode.com/problemset/database/
● https://leetcode.com/studyplan/top-sql-50/
● https://www.hackerrank.com/domains/sql

Consistent practice is the key to mastering SQL, as it not only helps reinforce your
understanding but also enables you to solve real-world problems efficiently, making you
a more confident and capable data analyst.

So practice as you learn (Will help in sql coding round in interviews as well)

4. Excel
What to learn:

● Excel Functions/ Formulas: Mathematical, Text, Logical functions


● Data Cleaning and Preparation: Handling missing data, text-to-columns,
removing duplicates, data validation
● Data Analysis with Formulas: Aggregate, lookup, date and time,
financial, Statistical functions
● Pivot tables and Charts: Creating and formatting pivot tables, creating
pivot charts, dynamic filtering with slicers and timelines
● What-If Analysis: Goal seek, data tables (one and two variable), scenario
manager
● Conditional Formatting: Highlight cells, data bars, color scales, icon
sets, conditional formatting with formulas
● Data Analysis with Power Query: Connecting to different data sources,
Importing and transforming data

Real Life Applications: Budget analysis, KPI dashboards, and quick data
cleaning, analyze small datasets, automate tasks with formulas, and create
dashboards.

Projects and Tutorials:


● Excel Playlist:
https://www.youtube.com/playlist?list=PLUaB-1hjhk8Hyd5NiPQ9CND82v
NodlFF5

● Sales Dashboard using Excel:


https://www.youtube.com/watch?v=6OMR81faW54&t=58s

● End-to-End Excel Project:


https://www.youtube.com/watch?v=gTK5rNhWJyA&t=5s&pp=ygUaZXhjZ
WwgZGF0IGFuYWx5c2lzIHByb2plY3Q%3D

● Data Analysis Excel Project:


https://www.youtube.com/watch?v=Rthh_bK5xUs

Practice Websites:
● https://www.excelpracticeonline.com/
● https://www.excel-easy.com/

5. BI Tools: Power BI/ Tableau


What to learn:

● Power BI
1. Basic Table transformation
2. Text, Number and Date tools
3. Index and Conditional Columns
4. Grouping and Aggregating Data
5. Pivoting and Unpivoting
6. Merging, Modifying and Appending Queries
7. Connecting to Folders
8. Defining Hierarchies and Categories
9. Best Practices of Query Editing and Power BI
10.Data Model
11. Database Normalization
12.Creating Table Relationships
13.Table Schemas
14.Connecting Multiple Data Tables
15.Filter
16.DAX
17.Creating Interactive Reports and Dashboards

● Tableau
1. Basics of Data Pane
2. Quick Visualizations
3. Marks and its Properties
4. Menu and Toolbar
5. Data Types, Sorting and Grouping
6. Filtering
7. Aggregations
8. Table Calculations
9. Formatting
10.Action Filters
11. Dashboard Layout
12.Stories
13.Distributing and Publishing
14.Joins
15.Relationships
16.Data Models
17.Types of Relationships
18.Pivot
19.Interactivity
20.Trend Lines
21. Clustering and Forecasting
22.Nested LODs and Mapping Functions
23.Dynamic Designs, Extensions and Tooltip Visualizations

Projects and Tutorials:

● Power BI Tutorial:
https://www.youtube.com/watch?v=bQ-HTp-tx40&pp=ygUJcG93ZXIgYmk
g

● Tableau Tutorial:
https://www.youtube.com/watch?v=j8FSP8XuFyk&pp=ygUIdGFibGVhdSA
%3D

● HR Analytics Dashboard using Power BI:


https://www.youtube.com/watch?v=6H4afhQeewU&t=6s
● Blinkit Real Time Dashboard using Power BI:
https://www.youtube.com/watch?v=mmxVCFceQgU&t=30s&pp=ygURcG9
3ZXIgYmkgcHJvamVjdHM%3D

● End-to-End Data Analyst Project using Power BI:


https://www.youtube.com/watch?v=tT4V7zguCnc&pp=ygURcG93ZXIgYm
kgcHJvamVjdHM%3D

● Tableau Project:
https://www.youtube.com/watch?v=KlAKAarfLRQ&pp=ygUPdGFibGVhdS
Bwcm9qZWN0

● Sales dashboard using Tableau:


https://www.youtube.com/watch?v=dahrmqT5GD4&pp=ygUPdGFibGVhd
SBwcm9qZWN0

6. Machine Learning

What to learn:

● Supervised learning: Regression (Linear, Ridge, Lasso), Classification


(Logistic, Decision Trees, Random Forests, Support Vector Machines,
Gradient Boosting)

● Unsupervised learning: K-Means Clustering, DBSCAN, PCA

● NLP: Tokenization, Text Cleaning, Bag of Words, Word Embeddings,


Topic Modeling, Named Entity Recognition, Part of Speech Tagging

● Overview of pipelines: Data preparation, Feature Engineering, Modeling,


Evaluation and Deployment

● Python libraries: scikit-learn, nltk,

Real Life Applications: Predictive analytics, Classification, Clustering and


Segmentation, Sentiment Analysis
Projects and Tutorials:

● Machine learning Tutorial:


https://www.youtube.com/watch?v=JxgmHe2NyeY&pp=ygUbbWFjaGluZS
BsZWFybmluZyBrcmlzaCBuYWlr

● Machine learning Playlist Tutorial:


https://www.youtube.com/watch?v=ZftI2fEz0Fw&list=PLKnIA16_Rmvbr7z
KYQuBfsVkjoLcJgxHH

● End-to-End Machine Learning Project:


https://www.youtube.com/watch?v=S_F_c9e2bz4&list=PLZoTAELRMXVP
S-dOaVbAux22vzqdgoGhG

7. Prompt Engineering and AI Tools

What to learn:

● Anatomy of a good prompt: Clarity, context, and specificity, using


keywords, command, and examples to guide outputs

● Types of prompts: Instructional, Conversation, Zero-shot, Few-shot,


Chained prompts

● Crafting Prompts for AI: Creating structured, specific instructions for


analytics tasks.

Real Life Applications: Generating SQL queries, summarizing reports, or


building Python scripts with AI assistance, coding logic, formulating research
questions

Prompt Engineering Tutorials:


https://www.youtube.com/watch?v=5i2Hn8OG94o&pp=ygUScHJvbXB0IGVuZ2lu
ZWVyaW5n
Other Skills to focus on

Version Control: Basics of Git/GitHub for tracking and sharing projects.


Big Data Tools (Optional): Basics of Spark or Hadoop for analyzing large datasets.

Projects

How many projects should you include?


3-5 High Impact Projects: showcasing different domains, tools, and techniques. Aim
for quality over quantity.

Guide to make data analysis projects:


https://www.youtube.com/watch?v=X-GRMfxNfrE&t=70s

End-to-End Analytics Projects:

● Solve any business problem using your projects tailored to industries like finance,
healthcare, retail, or marketing

● Projects that start from data collection from database or web scraping and end
with actionable insights or dashboards. Include multiple tools for creating
end-to-end solutions.

● Create reports in pdf or ppt with all insights and visualizations of the project
explaining the steps how you solved the problem statement and present your
data driven decisions based on extracted insights. Include a README file or
case study explaining the problem, solution, and impact.

● Use machine learning for forecasting/ decision making/ classification/ regression


tasks with advanced analysis and presenting data or predictions in dashboards
or web portals.

Linkedin Optimization, ATS Friendly Resume, Cover


Letter, Portfolio, Job Preparation and Interview Guide:

LinkedIn is not just a platform for networking—it's your online portfolio, resume, and
professional brand. Start building your profile from day one of your preparation and use
it strategically to showcase your journey, projects, and skills. Recruiters often scout
LinkedIn for talent, and having an optimized profile will give you a competitive edge.

1. Create and Optimize your linkedin profile


2. Post regularly to build your personal brand: learning journey, valuable content
and projects
3. Interact with others posts to expand your network.
4. Connect with recruiters and industry professionals working in the data analyst
domain.
5. Update your profile regularly like adding new certifications, projects, skills as you
learn them.

Develop Soft Skills

Soft skills are just as important as technical expertise in landing and succeeding in a
data analyst role. These skills enable you to communicate your findings effectively,
collaborate with teams, and solve business problems. Here's what to focus on:

● Communication skills:
Practice presenting your projects to friends or mentors.
Use storytelling techniques to explain data insights.

● Problem Solving and Critical Thinking:


Work on case studies and business problems.

● Storytelling with data:


Practice creating presentations or reports that include visuals and key insights.

● Negotiation and Persuasion


Learn to back your arguments with data and explain the “why” behind them,

Prepare for Interviews and Apply for Jobs


● Understand the Interview Structure
● Technical Round:
○ Expect questions on SQL, Python, statistics, data visualization, and
sometimes case studies.
○ Be prepared to solve coding problems and write SQL queries live.
● Scenario-Based Questions:
○ Be ready to explain how you'd approach real-world problems like
improving sales, reducing churn, or optimizing processes.
● Behavioral Round:
○ These questions assess your soft skills, teamwork, and
problem-solving abilities. Use the STAR method (Situation, Task,
Action, Result) to answer.

● Data analyst interview preparation might be difficult, but with the appropriate
approach, you can improve your chances of success. Here are some pointers for
getting ready for data analyst interviews:

● Review your projects and case studies: Review your portfolio's projects and case
studies, and be prepared to go into detail about each. Prepare to describe the
issue you set out to address, the techniques you employed, the outcomes you
attained, and the significance of your work.

● Practice your technical abilities, including any programming languages you are
knowledgeable with, such SQL, Python, R, and others. During the interview, be
ready to use a whiteboard to solve issues or create code.

● Study the company: Do your homework about the organization and its sector,
and be ready to explain how your qualifications fit with the company's goals and
core principles and how you can help it succeed.

● Be prepared for behavioral questions, such as "Tell me about a time when you
had to deal with a tough team member" or "Describe a circumstance when you
had to make a judgment with minimal evidence." Be prepared to describe your
approach to the circumstance with specific examples.

Strategically Applying for Jobs

● Build an ATS-Friendly Resume


○ Keywords: Add role-specific keywords like Data Cleaning, Data
Visualization, Dashboarding, Reporting, etc.
○ Customization: Tailor your resume for each job application by
emphasizing the skills and projects relevant to the job description.
● Write a Compelling Cover Letter
● Highlight why you’re a great fit for the role and how your skills align with
the company's needs.
● Use specific examples from your projects to demonstrate your value.

c) Use Job Platforms Effectively

● Platforms: LinkedIn, Indeed, Glassdoor, and company websites.


● Leverage LinkedIn:
○ Turn on the “Open to Work” feature.
○ Actively engage with job posts by commenting or liking.
○ Reach out to recruiters with personalized messages.

● Writing Cover Letter:


https://www.youtube.com/watch?v=a0ATCc6ytyw&t=29s

● Avoid Data Analyst Mistakes:


https://www.youtube.com/watch?v=W--TWiZPztU&t=85s

● Finding jobs on linkedin:


https://www.youtube.com/watch?v=NgdtWKtes6A

● Write Resume with no experience:


https://www.youtube.com/watch?v=EXyO1WiVuZw

From my experience and Industry Standards, I have


created an eBook Guide.

What you will find inside the Guide:

1. Crafting a Winning Resume (ATS Friendly)


1.1 What is ATS?
1.2 Common reasons, why the ATS rejects a resume, even if the candidate is
well qualified for the job
1.3 Essential Sections and What to Include
1.4 ATS Friendly Tips and Tricks
1.5 Do’s and Don’ts for resume
1.6 ATS Friendly Resume Template:
1.7 Top 5 websites to build great resume
1.8 Example Experience Section
1.9 Example Project Section

2. Building a Standout Portfolio


2.1 Importance of Portfolio
2.2 Websites for creating a portfolio
2.3 Different Ways to create a portfolio
2.4 What to include in a portfolio?
2.5 Explaining projects in a portfolio
2.6 Sections to include in project descriptions
2.7 Example project descriptions

3. Writing an Effective Cover Letter


3.1 Cover Letter Sections
3.2 Example Cover Letter
3.3 Tips for writing an effective cover letter
3.4 Common mistakes to avoid

4. LinkedIn Optimization
4.1 Importance of Linkedin Profile Optimization
4.2 Sections to complete
4.3 Mistakes to avoid
4.4 Do’s and Don’ts on your Linkedin
4.5 Networking through linkedin
4.6 How to reach out to recruiters on Linkedin? (Message Template)
4.7 How to announce your new job on linkedin?
4.8 How to ask for a referral?
4.9 Polite follow-up message for when you don’t hear back after asking for
referral

5. Interview Preparation
5.1 Steps to take before interview
5.2 Common Interview Questions
5.3 Questions to ask the interviewer
5.4 Different ways you can answer, Why did you leave your last job?
5.5 Tell me about yourself
5.6 How to explain the career gap?
5.7 Job application email template
5.8 How to follow up on a job application?
5.9 Thank you/ Follow up email after the interview
5.10 How to write a job acceptance email?
5.11 How to decline a job offer?
5.12 How to ask about salary before the interview?
5.13 How to respond to a job rejection email?
5.14 How to write a counter offer email?
5.15 How to respond to an offer with a low salary?
5.16 List of Strengths and Weakness for job interview
5.17 Do’s and Don’ts to say in interviews

6. ChatGPT Prompts
6.1 ChatGPT Roles
6.2 ChatGPT Prompts to help in Job Preparation, Customizing Resumes based
on Job Description, Cover Letters, Linkedin Optimization.

Whether you're a recent graduate, career changer, or aiming for that next big
opportunity, this eBook is designed to be your go-to resource. It’s more than just
advice, it’s a step-by-step guide to help you land your dream job. Here's what you'll find
inside:

● Craft a Standout Resume: Learn how to create a resume that grabs attention,
with insights and how to showcase your skills effectively.
● Master the Art of Cover Letters & Portfolios: Understand how to write
impactful cover letters and build portfolios that make you stand out. These skills
are crucial for making a memorable first impression.
● Optimize Your LinkedIn & Network Like a Pro: From optimizing your LinkedIn
profile to mastering the art of networking, you’ll learn how to connect with the
right people and open doors to new opportunities.
● Ace Interviews with Ease: Prepare for every type of interview, whether it's
behavioral, technical, or case-based. Get ready for the toughest questions with
proven techniques and practical examples.
● ChatGPT Prompts for Success: Gain access to specially designed prompts
that help you practice and refine your interview responses, making preparation
easy and efficient.

Why This Guide?


● Comprehensive & Actionable: Every section is packed with actionable steps
that you can start using right away.
● Tailored for Every Stage: Whether you're a beginner or looking to switch
careers, this guide has something for everyone.
● Boost Your Confidence: Equip yourself with the knowledge and skills needed to
approach every stage of the job search with confidence.

Ready to transform your job search and achieve career success? This eBook is the key
to making it happen. Get started now and take the first step toward landing your
dream DATA ANALYST job!

eBook Link: https://topmate.io/ayushi_mishra/842027


Most Effective
Data Analytics Roadmap For A
Top Analytics Job

Created By:
Abhay Bhagat
Data Delivery Analyst @ McKinsey | Ex-Deloitte
4 Things To Focus

Technical Skills Resume & Good


Soft Skills
and Business Linkedin Certifications
Understanding
Timeline - 4-5 Months
(16-20 weeks)
Technical Skills and Business
Understanding
Technical Skills :
SQL
Excel, Maths & Statistics
Python
Power BI
AI
Business Understanding :
Business case studies
First 2 Months
SQL, Excel, Resume and
Linkedin profile
Technical Skill 1 : SQL
Topics to cover
What is a Relational Database / RDBMS?
SQL Data Types - Varchar, text, int, number, date, float, boolean.
SQL commands - select, where, like, distinct, between, group by, having, order by, insert into,
case when, update, truncate, delete, commit, rollback (basically all the DDL, DML, DCL, TCL
commands in SQL).
Integrity Constraints - Primary key, foreign key, not null, unique.
Operators arithmetic, logical, and comparison operations.
Use of distinct, order by, limit, and top.
Use of union and union all.
Joins in SQL inner, left, right, outer, self, full outer, cross join.
Normalization in SQL
Aggregate, date, and string functions
Sub-Queries
CTE table / with clause
In-built SQL functions
Window functions
Views
Technical Skill 2 : Excel, Maths & Statistics
Topics to cover
Understanding structure of an Excel workbook.
Working with numeric and date values
Basic formulas - Sum, count, min, max, average, mean, median
Import/export data in Excel
Insert and delete columns
Common Excel shortcut keys
Advanced functions - Vlookup, index, match, if, countif, sumif, and, or
Sorting data
Handling duplicate and missing values in Excel
Filters and slicers
Formatting data (conditional formatting, table creation, highlighting columns etc.)
Data visualization in Excel (basic charts)
Pivot tables and pivot charts
Basics of Microsoft Copilot (Use of AI in Excel)
Statistics ▪ Mean, Median, Standard Deviation, Normal Distribution, Percentile
Basic Math: Arithmetic, Weighted average, Cumulative sum, Percentile
Resume & Linkedin profile

Create an optimized linkedin profile


ATS-Friendly Resume
Free Resources
How to make Ultimate Resume ?
https://youtu.be/y3R9e2L8I9E?si=qQOq_n4oqSX2MWTV
How to create a Great LinkedIn Profile in 2024 | for College Students
https://youtu.be/lzuiuRgwwrc?si=s287S-ta7Ms4r4rq
Next 2 Months
Python, Power BI, Case
Studies and Soft Skills
Technical Skill 3 : Python
Topics to cover
Variables and Data types
Lists, Tuples, Dictionaries & Sets
Python statements - For, while, if-else, list comprehension
Python functions
Comparison Operators
Objects oriented programming
Pandas Library (Most important data processing library for data analysts) :
Importing and exporting different file types (CSV, excel, etc.)
Dataframes
Data cleaning, processing, indexing, sorting, and filtering in pandas.
Group by, merge, concatenate and join.
Seaborn and Matplotlb libraries (for data visualization)
Technical Skill 4 : Power BI
Topics to cover
Understanding structure of Power BI tool and navigating the interface.
 Importing data from different sources (Excel, csv, SQL server etc.)
 Data types
 Power query editor
 Data preparation: Numeric operations, text operations, dates, edit/filter rows and columns,
filter pane etc.
 Data model fundamentals and relationships
 Data visualization – Line chart, pie chat, scatter plot, cards, slicers, maps, stacked
columnchart etc.
Technical Skills : Free Resources
Topics to cover
SQL :
https://youtu.be/SycDH3NSJUU?si=MxrUPXUbO3UoSXzP
https://youtu.be/hlGoQC332VM?si=OfySq6SVkJ_ggIcV
https://www.youtube.com/live/q_JsgpiuY98?si=JNK_ROXC5z9DCFjN
Excel :
https://youtu.be/OX-iyb-21tk?si=YMY4_63aicTahTCz
https://youtu.be/27dxBp0EgCc?si=BA6ds7EZ9xEM9koK
Business Maths & Statistics :
https://youtu.be/npgbI8KYvN8?si=QAe006kGaUJN4OOU
Power BI :
https://youtu.be/bQ-HTp-tx40?si=rj3b64r28FlZG8L2
https://youtu.be/UXhGRVTndQA?si=uxJ5yl3SEnHzVqT0
Python :
https://youtu.be/ERCMXc8x7mc?si=B1uTsPZ7Tb8Qx0Eq
Common Interview Questions

Free Resources
SQL- https://youtu.be/y3R9e2L8I9E?si=qQOq_n4oqSX2MWTV
Python-https://youtu.be/lEC7nuPtHp0?si=tUKBVM6PgpExsLzi
Excel - https://youtu.be/8rh2RpAjenQ?si=0c0S4ioSktzGDKZL
Power BI - https://youtu.be/aZihsRsEYTE?si=P4MVjnetb-jpzEMl
Google Data Analytics
Certifications Certificate
Link

In analytics companies, IBM Data Analyst


Professional Certificate Link
competition is super high.
So, along with good technical
Microsoft Power BI Data
skills, certifications play a key
Analyst Professional Link
role in helping you stand out. Certification
I have mentioned some great
Meta Data Analyst
certifications here: Professional Certificate
Link

Career Essentials in Data


Link
Analysis by Microsoft
Business Understanding : Business Case
Studies
All the below mentioned business case studies can be done from ThinkSchool youtube
channel’s free playlist
https://youtube.com/playlist?
list=PLGwmAEmjn4fmL_kCTORN4fXOlXvLa8dG&si=5FpEioizD1lRip9e
Mentioning a few good case studies below even though any case study in the playlist can be
referred to practice-

Amul Business Case Study


Zerodha Business Case Study
Coco-cola Business Case Study
IKEA Business Business Case Study
Zomato Business Case Study
Netflix Business Case Study
Indigo Airlines Business Case Study
Soft Skills

Verbal & Written Communication

Free Resources
How to Master Communication Skills? -
https://youtu.be/P4TJ1T3g7mo?si=VKOCgTBWTiMqoqax
10 Tips to Boost your Communication Skills -
https://youtu.be/vULoIGxBYA4?si=GPm0ug-GIyi8IlKG
The 90-day English learning challenge! -
https://youtu.be/3NMXtItuwtU?si=jDPitq4etOKENCPz
AI for data analysts

Free Resources
ChatGPT for Data Analytics - Full Tutorial
https://youtu.be/8qWtU51lxpM?si=Z9QxgG5vmmSNOhl3
Additional helpful videos

How to get a Data Analyst internship/job.


How to get off campus opportunities
Free Resources
https://youtu.be/6QdJGBkMn6s?si=Ln6TLuLywystUPf6
https://youtu.be/rWBvRFwq4as?si=vLhGeDBiQK1K3dLe
https://youtu.be/2aHnI2171l0?si=337oFWvVU3tZMnf5
That’s It!
All The Best!!

To succeed in your mission, you must have single-minded devotion to your goal. - A. P. J. Abdul Kalam
DATA ANALYST ROADMAP
This is the Ultimate RoadMap to become a Data Analyst, one needs to learn the following things. I have added the resource
links of all important things in this PDF.

DO YOU NEED A COLLEGE DEGREE?

With basic understanding of Maths, you can start. Even if you are not doing B Tech, Basic BSC Degree with Maths or some
other equivalent will suffice.

REQUIREMENTS

• Excel
o Basic Formulas: SUM, AVERAGE, MEAN, MEDIAN, SUMPRODUCT, CONCATENATE
o Advanced Formulas: VLOOKUP, INDEX, MATCH, IF, COUNTIF, SUMIF
o Remove Duplicates and Conditional Formatting
o Charts, Filters, Sort, and Slicers
o Pivot Tables and Pivot Charts
o Exclude VBA, Macros, etc.
o This course on Excel is amazing!
o This course on overall MS Office is amazing too!
• Statistics + Maths
o Arithmetic, Weighted average, Cumulative sum, Percentile
o Linear Algebra Notes (Amazing Resource by Queen Mary University of London)
o Learn the basics of Mean, median, mode, dy/dx. This quick video can help you get started.
o Buy a copy of Hines Book (Probability and Statistics in Engineering by William Hines).
o Focus a bit more on Normal Distribution
o Learn basics of Optimization and Gradient Descent. You can watch this series I created long back.
o Get this amazing book on Graphs (Play with Graphs Book – Amit Aggarwal)
• Programming
o If confused choose Python as your first programming language
▪ Python in Hindi – 100 Days of Code by CodeWithHarry
▪ For English Lovers, there is this awesome course on Udemy
• Now once you have a basic understanding of Python, dive in deeper!
o Learn Basics – Start from this free book or buy it on Amazon
o Learn NumPy from here
o Learn Pandas from 10 Minutes of Pandas here
• Data Visualization Tools
o Power BI (Recommended for Microsoft product integration)
o Tableau (Recommended for advanced dashboard capabilities)
• Database – Learn Basic CRUD Operations and depending upon how you are fetching your data, pick from these
technologies.
o MySQL + PhpMyAdmin
o PostgreSQL (Optional/ Depends on your work)
o MS SQL (Optional/ Depends on your work)
• Optional Tools that you can learn depending upon your requirements.
o AWS – Create an account and get started for Free. It will take you a long time to master it
o Learn about cronjobs from this video
o Learn about BeautifulSoup for Web Scraping using Python
o Linux
‭ULTIMATE DATA ANALYST ROADMAP -2025‬

‭ isclaimer - In case you are a fresher, preparing for a Data field interview, or an experienced‬
D
‭person who wants to transition their field then you can follow this Roadmap.‬

‭Definition of Data Analyst‬

‭ Data Analyst in 2025 is a professional who collects, processes, and interprets data to help‬
A
‭organizations make informed decisions. With advancements in AI, automation, and cloud‬
‭computing, the role has evolved beyond traditional reporting to include predictive analytics,‬
‭AI-powered insights, and real-time data storytelling.‬

‭How is a Data Analyst Different in 2025?‬

‭‬ A
● ‭ I-Powered Analytics‬‭– Analysts rely on AI-driven‬‭tools to generate insights faster.‬
‭●‬ ‭Cloud-First Approach‬‭– Data is processed & analyzed‬‭in cloud environments for‬
‭scalability.‬
‭●‬ ‭DataOps & Automation‬‭– Manual tasks are reduced with‬‭automated pipelines.‬
‭●‬ ‭Self-Service Analytics‬‭– Business users can generate‬‭insights using AI chatbots.‬
‭●‬ ‭AI & ML Awareness‬‭– While not Data Scientists, Analysts‬‭use low-code ML tools for‬
‭predictions.‬

‭Tech-stack Required‬

‭ .‬
1 ‭ asic Maths & Statistics‬
B
‭2.‬ ‭Excel‬
‭3.‬ ‭SQL & DBMS Knowledge‬
‭4.‬ ‭Visualisation Tools - Power BI/Tableau‬
‭5.‬ ‭Python & EDA‬
‭6.‬ ‭Cloud knowledge‬
‭7.‬ ‭Basic Machine Learning Algorithms‬
‭1.‬ ‭BASIC MATHS & STATISTICS‬‭(Week-1)‬

‭ ven if you haven’t touched maths in the last 5-6 years, don’t worry you can begin now. It's‬
E
‭not a Rocket-science.‬

‭Topics to cover‬

‭ ‬‭Basic Maths :- Average, Arithmetic, Weighted average,‬‭Cumulative‬



‭Sum, Percentile vs Percentage‬
‭●‬‭Statististics :- Mean, Median, Mode, Standard Deviation,‬‭Normal‬
‭Distribution‬

‭Resources‬

‭ ‬ Introduction | Mathematics and statistics for data science and machine learning

‭●‬ Complete Statistics For Data Science In 6 hours By Krish Naik
‭●‬ Starter Roadmap For Learning Statistics For Data Analyst & Data Science In Hindi ‭-‬
‭For Hindi Speaking People‬
‭●‬‭https://www.khanacademy.org/math/statistics-probability/analyzing-categorical-data‬

‭Goal‬

I‭nitially just focus on conceptual understanding of each term. You should be‬
‭able to differentiate between different terms of stats.‬
‭2.‬ ‭EXCEL‬‭(Week-2)‬

‭Topics to Cover‬

‭ Basic formulas: SUM, DIFF, AVERAGE, MEAN, MEDIAN, CONCATENATE‬



‭● Advance formulas: VLOOKUP, INDEX, MATCH, IF, COUNTIF, SUMIF‬
‭● REMOVE duplicates and conditional formatting‬
‭● Charts, filters, sort and slicers‬
‭● Pivot tables and pivot charts‬
‭● VBA, Macros, etc‬

‭Resources‬

‭‬
● Complete Excel Tutorial for Data Analysis in 4 Hours (with FREE Files)
‭●‬ Pivot Tables in Excel | Excel Tutorials for Beginners

‭ esources to practice‬
R
‭●‬‭https://www.excel-easy.com‬
‭●‬‭https://exceljet.net‬
‭●‬‭https://www.excelpracticeonline.com‬

‭Projects in Excel - Optional‬

‭‬
● Full Project in Excel with Interactive Dashboard | Excel Project | Excel Project f…
‭●‬ Excel Project | Data Analyst Portfolio Project | Finance Domain | Start to End | …

‭Goal‬

‭ earn how to manipulate data using Excel/Google Sheets, including‬


L
‭formulas, pivot tables, and basic data visualization.‬
‭3.‬ ‭SQL‬‭(Week-3,4,5,6)‬

‭Topics to Cover‬

‭● Basic Queries:- CREATE, INSERT, UPDATE, ALTER, DELETE, DROP,‬


‭TRUNCATE.‬
‭● Must Know Topics:- SELECT, WHERE, DISTINCT, LIKE, BETWEEN,‬
‭ORDER BY, LIMIT, GROUP BY, HAVING CLAUSE, IMPORT, DATA TYPES.‬
‭● Advance Queries:- Date time function, Window function, Sub query, Case‬
‭statement, CTE, Query Optimisation‬
‭● JOINS:- Self, Inner, Outer, Left, Right‬

‭Resources‬

‭ ‬‭https://www.w3schools.com/sql/default.asp‬‭- For‬‭Theory‬

‭●‬ SQL for Data Analytics - Learn SQL in 4 Hours
‭●‬ SQL - Complete Course in 3 Hours | SQL One Shot using MySQL

‭Resources to practice‬

‭ ‬‭https://www.hackerrank.com/domains/sql‬

‭●‬‭https://leetcode.com/studyplan/top-sql-50/‬
‭●‬‭https://datalemur.com/questions‬

‭Goal‬

‭ You should be able to visualize tables, joining tables and what you are‬

‭extracting from tables.‬
‭● Tip:- Try making rough tables if stuck with any sql question.‬
‭● You should be able to query basic to intermediate problem statements.‬
‭4.‬ ‭Power BI/ Tableau‬‭(Week-7,8,9)‬

‭Topics to Cover‬

‭ For Power BI -‬

‭https://medium.com/@AnweshaB/18-important-topics-to-cover-in-power-bi-e225c97c1ba1‬
‭● For Tableau -‬
‭https://medium.com/@shravan1998/important-concepts-in-tableau-you-should-know-114075‬
‭a2f4ee‬

‭Resources for Power BI‬

‭‬
● Power BI Full Course for FREE with Practical Projects [3 Hours] | Power BI Tutorial …
‭●‬ Complete Power BI in 10 Hours | PowerBI For Data Analysis (Hindi) #powerbi #data…
‭●‬ Power BI Tutorial For Beginners 2025 | Power BI Dashboard Project | Power BI Tuto…

‭Resources for Tableau‬

‭‬
● Tableau Full Course - in 3 Hours | Become a Data Visualization Rockstar | Beginner …
‭●‬ Tableau Full Course with Project – Master Data Visualization in 3 Hours (Beginner L…

‭Goal‬

‭Create advanced visualizations and interactive dashboards.‬


‭5.‬ ‭Python & EDA‬ ‭(Week-10,11,12,13)‬

‭Topics to Cover‬

‭ Introduction to Python:‬

‭- Variables, data types, and basic operations.‬
‭- Control structures (if statements, loops).‬
‭- Functions and modules.‬
‭- Working with Data Structures (List, Tuples, Dictionaries, Sets)‬
‭● NumPy:‬
‭- Array creation and manipulation.‬
‭- Mathematical operations on arrays.‬
‭- Indexing and slicing.‬
‭● Pandas:‬
‭- Series and DataFrame basics.‬
‭- Data cleaning and manipulation.‬
‭- Grouping and aggregation.‬
‭● Matplotlib:‬
‭- Creating Basic Plots‬
‭- Working with Figures & Axes‬
‭- Subplots & Multiple Plots‬
‭● Seaborn:‬
‭- Basic Plots in Seaborn‬
‭- Statistical Data Visualization‬
‭- Working with Categorical Data‬
‭● Data Cleaning and Preprocessing:‬
‭- Handling missing data.‬
‭- Removing duplicates.‬
‭- Data normalization and scaling.‬
‭● EDA (Exploratory Data Analysis)‬

‭Resources‬

‭ ‬ Python Tutorial For Beginners in Hindi | Complete Python Course 🔥



‭●‬ Python Tutorial for Beginners - Full Course (with Notes & Practice Questions)
‭●‬ Python Pandas Tutorial 2: Dataframe Basics
‭●‬ numpy tutorial - introduction | numpy array vs python list
‭●‬ Matplotlib Tutorial 1 - Introduction and Installation
‭●‬ Python SEABORN Tutorial [HINDI] | Learn Seaborn in 3 Hours - Complete Course
‭●‬‭https://courses.analyticsvidhya.com/courses/pandas-for-data-analysis-in-python‬
‭●‬‭https://www.w3schools.com/python/default.asp‬‭- For‬‭Theory‬
‭●‬ Learn Exploratory Data Analysis (EDA) from Scratch | EDA in 5 hours | Satyajit Patt…

‭ esources to Practice‬
R
‭●‬‭https://pynative.com/python-exercises-with-solutions/‬
‭●‬‭https://www.hackerrank.com/domains/python‬
‭●‬‭Leetcode Weekly Contest‬

‭ omplete at least 3-4 case study from below playlists‬


C
‭●‬‭https://youtube.com/playlist?list=PL_1pt6K-CLoDMEbYy2PcZuITWEjqMfyoA‬

‭ ython with EDA Project (Optional)‬


P
‭●‬ Python Project for Data Analysis- Exploratory Data Analysis | Data Analyst Project
‭●‬ New York Airbnb EDA Project with Python | Data Analytics Python Resume Project | …
‭●‬ Python Project For Data Analysis- Exploratory Data Analysis (EDA) End-to-End Proj…
‭6.‬ ‭Cloud Knowledge‬ ‭(Week-14,15)‬

‭Topics to Cover‬

‭ Cloud Storage & Data Warehousing‬



‭● Data Processing & ETL (Extract, Transform, Load)‬
‭● Business Intelligence & Data Visualization‬
‭- Power BI (Azure), Tableau (AWS/GCP), Google Looker Studio‬
‭- Connecting BI tools to cloud databases‬
‭● Cloud Security & Access Management‬

‭ esources‬
R
‭●‬ Azure Full Course - Learn Microsoft Azure in 8 Hours | Azure Tutorial For Beginners …
‭●‬ ‭https://www.youtube.com/live/m6ozQnqit50?si=WrEmpcaNUz1QR7F8‬
‭●‬ AWS Tutorial For Beginners | AWS Full Course - Learn AWS In 10 Hours | AWS Trai…

🔗
‭ ‬‭Free Courses & Guides:‬
‭●‬‭https://cloud.google.com/learn/certification/cloud-digital-leader‬
‭●‬‭https://aws.amazon.com/training/classroom/aws-cloud-practitioner-essentials/‬
‭●‬‭Microsoft Learn - Azure Fundamentals‬
‭7.‬ ‭Machine Learning Basics‬‭(Week-16,17)‬

‭ opics to Cover‬
T
‭● Supervised Learning (Regression & Classification)‬
‭● Unsupervised Learning (Clustering & Dimensionality Reduction)‬
‭● Feature Engineering & Data Preprocessing‬
‭● Model Evaluation & Performance Metrics‬
‭● Time Series Analysis (For Forecasting)‬

‭ esources‬
R
‭●‬ Complete Machine Learning In 6 Hours| Krish Naik
‭●‬ Python Machine Learning Tutorial (Data Science)
‭●‬ Complete ML Machine Learning in One Shot (5 Hours) | Semester Exam | In Hindi

‭Goal‬

‭ s a Data Analyst, focus on practical ML techniques like Regression, Classification,‬


A
‭Clustering, and Time Series Analysis, along with strong feature engineering and model‬
‭evaluation skills.‬
‭Must-Do 20 Data Analyst Interview Questions (2025 Edition)‬

I‭f you're preparing for a Data Analyst interview, these are the most important questions‬
‭across SQL, Excel, Python, Statistics, and Business Intelligence.‬

‭SQL Interview Questions‬

‭ . What is the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN?‬
1
‭2. How do you find duplicate records in a table? (Write an SQL query)‬
‭3. How do you rank rows without using the ROW_NUMBER() function?‬
‭4. What is the difference between WHERE and HAVING clauses in SQL?‬
‭5. Write an SQL query to calculate the running total of a sales column.‬

‭Python for Data Analysis‬

‭ . How do you handle missing values in a dataset using Python?‬


6
‭7. What is the difference between apply(), map(), and lambda functions in Pandas?‬
‭8. How do you merge two datasets in Pandas? (Explain different types of joins)‬
‭9. Explain the difference between a list, tuple, and dictionary in Python.‬
‭10. How do you group data in Pandas and perform aggregate functions?‬

‭Statistics & Probability‬

‭ 1. What is the difference between correlation and causation?‬


1
‭12. What is p-value, and how is it used in hypothesis testing?‬
‭13. What is standard deviation, and why is it important in data analysis?‬
‭14. Explain the Central Limit Theorem (CLT) and its significance.‬
‭15. What is A/B testing, and how do you determine if a test is successful?‬

‭Excel & Data Visualization‬

‭ 6. What is a Pivot Table, and how is it used in data analysis?‬


1
‭17. How do you create a dynamic dashboard in Excel using Power Query and Pivot Tables?‬
‭18. What are different types of charts in data visualization, and when should you use them?‬
‭19. How do you remove duplicates and clean data in Excel?‬

‭Business Intelligence & Problem-Solving‬

‭ 0. You are given a dataset with missing values, duplicates, and inconsistent formats. How‬
2
‭would you clean and prepare it for analysis?‬

‭Next Steps‬

‭ Practice SQL Queries on real datasets (LeetCode SQL)‬



‭● Work on Data Projects (e.g., customer churn, sales forecasting)‬
‭● Revise Statistics Concepts (Khan Academy, StatQuest YouTube)‬
‭● Practice with Real-World Datasets (Kaggle Datasets)‬
‭Important Note‬

‭ nce you are done with 60-70% of your syllabus, prepare your resume & start‬
O
‭applying along-side.‬

‭ALL THE VERY BEST FOR YOUR PREPARATION!!!‬

‭Follow on Youtube -‬‭Noodle Brain‬

‭Follow on Instagram -‬‭Noodle Brain - Instagram‬

‭Follow on Linkedin -‬‭https://www.linkedin.com/in/mansi-r-6a4115169/‬

‭Topmate -‬‭Book 1:1 session with me‬


DATA ANALYST ROADMAP
What is Data Analytics?
Data Analytics is about examining data to find useful information. It helps businesses
make smart decisions, improve their operations, and discover new opportunities by
cleaning, transforming, and modeling data.

What Does a Data Analyst Do?


A Data Analyst collects, processes, and analyzes data to find trends and insights. They
help organizations make data-driven decisions.

Steps in Data Analysis:


Define the Objective:

● Understand the business problem and set clear goals for what you want to achieve
with the analysis.

Data Collection:

● Identify where to get the data from and collect data from the identified sources.

Data Cleaning and Preprocessing:

● Remove duplicates, fix errors, and handle missing data and transform the data into
a usable format.

Exploratory Data Analysis (EDA):

● Look at the data to find patterns and trends and use summaries and visualizations
to understand the data better.

Data Modeling:

● Apply statistical & basic machine learning (Optional) models, aggregation to


analyze the data and validate the models to ensure they meet the objectives.

Data Visualization:
● Create visual representations like charts and graphs using tools like Excel,
Tableau, or Power BI.

Reporting and Interpretation:

● Summarize the results and Provide insights and recommendations based on the
analysis.

Communicating Results:

● Present the findings to stakeholders in a clear and understandable way and use
simple storytelling techniques to make the data insights relatable.

Let’s Start with our Roadmap !!

Syllabus:

- Statistics & Mathematics


- SQL
- MS Excel
- Python
- Power BI / Tableau
- Projects
- Pro Tips

1. Maths & Statistics (𝐖𝐞𝐞𝐤 𝟏 ):


Statistics & Maths Syllabus:

▪ Basic Statistics: Mean, Median, Mode, Standard deviation, Normal distribution,


Measure of dispersion with Variance And SD, Percentiles and Quartiles, Probability

▪ Basic Math: Arithmetic, Weighted average, Cumulative sum


Resources

NOTE: watch only above mentioned topics from any one of the below mentioned
youtube video:

https://www.youtube.com/watch?v=LZzq1zSL1bs

https://www.simplilearn.com/tutorials/statistics-tutorial

Web Resource:

https://news.lunartech.ai/fundamentals-of-statistics-for-data-scientists-and-data-analysts-
69d93a05aae7

2. SQL (𝐖𝐞𝐞𝐤 2 to 5):


SQL Syllabus:

- CREATE, INSERT, UPDATE, ALTER, DELETE, DROP, TRUNCATE &


DATA TYPES in SQL (WEEK 2)
- SELECT, DISTINCT, WHERE, LIKE, ORDER BY, LIMIT, TOP, AND,
OR, NOT, IN, BETWEEN (WEEK 2)

(After Completing above topics from below mentioned resources, Start


practicing Easy level questions on Hackerrank. (links of practice websites
are also mentioned below))

- SUM, MAX, MIN, COUNT, AVG , GROUP BY, HAVING (WEEK 3)


- JOINS - INNER JOIN, RIGHT JOIN, LEFT JOIN, OUTER JOIN & SELF
JOIN (WEEK 3)

(After Completing above topics from below mentioned resources, Start


practicing Medium level questions on Hackerrank, Leetcode, DataLemur &
StrataScratch. (links of practice websites are also mentioned below))

- EXISTS, UNION, UNION ALL, DATE TIME FUNCTIONS, CTE,


SUBQUERIES (WEEK 4)
- CASE WHEN, WINDOW FUNCTIONS (ROW_NUMBER, RANK,
DENSE_RANK, LEAD, LAG, NTILE, FIRST_VALUE, LAST VALUE)
(WEEK 4)

- AGGREGATE FUNCTIONS AS WINDOW FUNCTIONS (WEEK 4)

(After Completing above topics from below mentioned resources, Start


practicing Medium level to Hard level questions on Hackerrank, Leetcode,
DataLemur & StrataScratch. (links of practice websites are also mentioned
below))

(WEEK 5) - Put your SQL knowledge to the test on DataLemur , Hackerrank,


Leetcode & StrataScratch by practicing the real SQL interview questions asked by
companies like Facebook & Google. Use Below mentioned Websites for Practice to
practice SQL questions.

SQL Project Link (It is Optional, you can do it for learning):


https://www.youtube.com/watch?v=SAWiIV12sU4

RESOURCES:

Websites:

1. https://www.w3schools.com/sql/
2. https://sqlbolt.com/

Youtube Playlist:

https://youtube.com/playlist?list=PLavw5C92dz9Ef4E-1Zi9KfCTXS_IN8gXZ&si=XCw
pStf9zZ0YISN8

This above playlist contains the complete tutorial video of SQL with all the required
topics in English.
And if you want to learn in Hindi, then you can follow this below playlist:

https://youtube.com/playlist?list=PLdOKnrf8EcP17p05q13WXbHO5Z_JfXNpw&si=8m
4E9IGf-2MR9ZKA
Websites for Practice:

https://datalemur.com/questions?category=SQL

https://leetcode.com/problemset/database/

https://leetcode.com/studyplan/top-sql-50/

https://www.hackerrank.com/domains/sql

https://platform.stratascratch.com/coding?code_type=3

NOTE: Learning by doing is the key to mastering anything, especially for interviews !!
So, please focus more on practicing while learning.

NOTE: While learning SQL, create a professional LinkedIn account if you don't have one
already, and start sharing your learning experiences there on a daily basis. Try to build
relevant connections in the data analytics industry and aim to reach at least 2,000+
connections.

3. MS Excel (𝐖𝐞𝐞𝐤 6 to 7):


Excel Syllabus:

​ Data Management & Cleaning (Week 6)



​ - Removing Duplicates, Text to Columns, Data Validation, Flash Fill


Formula Mastery (Week 6)

- SUM, COUNT, AVERAGE, SUMIFS, COUNTIFS, AVERAGEIFS, VLOOKUP,


HLOOKUP, XLOOKUP, INDEX, MATCH, INDEX & MATCH, IF, IFERROR,
AND, OR, NOT, Nested Functions, ARRAY Formulas, LET, SUMPRODUCT,
INDIRECT, CHOOSE, OFFSET, LEFT, RIGHT

​ Data Analysis & Reporting (Week 6)

- Pivot Tables & Pivot Charts, Data Sorting and Filtering, Subtotals, Data Tables,
Scenarios (What-If Analysis), Goal Seek and Solver

Visualization Expertise (Week 7)

- Conditional Formatting, Basic to Advanced Charting, Creating Dynamic


Dashboards

​ Efficiency Enhancers (Week 7)

- Keyboard Shortcuts (You can get it from ChatGPT), Data Consolidation


Techniques, Error Checking

Advanced Excel Capabilities (Week 7)

- Advanced Filter, Slicers and Timelines in Pivot TableS

Start learning Excel with the YouTube playlist provided below -

https://www.youtube.com/playlist?list=PLUaB-1hjhk8Hyd5NiPQ9CND82vNodlFF5

NOTE: If you don't find a specific topic from the syllabus in the playlist above, you can
use any YouTube video or web article to understand the concept of that topic.

Websites for Practicing Excel:

1. https://www.excel-easy.com/

2. https://exceljet.net/

3. https://www.excelpracticeonline.com/

And, then Complete below project in Excel -


https://www.youtube.com/watch?v=m13o5aqeCbM

https://www.youtube.com/watch?v=opJgMj1IUrc

NOTE: By now, you have already completed 50% of the Data Analytics syllabus. After
this, you can start leveraging LinkedIn to ask for referrals and apply to relevant jobs.
Simultaneously, use Naukri.com for job applications. If you want to learn how to
effectively use these portals, you can watch my YouTube video linked below.

Youtube video Link: https://youtu.be/KfVkKtncLYE?si=_9rUbHAx7KIn32oi

NOTE: Additionally, you should create an ATS-friendly resume for job applications. If
you want to learn how to create an ATS-friendly resume, you can watch my YouTube
video linked below.

Youtube video Link: https://youtu.be/IIGWpw1FXhk?si=u1jvQj6JAnI34_z3

4. Python (𝐖𝐞𝐞𝐤 8 to 10):


Python Programming Syllabus (Week 8) :

- Understanding syntax, variables, and data types like integers, floats, strings,
booleans

- Control structures: if-else, Loops (for, while)

- Core data structures: lists, dictionaries, sets, tuples

- Functions , Error handling, lambda functions & try-except

- Using modules and packages

- OOP (Object Oriented Programming) : This you can learn in optional

First, complete the above Python programming Basics using the YouTube video
mentioned below.
- https://www.youtube.com/watch?v=kqtD5dpn9C8&t=1786s

- https://www.w3schools.com/python/default.asp
(Alternatively, use this above website to learn Python and become familiar with
Python syntax by doing basic hands-on exercises.)

Then, try to solve the top 30 Python coding questions below in your system environment
to gain hands-on experience and start with Python programming -

- https://www.analyticsvidhya.com/blog/2024/05/python-coding-interview-question
s-for-beginners/

Then start practicing Python from the websites mentioned below. Focus only on
solving basic to medium-level questions from the topics mentioned above. Avoid DSA
programming questions (Week 9):

- https://www.hackerrank.com/domains/python

- https://leetcode.com/problemset/

Python Data Analysis Libraries Syllabus (Week 9):

Pandas: What is Pandas?, Installing Pandas, Importing Pandas, Pandas Data


Structures (Series, DataFrame, Index)

Working with DataFrames: Creating DataFrames, Accessing Data in


DataFrames, Filtering and Selecting Data, Adding and Removing Columns,
Merging and Joining DataFrames, Grouping and Aggregating Data, Pivot Tables

Data Cleaning and Preparation: Handling Missing Values, Handling Duplicates,


Data Formatting, Data Transformation, Data Normalization

Data Visualization with Pandas: Line Plots, Bar Plots, Scatter Plots, Histograms,
Box Plots, Heatmaps

File Handling in Python: Reading and Writing Text Files, Reading and Writing
Binary Files, Working with CSV Files, Working with JSON Files
Numpy: What is NumPy?, Installing NumPy, Importing NumPy, NumPy Arrays

NumPy Array Operations: Creating Arrays, Accessing Array Elements, Slicing


and Indexing, Reshaping Arrays, Combining Arrays, Splitting Arrays, Arithmetic
Operations, Broadcasting, Mathematical Functions, Statistical Functions, Linear
Algebra Operations

Working with Data in NumPy: Reading and Writing Data with NumPy, Filtering
and Sorting Data, Data Manipulation with NumPy, Window Functions

NumPy with Other Libraries: Matplotlib, Pandas

Complete below course of python data analysis using pandas, numpy, matplotlib
(optional) and seaborn (optional) (Week 9):

- https://www.youtube.com/watch?v=r-uOLxNrNk8&t=683s

Complete at least 3-4 case study from below playlists (Week 10)

- https://youtube.com/playlist?list=PL_1pt6K-CLoDMEbYy2PcZuITWEjqMfyoA

Python Project (Optional) - (Week 10)

- https://www.youtube.com/watch?v=iwUli5gIcU0

NOTE: As a beginner, choose either Power BI or Tableau. After getting into a job, you
can switch tools as needed.

5. Power BI (𝐖𝐞𝐞𝐤 11 to 12)

Tutorial Playlist (Week - 11):


https://youtube.com/playlist?list=PLmejDGrsgFyDMME3o2CamamZ8w9NxSWWo&si=
w5hJyBq35bXOMt-v

End to End Dashboarding Project for understanding (Week - 12):

https://www.youtube.com/watch?v=mmxVCFceQgU

https://www.youtube.com/watch?v=pixlHHe_lNQ&list=PLUaB-1hjhk8H48Pj32z4GZgG
Wyylqv85f&index=11

NOTE: After completing this, if you have more time, you can work on as many
projects as you like from youtube.

5. Tableau (𝐖𝐞𝐞𝐤 11 to 12)

Tutorial Video (Week - 11):

https://www.youtube.com/watch?v=K3pXnbniUcM

End to End Dashboarding Projects for understanding (Week - 12):

https://www.youtube.com/watch?v=dahrmqT5GD4&t=4366s

https://www.youtube.com/watch?v=oAIubTqg-Kw (Part - 1)

https://www.youtube.com/watch?v=oTyCZVnNVZA (Part - 2)

NOTE: After completing this, if you have more time, you can work on as many
projects as you like from youtube.

6. Projects (𝐖𝐞𝐞𝐤 13 to 14):


NOTE: The projects mentioned below are end-to-end guided projects. After completing
them, you can download any dataset from Kaggle and start experimenting on your own.

Power BI Dashboarding Projects :-

1. https://youtube.com/playlist?list=PLeo1K3hjS3uva8pk1FI3iK9kCOKQdz1I9&si=
9AbP-H2sbnIiDTQO
2. https://www.youtube.com/watch?v=tT4V7zguCnc&list=PLeo1K3hjS3utcb9nKtan
hcn8jd2E0Hp9b&index=27
3. https://www.youtube.com/watch?v=JC66t9eM10s&list=PLeo1K3hjS3utcb9nKtan
hcn8jd2E0Hp9b&index=25

Project Using Web Scraping, Python, Pandas and Power BI:-

1. https://www.youtube.com/watch?v=4QkYy1wANXA&list=PLeo1K3hjS3utcb9nK
tanhcn8jd2E0Hp9b

Project using SQL & Power BI:-

1. https://www.youtube.com/watch?v=V-s8c6jMRN0

Tableau Dashboarding Projects:

1. https://youtube.com/playlist?list=PLeo1K3hjS3usDI9XeUgjNZs6VnE0meBrL&si
=Tq1iZ-sTeMuxFUOI
2. https://www.youtube.com/watch?v=UcGF09Awm4Y

End to End Data Analytics Project (Python + SQL)

1. https://www.youtube.com/watch?v=uL0-6kfiH3g

7. Pro Tips - Soft Skills in Data Analytics (Week 15):-


In data analytics, soft skills are as crucial as technical skills. They enable data analysts to
bridge the gap between raw data and actionable insights, making them indispensable in
decision-making processes.

1. Communication Skills:

- Why: Effectively conveying complex data insights to non-technical


stakeholders.

- How to Improve: Write concise reports, engage in public speaking, and


participate in group discussions.

2. Analytical Thinking:

- Why: To view data from multiple perspectives and draw meaningful


conclusions.

- How to Improve: Practice critical thinking exercises and problem-solving


scenarios.

3. Problem-Solving Skills:

- Why: To navigate ambiguous challenges and find innovative solutions.

- How to Improve: Tackle real-world data challenges and collaborate on projects.

4. Storytelling with Data:

- Why: To transform data into compelling narratives that drive action.

- How to Improve: Create data visualizations that tell a story, and practice
presenting insights as narratives.

5. Business Understanding:

- Why: To align data insights with business goals and strategies.

- How to Improve: Stay updated with industry trends, and read business case
studies.

Resources to Develop Soft Skills


- Blogs and Articles: Stay updated with platforms like Towards Data Science and
LinkedIn Learning to enhance communication and business understanding.

- Podcasts and YouTube: Watch interviews and industry projects to see soft skills
in action.

- Social Media Sharing: Share your learnings on LinkedIn to refine your


communication and storytelling abilities.

Mastering these soft skills can significantly enhance your effectiveness as a data
analyst, making you a more versatile and valuable asset to any organization.

ALL THE BEST !! FOR YOUR JOB SEARCH.

Connect With Me:


YouTube:

https://youtube.com/@shakrashamim?si=ucGSJ3mkKv8Lk7MQ

Instagram:

https://www.instagram.com/shakra.shamim/?igshid=OTJlNzQ0NWM%3D

LinkedIn:

https://in.linkedin.com/in/shakra-shamim-8ab3a1233

Telegram:

t.me/Data_geeks_by_Shakra_Shamim
DATA ANALYST ROADMAP - 2023

Start your career to data science in just 3 months, this roadmap will help you to
learn data science skills from scratch. This is a structured and detailed roadmap
which includes:
tech skills + resources, projects, soft skills + resources, resume template + tips
and interview preparation guide

➢ Role of a Data Analyst

A data analyst collects, cleans, and interprets data sets in order to answer a question
or solve a problem

Six steps of data analysis:


1. Business Question: Define what problem you want to solve
2. Get Data: Collect the data required for analysis
3. Explore Data: Explore data with visual exploration to understand what is in a
dataset
4. Prepare Data: Data cleaning, calculated fields and data validation
5. Analyze Data: Use data analysis techniques to understand, interpret, and
derive conclusions based on the requirements
6. Present Findings: Share insights with stakeholders

< In just 3 months you can START your career to data science >

Let’s Start!
Week – 1, 2

1. Excel

o Topics
▪ Basic formulas: SUM, AVERAGE, MEAN, MEDIAN, SUMPRODUCT, CONCATENATE
▪ Advance formulas: VLOOKUP, INDEX, MATCH, IF, COUNTIF, SUMIF
▪ Remove duplicates and conditional formatting
▪ Charts, filters, sort and slicers
▪ Pivot tables and pivot charts
▪ Ignore VBA, Macros, etc

o Resources
▪ Complete Excel Tutorial in one video: https://www.youtube.com/watch?v=OX-iyb-21tk
▪ Excel for beginner’s playlist:
https://www.youtube.com/playlist?list=PLdOKnrf8EcP1Y1XRUVSUc0g-WfeiBigbd

▪ Google Template Gallery: https://docs.google.com/spreadsheets/u/0/?ftv=1


▪ Full Excel Project: https://www.youtube.com/watch?v=gTK5rNhWJyA

2. Math and Statistics

o Topics
▪ Basic Math: Arithmetic, Weighted average, Cumulative sum, Percentile
▪ Basic Statistics: Mean, Median, Mode, Standard deviation, Normal distribution

o Resources
▪ Note: just learn basics, initially don’t go for phd
▪ Complete Statistics For Data Science In 6 hours By Krish Naik (watch only above mentioned
topics): https://www.youtube.com/watch?v=LZzq1zSL1bs
▪ Statistics Tutorial for Beginners by Simplilearn:
https://www.simplilearn.com/tutorials/statistics-tutorial

Rishabh Mishra
NEXT STEPs
o Create a professional LinkedIn profile
▪ Add professional photo, headline, summary and educational details
o Make full Excel project and add to your resume/LinkedIn profile
o Optional: create a GitHub account

Week – 3, 4, 5

3. SQL

o Topics
▪ Basic Queries: SELECT, WHERE, DISTINCT, LIKE, BETWEEN, ORDER BY, LIMIT, GROUP BY,
HAVING CLAUSE, INSERT, UPDATE, ALTER, IMPORT, Data types
▪ Advance Queries: Date time function, Window function, Sub query, Case statement, CTE,
query optimisation
▪ JOINS: Inner, Outer, Left, Right

o Resources to learn SQL


▪ SQL tutorial in one video: https://www.youtube.com/watch?v=On9eSN3F8w0
▪ SQL for beginner’s playlist:
https://www.youtube.com/playlist?list=PLdOKnrf8EcP17p05q13WXbHO5Z_JfXNpw
▪ W3schools website to learn SQL: https://www.w3schools.com/sql/
▪ SQL Data Analysis Portfolio Project: https://www.youtube.com/watch?v=VFIuIjswMKM
▪ SQL interview question’s playlist:
https://www.youtube.com/playlist?list=PLdOKnrf8EcP1y_LPEv7uBpzoRmlATjCVr

Rishabh Mishra
o Resources to practice SQL
▪ W3schools: https://www.w3schools.com/sql/
▪ Hacker rank sql: https://www.hackerrank.com/domains/sql
▪ 8-week sql challenge- case study: https://8weeksqlchallenge.com/
▪ Data lemur: https://datalemur.com/
▪ Leetcode: https://leetcode.com/problemset/database/

NEXT STEP
o Make full SQL projects and add them to your resume/ LinkedIn/ GitHub profile
o Connect with people on LinkedIn, who are working in data science industry

Week – 6, 7

4. BI Tools (Power BI or Tableau)

Note: If you are a beginner, my personal suggestion will be to learn Power BI instead of Tableau- as its high
in demand and feels similar to MS Excel

Power BI
o Resources to learn Power BI
▪ Power Bi Tutorial + Project Beginners: https://www.youtube.com/watch?v=6cV3OwFrOkk
▪ Power BI Roadmap: https://www.youtube.com/watch?v=ZBCAR5Rs7wk
▪ Avi Singh YouTube channel: https://www.youtube.com/watch?v=AGrl-H87pRU
▪ Power BI Full Course by edureka: https://www.youtube.com/watch?v=3u7MQz1EyPY
▪ Power BI Udemy course: https://www.udemy.com/course/powerbi-complete-introduction/

Rishabh Mishra
o Power BI Projects
▪ Note: Below are just examples, do a search on Google and YouTube for more projects-
thoda khud bhi mehnat karo
▪ Power BI dashboard project End-to-end: https://www.youtube.com/watch?v=j4xlVLgsmNQ
▪ Power BI dashboard by End-to-End: https://www.youtube.com/watch?v=et8tAUTwcvY
▪ Power BI report by Data With Decision: https://www.youtube.com/watch?v=0BKlUySopU4
▪ End to End Project by KSR datavizon: https://www.youtube.com/watch?v=aXNhtcQ4nEU

Tableau
o Resources to learn Tableau
▪ Tableau Basics For Beginners In Hindi by Great Learning:
https://www.youtube.com/watch?v=6RZEaEH9ZsQ
▪ Tableau by Edureka: https://www.youtube.com/watch?v=aHaOIvR00So
▪ Tableau Tutorial by Simplilearn: https://www.youtube.com/watch?v=fO7g0pnWaRA
▪ Tableau Udemy course: https://www.udemy.com/course/tableau10/

o Tableau Projects
▪ Note: Below are just examples, do a search on Google/YouTube for more projects- thoda
khud bhi mehnat karo
▪ Practice With Examples by Simplilearn: https://www.youtube.com/watch?v=5uzB4z4iN0g
▪ Sales insights by codebasics:
https://www.youtube.com/watch?v=CCNd2fUfFkk&list=PLeo1K3hjS3usDI9XeUgjNZs6VnE0
meBrL
▪ Customer Analysis n Dashboard by Stanley:
https://www.youtube.com/watch?v=_qReGTOrKTk

Rishabh Mishra
Week – 8, 9, 10

5. Programming- Python

Note: If you are a beginner, my personal suggestion will be to learn Python instead of R- as its high in
demand and beginner friendly. Also, it will help to solve Machine Learning problems

As a beginner learn programming language to an intermediate level, don’t waste time to master it

o Topics in Python:
▪ Variables, Data types, Lists, Tuples, Dictionaries, Sets, Conditional expressions, Modules,
Functions, Operators, if statements, Loops, classes and objects
▪ Python libraries: Pandas and Matplotlib
▪ Pandas: read/write csv, excel and JSON files, work with dataframe, data manipulation and
analysis- Group by, Concatenate, Merge
▪ Matplotlib: creating static, animated, and interactive visualizations in Python

o Resources to learn Python


▪ Python Roadmap: https://www.youtube.com/watch?v=AyMBPxxQRtA
▪ Python Tutorial For Beginners by CodeWithHarry:
https://www.youtube.com/watch?v=gfDE2a7MKjA
▪ Python Course by Udemy: https://www.udemy.com/course/complete-python-bootcamp/
▪ W3schools website: https://www.w3schools.com/python/
▪ Python for data science by freecodecamp: https://www.youtube.com/watch?v=LHBE6Q9XlzI
▪ Python Tutorial by Mosh: https://www.youtube.com/watch?v=_uQrJ0TkZlc

o Resources to learn Pandas and Matplotlib


▪ Python for data science course by Udemy: https://www.udemy.com/course/python-for-
data-science-and-machine-learning-bootcamp/
▪ Pandas Tutorial by CodeWithHarry: https://www.youtube.com/watch?v=RhEjmHeDNoA
▪ Pandas Tutorials by Krish Naik: https://www.youtube.com/watch?v=BN0nnnadFl0
▪ Matplotlib Tutorial: https://www.youtube.com/watch?v=VFsRLjSc8GA
▪ Project- Cricket Data Analytics by Codebasics:
https://www.youtube.com/watch?v=4QkYy1wANXA
▪ Project- Covid Analysis by Simplilearn: https://www.youtube.com/watch?v=G9NmACvXh8w

Rishabh Mishra
▪ Solving real world data science tasks with Python Pandas by Keith Galli:
https://www.youtube.com/watch?v=eMOA1pPVUc4
▪ Kaggle free dataset: https://www.kaggle.com/datasets

NEXT STEPs
o Make a full project end-to-end using Python, above are just for example you can search more
projects on GitHub, Kaggle, YT, Google

Week – 11 & 12

6. Soft Skills
I. Communication skill
II. Analytical skill
III. Problem solving skill
IV. Story telling
V. Business understanding

o Resources to work on soft skills


▪ Read latest blogs on data science news, helps you keep updated and improve
communication, business understand and storytelling skill
▪ Example: towards data science website, news sites, LinkedIn learning, etc
▪ Watch podcasts, interviews, industry level projects on YouTube
▪ Share your learnings on LinkedIn or other social media platforms

Rishabh Mishra
7. Resume & Interview Prep

o Resume preparation
▪ Note: There is nothing called PERFECT resume, so keep learning and updating!
▪ Prepare one page resume and use professional template
▪ Based on above learnings and projects update your resume
▪ Also, if you have done any courses/certificates do add them as well
▪ Tailor your resume based on the role/company you’re applying
▪ Free resume template sites:
✓ novo resume: https://novoresume.com/resume-templates
✓ resume io: https://resume.io/resume-templates
✓ canva resume: https://www.canva.com/resumes/templates/

o Interview preparation
▪ Note: Once you have completed all the above steps, just start applying for related jobs.
Giving interview is also a part of your learning
▪ Be thorough with your resume, even with minute details
▪ Again, watch podcasts and interview experience shared on YouTube
▪ Read interview questions available on sites like:
✓ Glassdoor
✓ LinkedIn

Rishabh Mishra
Let’s connect now

YouTube: https://www.youtube.com/@RishabhMishraOfficial

Instagram: https://www.instagram.com/rishabhnmishra/

LinkedIn: https://www.linkedin.com/in/rishabhnmishra/

Twitter: https://twitter.com/rishabhnmishra

Rishabh Mishra

You might also like