Data Analytics Porfolio Project
Data Analytics Porfolio Project
ANALYTICS
PORTFOLIO
BY ERIC LOBO
PROFESSIONAL BACKGROUND
HEY THERE, MY NAME IS ERIC LOBO AND I HAVE RECENTLY GRADUATED
WITH A BACHELOR OF COMMERCE, WITH A CGPA OF 7.30. AND I AM DOING A
PROFESSIONAL COURSE CALLED CERTIFIED MANAGEMENT ACCOUNTANT
CMA(USA) WHERE I AM SEMI-QUALIFIED WITH PASSING PART 1 WITH A
SCORE OF 370 AND PART 2 IS UNDERGOING.
I AM A PURE FRESHER, AND I WANTED TO HAVE THE KNOWLEDGE OF DATA
ANALYTICS, SO I CAN UPGRADE MYSELF AT AN EARLY AGE IN THE NEW
TECHNOLOGICAL ENVIRONMENT.
BELOW I WILL BE PRESENTING ALL OF MY PROJECTS, THAT I HAVE
COMPLETED SUCESSFULLY UNDER TRAINITY.
TABLE OF CONTENTS
1. DATA ANALYTICS PROCESS
2. INSTAGRAM USER ANALYTICS.
3. OPERATION & METRICS ANALYTICS.
4. HIRING PROCESS ANALYTICS.
5. IMDB PROCESS ANALYTICS.
6. BANK LOAN CASE STUDY
7. IMPACT OF CAR FEATURES ON PRICE AND PROFITABILITY.
8. ABC CALL VOLUME TRENDS.
9. CONCLUSION
PROJECT 1:-
DATA
ANALYTICS
PROCESS
BY ERIC LOBO
THE 6 STEPS OF DATA ANALYTICS PROCESS EXAMPLE 1 :- CRICKET EG
DREAM 11
PLAN Before going to Croma we plan and decide what we need a phone, a laptop, a tv, etc.
PREPARE After deciding, for example, I decided to buy a phone, I would collect all the data
about what type of phone is well suited, which brand, specs, etc.
PROCESS After all the data is been collected, now I will need to specify which phones are in
our budget, which phones have high specs, and which all brands are reliable.
ANALYZE Now it is time to choose which phone is best suited for us, if I want a phone for
daily use, I will not need a very expensive phone, but if I will need a phone for
gaming, vlogging then I will an expensive phone (like apple).
•SHARE Now I will communicate my analysis with the Croma salesperson, then he/she will
help me find the best phone which is suitable for me.
•ACT If I like the phone shown by the salesperson, I would finally buy it.
PROJECT 2:-
INSTAGRAM
USER
ANALYTICS
BY ERIC LOBO
PROJECT DESCRIPTION
THE PROJECT IS ABOUT users analysis, user analysis is used to
derive business insights so that the company could come up
with the best marketing efforts that the consumers would
prefer.
Handling of the project will be done through my SQL(as
recommended)
Things that I am going to find out during the project is how can
I be helpful to marketing team, so that I can provide them
insights, and make their job relatively easy.
APPROACH
So first of all I did download MY SQL on my laptop, and viewed all the
learning materials in the dashboard about Instagram user analytics, because
I did not have any prior knowledge about SQL, then I went on with writing
the code(given in the datasets provided) in MY SQL.
I did watch the You tube video of SQL workbench provided in SQL
installation resources.
After writing the code I went on to write the SQL QUERY to get insights into
the dataset, I just knew the basic knowledge about SQL, so I did not have the
best knowledge about the query's, so I took more help of some You tube
short videos. That really helped me to write and understand the query`s to
get the correct output
THE TECH STACK USED IS MY SQL WORKBENCH
The purpose of using SQL is to write the codes and to write
queries to the insights about the data
MICROSOFT EXCEL
The purpose of using Excel is just to create some tables of
the insights I found.
INSIGHTS 1 REWARDING THE MOST LOYAL USERS
Here we need to find the top 5 oldest users of Instagram from the data provided, Top 5
users are as follows
As a social media company, we need to keep rewarding or bring up exciting updates, so
that users would be engaged with the app.
ANS:-
INSIGHT NO 2
Here we need to identify the bots who are liking every single
pictures where a normal users will not be able to do this
A real problem on any social media platform is bots, so we
need to provide the data, so that the Instagram would do the
necessary changes
ANS:- ON THE NEXT SLIDE
CHANGES WE CAN MAKE ARE, When bots accounts are
detected it should be deleted immediately, a valid phone
number, email id should be used to create a account.
RESULTS
30
20
Topic 3:- operation
10 and metrics analysis
0
Item 1 Item 2 Item 3 Item 4 Item 5
BY Eric lobo
Project description
So, the project is divided into two parts operation Analytics and
investigating metrics spikes.
So operation analytics is basically done to improve our data related to
customers, how can be even better? Or how can we give our customers
the best satisfaction, so they won't turn up to our competitors?
Investigating spikes is all about how we answer important questions like in
what month did our sales go high/low, what was the reason for our sales
going high/low was it a seasonal effect? Or an unexpected trend? All of
these answers must be answered time to time!
Approach and tech-stack used!
So for case study 1 I created a database and with the sample I created 30-40
datasets of my own and use MYSQL for perform the analysis.
Case study 2 investigating spike, was the harder one, coz the first problem I faced
is on how to import the data because the data were huge, so I had to learn how to
use LOAD INFILE DATA, after doing that I used MYAQL to perform analysis, as I am
still not good in SQLZ I had to take help of more videos to perform the analysis, I
even search up on google about the queries part, but the best part Is that I am
getting better in this, and hopefully by the end of the course, I should be good at
SQL
TECH STACK USED IS MY SQL AND EXCEL TO PERFORM ANALYSIS
INSIGHTS
Q4 Let’s say you see some duplicate rows in the data. How will
you display duplicates from the table?
thankyou, that's the end
of project 3
www.reallygreatsite.com
Project 4:- Hiring process
analytics
By Eric lobo
Project description
THANKYOU
T H E E N D
THAT'S
O J E C T 4
OF P R
PROJECT 5
PROJECT DESCRIPTION
This project aims at answering some questions like, top 250
movies, best actor, num of votes over decade, best director
and many more of this questions to be answered, This project
will give a vast understanding on how data are handled in real
world, and how they are cleaned and used to derived insights
from the cleaned data.
IMDB TOP 250 MOVIES
THANKYOU
THAT'S THE END OF PROJECT 5
Project 6
bank loan
case study
BY ERIC LOBO
PROJECT DESCRIPTION
Project7:- Analyzing
the impact of car
features on price and
profitability
Created by ERIC LOBO.
PROJECT DESCRIPTION
Least market category popularity:- flex fuel, hybrid !! And exotic, luxury.
Task 2:- What is the relationship between a car's engine power
and its price?
Task 2: Create a scatter chart that plots engine power on the x-axis and price on
the y-axis. Add a trendline to the chart to visualize the relationship between these
variables.
I used the CORREL function to find how strong/weak the relationship between two
variables, or just to know if the two variables had a positive or negative
relationship.
CORREL function gave me 0.661402, which means it has a positive relationship,
and if the horsepower of the engine increases so the MSRP of the car increases.
TASK 4:- How does the average price of a car vary across different
manufacturers?
Task 4.A: Create a pivot table that shows the average price of cars for each
manufacturer.
Task 4.B: Create a bar chart or a horizontal stacked bar chart that visualizes the
relationship between the manufacturer and the average price.
Results:- Bugatti has the highest average MSRP of the car, Probably due to which
market category it sells in i.e., exotic, high-performance category.
The least average price of MSRP is Plymouth. Because it sells its car at a very low
price as compared to Bugatti. And this car company was made to serve common
people to cannot afford cars are Bugatti. as it was a cost-focused company.
TASK 5:-What is the relationship between fuel efficiency and the number of
cylinders in a car's engine?
Task 5. A: Create a scatter plot with the number of cylinders on the x-axis and highway
MPG on the y-axis. Then create a trendline on the scatter plot to visually estimate the
slope of the relationship and assess its significance.
Task 5. B: Calculate the correlation coefficient between the number of cylinders and
highway MPG to quantify the strength and direction of the relationship.
To calculate the coefficient correlation between two variables, ie, number of cylinders and
higher MPG I again used the function called CORREL function.
ANS:- -0.60095 which indicate they have a negative relation, as the number of cylinder
increases the highway estimated miles per gallon(MPG) of the car decreases.
40
50
40
30
30
20 20
10
10
0
Item 1 Item 2 Item 3 Item 4 Item 5
Task 2: Which car brands have the highest and lowest average
MSRPs, and how does this vary by body style?
BY ERIC LOBO