Datamites Certified Data Analyst Brochure INDIA V9
Datamites Certified Data Analyst Brochure INDIA V9
PROGRAM BROCHURE
TABLE OF CONTENTS
DATAMITES® ACCOLADES … 2
WHY DATAMITES® … 3
PROGRAM STRUCTURE … 4
REAL-TIME INTERNSHIP … 5
JOB READY PROGRAM … 6
PROGRAM CURRICULUM … 7
ADMISSIONS AND CONTACTS … 18
IBM linkedin.com/in/ashokveda/
KEY HIGHLIGHTS
1. Flexible Learning 5. Learning Community
Learners can repeat sessions, change batches , Exclusive Online learning community with
change learning modes, ad-hoc doubts thousands of active learners, mentors and Alumni
sessions anytime. available for clarifying doubts and mentoring
CERTIFICATIONS
• IABAC CERT
• COURSE COMPLETION
• 3-MONTH DURATION • INTERNSHIP CERT
• PROJECT MENTORING
• 5+ CAPSTONE PROJECTS
• REAL-TIME INTERNSHIP
• 3-MONTH DURATION • 1 CLIENT /LIVE PROJECT
• LIVE TRAINING
• 20 HOUR A WEEK
• COMPREHENSIVE SYLLABUS
• PRE COURSE SELF-STUDY • HANDS-ON PROJECTS
• HIGH QUALITY VIDEOS • EXPERT TRAINERS AND
WITH EASY LEARNING MENTORS
APPROACH.
REALTIME DELIVERY
INTERNSHIP
INTAKE PROCESS Work on Real- PROJECT
time projects DELIVERABLES ✔ INTERNSHIP CERTIFICATE
✔ EXPERIENCE LETTER
internship@datamites.com
DOUBTS SESSIONS
PROJECT MENTORING Twice a week, Live
100 hours Live Mentoring
doubts session from
in industry projects
mentors and experts
PLACEMENT
ASSISTANCE TEAM
PLACEMENT PARTNERS
• The course is vigorously updated as per the industry requirements and fine-tuned to
make the learning process structured enabling lean learning.
LEARNING
ORDER COURSE CODE
HOURS
Important Note: The curriculum is subjected to change as required by the global accreditation
bodies to align with industry requirements. Please check with your counsellor or drop email to
care@datamites.com for updated curriculum
MODULE 1 MODULE 4
DATA ANALYSIS FOUNDATION UNIVARIATE DATA ANALYSIS
• Data Analysis Introduction • Summary statistics -Determines the
• Data Preparation for Analysis value’s center and spread.
• Common Data Problems • Measure of Central Tendencies: Mean,
• Various Tools for Data Analysis Median and Mode
• Evolution of Analytics domain • Measures of Variability: Range, Inter-
quartile range, Variance and Standard
Deviation
• Frequency table -This shows how
MODULE 2
frequently various values occur.
CLASSIFICATION OF ANALYTICS
• Charts -A visual representation of the
• Four types of the Analytics distribution of values.
• Descriptive Analytics
• Diagnostics Analytics
• Predictive Analytics MODULE 5
• Prescriptive Analytics DATA ANALYSIS WITH VISUAL CHARTS
• Human Input in Various type of Analytics • Line Chart
• Column/Bar Chart
MODULE 3 • Waterfall Chart
CRIP-DM Model • Tree Map Chart
• Introduction to CRIP-DM Model • Box Plot
• Business Understanding
• Data Understanding MODULE 6
• Data Preparation BI-VARIATE DATA ANALYSIS
• Modeling • Scatter Plots
• Evaluation • Regression Analysis
• Deploying • Correlation Coefficients
• Monitoring
TOOLS/PLATFORMS COVERED
MODULE 1 MODULE 4
OVERVIEW OF STATISTICS HYPOTHESIS TESTING
• Descriptive And Inferential Statistics • Hypothesis Testing Introduction
• Basic Terms Of Statistics • P- Value, Confidence Interval
• Types Of Data • Parametric Hypothesis Testing Methods
• Hypothesis Testing Errors : Type I And Type Ii
MODULE 2 • One Sample T-test
HARNESSING DATA • Two Sample Independent T-test
• Random Sampling • Two Sample Relation T-test
• Sampling With Replacement And • One Way Anova Test
Without Replacement
• Cochran's Minimum Sample Size
• Simple Random Sampling
• Stratified Random Sampling MODULE 5
• Cluster Random Sampling CORRELATION AND REGRESSION
• Systematic Random Sampling • Correlation Introduction
• Biased Random Sampling Methods • Direct/Positive Correlation
• Sampling Error • Indirect/Negative Correlation
• Methods Of Collecting Data • Regression
• Choosing Right Method.
MODULE 3
EXPLORATORY DATA ANALYSIS
• Exploratory Data Analysis Introduction
• Measures Of Central Tendencies: Mean,
Median And Mode
• Measures Of Central Tendencies: Range,
Variance And Standard Deviation
• Data Distribution Plot: Histogram
• Normal Distribution
• Z Value / Standard Value
• Empherical Rule and Outliers
• Central Limit Theorem
• Normality Testing
• Skewness & Kurtosis
• Measures Of Distance: Euclidean,
Manhattan And Minkowski Distance
MODULE 1 MODULE 5
COMPARISION AND CORRELATION ANALYSIS PARETO (80/20 RULE) ANALSYSIS
• Data comparison Introduction • Pareto rule Introduction
• Concept of Correlation • Preparation Data for Pareto Analysis
• Calculating Correlation with Excel • Insights on Optimizing Operations with
• Comparison vs Correlation Pareto Analysis
• Performing Comparison Analysis on Data • Performing Pareto Analysis on Data
• Performing correlation Analysis on Data • Hands-on case study: Pareto Analysis
• Hands-on case study 1: Comparison Analysis
• Hands-on case study 2 Correlation Analysis
MODULE 6
MODULE 2 Time Series and Trend Analysis
VARIANCE AND FREQUENCY ANALYSIS • Introduction to Time Series Data
• Concept of Variability and Variance • Preparing data for Time Series Analysis
• Data Preparation for Variance Analysis • Types of Trends
• Business use cases for Variance and • Trend Analysis of the Data with Excel
Frequency Analysis • Insights from Trend Analysis
• Performing Variance and Frequency • Hands-on Case Study: Trend Analysis
Analysis
• Hands-on case study 1: Variance Analysis
MODULE 7
• Hands-on case study 2: Frequency Analysis
DATA ANALYSIS BUSINESS REPORTING
MODULE 3 • Management Information System
RANKING ANALYSIS Introduction
• Introduction to Ranking Analysis • Various Data Reporting formats
• Data Preparation for Ranking Analysis • Creating Data Analysis reports as per the
• Performing Ranking Analysis with Excel requirements
• Insights for Ranking Analysis • Presenting the reports
• Hands-on Case Study: Ranking Analysis • Hands-on case study: Create Data Analysis
Reports
MODULE 4
BREAK EVEN ANALYSIS
• Concept of Breakeven Analysis
• Make or Buy Decision with Break Even
• Preparing Data for Breakeven Analysis
• Hands-on Case Study: Procurement
Decision with break even
TOOLS/PLATFORMS COVERED
MODULE 1 MODULE 3
DATA ANALYTICS FOUNDATION PREDICTIVE ANALYTICS WITH REGRESSION
• Business Analytics Overview • Mathematics beyond Linear Regression
• Application of Business Analytics • Hands on: Regression Modeling in Excel
• Visual Perspective • Case Study 2 : Sales Promotion Decision with
• Benefits of Business Analytics Regression Analysis
• Challenges • Assignment 2 : Design Marketing Decision
• Classification of Business Analytics board for QuikMark Inc.
• Data Sources
• Data Reliability and Validity
• Business Analytics Model
MODULE 2 MODULE 4
OPTIMIZATION MODELS DECISION MODELING
• Prescriptive Analytics with Low Uncertainty • Prescriptive Analytics with High
• Mathematical Modeling and Decision Uncertainty
Modeling • Comparing Decisions in Uncertain Settings
• Break Even Analysis • Decision Trees for Decision Modeling
• Product Pricing with Prescriptive Modeling • Case Study 3 : Decision modeling of
• Building an Optimization Model Internet Plans, Monte Carlo Simulation
• Case Study 1 : WonderZon Network • Case Study 4 : Kickathlon Sports Retailer
Optimization Supplier Decision Modeling
• Assignment 1 : KERC Inc, Optimum
Manufacturing Quantity
TOOLS/PLATFORMS COVERED
MODULE 1 MODULE 6
MACHINE LEARNING INTRODUCTION ML ALGO: DECISION TREE
• What Is ML? ML Vs AI • Random Forest Ensemble technique
• ML Workflow, Popular ML Algorithms • How it works: Bagging Theory
• Clustering, Classification And Regression • Hands-on Decision Tree with ML Tool
• Supervised Vs Unsupervised
MODULE 2
ML ALGO: LINEAR REGRESSSION MODULE 7
• Introduction to Linear Regression ML ALGO: SUPPORT VECTOR MACHINE (SVM)
• How it works: Regression and Best Fit Line • Introduction to SVM
• Hands-on Linear Regression with ML Tool • How It Works: SVM Concept, Kernel Trick
• Modeling and Evaluation of SVM in Python
MODULE 3
ML ALGO: LOGISTIC REGRESSION
• Introduction to Logistic Regression
• How it works: Classification & Sigmoid Curve MODULE 8
• Hands-on Logistics Regression with ML Tool ARTIFICIAL NEURAL NETWORK (ANN)
• Introduction to ANN
• How It Works: Back prop, Gradient Descent
MODULE 4
• Modeling and Evaluation of ANN in Python
ML ALGO: KNN
• Introduction to KNN
• How It Works: Nearest Neighbor Concept
• Hands-on KNN with ML Tool
MODULE 9
MODULE 5 PROJECT: PREDICTIVE ANALYTICS WITH ML
ML ALGO: K MEANS CLUSTERING • Project Business requirements
• Understanding Clustering (Unsupervised) • Data Modeling
• K Means Algorithm • Building Predictive Model with ML Tool
• How it works : K Means theory • Evaluation and Deployment
• Hands-on K Means Clustering with ML Tool • Project Documentation and Report
MODULE 1 MODULE 5
DATABASE INTRODUCTION SQL JOINS
• DATABASE Overview • Inner join
• Key concepts of database management • Outer join
• CRUD Operations • Left join
• Relational Database Management System • Right join
• RDBMS vs No-SQL (Document DB) • Cross join
• Self join
MODULE 2
SQL BASICS
• Introduction to Databases
• Introduction to SQL MODULE 6
• SQL Commands SQL COMMANDS AND CLAUSES
• MY SQL workbench installation • Select, Select distinct
• Comments • Aliases, Where clause
• import and export dataset • Relational operators, Logical
• Between, Order by, In
MODULE 3 • Like, Limit, null/not null, group by
DATA TYPES AND CONSTRAINTS • Having, Sub queries
• Numeric, Character, date time data type
• Primary key, Foreign key, Not null
• Unique, Check, default, Auto increment
MODULE 7
MODULE 4 DOCUMENT DB/NO-SQL DB
DATABASES AND TABLES (MySQL) • Introduction of Document DB
• Create database • Document DB vs SQL DB
• Delete database • Popular Document DBs
• Show and use databases • MongoDB basics
• Create table, Rename table • Data format and Key methods
• Delete table, Delete table records • MongoDB data management
• Create new table from existing data types
• Insert into, Update records
• Alter table
TOOLS/PLATFORMS COVERED
MODULE 1 MODULE 4
GIT INTRODUCTION TAGGING, BRANCHING AND MERGING
• Purpose of Version Control • Organize code with branches
• Popular Version control tools • Checkout branch
• Git Distribution Version Control • Merge branches
• Terminologies
• Git Workflow
• Git Architecture MODULE 5
UNDOING CHANGES
MODULE 2 • Editing Commits
GIT REPOSITORY and GitHub • Commit command Amend flag
• Git Repo Introduction • Git reset and revert
• Create New Repo with Init command
• Copying existing repo
• Git user and remote node MODULE 6
• Git Status and rebase GIT WITH GITHUB AND BITBUCKET
• Review Repo History • Creating GitHub Account
• GitHub Cloud Remote Repo • Local and Remote Repo
• Collaborating with other developers
MODULE 3
• Bitbucket Git account
COMMITS, PULL, FETCH AND PUSH
• Code commits
• Pull, Fetch and conflicts resolution
• Pushing to Remote Repo
TOOLS/PLATFORMS COVERED
MODULE 1
BIG DATA INTRODUCTION MODULE 4
• Big Data Overview SPARK SQL and HADOOP HIVE
• Five Vs of Big Data • Introducing Spark SQL
• What is Big Data and Hadoop • Spark SQL vs Hadoop Hive
• Introduction to Hadoop • Working with Spark SQL Query Language
• Components of Hadoop Ecosystem
• Big Data Analytics Introduction
MODULE 5
MODULE 2 MACHINE LEARNING WITH SPARK ML
HDFS AND MAP REDUCE • Introduction to MLlib
• HDFS – Big Data Storage Various ML algorithms supported by Mlib
• Distributed Processing with Map Reduce • ML model with Spark ML.
• Mapping and reducing stages concepts • Linear regression
• Key Terms: Output Format, Partitioners, • logistic regression
Combiners, Shuffle, and Sort • Random forest
• Hands-on Map Reduce task
MODULE 3
PYSPARK FOUNDATION MODULE 6
• PySpark Introduction KAFKA and Spark
• Spark Configuration • Kafka architecture
• Resilient distributed datasets (RDD) • Kafka workflow
• Working with RDDs in PySpark • Configuring Kafka cluster
• Aggregating Data with Pair RDDs • Operations
TOOLS/PLATFORMS COVERED
MODULE 1 MODULE 4
PYTHON BASICS PYTHON FUNCTIONS
• Introduction of python • Functions basics
• Installation of Python and IDE • Function Parameter passing
• Python objects • Iterators
• Python basic data types • Generator functions
• Number & Booleans, strings • Lambda functions
• Arithmetic Operators • Map, reduce, filter functions
• Comparison Operators
• Assignment Operators MODULE 5
• Operator’s precedence and associativity PYTHON NUMPY PACKAGE
MODULE 2 • NumPy Introduction
PYTHON CONTROL STATEMENTS • Array – Data Structure
• IF Conditional statement • Core Numpy functions
• IF-ELSE • Matrix Operations
• NESTED IF
• Python Loops basics MODULE 6
• WHILE Statement PYTHON PANDAS PACKAGE
• FOR statements • Pandas functions
• BREAK and CONTINUE statements • Data Frame and Series – Data Structure
• Data munging with Pandas
MODULE 3 • Imputation and outlier analysis
PYTHON DATA STRUCTURES
• Basic data structure in python
• String object basics and inbuilt methods
• List: Object, methods, comprehensions
• Tuple: Object, methods, comprehensions
• Sets: Object, methods, comprehensions
• Dictionary: Object, methods,
comprehensions
TOOLS/PLATFORMS COVERED
MODULE 1 MODULE 4
BUSINESS INTELLIGENCE INTRODUCTION TABLEAU : BUSINESS INSIGHTS
• What Is Business Intelligence (BI)? • Getting Started With Visual Analytics
• What Bi Is The Core Of Business Decisions? • Drill Down And Hierarchies
• BI Evolution • Sorting & Grouping
• Business Intelligence Vs Business Analytics • Creating And Working Sets
• Data Driven Decisions With Bi Tools • Using The Filter Shelf
• The Crisp-Dm Methodology • Interactive Filters
• Parameters
• The Formatting Pane
MODULE 2 • Trend Lines & Reference Lines
BI WITH TABLEAU: INTRODUCTION • Forecasting
• The Tableau Interface • Clustering
• Tableau Workbook, Sheets And Dashboards
• Filter Shelf, Rows And Columns MODULE 5
• Dimensions And Measures DASHBOARDS, STORIES AND PAGES
• Distributing And Publishing • Dashboards And Stories Introduction
• Building A Dashboard
• Dashboard Objects
MODULE 3
• Dashboard Formatting
TABLEAU: CONNECTING TO DATA SOURCE
• Dashboard Interactivity Using Actions
• Connecting To Data File , Database Servers • Story Points
• Managing Fields • Animation With Pages
• Managing Extracts
• Saving And Publishing Data Sources
• Data Prep With Text And Excel Files MODULE 6
• Join Types With Union BI WITH POWER-BI
• Cross-Database Joins • Power BI basics
• Data Blending • Basics Visualizations
• Connecting To Pdfs • Business Insights with Power BI
TOOLS/PLATFORMS COVERED
DURATION : 6 MONTHS
LEARNING MODE : LIVE ONLINE / IN-PERSON CLASSROOM (SELECTED CITIES)
+ +
SELF LEARNING INSTRUCTOR LED LIVE ONLINE LEARNING PROJECTS & ASSESSMENTS
ENQUIRE NOW