Course Title : Introduction to R in Business
Applications : Project Work
Ram Mohan Dhara
IMTG/ PGDM-Ex/ 2018-2019
Session 15 & 16 : Project Work
1. It’s a group assignment and you are free to choose
your project.
2. Project work will carry 50 marks.
3. But your choice should comply to project selection
guidelines. Remember project selection would carry
10 marks.
4. You must get the dataset vetted through the
instructor before you start working with the project.
5. Your time limit for presentation is 13(+/- 2) minutes.
6. You can choose one/ multiple presenters. But all
Project Guidelines : group members must be present.
General 7. Evaluator would appreciate – clarity of thought,
amount of efforts and presentation skill.
8. You will not be allowed to modify the project after
presentation.
9. Your project will have three broad sections –
A. Selection of datasets
B. Objectives and hypotheses
C. Data visualization, Analysis and Conclusion
10. Form a maximum of 5 groups with 4 members in
each group.
1. Any industry – healthcare,
insurance, automotive, retail etc.
2. At least 1000 cases
3. At least 10-15 variables with a mix
of continuous and categorical
variables
4. The data should contain substantial
amount of ‘structure’ so that
Project Selection
different visualization options can
Guidelines
be applied.
5. The data should contain substantial
amount of ‘promise’ so that
different analytical techniques can
be applied.
1. Develop hypothesis based on
domain knowledge
2. Use appropriate techniques to
prove/ disprove the hypotheses
3. Use analytical techniques to address
project objectives
1. Regression
2. Classification
Project Guidelines – 3. Clustering
Analytics 4. Association
4. You must use “R” along with
• Excel
• SPSS
• Python
5. No of slides should not exceed 15.
1. Compliance to choice of dataset [10]
2. Describe Business Context [5]
3. Define Project Objectives [5]
4. Study the dataset (EDA) [10]
5. Create some charts and graphs
(Visualization) [5]
6. Briefly discuss the analytical techniques
(Analytics) [10]
Project template and 7. Write the key conclusions [5]
evaluation [50 marks]
• Project initiation – 28th Aug 2019
• Project group formation - 30th Aug 2019
• Approval on dataset – 30th Aug 2019
• Project discussion – any time before the
presentation
• Project presentation – 4th Sept 2019
Project timelines
1. [Link] (Kaggle)
2. [Link]
(UCI Machine Learning Repository)
3. [Link] (GitHub)
4. [Link] (Census data)
5. [Link] (Election Commission of
Some Data Sources
India)
6. [Link] ( Govt. of India
data sources , more than 300,000 data
sources)
All the best