Question Part a and B New
Question Part a and B New
Assessment - 2
This Assignment leads you through a statistical analysis of census income data.
Assignment Data
The data for this project can be accessed from the Moodle – Assessment 2 – Project
data file. The data Set consists of the Age, work class, Gender, Income, working hours
per week and Education of different countries in different sheets.
Select the sample data according to the last digit of your Student ID
Your datasheet matches the Last digit of your ID. For example, if your student ID is
2024001 then copy the data of “1”.
Assignment Submission
• Each part of the project should be submitted as a SINGLE Word file with copied
Excel output.
• Design your own cover page (Student ID must be included in the Cover page). For
Part B mention the Level of significance and Confidence Interval on your cover
page
Due: Week 6
Marks: 30 Marks
Topics: 1, 2 and 3
Learning Outcomes: 3, 4
Weight: 15%
Introduction:
This analysis uses census income data, which is critical to understanding the economic
structure and supporting decision-making in a democratic government. The data,
gathered from the World Data Bank website, reflects key demographic and economic
variables.
• Find the Probability given that an individual is male, what is the probability that
they earn more than 50K?
• Find the probability if that an individual is female, what is the probability that
they earn more than 50K?
• What is the overall probability of selecting an individual with an income of more
than 50K from this sample?
• Use the probability value to advice about the income distribution
Marks Criterion Excellent (HD) Very Good (D) Good (C) Satisfactory (P) Poor (F) Unsatisfactory
5 Fill in the blanks All questions correct Four Question Correct Three Questions Two Question correct One question No Questions Correct
(5 marks) (4 marks) Correct (3 marks) (2 marks) corrects (0 Marks)
(1 marks)
10 Histogram / Constructed the Constructed the Constructed the Basic use of Excel and Poor use of Excel; No graph and frequency
Polygon Frequency Distribution Frequency Distribution Frequency frequency distribution Shape of the Distribution
properly and discussed with minimum error Distribution few is not present distribution
the shape of the and discussed the shape errors and The shape of the discussion is missing
distribution of the distribution discussed the distribution discussion
shape of the is missing
distribution
5 Descriptive All measures are Some of the measures Partially measured Very few measures are Measures are No any measures
measures measured accurately and are missing and were and partially measured and incorrect and not at
discussed properly in the not discussed in the discussed in the discussed all discussed in the
inference inference inference inference
10 Probability The contingency table is The contingency table The contingency The contingency table Incomplete or wrong No Table/ no calculation
properly filled. Shown all is properly filled with table is partially is wrong. Methodology contingency table
the working out of minimal error. Shown filled. Shown all was correct and probability value
probability and all the working out of the working out of
Explanation was probability and probability and No
exemplary Explanation was good Explanation
Due: Week 10
Marks: 30
Topics: 6 to 9
Weight: 15%
In response to Assignment Part B, you use the answers obtained in Part A and
techniques from statistical inference regression, and correlation to complete this
part.
Part B Submission
Statistical Inference
Choose a level of significance for any hypothesis tests and a level of confidence
for any confidence intervals.
Note: All the steps (Assumptions, H0, H1 etc.,) need to be carried out to complete
the question
• Test the significance of whether the individual who earns more than 50K has
an average age of above 40 years. (Question 3 Part A)
• Test whether there is a significant difference in the average hours worked per
week between people earning. (Question 3 Part A)
• Model the multiple regression for income based on predictors like age,
work hours, and gender.
o Frame an Equation
o Interpret the coefficients.
Marks Criterion Excellent (HD) Very Good (D) Good (C) Satisfactory (P) Unsatisfactory (F
6 Confidence An appropriate assumption An appropriate A partial assumption was made. An Assumption was not No Assumption, no proper
Interval was made. assumption was made. Select the correct formula based made properly. Formula, no Inference
Select the correct formula Select the correct formula on the assumption. The formula was not
based on the assumption. based on the assumption. A general conclusion was made. correct.
An exemplary logical An advanced logical No inference was made.
conclusion was made. conclusion was made.
6 One sample An appropriate assumption An appropriate A partial assumption was made. An Assumption was not No Assumption, no proper
Hypothesis testing was made. assumption was made. Select the correct formula based made properly. Formula, no Inference
Select the correct formula Select the correct formula on the assumption. The formula was not
based on the assumption. based on the assumption. A general conclusion was made. correct.
An exemplary logical An advanced logical No inference made
conclusion was made. conclusion was made.
8 Two sample An appropriate assumption An appropriate A partial assumption was made. An Assumption was not No Assumption, no proper
Hypothesis testing was made. assumption was made. Select the correct formula based made properly. Formula, no Inference
Select the correct formula Select the correct formula on the assumption. The formula was not
based on the assumption. based on the assumption. A general conclusion was made. correct.
An exemplary logical An advanced logical No inference made
conclusion was made. conclusion was made.
10 Simple linear Variables are properly Variables are partially Variables are not properly Variables and All the calculations are
regression addressed. addressed. addressed. Equations are properly missing
Equations are properly Equations are properly Equations are not properly structured.
structured. structured. structured. The inference was
The inference was made with The inference was made The inference was general. missing
exemplary logic in advanced logic