Statistic

The document introduces statistics as a methodology for collecting, analyzing, and interpreting numerical data, emphasizing its importance in economics and social sciences. It distinguishes between descriptive and inferential statistics, highlighting the significance of representative sampling and the scales of measurement. Additionally, it categorizes data into categorical and quantitative types, explaining their implications for statistical analysis.

Uploaded by

dandenatamiru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views15 pages

Statistic

Uploaded by

dandenatamiru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Chapter one

Introduction to Statistics, Data and

Statistical Thinking
What is Statistics?
• In common usage people think of statistics as
numerical data—the unemployment rate last
month, total government expenditure last year, and
so forth.
• Although there is nothing wrong with viewing
statistics in this way, we are going to take a deeper
approach.
• We will view statistics the way professional
statisticians view it—as a methodology for
collecting, classifying, summarizing, organizing,
presenting, analyzing and interpreting numerical
information.
The Use of Statistics in Economics and Other Social
Sciences
• Businesses use statistical methodology and thinking
to make decisions about which products to
produce, how much to spend advertising them,
how to evaluate their employees, and nearly every
aspect of running their operations.
• The motivation for using statistics in the study of
economics and other social sciences is somewhat
different.
• The object of the social sciences and of economics
in particular is to understand how the social and
economic system functions.
• Views and understandings of how things work are called theories.
• They are composed of two parts—a logical structure which is tautological (tha
is, true by definition), and a set of parameters in that logical structure which
gives the theory empirical content (that is, an ability to be consistent o
inconsistent with facts or data).
• The logical structure, being true by definition, is uninteresting, except insofar as
it enables us to construct testable propositions about how the economic system
works. If the facts turn out to be consistent with the testable implications of the
theory, then we accept the theory as true until new evidence inconsistent with
it is uncovered.
• A theory is valuable if it is logically consistent both within itself and with othe
theories established as “true” and is capable of being rejected by, bu
nevertheless consistent with, available evidence.
• Its logical structure is judged on two grounds—internal consistency and
usefulness as a framework for generating empirically testable propositions. To
illustrate this, consider the statement: “People maximize utility.” This statemen
is true by definition—behavior is defined as what people do and utility is
defined as what people maximize when they choose to do one thing rather than
something else.
• These definitions and the associated utility maximizing approach form a usefu
• One can choose the parameters in this tautological utility maximization
structure so that the marginal utility of a good declines relative to the
marginal utility of other goods as the quantity of that good consumed
increases relative to the quantities of other goods consumed.
Downward sloping demand curves emerge, leading to the empirically
testable statement: “Demand curves slope downward.” This theory of
demand (which consists of both the utility maximization structure and
the proposition about how the individual’s marginal utilities behave)
can then be either supported or falsified by examining data on prices
and quantities and incomes for groups of individuals and commodities.
• The set of tautologies derived using the concept of utility maximization
are valuable because they are internally consistent and generate
empirically testable propositions such as those represented by the
theory of demand. If it didn’t yield testable propositions about the real
world, the logical structure of utility maximization would be of little
interest.
• Alternatively, consider the statement: “Canada is a wonderful country.”
This is not a testable proposition unless we define what we mean by
the adjective “wonderful”.
• Statistics is the methodology we use to confront theories like the theory
of demand and other testable propositions with the facts.
• It is the set of procedures and intellectual processes by which we decide
whether or not to accept a theory as true—the process by which we
decide what and what not to believe. In this sense, statistics is at the
root of all human knowledge.
• Unlike the logical propositions contained in them, theories are never
strictly true. They are merely accepted as true in the sense of being
consistent with the evidence available at a particular point in time and
more or less strongly accepted depending on how consistent they are
with that evidence.
• Given the degree of consistency of a theory with the evidence, it may or
may not be appropriate for governments and individuals to act as though
it were true. A crucial issue will be the costs of acting as if a theory is
true when it turns out to be false as opposed to the costs of acting as
though the theory were not true when it in fact is.
• As evidence against a theory accumulates, it is eventually rejected in
favor of other “better” theories—that is, ones more consistent with
• Statistics, being the set of analytical tools used to
test theories, is thus an essential part of the
scientific process.
• Theories are suggested either by casual
observation or as logical consequences of some
analytical structure that can be given empirical
content.
• Statistics is the systematic investigation of the
correspondence of these theories with the real
world. This leads either to a wider belief in the
‘truth’ of a particular theory or to its rejection as
inconsistent with the facts.
Descriptive and Inferential Statistics
• The application of statistical thinking involves two sets of processes. First,
there is the description and presentation of data. Second, there is the process
of using the data to make some inference about features of the environment
from which the data were selected or about the underlying mechanism that
generated the data.
• The first is called descriptive statistics and utilizes numerical and graphical
methods to find patterns in the data, to summarize the information it reveals
and to present that information in a meaningful way. The second, Inferential
statistics uses data to make estimates, decisions, predictions, or other
generalizations about the environment from which the data were obtained.
• Statistical inference essentially involves the attempt to acquire information
about a population or process by analyzing a sample of elements from that
population or process.
• A population includes the set of units that we are interested in learning about.
For example, we could be interested in the effects of schooling on earnings in
later life, in which case the relevant population would be all people working.
• A sample is a subset of the units comprising population. Because it is costly
to examine most populations of interest, and impossible to examine the
entire output of a process, statisticians use samples from populations and
processes to make inferences about their characteristics.
• Obviously, our ability to make correct inferences about a population based
on a sample of elements from it depends on the sample being
representative of the population. So the manner in which a sample is
selected from a population is of extreme importance.
• An example of the importance of representative sampling occurred in the
1948 presidential election in the United States. The Democratic incumbent,
Harry Truman, was being challenged by Republican Governor Thomas Dewey
of New York. The polls predicted Dewey to be the winner but Truman in fact
won.
• To obtain their samples, the pollsters telephoned people at random,
forgetting to take into account that people too poor to own telephones also
vote. Since poor people tended to vote for the Democratic Party, a sufficient
fraction of Truman supporters were left out of the samples to make those
samples unrepresentative of the population. As a result, inferences about the
proportion of the population that would vote for Truman based on the
proportion of those sampled intending to vote for Truman were incorrect.
• A process is a mechanism that produces output. We might
be interested in the effects of drinking on driving, in which
case the underlying process is the on-going generation of
car accidents as the society goes about its activities. Note
that a process is simply a mechanism which, if it remains
intact, eventually produces an infinite population.
• Finally, when we make inferences about the characteristics
of a population/process based on a sample, we need some
measure of the reliability of our method of inference.
• What are the odds that we could be wrong.
• We need not only a prediction as to the characteristic of the
population of interest (for example, the proportion by which
the salaries of college graduates exceed the salaries of those
that did not go to college) but some quantitative measure of
the degree of uncertainty associated with our inference.
Data sets
• Data are the facts and figures collected, analyzed, and
summarized for presentation and interpretation.
• All the data collected in a particular study are referred to
as the data set for the study.
• Elements are the entities on which data are collected.
• A variable is a characteristic of interest for the elements.
• Measurements collected on each variable for every
element in a study provide the data set. The set of
measurements obtained for a particular element is
called an observation.
Scales of Measurement
• The scale of measurement determines the amount of information
contained in the data and indicates the most appropriate data
summarization and statistical analyses.
• When the data for a variable consist of labels or names used to identify
an attribute of the element, the scale of measurement is considered a
nominal scale (sex)
• The scale of measurement for a variable is called an ordinal scale if the
data exhibit the properties of nominal data and the order or rank of the
data is meaningful (service).
• The scale of measurement for a variable is an interval scale if the data
have all the properties of ordinal data and the interval between values is
expressed in terms of a fixed unit of measure. Interval data are always
numeric (Mark).
• The scale of measurement for a variable is a ratio scale if the data have all
the properties of interval data and the ratio of two values is meaningful.
This scale requires that a zero value be included to indicate that nothing
Categorical and Quantitative Data
Data can be classified as either categorical or quantitative. Data that can
be grouped by specific categories are referred to as categorical data.
Categorical data use either the nominal or ordinal scale of
measurement. Qualitative data cannot be measured on a naturally
occurring numerical scale but can only be classified into one of a group
of categories.
Data that use numeric values to indicate how much or how many are
referred to as quantitative data. Quantitative data are obtained using
either the interval or ratio scale of measurement.
If the variable is categorical, the statistical analysis is limited. We can
summarize categorical data by counting the number of observations in
each category or by computing the proportion of the observations in
each category. However, even when the categorical data are identified by
a numerical code, arithmetic operations such as addition, subtraction,
multiplication, and division do not provide meaningful results.
Arithmetic operations provide meaningful results for quantitative
variables
• There are three general kinds of data sets—cross-
sectional, time-series and panel.
• Cross-sectional data are data collected at the same
or approximately the same point in time.
• Time series data are data collected over several
time periods.
• Some data sets are both time-series and cross-
sectional. Imagine, for example a data set
containing wage and gender data for each of a
series of years. These are called panel data.
Group Assignment

descriptive statistics
nominal
numerical

Personal and Business Transformation Assignment STU49009
No ratings yet
Personal and Business Transformation Assignment STU49009
23 pages
Motion To Admit Supplemental Pleading
100% (7)
Motion To Admit Supplemental Pleading
2 pages
Lecture 1- Social Science Research
No ratings yet
Lecture 1- Social Science Research
15 pages
ASSIGHNMENT No 1
No ratings yet
ASSIGHNMENT No 1
11 pages
Lecture 1 Asbc 105
No ratings yet
Lecture 1 Asbc 105
10 pages
What Is Statistics Intro
No ratings yet
What Is Statistics Intro
16 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
82 pages
Ch-1, Part-3
No ratings yet
Ch-1, Part-3
20 pages
Sample 2
No ratings yet
Sample 2
8 pages
Concepts: September 7 September 12 September 14 September 19 September 26 September 28 October 10 October 31 November 23
No ratings yet
Concepts: September 7 September 12 September 14 September 19 September 26 September 28 October 10 October 31 November 23
28 pages
Sheet - 1 - EEE - Introduction of Statistics - Not in Syllabus
No ratings yet
Sheet - 1 - EEE - Introduction of Statistics - Not in Syllabus
15 pages
Lecture Statistics
No ratings yet
Lecture Statistics
5 pages
2023 - 09 - 23 12 - 58 PM Office Lens
No ratings yet
2023 - 09 - 23 12 - 58 PM Office Lens
3 pages
Stat I CH - 1
No ratings yet
Stat I CH - 1
9 pages
Business Statistics
100% (1)
Business Statistics
47 pages
Introduction To Statistics
100% (1)
Introduction To Statistics
25 pages
QT Unit 1
No ratings yet
QT Unit 1
10 pages
Lesson 1 STATISTICAL ANALYSIS
No ratings yet
Lesson 1 STATISTICAL ANALYSIS
7 pages
Components of A Statistical Research
No ratings yet
Components of A Statistical Research
29 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
14 pages
What Is Statistics
No ratings yet
What Is Statistics
11 pages
Use of Assessment in Outcomes Assessment
No ratings yet
Use of Assessment in Outcomes Assessment
11 pages
Assignment No: 1 Course Code: (8614) Educational Statistics Unit: 1 To 5 Semester: Spring 2021 Name: Sana Manzoor Roll No: CA651932
No ratings yet
Assignment No: 1 Course Code: (8614) Educational Statistics Unit: 1 To 5 Semester: Spring 2021 Name: Sana Manzoor Roll No: CA651932
17 pages
MMW Statistics
No ratings yet
MMW Statistics
5 pages
Statistics and Probability
No ratings yet
Statistics and Probability
17 pages
Chapter 1-1
No ratings yet
Chapter 1-1
18 pages
Introductionto Statistics
No ratings yet
Introductionto Statistics
15 pages
AGE 301 Handout by Dr. Show
No ratings yet
AGE 301 Handout by Dr. Show
150 pages
Chapter 2 POLI212 Summary
No ratings yet
Chapter 2 POLI212 Summary
5 pages
Lecture (Chapter 1) :: Ernesto F. L. Amaral
No ratings yet
Lecture (Chapter 1) :: Ernesto F. L. Amaral
39 pages
5 Measures of Central Tendency
No ratings yet
5 Measures of Central Tendency
61 pages
Introduction To Statistics and Data Presentation PDF
No ratings yet
Introduction To Statistics and Data Presentation PDF
6 pages
Unit 1 Quantitative Techniques
No ratings yet
Unit 1 Quantitative Techniques
30 pages
Biostat Prelims
No ratings yet
Biostat Prelims
9 pages
ch03 Theory Building
No ratings yet
ch03 Theory Building
4 pages
1) Unit 1. Introduction PDF
No ratings yet
1) Unit 1. Introduction PDF
7 pages
Theory and Variables
No ratings yet
Theory and Variables
26 pages
11th Statistics Full Book CH 1 To CH 13
No ratings yet
11th Statistics Full Book CH 1 To CH 13
100 pages
Economic Topic 4 N
No ratings yet
Economic Topic 4 N
10 pages
Module 2 ILT1 Statistics in Analytical Chemistry
No ratings yet
Module 2 ILT1 Statistics in Analytical Chemistry
7 pages
Conditions of Statistical Research
No ratings yet
Conditions of Statistical Research
15 pages
LR 1 Intro
No ratings yet
LR 1 Intro
24 pages
002 Core 6 - Business Statistics - III Sem-Cropped
No ratings yet
002 Core 6 - Business Statistics - III Sem-Cropped
56 pages
SSC 201 - Statistical Methods and Sources I
100% (1)
SSC 201 - Statistical Methods and Sources I
77 pages
8614 Ass1
No ratings yet
8614 Ass1
39 pages
Chapter 1
No ratings yet
Chapter 1
13 pages
CMP221 Statistics
0% (1)
CMP221 Statistics
183 pages
Unit 3 Diverse Logic of Building A Theory
No ratings yet
Unit 3 Diverse Logic of Building A Theory
4 pages
Anawesha Mitra - 18731722027 Maths
No ratings yet
Anawesha Mitra - 18731722027 Maths
15 pages
Chapter 1: What Is Statistics?: Christopher J. Wild, Jessica M. Utts, and Nicholas J. Horton
No ratings yet
Chapter 1: What Is Statistics?: Christopher J. Wild, Jessica M. Utts, and Nicholas J. Horton
27 pages
BAINTES 01 Descriptive Statistics
No ratings yet
BAINTES 01 Descriptive Statistics
39 pages
Uses of Statistics
No ratings yet
Uses of Statistics
45 pages
STA111 Khairun
No ratings yet
STA111 Khairun
63 pages
Class Notes 1 Math 1114
No ratings yet
Class Notes 1 Math 1114
11 pages
Spirt Es 1993
No ratings yet
Spirt Es 1993
551 pages
Apuntes EECCSS - I - V4.es - en
No ratings yet
Apuntes EECCSS - I - V4.es - en
31 pages
Statistics - Wikipedia
No ratings yet
Statistics - Wikipedia
23 pages
Module Lesson 1
No ratings yet
Module Lesson 1
12 pages
Glossary of Research Methods
From Everand
Glossary of Research Methods
Dr. Awadhesh Kishore
No ratings yet
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet
Gale Researcher Guide for: Studying Families
From Everand
Gale Researcher Guide for: Studying Families
Hendricks
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Enga7 gw11
No ratings yet
Enga7 gw11
1 page
Land Laws: Sandeep Chawda B.A.LL.B (HONS.) 5 Year (Regular) Jamia Millia Islamia
No ratings yet
Land Laws: Sandeep Chawda B.A.LL.B (HONS.) 5 Year (Regular) Jamia Millia Islamia
25 pages
Appendix A 2
No ratings yet
Appendix A 2
7 pages
Jose Rizal's Love Affairs
100% (1)
Jose Rizal's Love Affairs
31 pages
l1 - Development of Self in Society - Goals - Life Orientation 11 Term 1 Notes
No ratings yet
l1 - Development of Self in Society - Goals - Life Orientation 11 Term 1 Notes
11 pages
Earthquake Resistance Structure Experiment No. - : I-Objectives
No ratings yet
Earthquake Resistance Structure Experiment No. - : I-Objectives
2 pages
1 Amartya Sen
No ratings yet
1 Amartya Sen
5 pages
Lecture 8
No ratings yet
Lecture 8
42 pages
Advances in Mechanical Engineering ME 702
No ratings yet
Advances in Mechanical Engineering ME 702
2 pages
最新英语电影评论
100% (2)
最新英语电影评论
7 pages
Smallfoot English
No ratings yet
Smallfoot English
135 pages
Optic Disc Cupping
No ratings yet
Optic Disc Cupping
18 pages
Assemblage Sculpture
No ratings yet
Assemblage Sculpture
21 pages
Health Mapeh
No ratings yet
Health Mapeh
19 pages
Heidegger and The Question of Daseins Being-A-Whole
No ratings yet
Heidegger and The Question of Daseins Being-A-Whole
4 pages
04 Truss - Method of Joints and Sections
No ratings yet
04 Truss - Method of Joints and Sections
30 pages
3 Pamatong V Comelec GR No 161872
100% (1)
3 Pamatong V Comelec GR No 161872
2 pages
6 Authentic Leadership
No ratings yet
6 Authentic Leadership
12 pages
Workplace Childcare (Organization Psychology)
No ratings yet
Workplace Childcare (Organization Psychology)
19 pages
Recurrent Pneumonia Final2
No ratings yet
Recurrent Pneumonia Final2
81 pages
Contemporary Social Issues of Tamil Nadu
No ratings yet
Contemporary Social Issues of Tamil Nadu
8 pages
Design of A Settling Basin For Small Scale Water Treatment Plant in Borno State, Nigeria
No ratings yet
Design of A Settling Basin For Small Scale Water Treatment Plant in Borno State, Nigeria
5 pages
Exploring The Impact of Technology Implementation at The Elementa
No ratings yet
Exploring The Impact of Technology Implementation at The Elementa
44 pages
Quiz Eco Club Final
No ratings yet
Quiz Eco Club Final
5 pages
New Results Bolster Penrose's Quantum Consciousness Hypothesis
No ratings yet
New Results Bolster Penrose's Quantum Consciousness Hypothesis
7 pages
Role Play About The Patient'S Family Complaints Over The Nurse'S Negligence
No ratings yet
Role Play About The Patient'S Family Complaints Over The Nurse'S Negligence
7 pages
Reviewer For Art Appreciation
No ratings yet
Reviewer For Art Appreciation
4 pages
JK Tyre Industries LTD
No ratings yet
JK Tyre Industries LTD
15 pages

Statistic

Uploaded by

Statistic

Uploaded by

Chapter one

Introduction to Statistics, Data and

You might also like