0% found this document useful (0 votes)

4 views22 pages

Second Lecture

Uploaded by

dinaelkordy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views22 pages

Second Lecture

Uploaded by

dinaelkordy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

‫مادة تصميم وتحليل‬

‫التجارب‬
‫" المحاضرة الثانية"‬
‫د‪ .‬سوزان عبد الرحمن‬
‫مدرس بكلية الدراسات العليا للبحوث اإلحصائية‬
Interpretation
presentation

Data
analysis
Data
collection

Design
of study
Sample
Population A subset of the large
Entire group for which group(population) for which
information is wanted information is collected to learn
about the larger group

Continuous data Binary (dichotomous) data:

• Blood pressure takes on only two values, “yes” or “no”
• Weight
• Height • Having COVID-19:Yes/No
• Age • Sex: Male/Female
• Income • Smoking: Yes/No

Categorical data: an extension of binary data to include more than 2 possible values
• Nominal categorical data: no order to categories [Country of birth , marital status]
• Ordinal categorical data: order to categorize [Income level classified into four levels].
[Degree of agreement, five categories from strongly disagree to strongly agree ]
Mean
Measures of
the center of
data
Median

variance
Describing
continuous data Measure of
data variability
Standard
deviation

Other
measures of Percentiles
location
Measures of the center of data
▪ Systolic Blood pressure of 20 patients

𝒙𝟏 =120 𝒙𝟐 =80 𝒙𝟑 =95 𝒙𝟒 =115 𝒙𝟓 =89 𝒙𝟔 =160 𝒙𝟕 =140 𝒙𝟖 =120 𝒙𝟗 =115 𝒙𝟏𝟎 =120

𝑥11 =110 𝑥12 =115 𝑥13 =180 𝑥14 =190 𝑥15 =110 𝑥16 =105 𝑥17 =95 𝑥18 =80 𝑥19 =120 𝑥20 =115

The sample mean:

120+80+95+115+89+⋯…+115 2374
ഥ=
𝒙 = = 118.7 ≈ 119 𝑚𝑚𝐻𝐺
20 20
Sample average
( arithmetic mean)
σ𝑛
𝑖=𝑛 𝑥𝑖
𝑥=
ҧ
𝑛
▪ where σ𝑛𝑖=𝑛 𝑥𝑖 = 𝑥1 + 𝑥2 + 𝑥3 + 𝑥4 + 𝑥5 ……… 𝑥𝑛

▪ Sample mean(𝑥)ҧ is different from the population mean(𝜇)

▪ ► 𝑥ҧ is the best estimate of the population mean (µ)

▪ Disadvantages

▪ Sensitive of extreme values ( in smaller samples) m if we change the value of one data point
could make change in the sample mean, replace 𝑥18 =80 with 220
▪ 𝑥=
ҧ 125.7 𝑚𝑚𝐻𝐺 instead of 119 𝑚𝑚𝐻𝐺
Median
▪ The median is the middle value in an ordered set of continuous data
▪ The median is also called the 50th percentile
▪ The median value of five patient
𝑥1 =120 , 𝑥2 =80, 𝑥3 =95, 𝑥4 =115, 𝑥5 =89
Not sensitive to
Order : 80 89 95 115 120 the influence of
extreme sample
values

▪ If we replace 𝑥1 =220 , 𝑥2 =80, 𝑥3 =95, 𝑥4 =115, 𝑥5 =89

Order : 80 89 95 115 220
The sample variance
▪ Sample variance 𝑆 2 while the sample standard deviation (S or SD)
▪ The sample variance is the average of the square of the deviations about the sample
mean.

σ𝑛 (𝑥 − ҧ
𝑥) 2
𝑆2 = 𝑖=𝑛 𝑖
𝑛−1
▪ The sample standard deviation is the square root of the sample variance

σ𝑛𝑖=𝑛(𝑥𝑖 − 𝑥)ҧ 2
𝑆 =
𝑛−1
► s is the best estimate of the population standard deviation (σ)
▪ Systolic blood pressures (mmHg), n=5: 120 mmHg, 80 mmHg, 90 mmHg, 110 mmHg,
95 mmHg. The mean 𝒙 ഥ is 99 mmHg.
▪ ► The sample variance computation, numerator:
The sample variance
The more variability there is in the sample of data, the larger the value of s
► s measures the variability (spread) of the individual sample values around the
sample mean
► s can equal 0 only if there is no variability (if all n sample observations have
the same value)
► The units of s are the same as the units of the data measurements in the
sample (for example, mmHg)
► Often abbreviated SD or sd
s2 is the best estimate from the sample of the population variance σ2; s is the
best estimate of the population standard deviation σ
female (142) Female (142) Male (142) Male (162)

female (120) Female (123) Male (120) Male (183)

female (115) Female (107) Male (115) Male (187)

female (140) Female (129) Male (140) Male (179)

female (155) Female (114) Male (155) Male (154)

female (135) Female (105) Male (135) Male (195)

female ( 140) Female (128) Male ( 140) Male (178)

Female (150) Female (108) Male (150) Male (168)

…………..

Estimate: …………

……………
Percentiles
Sample percentiles are Used to describe the distribution of the continuous data
𝑃𝑡ℎ sample percentile is the value in a sample of the data such that the p percent of the
sample values are less than or equal to this value.
Percentiles can be computed by hand or via computer.
Systolic blood pressure (SBP) measurements from a random sample of 113 adult men
taken from a clinical population (based on results from a computer)
► The 10th percentile for these 113 blood pressure measurements is 107 mmHg,
meaning that approximately 10% of the men in the sample have SBP ≤ 107 mmHg, and
(100−10) = 90% of the men have SBP > 107 mmHg
► The 75th percentile for these 113 blood pressure measurements is 132 mmHg,
meaning that approximately 75% of the men in the sample have SBP ≤ 132 mmHg, and
(100−75) = 25% of the men have SBP > 132 mmHg
Continuous Data: Visual Displays
Utilize histograms and boxplots to visualize the distributions of samples of
continuous data

► Identify key summary statistics on the boxplot

► Name and describe basic characteristics of some common distribution

shapes for continuous data
► Means, standard deviations, and percentile values do not tell the whole story
of data distributions

► Differences in shape of the distribution

Histograms are a way of displaying the distribution of a set of data by charting the
number (or percentage) of observations whose values fall within pre-defined
numerical ranges
Data on systolic blood pressure (SBP) from a random clinical sample of 113 men
► A histogram can be created by:
► Breaking the data (blood pressure) range into bins of equal width
► Counting the number of the 113 observations whose blood pressure values fall
within each bin
► Plotting the number (or relative frequency) of observations that fall within
each bin as a bar graph
Percentage of observations on the vertical axis, larger bin width
Boxplots are graphics that display key characteristics of a dataset: these are especially nice tools for
comparing data from multiple samples visually

Q1: lower quartile ( 25% of data points are under Q1 when arranged in increasing order.
Q3: upper quartile (75% of data points under Q3 when arranged in in creasing order)
Q2: median ( divide the data into two equal parts)
Interquartile range )IQR)= Q3-Q1
Left (negatively) skewed Right (positively) skewed
▪ Histograms and boxplots are useful visuals tools for characterizing the shape of a data
distribution above and beyond the information given by summary statistics.
▪ Relatively common shapes for samples of continuous data measures include symmetric and
“bell” shaped, right skewed, left skewed, and uniform
▪ Suggest graphical approaches to comparing distributions of continuous data between two or
more samples
► Explain why a difference in sample means can be used to quantify, in a single number summary,
differences in distributions of continuous data.
Such comparisons can be used to investigate questions, such as:
► How does weight change differ between those who are on a low-fat diet compared to those on
a low-carbohydrate diet?
► How do salaries differ between males and females?
► How do cholesterol levels differ across weight groups
Common numerical comparison: difference in means

► On average, male children weigh more than female

children by 0.7 kg

On average, female children weigh less than male children by 0.7 kg,
which is the same as stating, “on average, male children weigh more
than female children by 0.7 kg
Normal distribution

▪ The normal distribution is a theoretical probability distribution

that is perfectly symmetric about its mean (and median and
mode)
▪ ► A “bell”-like shape
▪ Normal distributions are uniquely defined by two quantities: a
mean (µ) and standard deviation (σ)
▪ All normal distributions, regardless of mean and standard
deviation values, have the same structural properties:
► Mean = median (= mode)
► Values are symmetrically distributed around the mean
► Values “closer” to the mean are more frequent than values
“farther” from the mean

Statistics Made Easy Presentation PDF
No ratings yet
Statistics Made Easy Presentation PDF
226 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Module 2 Summarization and Measurement
No ratings yet
Module 2 Summarization and Measurement
16 pages
Biostats Notes
No ratings yet
Biostats Notes
82 pages
Statistics Made Easy Presentation
100% (2)
Statistics Made Easy Presentation
226 pages
02 - Descriptive Statistics
No ratings yet
02 - Descriptive Statistics
45 pages
Interpreting Data 2024 QM+
No ratings yet
Interpreting Data 2024 QM+
83 pages
Full Slides Beginselen2019
No ratings yet
Full Slides Beginselen2019
364 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Stats 1 Module Updated
No ratings yet
Stats 1 Module Updated
53 pages
Screenshot 2024-07-22 at 10.26.36 AM
No ratings yet
Screenshot 2024-07-22 at 10.26.36 AM
35 pages
SPSS Advance Statistics Session 1 RCD DR Muhammad Khan Asif
No ratings yet
SPSS Advance Statistics Session 1 RCD DR Muhammad Khan Asif
55 pages
Data Types - Research Methodology
No ratings yet
Data Types - Research Methodology
34 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
Reviewer in IE-SAN1
No ratings yet
Reviewer in IE-SAN1
5 pages
BioStat Module 3
No ratings yet
BioStat Module 3
41 pages
Biostat 4&5
No ratings yet
Biostat 4&5
6 pages
Intro SRM
No ratings yet
Intro SRM
73 pages
IL2-Describing Variation in Data
No ratings yet
IL2-Describing Variation in Data
7 pages
Lecture 04 (09.16)
No ratings yet
Lecture 04 (09.16)
38 pages
Reviewer in IE-SAN1
No ratings yet
Reviewer in IE-SAN1
5 pages
Basic Statistics: Populations and Samples
No ratings yet
Basic Statistics: Populations and Samples
10 pages
Introduction To Statistics 1 COD
No ratings yet
Introduction To Statistics 1 COD
58 pages
Psyc 103 (Stats)
No ratings yet
Psyc 103 (Stats)
75 pages
Lec 11 Chapter IV Descriptiv and Inferential Stat.
No ratings yet
Lec 11 Chapter IV Descriptiv and Inferential Stat.
26 pages
2.4 General Epidemiological Measures
No ratings yet
2.4 General Epidemiological Measures
32 pages
Midterms Gec Math Adooooor
No ratings yet
Midterms Gec Math Adooooor
6 pages
MATM111-Midterms-REVIEWER
No ratings yet
MATM111-Midterms-REVIEWER
3 pages
dispersion
No ratings yet
dispersion
13 pages
2statsnotes 1
No ratings yet
2statsnotes 1
24 pages
Bio Statistics
No ratings yet
Bio Statistics
72 pages
Cheat Sheet 1
No ratings yet
Cheat Sheet 1
2 pages
Basic Statistics - Hill
No ratings yet
Basic Statistics - Hill
44 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
Introduction To Biostatistics: Data Collection Descriptive Statistics
No ratings yet
Introduction To Biostatistics: Data Collection Descriptive Statistics
33 pages
Sampling and Estimation
No ratings yet
Sampling and Estimation
12 pages
Statistical Inference: Prepared By: Antonio E. Chan, M.D
No ratings yet
Statistical Inference: Prepared By: Antonio E. Chan, M.D
227 pages
23-Biostatistics
No ratings yet
23-Biostatistics
18 pages
Statistical Foundations - Intro 64zlf
100% (2)
Statistical Foundations - Intro 64zlf
86 pages
Biostatistics in Orthodontics
100% (3)
Biostatistics in Orthodontics
108 pages
Statistics
100% (4)
Statistics
124 pages
Spring Semester, 2020-2021
No ratings yet
Spring Semester, 2020-2021
40 pages
MTH1310 - Statistics
No ratings yet
MTH1310 - Statistics
34 pages
Descriptive Statistics Analysis Part 1
No ratings yet
Descriptive Statistics Analysis Part 1
42 pages
6.descriptve PPHD
No ratings yet
6.descriptve PPHD
70 pages
Conflict of Interest Disclosures
No ratings yet
Conflict of Interest Disclosures
24 pages
2NUBIONormalCurve2T24-25
No ratings yet
2NUBIONormalCurve2T24-25
50 pages
Lecture 1_Online_INTRODUCTION TO BIOSTATISTICS [Compatibility Mode]
No ratings yet
Lecture 1_Online_INTRODUCTION TO BIOSTATISTICS [Compatibility Mode]
28 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Central Limit Theorm
No ratings yet
Central Limit Theorm
101 pages
Introduction to Statistics 2_012233
No ratings yet
Introduction to Statistics 2_012233
29 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
Statistics 1
No ratings yet
Statistics 1
291 pages
PDF Notes (1)
No ratings yet
PDF Notes (1)
28 pages
Notes On Data Processing, Analysis, Presentation
No ratings yet
Notes On Data Processing, Analysis, Presentation
63 pages
UNIT II_ Statistics for Data Science_new (1)
No ratings yet
UNIT II_ Statistics for Data Science_new (1)
153 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
SBST1303 Pengenalan Statistik
No ratings yet
SBST1303 Pengenalan Statistik
4 pages
Assessment and Evaluation in Learning Part 4
No ratings yet
Assessment and Evaluation in Learning Part 4
8 pages
Stat 401B Exam 2 Key F15
No ratings yet
Stat 401B Exam 2 Key F15
10 pages
AMFO
No ratings yet
AMFO
4 pages
Mock Test Maths 2021
No ratings yet
Mock Test Maths 2021
17 pages
Contoh Uji Validitas Dan Reliabulitas Dengan Excell Dan SPSS
No ratings yet
Contoh Uji Validitas Dan Reliabulitas Dengan Excell Dan SPSS
8 pages
Table_ Blood Pressure (BP) Percentile Levels for Boys by Age and Height (Measured and Percentile)-MSD Manual Professional Edition
No ratings yet
Table_ Blood Pressure (BP) Percentile Levels for Boys by Age and Height (Measured and Percentile)-MSD Manual Professional Edition
7 pages
UOM (Statistics)
No ratings yet
UOM (Statistics)
13 pages
Grade 10 Exam Course Statistics 3
No ratings yet
Grade 10 Exam Course Statistics 3
49 pages
Module - Data Management (Part 2)
No ratings yet
Module - Data Management (Part 2)
31 pages
Stat Module 5
No ratings yet
Stat Module 5
10 pages
4.10 Quartiles Deciles Percentiles Grouped Data
No ratings yet
4.10 Quartiles Deciles Percentiles Grouped Data
2 pages
Why Study Dispersion?: Spread of The Data
No ratings yet
Why Study Dispersion?: Spread of The Data
31 pages
Error of Measurements
No ratings yet
Error of Measurements
45 pages
Chapter 4 Powerpoint
No ratings yet
Chapter 4 Powerpoint
8 pages
BOYS - HYPERTENSION
No ratings yet
BOYS - HYPERTENSION
5 pages
MAT5007 - Module 1 Problem Set
No ratings yet
MAT5007 - Module 1 Problem Set
3 pages
Chapter 3 - Measure of Location and Dispersion
No ratings yet
Chapter 3 - Measure of Location and Dispersion
11 pages
SKEWNESS and KURTOSIS
No ratings yet
SKEWNESS and KURTOSIS
12 pages
Chapter 6
No ratings yet
Chapter 6
14 pages
Biostatiska Tugas 1 Kumara Sandi
No ratings yet
Biostatiska Tugas 1 Kumara Sandi
5 pages
Dilla University: Page 1 of 6
100% (2)
Dilla University: Page 1 of 6
6 pages
Statistics Question Bank
No ratings yet
Statistics Question Bank
4 pages
Business Statistics in Practice 8th Edition Bowerman Test Bank download
100% (3)
Business Statistics in Practice 8th Edition Bowerman Test Bank download
56 pages
Measure of Relative Position
No ratings yet
Measure of Relative Position
5 pages
Foundation - ICAI PAPER JUNE 2023 (BMRS) 04.07.23
No ratings yet
Foundation - ICAI PAPER JUNE 2023 (BMRS) 04.07.23
14 pages
Research Methodology Lecture 3
No ratings yet
Research Methodology Lecture 3
111 pages
Statistics 1-17
No ratings yet
Statistics 1-17
18 pages
Stats 2 Week 7 GA
No ratings yet
Stats 2 Week 7 GA
6 pages
Reviewer 4 Nat
No ratings yet
Reviewer 4 Nat
3 pages

Second Lecture

Uploaded by

Second Lecture

Uploaded by

‫مادة تصميم وتحليل‬

Continuous data Binary (dichotomous) data:

The sample mean:

▪ Sample mean(𝑥)ҧ is different from the population mean(𝜇)

▪ If we replace 𝑥1 =220 , 𝑥2 =80, 𝑥3 =95, 𝑥4 =115, 𝑥5 =89

female (120) Female (123) Male (120) Male (183)

female (115) Female (107) Male (115) Male (187)

female (140) Female (129) Male (140) Male (179)

female (155) Female (114) Male (155) Male (154)

female (135) Female (105) Male (135) Male (195)

female ( 140) Female (128) Male ( 140) Male (178)

Female (150) Female (108) Male (150) Male (168)

► Identify key summary statistics on the boxplot

► Name and describe basic characteristics of some common distribution

► Differences in shape of the distribution

► On average, male children weigh more than female

▪ The normal distribution is a theoretical probability distribution

You might also like