Choosing The Right Elementary Statistical Test: Type of Question Level of Data / Assumptions Examples Statistical Test
Choosing The Right Elementary Statistical Test: Type of Question Level of Data / Assumptions Examples Statistical Test
The first step in determining what statistical test to use is to determine the type of research question to be answered by the statistical analysis. In elementary courses, the two basic types of questions are: 1. the degree of relationship or dependence among variables (H0 = there is no relationship or dependence, and the statistical test answers the question as to whether any relationship or dependence found is sufficiently different from zero that it can be co nsidered statistically significant.) 2. the significance of group differences (H0 = there is no difference between groups. The statistical test answers the question as to whether an observed difference is probably due just to random factors, or is large enough to be considered statistically significant and due to the treatment factor.) The next step is to determine the nature(s) of the variables under discussion, and whether they meet the assumptions of a particular test (e.g. the data are normally distributed). Types/levels of data: nominal = unordered categories (e.g. religion; country of birth; etc.) ordinal = ordered categories (e.g. level of agreement on an opinion survey; proficiency level at a martial art as measured by the col our of ones belt) For nominal and ordinal data, what is usually recorded is the number of occurrences of a particular result (e.g. number of Christians, number of Buddhists etc. but these numbers are not the values of the variable. In this case, variable = religion, values = Christian, Buddhist, and the numbers are the number of occurrences of a particular value.) interval = ordinal + distance between values is of constant size (e.g. temperature) ratio = interval + (i) there is a meaningful zero and (ii) the ratio between two numbers is meaningful (e.g. weight, distance, number of children) ratio and interval data can be either discrete (i.e. there are gaps between values, e.g. number of children) or continuous (i.e. there are no gaps between values (e.g. weight, height).
Examples
Statistical Test
(i) Is eye colour independent of hair colour? (ii) Is opinion about a government policy independent of political affiliation? (i) Does GPA depend on IQ? (ii) Does weight depend on height?
Contingency table
interval, ratio
t-test for correlations using Pearsons correlation coefficient t-test for correlations using Spearmans rs
ordinal
(i) How strong is the relationship between the starting grid position of a race driver and their finishing position?
(i) Are the absentee rates the same for each day of the week? (ii) Are the number of workplace accidents the same for each hour of the day? (iii) Is the observed set of scores normally distributed?
Goodness-of-fit
(i) Is the mean weight of soup in a can the same as stated on the label? (ii) Are 90% of pizza orders delivered within 30 minutes? (Success = delivered within 30 minutes, failure = delivered over 30 minutes.)
If samples are independent, two-sample t-test (small samples) or ztest (large samples), related sample t-test if samples are dependent. Mann-Whitney
Are the means of three or more populations (as estimated by samples) the same? As above As above, but more than one independent variable.
H0:
==
ANOVA Kruskal-Wallis Factorial ANOVA (Looking for main and interaction effects) Linear contrast
>