Statistics
Statistics
2. What is a sample?
• The mean is the average of a set of numbers, calculated by adding them up and dividing
by the count.
• The median is the middle value in a list of numbers sorted in ascending or descending
order.
6. What is range?
• The range is the difference between the highest and lowest values in a dataset.
• Standard deviation measures how spread out the numbers are in a dataset.
8. What is variance?
9. What is a histogram?
• A histogram is a bar graph that represents the frequency distribution of numerical data.
• A scatter plot is a graph used to study the relationship between two variables.
• Correlation measures the strength and direction of the relationship between two
variables.
• Positive correlation means that as one variable increases, the other also increases.
Negative correlation means that as one variable increases, the other decreases.
• Regression analysis is used to predict the value of a dependent variable based on the
value of one or more independent variables.
• Correlation is a relationship between two variables, while causation means that one
variable directly affects the other.
• A p-value indicates the probability of obtaining the observed results assuming the null
hypothesis is true.
• A Type I error occurs when we reject the null hypothesis when it is actually true.
• A Type II error occurs when we fail to reject the null hypothesis when it is actually false.
ADVANCED QUESTIONS
21. What is ANOVA?
• ANOVA (Analysis of Variance) is used to compare the means of three or more groups.
• The Central Limit Theorem states that the distribution of the sample mean approaches a
normal distribution as the sample size increases.
• A residual is the difference between the observed value and the predicted value of the
dependent variable.