PCP 404 Bio Res Methods
PCP 404 Bio Res Methods
• Data presentation;
• inferential statistics:
• Hypothesis testing.
• Parametric and non-parametric tests:
• Student T test (paired and independent); Z score test; ANOVA; Chi square test and their non-
parametric equivalent.
• Correlation and Regression.
Introduction
Biostatistics and Research Methodology are essential components of the pharmacy curriculum,
equipping students with the skills to design studies, collecting, organizing, summarizing,
presenting, analyze data, and make evidence-based decisions in pharmaceutical practice and
research.
Integrating biostatistics with research methodology enables pharmacy students to not only
conduct and analyze research effectively but also to apply these findings in clinical settings,
thereby enhancing patient outcomes and advancing the field of pharmacy.
Data Presentation
o Example: "The study observed that 60% of participants responded positively to the
treatment, while 40% showed no significant improvement."
2. Tabular Presentation:
o Utilizes tables to organize data into rows and columns, allowing for systematic
arrangement and easy comparison.
o Effective for presenting large datasets or complex information succinctly.
3. Graphical Presentation:
o Employs visual elements like charts and graphs to represent data, making patterns
and trends more discernible.
o Common types include:
▪ Line Graphs: Display data points connected by lines, useful for showing
trends over time.
• Relevance: Include only pertinent data that supports the study's objectives.
• Consistency: Maintain uniformity in formatting, scales, and labeling across all data
presentations.
Inferential statistics
Inferential statistics is a branch of statistics that allows researchers to make conclusions and
predictions about a population based on data collected from a sample. This approach enables
analysts to extend findings beyond the immediate dataset, providing insights that inform decisions
about larger groups or phenomena.
2. Sampling Methods:
3. Estimation:
o Interval Estimation: Offers a range of values (confidence interval) within which the
population parameter is expected to lie.
4. Hypothesis Testing:
5. Regression Analysis:
Hypothesis testing
Hypothesis testing is a fundamental statistical method used to make inferences about a population
based on sample data. It involves formulating two opposing hypotheses—the null hypothesis (H₀)
and the alternative hypothesis (H₁)—and using sample data to determine which hypothesis is
supported by the evidence.
Key Components:
1. Null Hypothesis (H₀): This is the default assumption that there is no effect or difference in
the population. For example, stating that a new drug has no effect on blood pressure
compared to a placebo.
2. Alternative Hypothesis (H₁ or Ha): This hypothesis suggests that there is an effect or
difference. Continuing the example, it would posit that the new drug does affect blood
pressure.
3. Select the Appropriate Test: Depending on the data and research question, choose a
statistical test (e.g., t-test, chi-square test).
4. Calculate the Test Statistic: Compute the statistic that summarizes the data in the context
of the chosen test.
5. Determine the p-value: The p-value indicates the probability of obtaining results at least as
extreme as the observed ones, assuming H₀ is true.
6. Make a Decision: If the p-value is less than or equal to α, reject H₀; otherwise, do not reject
H₀.
Example:
Suppose researchers want to test if a new medication lowers blood pressure more effectively than
an existing one.
• H₀: There is no difference in effectiveness between the new and existing medications.
• H₁: The new medication is more effective than the existing one.
After conducting a study and analyzing the data, they calculate a p-value of 0.03. With a
significance level of 0.05, they reject H₀ and conclude that the new medication is statistically
significantly more effective.
Considerations:
• Type I Error (False Positive): Rejecting H₀ when it is true.
Hypothesis testing provides a structured framework for making data-driven decisions and
assessing the strength of evidence in research findings.
Parametric and non-parametric tests are two broad categories of statistical tests used to analyze
data. The choice between them depends on the nature of the data, distribution assumptions, and
sample size.
1. Parametric Tests
These tests assume that the data follows a specific distribution, typically a normal distribution.
They require that the data meets certain conditions, such as homogeneity of variance and interval
or ratio level of measurement.
• Pearson’s Correlation Coefficient: Measures the strength and direction of the linear
relationship between two variables.
2. Non-Parametric Tests
These tests do not assume a specific distribution for the data, making them useful when normality
cannot be assumed or when dealing with ordinal or non-normally distributed interval data.
• Kruskal-Wallis Test: Alternative to ANOVA, used for comparing three or more groups.
• Chi-Square Test: Used for categorical data to test associations between variables.
Student’s t-test (paired and independent), Z-score test, ANOVA, Chi-square test, and their
non-parametric equivalents:
The t-test is a parametric test used to compare the means of two groups. It assumes that the data
follows a normal distribution and that the samples have equal variances.
• Example: Comparing the average blood pressure of two different groups of patients (e.g.,
hypertensive vs. non-hypertensive).
• Assumptions:
• Example: Measuring patients’ blood pressure before and after a drug intervention.
• Assumptions:
Non-Parametric Equivalent:
2. Z-score Test
• Purpose: Used to compare sample and population means when the population standard
deviation (σ\sigma) is known.
• Example: Testing if the average cholesterol level in a sample differs from a known
population mean.
• Assumptions:
o Data is normally distributed or has a large sample size (n>30n > 30).
Non-Parametric Equivalent:
• The Z-test doesn’t have a direct non-parametric equivalent, but Sign Test can be used for
single-sample comparisons.
ANOVA is a parametric test that compares means across three or more groups.
• Purpose: Tests if there are significant differences between the means of three or more
independent groups.
• Example: Studying the effect of drug type (A, B, C) and age group (Young, Old) on blood
pressure.
Non-Parametric Equivalent:
• Assumptions:
o Data is categorical.
• Example: Testing if the distribution of blood types (A, B, AB, O) in a sample matches the
general population distribution.
Non-Parametric Equivalent:
Chi-square is already non-parametric, but for small sample sizes, Fisher’s Exact Test is used.
test groups
Two-way ANOVA Compare multiple groups with two factors Friedman Test
Practical examples of applying these statistical tests in pharmaceutical and clinical research:
Scenario: A study compares the effect of Drug A vs. Drug B on reducing blood pressure.
• Application: If the mean systolic blood pressure reduction differs significantly between
the two groups, the independent t-test will determine if the difference is statistically
significant.
Used when blood pressure reduction values are not normally distributed.
Scenario: A clinical trial tests the effect of a new diabetes drug on fasting blood glucose levels.
• Application: A paired t-test will assess if the difference in glucose levels before and after
treatment is statistically significant.
3. Z-score Test
Example: Comparing a Sample to Population Data
• Application: A Z-test determines if the mean LDL cholesterol level in the study sample
significantly differs from the general population.
4. One-Way ANOVA
Scenario: A pharmaceutical company develops three formulations of a pain relief drug and wants
to compare their effectiveness.
• Application: ANOVA determines if the mean pain reduction score significantly differs
among the three groups.
5. Two-Way ANOVA
Scenario: Researchers study how a hypertension drug’s effectiveness varies based on:
• Application: Two-way ANOVA assesses both factors and their interaction to see if they
influence blood pressure reduction.
Scenario: A hospital collects data on adverse effects from a new antibiotic and categorizes them
as:
1. Present (Yes/No)
Scenario: A genetic study tests whether the observed distribution of a drug metabolism gene
variant in a population matches the expected distribution.
Conclusion
Relationship between
Chi-square test Adverse drug reactions and gender
categorical variables
Step-by-step guides on how to perform these statistical tests in SPSS for pharmaceutical and
clinical research data analysis.
Scenario: Comparing the reduction in blood pressure between Drug A and Drug B.
SPSS Procedure:
1. Load Data:
o Ensure data is structured with one column for Blood Pressure Reduction and one
for Drug Group (A/B).
3. Select:
4. Click OK.
5. Interpret Output:
o Look at the p-value (Sig. 2-tailed) under Levene's Test for Equality of Variances.
SPSS Procedure:
2. Select:
3. Click OK.
4. Interpret Output:
SPSS Procedure:
SPSS Procedure:
2. Select:
3. Click OK.
4. Interpret Output:
o Look at Sig. value under ANOVA → If p < 0.05, there is a significant difference.
Scenario: Studying how drug dosage and age group affect blood pressure.
SPSS Procedure:
2. Select:
3. Click OK.
4. Interpret Output:
o Check Sig. values for main effects (Dosage & Age) and interaction effect (Dosage *
Age).
SPSS Procedure:
1. Go to: Analyze → Descriptive Statistics → Crosstabs.
2. Select:
4. Interpret Output:
o Look at Asymp. Sig. under Chi-Square Tests. If p < 0.05, the association is
significant.
SPSS Procedure:
4. Click OK.
5. Interpret Output:
o Check Sig. under Chi-Square Tests. If p < 0.05, the sample distribution differs
significantly.
Final Thoughts
Test SPSS
1. Correlation Analysis
Correlation measures the strength and direction of a relationship between two variables. It does
not imply causation.
Example in Pharmaceutical Research
• Scenario: A study investigates the correlation between dosage of a painkiller and pain
relief score.
• Expected Outcome: A positive correlation means higher doses provide greater pain relief.
Pearson’s Correlation When both variables are normally Measures linear relationship
(r) distributed and continuous (values range from -1 to +1)
4. Interpret Output:
2. Regression Analysis
Types of Regression:
Simple Linear Predict one outcome based on Predict blood sugar levels based on insulin
Regression one predictor dose
Regression Type Use Case Example
Multiple Linear Predict one outcome using Predict cholesterol reduction using diet,
Regression multiple predictors exercise, and medication
2. Select:
3. Click OK.
4. Interpret Output: