BRM Notes
BRM Notes
In statistics, the sample size is the measure of the number of individual samples
used in an experiment. For example, if we are testing 50 samples of people who
watch TV in a city, then the sample size is 50. We can also term it Sample Statistics.
Population data
Sample data
Population data is a large amount of data that includes the whole area of study,
which is termed as population. A population consists of all the elements that are
studied for the research.
On the other hand, sample data is a part of the population. Usually, it is quite clumsy
and difficult to compute the whole population. In this case, a representative sample is
selected from the population. This sample is termed sample data.
The sample size is defined as the number of observations used for determining the
estimations of a given population. The size of the sample has been drawn from the
population. Sampling is the process of selection of a subset of individuals from the
population to estimate the characteristics of the whole population. The number of
entities in a subset of a population is selected for analysis.
Sometimes the sample size can be very small. When the sample size is small (n <
30), we use the t distribution in place of the normal distribution. If the population
variance is unknown and the sample size is small, then we use the t statistic to test
the null hypothesis with both one-tailed and two-tailed, where
Where,
Generate for more accurate estimates but large sample size might cause difficulties
in interpreting the usual tests of significance, and the same problem may arise in
case of very small sample size. Thus, neither too large nor too small sample sizes
help research projects.
Formula
The sample size formula for the infinite population is given by:
Where,
SS = Sample Size
Z = Z -Value
P = Percentage of Population
C = Confidence interval
When the sample input or data is obtained, and the sample mean x̄ is calculated, the
sample mean obtained is different from the population mean μ. This difference
between the population mean and the sample mean can be considered as an error
E, which is the maximum difference between the observed sample mean and the
true value of the population mean.
The above-given formula can be solved for n, which can be used to determine the
minimum sample size.
Therefore, the formula to find the minimum sample size is given by
A = n / (1 + (n – 1)/P)
Where, A represents the adjusted sample size, n stands for the sample size, P
denotes the population size. For a population of infinite size, the formula is
expressed concerning the z-value and error margin:
Solved Example
Question 1:
Assuming the heights of students in a college campus are normally distributed with a
standard deviation = 5 in, find the minimum size required to construct a 95%
confidence interval for mean with a maximum error = 0.5 in.
Solution:
Now, substitute the given values in the sample size formula, we get
Therefore, rounding this value up to the next integer, the minimum sample space
required is 385.
Example 2: Determine the adjusted sample size for a sample size of 150 from a
population of 30,000.
Solution: Given: n=150 ,P=30,000
A= 1.004967 150
A≈149.254
Example 3: Calculate the adjusted sample size for a sample of 50 from a population
of 5,000.
A= 1.0098 50
A≈49.55
Example 4: Determine the population size if the adjusted sample size is 85.6, given
a sample size of 90.
1+ 89 / P = 90 / 85.6
89 / P = 90 / 85.6 −1
89/ P ≈1.0491
P≈ 1.0491 89
P≈84.78
2. Stages of Research: A Breakdown
Research is a systematic process of inquiry that involves collecting data, analyzing it, and
drawing conclusions to answer a specific question or solve a problem. Here's a breakdown of
the key stages involved:
Remember, research is an iterative process. You may need to revisit previous stages as you
analyze your data and refine your understanding of the topic.
Here are some solved examples to help you understand how to interpret SPSS output for
ANOVA and Regression analysis:
Scenario: A researcher wants to investigate the impact of different fertilizer types (3 groups)
on plant growth (measured in centimeters).
SPSS Output:
Interpretation:
F-value (5.24): This value indicates a statistically significant difference (Sig. = 0.012, which
is less than the typical significance level of 0.05) between at least two fertilizer types in their
impact on plant growth.
Mean Squares (MS): The MS Between Groups (62.725) is larger than the MS Within
Groups (6.584), suggesting a greater variance in plant growth between the fertilizer groups
compared to the variation within each group.
Further Analysis: Since the ANOVA suggests a significant difference, you might need to
conduct post-hoc tests (e.g., Tukey's HSD) to identify which specific fertilizer types differ in
their effect on plant growth.
2. Linear Regression:
SPSS Output:
Interpretation:
R-squared (0.624): This value indicates that 62.4% of the variance in brand awareness is
explained by the advertising spending variable.
Standardized Coefficient (Beta) for Advertising Spending (0.789): This positive value
suggests a positive relationship between advertising spending and brand awareness. For every
one-unit increase in advertising spending, brand awareness is expected to increase by 0.789
units (on the standardized scale).
P-value (Sig.) for Advertising Spending (0.000): This value is less than the significance
level (usually 0.05), indicating a statistically significant relationship between advertising
spending and brand awareness.
Overall, the regression analysis suggests that advertising spending has a positive and
statistically significant impact on brand awareness.
4. Solved Examples of F & P-value
Scenario: A bakery is testing the effect of different flour types (3 groups) on the rise time of
bread dough. They suspect a difference in rise time based on the flour type.
Null Hypothesis (H₀): There is no significant difference in the average rise time between the
three flour types.
Alternative Hypothesis (H₁): There is a significant difference in the average rise time
between the three flour types.
SPSS Output:
Explanation:
F-value (10.80): A relatively high F-value suggests a potential difference in rise time
between the flour groups.
P-value (0.002): This value is less than the significance level (usually 0.05). This indicates
we can reject the null hypothesis and conclude that there is a statistically significant
difference in rise time between at least two flour types.
2. Linear Regression:
Scenario: A company is investigating the relationship between customer age and purchase
amount.
SPSS Output:
Explanation:
P-value (Sig.) for Customer Age (0.042): This value is less than the significance level,
indicating a statistically significant relationship between customer age and purchase amount.
F-value (not shown in regression output, but can be calculated): While regression doesn't
directly display F-value, it can be calculated from the p-value. A low p-value suggests a high
F-value, which again points to a statistically significant relationship.
Note: While F-value is often associated with ANOVA, it can also be used in regression
analysis to test the overall significance of the model (i.e., whether the independent variables
together explain a significant portion of the variance in the dependent variable).
In both examples, a low P-value (along with a high F-value in the ANOVA case)
suggests that the observed results are unlikely due to chance and provide evidence to
reject the null hypothesis. This allows researchers to conclude that there is a statistically
significant effect or relationship.
5. Solved Examples: Identifying Variables and Management Problems
in a Case
Here are two examples showcasing how to identify variables and discuss management
problems in a case study:
Scenario: A clothing store experiences a sudden and significant decline in sales compared to
the previous year. The store manager is concerned and wants to identify the cause of the
decline.
Variables:
Management Problems:
Identifying the root cause: Without a clear understanding of the cause (e.g., change in
customer preferences, new competitor), effective solutions are difficult to implement.
Adapting to changing market dynamics: The store might not be adequately responding to
shifts in customer needs or fashion trends.
Optimizing internal operations: Inefficiencies in areas like pricing, customer service, or
marketing could be contributing to the decline.
Monitoring external factors: The store might not be effectively tracking external factors
like economic changes or competitor activity.
Scenario: A young tech startup experiences a drop in employee morale, leading to increased
turnover and a decrease in productivity.
Variables:
Management Problems:
Employee engagement: The company might not be effectively engaging employees and
fostering a positive work environment.
Work-life balance: Excessive workload or lack of flexibility could be leading to burnout and
dissatisfaction.
Compensation and benefits: Uncompetitive pay or inadequate benefits might be causing
employees to seek opportunities elsewhere.
Communication breakdown: Employees might feel uninformed or unheard, leading to
frustration and a disconnect with the company.
Research, while crucial for advancing knowledge and solving problems, must be conducted
ethically. Here are some key principles to ensure responsible research practices:
Informed Consent: Participants must be fully informed about the research objectives,
methods, potential risks, and their right to withdraw from the study at any time. They should
provide their voluntary consent before participating.
Confidentiality: Maintain the privacy of participants and ensure their data is kept
confidential. Anonymize data whenever possible and store it securely.
Anonymity: When feasible, collect data anonymously to further protect participant privacy.
This is especially important when dealing with sensitive topics.
Avoiding Harm: Design your research to minimize any potential risks or harm to
participants. This includes psychological, physical, or social risks. Ensure appropriate
safeguards are in place to mitigate any potential negative consequences.
Honesty: Conduct your research with integrity and report findings honestly and objectively.
Do not falsify or fabricate data, and acknowledge any limitations of your study.
Justice: Ensure fair treatment for all participants regardless of background or characteristics.
Strive to recruit a diverse sample that reflects the population you are studying.
Scientific Merit: Research should be well-designed and have a clear scientific purpose. It
should contribute to existing knowledge and not be conducted solely for personal gain or
commercial interests.
Animal Welfare: If your research involves animals, ensure they are treated humanely and in
accordance with established ethical guidelines.
Data Sharing: Consider sharing your data with other researchers, following ethical
guidelines and participant consent agreements, to promote transparency and collaboration.
Additional Considerations:
Research with Vulnerable Populations: Extra caution is needed when conducting research
with vulnerable populations (e.g., children, prisoners) who may have limited autonomy or
ability to provide informed consent.
Institutional Review Boards (IRBs): Many institutions have IRB committees that review
research proposals to ensure they adhere to ethical guidelines. Gaining IRB approval might
be necessary before initiating your research.
By following these ethical principles, researchers can ensure their work benefits society
without compromising the rights and well-being of participants.
7. Sampling Methods with Examples
Choosing the right sampling method is crucial for ensuring your research results are
generalizable to the population you're interested in. Here's an overview of common sampling
methods along with illustrative examples:
1. Probability Sampling:
Every member of the population has a known chance of being selected. This allows for
statistical inferences about the population based on the sample.
Simple Random Sampling: Each member has an equal chance of being selected.
o Example: A researcher studying political opinions randomly selects phone numbers from a
phone directory to conduct a survey.
o Example: A grocery store wants to survey customers about their experience. Every 10th
customer leaving the store on a specific day is selected for the survey.
Stratified Sampling: The population is divided into subgroups (strata) based on relevant
characteristics, and a random sample is drawn from each subgroup proportionally.
o Example: A university wants to understand student satisfaction with online learning. They
divide the student body by year (freshman, sophomore, etc.) and randomly select a
proportional sample from each year group.
Cluster Sampling: The population is divided into groups (clusters), some clusters are
randomly selected, and all members within those clusters are included.
o Example: A researcher wants to study healthcare access in rural areas. They randomly select
villages (clusters) and then survey all residents within those villages.
2. Non-Probability Sampling:
Members of the population don't have a known chance of being selected. These methods are
useful for exploratory research or when obtaining a random sample is difficult. However,
generalizability to the population is limited.
Convenience Sampling: Participants are selected based on their availability and ease of
access.
Quota Sampling: Researchers set quotas for specific subgroups (e.g., age, gender) and select
participants until the quotas are filled.
o Example: A market research firm wants to conduct a focus group discussion with teenagers.
They recruit a specific number of teenagers from different age groups and genders to ensure a
diverse group.
Snowball Sampling: Initial participants are recruited, and they are asked to refer others who
meet the study criteria.
o Example: A researcher is studying the experiences of people with a rare disease. They recruit
some initial participants through a support group and then ask them to refer others with the
same condition.
The best sampling method depends on your research question, population characteristics, and
resource constraints. Probability sampling methods are preferred for generalizable results,
while non-probability methods can be valuable for initial exploration or when random
sampling is impractical.
8. Solved Example:
Hypotheses:
Null Hypothesis (H₀): There is no significant difference between the current average wait
time for takeout orders and the desired wait time of 15 minutes.
Alternative Hypothesis (H₁): The current average wait time for takeout orders is
significantly greater than the desired wait time of 15 minutes.
Objectives:
1. Identify the factors contributing to the increase in wait times for takeout orders. This
might involve analyzing kitchen operations, order processing procedures, staffing levels
during peak hours, or equipment limitations.
2. Evaluate customer perception of wait times. A survey or focus group could be conducted
to understand how long customers perceive they are waiting compared to the actual wait
time.
3. Analyze the impact of wait times on customer satisfaction. This could involve reviewing
online reviews or conducting customer satisfaction surveys to see if longer wait times
correlate with a decline in satisfaction.
4. Develop and implement strategies to reduce wait times for takeout orders. Based on the
identified factors, the restaurant owner might explore solutions like optimizing kitchen
workflow, improving order processing systems, or adjusting staffing during peak hours.
Explanation:
The hypotheses frame the research question statistically. We want to test if the observed
increase in complaints reflects a real difference in wait times compared to the desired
standard.
The objectives outline the specific steps the restaurant owner can take to investigate the
problem and develop solutions. By addressing the factors contributing to delays, the owner
can aim to reduce wait times and improve customer satisfaction.
Hypotheses:
Objectives:
1. Conduct user research to understand user experience (UX) issues with the current
website. This might involve user interviews, usability testing, or surveys to identify pain
points users encounter while navigating the site.
2. Develop a new website design based on user feedback and industry best practices. The
new design should address identified UX issues and aim to improve user experience and
information accessibility.
3. Conduct A/B testing to compare user engagement with the current and redesigned
websites. A portion of website visitors would see the original design (control group), while
others would see the new design (test group). User engagement metrics (time spent, page
views) would be compared between the groups.
4. Analyze the results of the A/B testing and decide on website implementation. If the
redesigned website shows a statistically significant improvement in user engagement, the
company might choose to implement it permanently.
Explanation:
These hypotheses focus on the potential impact of a website redesign on user engagement.
The objectives outline a user-centered approach to website improvement. By understanding
user needs and preferences, the company can develop a more engaging website that
encourages users to explore and interact with the content. The A/B testing allows for a data-
driven decision before fully implementing the redesigned website.
9. Sample Questioner
Welcome!
This short survey is designed to understand your preferences for movie recommendation
algorithms used by streaming services. Your feedback will help us improve the way movies
are suggested to you. The survey should take no more than 5 minutes to complete.
1. Demographics:
Age
Gender
Occupation
2. Streaming Habits
When considering a movie suggestion, which factors are most important to you? Please rank
the following in order of importance, with 1 being most important:
o Genre
o Director
o Actors
o Critical reception
o User ratings
o Similar movies you've watched
o Other (Please specify)
Are you open to trying movies from genres you don't typically watch?
o Yes
o No
If you answered yes to the previous question, how would you like movie recommendations to
introduce you to new genres? (Select all that apply)
o Suggesting highly-rated movies from unfamiliar genres
o Providing brief summaries of movies from new genres
o Offering trailers for movies from new genres
o Other (Please specify)
6. Additional Feedback
Is there anything else you would like to share about your experience with movie
recommendations on streaming services? (Open-ended)
Clear and concise title: The title accurately reflects the survey topic.
Welcome message: A brief introduction sets the context and thanks participants for their
time.
Estimated completion time: Providing an estimated time frame can improve response rates.
Logical question flow: The questions progress from general streaming habits to specific
preferences regarding recommendations.
Question format variety: A mix of question formats keeps the survey engaging and gathers
diverse data.
Thank you message: Expressing gratitude for participation shows appreciation.