100% found this document useful (1 vote)

320 views80 pages

Introduction To Item Analysis Workshop PDF

This document provides an overview of item analysis, which is a technique for statistically analyzing multiple choice test items to ensure questions are effectively evaluating student ability. Item analysis examines item difficulty, discrimination, and test reliability to improve test quality and precision in measuring student assessment. The benefits and applications of item analysis across academic exams, employee selection, and personality assessment are discussed.

Uploaded by

Sajid Ahmad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

320 views80 pages

Introduction To Item Analysis Workshop PDF

Uploaded by

Sajid Ahmad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 80

DePaul University

INTRODUCTION TO
ITEM ANALYSIS:
EVALUATING AND IMPROVING MULTIPLE
CHOICE QUESTIONS

Ivan Hernandez, PhD

OVERVIEW 2

‣What is Item Analysis?

‣Overview
‣Benefits of Item Analysis
‣Applications

‣Main Statistics of Item Analysis

‣Item Difficulty
‣Item Discrimination
‣Test Reliability

‣Implementing Item Analysis

‣D2L
‣Excel
‣SPSS
3

WHAT IS
ITEM
ANALYSIS?
WHAT IS ITEM ANALYSIS 4
‣Consider the following ...

‣Imagine you have a multiple-choice test

‣Every student sees a collection of questions
‣Each question has different choices; only one choice is correct

‣We use the overall score on the exam to assess the student’s aptitude/ability
‣We want to know who understands the material and who doesn’t
‣We want to make sure that the student’s score is stable

‣Question: How do we know we assessing are the student's ability as well as we can?

‣Answer: Item Analysis

WHAT IS ITEM ANALYSIS 5
‣What is Item Analysis?

‣Statistically analyzing your multiple choice test items

‣So that you can ensure that your items are effectively evaluating student ability

‣Item Analysis is a collection of techniques

‣Many tools/methods to analyze questions

‣Examples:
‣Item Difficulty
‣Item Discrimination
‣Internal Consistency
‣Differential Item Functioning
ITEM ANALYSIS PROCESS 6

‣Item Analysis is an Iterative and Continuous Process

Write Test Items Administer Exam

Teach Content Area Perform Item Analysis
Have items reflect overall All students should be
Consider learning goals given sufficient time to Examine item
content area
complete exam performance
Consider student
Eliminate ambiguous or
abilities Testing conditions should Evaluate reasons for poor
misleading items
be consistent for all performance
students
7

BENEFITS
PURPOSE OF ITEM ANALYSIS 8

‣Primary questions that can be answered by Item Analysis

‣Were any of the questions too difficult or easy?1

‣How well do the questions identify those students who knew the
material from those that did not? (The more of these questions, the better
your exam can precisely measure ability)2

‣How consistent are the exam’s questions?3

1: Item Difficulty
‣How stable are people’s scores?3 2: Item Discrimination
3: Reliability
BENEFITS OF ITEM ANALYSIS 9

‣Benefits of using Item Analysis

‣Test Development - Improve test quality

‣Assess ability more efficiently
‣Produce scores that are stable
‣Make the exam more coherent

‣Precision - Quantify measurement of assessment

‣Identify areas of improvement
‣Understand characteristics of exam
‣Have clear indication of item quality
10

APPLICATIONS
APPLICATIONS OF ITEM ANALYSIS 11

‣Item Analysis can answer many questions (“Am I able to know who has
ability and who does not”, “Am I able to get a fine-grained view on ability”, “How
consistent are scores”)

‣Many fields are concerned with knowing the answer to those questions:

‣Academic exams (GRE, LSAT, ACT)

‣Employee selection (Wunderlic, Situational Judgement Test)

‣Personality assessment (Five Factor Model, Emotional Intelligence)

APPLICATIONS OF ITEM ANALYSIS: ACADEMIC EXAMS 12

‣Academic Exams: Purpose is to assess a student’s ability relative to their peers

‣Item analysis helps construct an exam where:

‣The extremes (high and low scorers) are identifiable

‣Scores are separated (Little overlap between students)

‣Exam scores are consistent across examinations

APPLICATIONS OF ITEM ANALYSIS: EMPLOYEE SELECTION 13

‣Employee selection: Purpose is to select employees that have the highest ability
in the workplace

‣Item analysis helps construct an employee questionnaire where

‣Employees’ abilities are distinguished from one another

‣Slightly better employees are noticeable than slightly worse employees

‣Scores on the employee questionnaire are consistent

APPLICATIONS OF ITEM ANALYSIS: PERSONALITY ASSESSMENT 14

‣Personality assessment: Purpose is to evaluate the behavioral tendencies of

individuals

‣Item analysis helps create a questionnaire where

2) The activity sheet contains five students’ responses to five different multiple choice
question

3) Create a data matrix that is formatted in a manner for item-analysis

a) Each row is a different student

EXERCISE
b) Each column is a different question

c) Each cell indicates whether the student answered the question correctly (1) or
incorrectly(0)
22

ITEM DIFFICULTY
ITEM DIFFICULTY: WHAT IS IT? 23

‣What is Item Difficulty?

‣Item difficulty is how easy or hard a question is

‣Examples

‣If no one got the question right, the item is “difficult”

‣If everyone got the question right, the item is considered “easy”

‣If half the people got the question right, then the item is somewhere
between easy and hard.
ITEM DIFFICULTY: WHY DOES IT MATTER? 24

‣Why Does Item Difficulty Matter?

‣You want your test to provide information on the full range of

people’s ability

‣If you don’t pay attention to Item Difficulty, you don’t get a precise measure
of ability

‣Helps ensure the full spectrum of ability is represented

‣You want scores to be roughly symmetric

‣If true ability has a bell-shaped distribution, then your estimated ability
should have a bell-shaped distribution
ITEM DIFFICULTY: WHY DOES IT MATTER? 25

‣You want your test to provide

Student Question
information on people’s ranges of ability
Alice 100%

Bob 100%
‣If question too easy -> everyone gets the Cindy 100%
question right -> you don’t know who is on
Too Easy - Cannot tell who has
the lower end of ability more/less ability

‣If question too hard -> everyone gets

Student Question
question wrong -> you don’t know who is on
Alice 0%
the higher end of ability
Bob 0%

Cindy 0%
‣Therefore, it is important to pay attention to
Too hard - Cannot tell who has
item difficulty to have it be just right more/less ability
ITEM DIFFICULTY: WHY DOES IT MATTER? 26

‣You want scores to be roughly symmetric

‣Ability tends to follow a roughly normal distribution

‣If item difficulty is too high or low, then scores will be truncated (prevents
symmetry)

Difficulty is too high Difficulty is just right Difficulty is too easy

‣Therefore, it is important to pay attention to item difficulty to have it be just right

ITEM DIFFICULTY: EXAMPLE 27

Examinee Q1 Q2 Q3
‣Imagine a test with THREE questions and FIVE
Alice 1 0 1 examinees
Bob 1 0 1

‣Everyone got Question 1 correct

Cindy 1 0 1

Dan 1 0 1 ‣Everyone got Question 2 wrong

Erin 1 0 0
‣FOUR people got Question 1 correct
ITEM DIFFICULTY: EXAMPLE 28

Examinee Q1 Q2 Q3
‣ We can total up how many people got each question correct by taking the
Alice 1 0 1 sum for each question
‣Question 1 = 5
‣Question 2 = 0
Bob 1 0 1
‣Question 3 = 4
Cindy 1 0 1

Dan 1 0 1

Erin 1 0 0

Total 5 0 4
ITEM DIFFICULTY: EXAMPLE 29

Examinee Q1 Q2 Q3
‣ We can total up how many people got each question correct by taking the
Alice 1 0 1 sum for each question
‣Question 1 = 5
‣Question 2 = 0
Bob 1 0 1
‣Question 3 = 4
Cindy 1 0 1
‣ Divide the total number of people who got the question correct by the
Dan 1 0 1 total number of people who took the test, and you have the item’s
difficulty
Erin 1 0 0 Question # Total Item
correct/ Difficulty
Total test
takers

Question 1 5 / 5 = 1.00 100%

Total 5 0 4
Question 2 0 / 5 = .00 0%

Question 3 4/ 5 = .80 80%

ITEM DIFFICULTY: HOW TO CALCULATE IT 30

‣How to Calculate Item Difficulty

‣Count how many people answered the answer at all = N

‣Count how many people answered the question correctly = P

‣Item Difficulty = P / N
ITEM DIFFICULTY: HOW TO INTERPRET IT 31
‣How to Interpret Item Difficulty
‣Think of it as “Item Easiness”

‣Ranges from 0% to 100%

‣Larger values = Easier
‣Smaller values = Harder

‣If value is at an ideal “sweet spot”, then your test can better separate high
ability people from low ability people (discussed in next section)

‣Ideal values depend on how answer choices there are:

‣Two answer choices (true and false) = .75
‣Three answer choices= .67
‣Four answer choices= .63
‣Five answer choices= .60
ITEM DIFFICULTY: DIAGNOSTICS 32

‣Low item-difficulty can be problematic: Indicates that people, regardless of

ability, could not answer the question correctly

‣If Item Difficulty is too Low / Item is too Hard (< .25 or .30):

‣The item may have been miskeyed

‣The item may be too challenging relative to the overall level of ability of the
class

‣The item may be ambiguous or not written clearly

‣There may be more than one correct answer.

ITEM DIFFICULTY: IMPROVEMENTS 33

‣How to Improve Low Item Difficulty (Item is too hard)

‣Make sure all items are keyed correctly

‣Find a less challenging concept to assess

‣Improve the class ability (typically via re-instruction or practice)

‣Find where people are being confused and clarify in the question

‣Make the question more specific

ITEM DIFFICULTY: PRECAUTIONS 34
‣Precautions on using Item Difficulty

‣Not meaningful if

‣ Too few respondents (small sample size)

‣Goal is to assessing mastery of specific concepts/problems

‣The recommended values (.60-.75) assume you want to assess people’s ability
relative to others

‣If you are concerned with content mastery, you want all items answered
correctly

‣Test had a short time limit (“speed test”) - later items seem difficult
ACTIVITY 35

1) Break out into 4 separate groups

2) Each group will be assigned an item-difficulty value

3) Think of a question you can ask your fellow attendees that would probably have a
difficulty value close to your group’s assigned value

EXERCISE Example: If you are assigned a difficulty value of .25, what is a question you could ask
that only 25% of the attendees would know

4) Think of 4 multiple choice options to go with your question, one of which is right
‣ Remember: Item
Difficulty: Percentage of 5) When you are ready, have one member of the group go up to the presenter and share the
group’s question and answers
people answering the
question correctly 6) WHEN TOLD THE SURVEY IS READY BY THE PRESENTER: Complete the combined
survey online (link will be provided) - Skip your question
‣ Larger values = Easier
7) Access the spreadsheet link provided, and compute the difficulty for each question
‣ Smaller values = Harder
8) How close was the actual difficulty to the difficulty you were assigned?
36

ITEM DISCRIMINATION
ITEM DISCRIMINATION: WHAT IS IT? 37

‣What is Item Discrimination?

‣Item discrimination is how much a question relates to a person’s overall

knowledge on the exam / ability

‣High item discrimination = “You either know it or you don’t”

‣Examples of Question with Good Discrimination

‣Smart people know the answer to the question, and low ability people don’t know

‣People who studied get the question right, people who didn’t study get the question
wrong
ITEM DISCRIMINATION 38

Examinee Q1 Q2 Q3
‣Imagine a test with THREE questions and FIVE
Alice 1 1 1 students
Bob 1 1 1

Cindy 0 0 1

Dan 0 1 0

Erin 0 0 1
ITEM DISCRIMINATION 39

Examinee Q1 Q2 Q3 Total ‣Imagine a test with THREE questions and FIVE

students
Alice 1 1 1 100%

Bob 1 1 1 100% ‣Alice and Bob did the best (perfect scores)

Cindy 0 0 1 33%
‣Cindy, Erin, and Dan did the worst (33%)
Dan 0 1 0 33%

Erin 0 0 1 33%
ITEM DISCRIMINATION 40

Examinee Q1 Q2 Q3 Total ‣Imagine a test with THREE questions and FIVE

students
Alice 1 1 1 100%

Bob 1 1 1 100% ‣Alice and Bob did the best (perfect scores)

Cindy 0 0 1 33%
‣Cindy, Erin, and Dan did the worst (33%)
Dan 0 1 0 33%

Erin 0 0 1 33%
‣Which question’s performance best predicts
who will score high on the exam (Alice and
Bob) and who will score low (Cindy, Dan,
and Erin)?
ITEM DISCRIMINATION 41

Examinee Q1 Q2 Q3 Total ‣How people did on Question 1 predicts who will

score high and who will score low on the exam
Alice 1 1 1 100%

Bob 1 1 1 100% ‣People who answered question 1 correctly, got a

100%
Cindy 0 0 1 33%

Dan 0 1 0 33% ‣People who answered question 1 incorrectly, got a

0%
Erin 0 0 1 33%

‣Question 1 has good discrimination

ITEM DISCRIMINATION 42

Examinee Q1 Q2 Q3 Total ‣How people did on Question 3 does not

predicts who will score high and who will score
Alice 1 1 1 100% low on the exam
Bob 1 1 1 100%
‣People who answered question 3 correctly, got a
Cindy 0 0 1 33%
100%
Dan 0 1 0 33%

‣Many who scored 33% also got question 3 correct

Erin 0 0 1 33%

‣Question 3 has worse discrimination than

Question 1
ITEM DISCRIMINATION: HOW TO CALCULATE IT 43

‣How to Calculate Item Discrimination

‣Make one column that has whether people got the question right (1) or wrong
(0) = Question Scores

‣Make another column that has people’s total score on the exam (0 to 100%) =
Total Scores

‣Item Discrimination = (Pearson’s) correlation between Question

scores and Total scores

‣Commonly called “Point-biserial correlation” or “Item-total

correlation”

‣Often “corrected” by removing the item’s score from the total score
ITEM DISCRIMINATION: HOW TO CALCULATE IT 44

‣Item-Discrimination = correlation between the item’s

Examinee Q1 Q2 Q3 Total
performance and the total performance
Alice 1 1 1 3
‣Correlation between Q1 column and Total column
Bob 1 1 1 3 ‣Discrimination =1.00

Cindy 0 0 1 1 ‣ Correlation between Q2 column and Total column

‣Discrimination = .67
Dan 0 1 0 1

‣ Correlation between Q3 column and Total column

Erin 0 0 1 1
‣Discrimination = .41

‣Larger values indicate the item can better predict

overall test performance
ITEM DISCRIMINATION: HOW TO INTERPRET IT 45
‣How to Interpret Item Discrimination

‣Ranges from -1 to +1 (Almost always positive)

‣Larger positive values = Question strongly relates to ability
‣Smaller values = Question does not relate ability much

‣Ideal values are positive and high (above +.20)

‣positive = those who correctly answer a particular item also tend
to do well on the test overall
‣0 = no relationship between exam knowledge and getting the question right
‣negative = the more you know, the less likely you are to get the question
right

‣If above .20, item is useful for describing people’s overall ability
ITEM DISCRIMINATION: DIAGNOSTICS 46
‣Low item discrimination is problematic: Suggests that people who know the
concepts really well overall, were not any more likely to understand the specific concept of
the question

‣If Item Discrimination is too Low (< .20):

‣The item may be miskeyed

‣Item may not represent domain of interest

‣Item concept may not be taught well

‣The item may be ambiguous

‣The item maybe misleading

‣The item may be too easy or difficult (everyone got question right or wrong)
ITEM DISCRIMINATION: IMPROVEMENTS 47

‣To Improve Low Item Discrimination (< .20):

‣Check to make sure the item is keyed correctly

‣Check to make sure item is conceptually relevant

‣Modify the instruction to explain the concept better

‣Make the question more specific

‣Ask how students interpreted the question

‣Ensure that difficulty is at the ideal level, given the number of response options
ITEM DISCRIMINATION: PRECAUTIONS 48

‣Precautions on using Item Discrimination

‣Not meaningful if

‣ Too few respondents (small sample size)

‣Too few questions (doesn’t capture topic area)

‣Item difficulty too low or high (no variation for correlation)

‣Partial credit for answers (some answers are less wrong than others)
ACTIVITY: ITEM DISCRIMINATION 49

1) Think to yourself about a multiple choice exam you might give in your respective field
for a specific topic

2) What kinds of questions would you ask?

EXERCISE 3) Which of those questions, if answered correctly would indicate that this
person understands the topic well as a whole?
a) This question has good discrimination
b) Can tell you who likely has high knowledge and who has lower knowledge

4) Which of those questions, if answered correctly doesn’t necessarily

indicate that this person understands the topic well?
a) This question has poor discrimination
b) Cannot separate the high knowledge from the low knowledge students
50

RELIABILITY
TEST RELIABILITY: WHAT IS IT 51
‣ What is Test Reliability?

‣ Test Reliability = Consistency of Scores

‣People’s observed exam score is a mixture of true ability and error

‣True ability = what you actually know for the entire topic

‣Error = Fatigue, Misreading Question, Luck

TEST RELIABILITY: WHAT IS IT? 52
‣What Reliability Means?

‣If test 100% reliable, then the score a person receives is their true score, and they
get the same score each time they retake the exam

‣There was no error on the exams

‣Luck had nothing to do with the scores
‣No questions were misread
‣You were not tired

‣The score is based completely on ability

‣If our test is reliable, then a student’s ability is reflected in the score
received

‣You are capturing “pure” ability instead of ability + error

TEST RELIABILITY: WHAT IS IT 53
‣ What Reliability Means?

‣If test not 100% reliability, then the score a person receives may be either higher or lower
than their actual true score. Next score might be different

‣Error played a big role in the scores

‣You got lucky the first time
‣You misread some questions
‣You were tired

‣Your true ability wasn’t reflected in the scores

‣ If our test is unreliable, then a student’s ability is not reflected in the score received
TEST RELIABILITY: WHAT IS IT 54

‣Ways of Measuring Reliability

‣Test-retest reliability: Consistency from one examination point and another

‣Parallel forms reliability: Consistency from one exam form and another

‣Internal consistency reliability: Consistency of items with other items

‣We’re going to focus on internal consistency because it is the easiest to

measure, and also provides highly useful information
INTERNAL CONSISTENCY: WHAT IS IT 55

‣What is Internal Consistency?

‣Internal consistency is how consistent the items are with the other
items

‣High internal consistency = Questions are highly correlated (address a similar

topic) and many questions

‣Low internal consistency = Questions are unrelated and few questions

‣Internal consistency measured with “Cronbach’s Alpha” (also called “KR-20”)

INTERNAL CONSISTENCY: WHAT DOES IT MATTER 56

‣Why Does Internal Consistency Matter?

‣Cronbach’s Alpha provides a “lower-bound” on the reliability of an exam

‣If you know an exam internal consistency, then you know the worst-case of its
reliability

‣Reliability = correlation of scores on exam with scores on another equivalent exam

‣Good internal consistency makes it more likely the students’ scores are
stable
INTERNAL CONSISTENCY: HOW TO INTERPRET IT 57
‣How to Interpret Internal Consistency

‣Ranges from -Infinity to +Infinity (Almost always positive)

‣Larger positive values = Test is highly reliable
‣Smaller values = Test is not very reliable

‣Ideal values are positive and high (above +.70)

Alpha Interpretation

> .90 Excellent

.90 > .80 Good

.80 > .70 Acceptable

.70 > 60 Marginal

<. 60 Poor
INTERNAL CONSISTENCY: HOW TO CALCULATE IT 58
‣ How to Calculate Internal Consistency

‣Count the number of questions = K

‣For each question, make a column that has whether people got the question right (1) or wrong
(0)

‣Calculate the correlation between each right/wrong column and every other right/wrong
column
‣If K questions, then (K * (K-1) / 2) comparisons
‣If 20 questions, then (20 * (20-1) / 2) comparisons
‣If 40 questions, then (40 * (40-1) / 2) comparisons

‣Calculate the average inter-item correlation = r̄

‣Apply the following formula:

ACTIVITY: TEST RELIABILITY 59
Internal consistency is affected by the test length and average inter-item correlation

Test length: The more questions on the test, the more reliable the test will be.

Average inter-item correlation: The more the questions address a single common domain,
the more reliable the test will be
- All questions pertain to the same topic area = Higher average correlation between
question scores
- All questions pertain to the disparate topic areas = Lower average correlation between
EXERCISE question scores

1) Imagine we ask students:

a) “Calculate the area of a hexagon”
b) “Find the hypotenuse of a right triangle”
c) “Find the missing angle in a triangle”
d) “Find the radius of a circle”

2) What additional question could we ask that would probably INCREASE the
average inter-item correlation?

3) What additional question could we ask that would probably DECREASE

the average inter-item correlation?
TEST RELIABILITY: DIAGNOSTICS 60

‣If Test Reliability is too Low (< .70):

‣The item may be miskeyed

‣Items represent too many distinct dimensions (too many concepts being asked)

‣Too few items

‣Items are not written clearly

‣Items have poor difficulty and discrimination

TEST RELIABILITY: IMPROVEMENT 61

‣To Improve Low Test Reliability (< .70):

‣Check that items are keyed correctly

‣Check that items are assessing common domain

‣Increase the number of items (Increasing the number of items by 50%

increases reliability by ~.10)

‣Clarify ambiguous items

‣Recheck item difficulty and discrimination

TEST RELIABILITY: PRECAUTIONS 62

‣Precautions on using Test Reliability

‣Not meaningful if too few respondents (small sample size)

‣Cronbach’s Alpha only provides a lower-bound estimate of

reliability - the actual reliability could be much higher
63

IMPLEMENTING
ITEM ANALYSIS
IMPLEMENTING ITEM ANALYSIS 64

‣You have several options for implementing item analysis

‣D2L

‣Excel

‣SPSS
IMPLEMENTING ITEM ANALYSIS: D2L 65

‣ D2L provides many of the item analysis statistics

‣Make a Quiz on D2L and collect responses

‣Go to “Quizzes”

‣Click on the dropdown menu next to the quiz name

‣Click “Statistics”

‣Click on “Question Stats”

‣“Average Grade” = Item Difficulty

‣“Point-Biserial” = Discrimination
‣“Discrimination Index” = Similar to Point-Biserial, but not recommended
IMPLEMENTING ITEM ANALYSIS: D2L - STEPS VISUALIZED 66
(1) Go to D2L Course Page and Click on “Quizzes” (2) Click on the menu for a specific quiz and select “Statistics”

(3) Click on “Question Stats” to view Item Analysis (4) Each item has its own analysis statistics
IMPLEMENTING ITEM ANALYSIS: D2L - THE OUTPUT 67
Point Biserial = Item Discrimination
Average Grade = Item Difficulty
IMPLEMENTING ITEM ANALYSIS: EXCEL 68

‣You can calculate item analysis with Excel

‣Need an Excel spreadsheet that has

‣Each row is a different student
‣Each column is a different question
‣Each cell is either 1 or 0 indicating whether the student got the
corresponding question right or wrong

‣Example Spreadsheet: https://goo.gl/XCCrGE

IMPLEMENTING ITEM ANALYSIS: EXCEL - ENTER IN DATA 69

Enter in the data

in the first Sheet,
in the same format
shown previously
in the lesson
IMPLEMENTING ITEM ANALYSIS: EXCEL - EXAM ITEM STATISTICS 70

Each Item has its

own Difficulty and
Discrimination
Score
IMPLEMENTING ITEM ANALYSIS: EXCEL - EXAM TEST STATISTICS 71

Cronbach’s alpha
= Internal
Consistency
IMPLEMENTING ITEM ANALYSIS: SPSS 72
‣ You can calculate item analysis with SPSS (or R, SPSS, Minitab, Stata)
‣ Access SPSS for free via DePaul Virtual Labs

‣Enter the data as you would for an Excel spreadsheet

‣Each row is a different student
‣Each column is a different question
‣Each cell is either 1 or 0 indicating whether the student got the corresponding question right or
wrong

‣At the top menu, go to Analyze -> Scale -> Reliability Analysis

‣Move the Test Questions to the “Items” panel

‣Click “Statistics” and ask for “item,” “scale,” and “scale if item deleted” statistics

‣Click “OK”
IMPLEMENTING ITEM ANALYSIS: SPSS - STEP 1 - CHOOSE ANALYSIS 73
IMPLEMENTING ITEM ANALYSIS: SPSS - STEP 2 - SELECT VARIABLES 74
IMPLEMENTING ITEM ANALYSIS: SPSS - STEP 3 - INTERPRETATION 75
Cronbach’s alpha = Internal consistency

Item Mean = Item difficulty

Correct Item-Total Correlation = Item

discrimination
76

SUMMARY
SUMMARY 77

‣Item Analysis can provide useful information when examining multiple

choice tests
‣How difficult were the questions?
‣How well does a question contribute to understanding a person’s performance?
‣How reliable are the overall test scores?

‣It is important to consider the reasons for why an item is performing

poorly

‣Items can perform poorly due to wording ambiguity, lack of ability in that
domain, miscoding, lack of conceptual relevance, instructional issues

‣Item Analysis is an iterative process - takes time

SUMMARY 78

‣There are many tools for performing item analysis

‣Ones we discussed: D2L, Excel, SPSS

‣Many others available: Stata, SAS, R, PSPP, jmetrik

SUMMARY 79

‣Item Analysis is a collection of tools - there are more out there

‣Alternative analysis - Are the incorrect answers equally likely to be chosen?

‣Differential Item Functioning - Are the items fair between groups of

people?

‣Factor analysis - What underlying constructs are the items measuring?

‣Item-Response Theory - What are people’s ability, when you take into
account the difficulty and discrimination of the item that people’s answered
correctly/incorrectly
Q&A

Validation Instrument
No ratings yet
Validation Instrument
1 page
Micro Teaching Rubric
No ratings yet
Micro Teaching Rubric
1 page
Budget Proposal Format
No ratings yet
Budget Proposal Format
4 pages
Topic 7-Interactional Theories
No ratings yet
Topic 7-Interactional Theories
36 pages
Bloom Taxonomy and Table of Specification
100% (1)
Bloom Taxonomy and Table of Specification
19 pages
Assessment of Learning
100% (1)
Assessment of Learning
3 pages
Assumptions of Classroom Assessment
100% (1)
Assumptions of Classroom Assessment
1 page
Constructing Objective Test Items Simple Forms
100% (1)
Constructing Objective Test Items Simple Forms
52 pages
Item Analysis Workshop
No ratings yet
Item Analysis Workshop
74 pages
Item Analysis
No ratings yet
Item Analysis
17 pages
Notice: List of Candidates To Appear
No ratings yet
Notice: List of Candidates To Appear
2 pages
Multiple Choice (Lea Mae)
100% (1)
Multiple Choice (Lea Mae)
2 pages
Education Curriculum
No ratings yet
Education Curriculum
28 pages
Item Analysis-Ppt 2 - 083033
100% (2)
Item Analysis-Ppt 2 - 083033
47 pages
Session - 5 - Constructing Lesson Plans Using Bloom's Taxonomy PDF
67% (3)
Session - 5 - Constructing Lesson Plans Using Bloom's Taxonomy PDF
31 pages
Operant Conditioning Edward Thorndike
No ratings yet
Operant Conditioning Edward Thorndike
5 pages
Portfolio Presentation Rubric
No ratings yet
Portfolio Presentation Rubric
1 page
Principles of Assessment
No ratings yet
Principles of Assessment
20 pages
Educational Assessment Ch#3 B
100% (1)
Educational Assessment Ch#3 B
13 pages
PRACTICE TEST - Assess Without Answer
No ratings yet
PRACTICE TEST - Assess Without Answer
10 pages
Data Processing and Tabulation
100% (1)
Data Processing and Tabulation
14 pages
Instructional Module and Its Components
100% (1)
Instructional Module and Its Components
5 pages
Video Lesson
No ratings yet
Video Lesson
24 pages
An Intoduction To Educational Research
100% (3)
An Intoduction To Educational Research
398 pages
Differentiate Formal and Informal Assessments
100% (2)
Differentiate Formal and Informal Assessments
2 pages
Variables
100% (1)
Variables
3 pages
Presentation of Teaching (MCQ, S)
No ratings yet
Presentation of Teaching (MCQ, S)
4 pages
Rules in Creating A Multiple Choice Test
No ratings yet
Rules in Creating A Multiple Choice Test
38 pages
Construction of Test
100% (1)
Construction of Test
14 pages
Siegle Reliability Calculator 2
No ratings yet
Siegle Reliability Calculator 2
397 pages
Skinner
No ratings yet
Skinner
10 pages
Bloom Taxonomy
No ratings yet
Bloom Taxonomy
3 pages
Grounded Theory
No ratings yet
Grounded Theory
8 pages
Education Assessment and Evaluation: Submitted By: Ayesha Khalid Assignment Number 1 B.ED 1.5 Year
No ratings yet
Education Assessment and Evaluation: Submitted By: Ayesha Khalid Assignment Number 1 B.ED 1.5 Year
14 pages
Title Pages
No ratings yet
Title Pages
15 pages
Academic Effects of Providing Peer Support in General Education Classrooms On Students Without Disabilities
No ratings yet
Academic Effects of Providing Peer Support in General Education Classrooms On Students Without Disabilities
14 pages
Asessment of Learning Prepared by Ariel Berico Objectives
No ratings yet
Asessment of Learning Prepared by Ariel Berico Objectives
7 pages
Curriculum Validation
No ratings yet
Curriculum Validation
32 pages
Oct-Chapter 4 - Practical Research 2
No ratings yet
Oct-Chapter 4 - Practical Research 2
7 pages
Quantitative Research (Hypothesis)
No ratings yet
Quantitative Research (Hypothesis)
11 pages
ASSESSMENT OF LEARNING New MODULES
100% (2)
ASSESSMENT OF LEARNING New MODULES
26 pages
Qualitative Variable
No ratings yet
Qualitative Variable
11 pages
Table of Specification Test
100% (2)
Table of Specification Test
9 pages
Fs 1 Learning Plan
No ratings yet
Fs 1 Learning Plan
4 pages
Pre Test MULTIPLE CHOICE. Shade The Letter of The Correct Answer
No ratings yet
Pre Test MULTIPLE CHOICE. Shade The Letter of The Correct Answer
4 pages
3.01 S.M.A.R.T. Goal QUIZ
100% (1)
3.01 S.M.A.R.T. Goal QUIZ
4 pages
Planning and Administering Classroom Tests: UNIT-7
No ratings yet
Planning and Administering Classroom Tests: UNIT-7
26 pages
Quiz 1 - Assessment
100% (1)
Quiz 1 - Assessment
2 pages
Questionnaire A
100% (1)
Questionnaire A
5 pages
Inquiry Based Learning
No ratings yet
Inquiry Based Learning
43 pages
Questionnaire On Student-Centred Learning
No ratings yet
Questionnaire On Student-Centred Learning
2 pages
MIE Expert Nomination Questions 2021-2022
No ratings yet
MIE Expert Nomination Questions 2021-2022
10 pages
RBT Knowledge Dimension
No ratings yet
RBT Knowledge Dimension
3 pages
PHD - Advanced Educational Statistics
No ratings yet
PHD - Advanced Educational Statistics
8 pages
Levels of Understanding Assessed by Multiple Choice Questions
No ratings yet
Levels of Understanding Assessed by Multiple Choice Questions
24 pages
2024 Guidelines For The Research Proposal Hearing
No ratings yet
2024 Guidelines For The Research Proposal Hearing
5 pages
Sample Student Portfolio Rubric
No ratings yet
Sample Student Portfolio Rubric
3 pages
Educ-602 Statistics
No ratings yet
Educ-602 Statistics
3 pages
Two Way Table of Specification
No ratings yet
Two Way Table of Specification
1 page
Why Teach? How To Assess?: Assessment and Feedback Reviewing The FIDP of Earth and Life Science
No ratings yet
Why Teach? How To Assess?: Assessment and Feedback Reviewing The FIDP of Earth and Life Science
7 pages
The 7es Model of Instruction
No ratings yet
The 7es Model of Instruction
2 pages
Teacher-Made Test: by Jaymie Eileen Victoria A. Casople MA Industrial Psychology
No ratings yet
Teacher-Made Test: by Jaymie Eileen Victoria A. Casople MA Industrial Psychology
41 pages
Curriculum Evaluation Using CIPP Model
No ratings yet
Curriculum Evaluation Using CIPP Model
5 pages
What Is Research Methodology
83% (23)
What Is Research Methodology
16 pages
Subjective Type Test
No ratings yet
Subjective Type Test
51 pages
PISA Questions-Winter Vacation-Class7th PDF
No ratings yet
PISA Questions-Winter Vacation-Class7th PDF
4 pages
0406 Basic Computer Course Book PDF
No ratings yet
0406 Basic Computer Course Book PDF
33 pages
Questionnaire Validation Form
No ratings yet
Questionnaire Validation Form
2 pages
Appraising The Classroom Test & Assessments
No ratings yet
Appraising The Classroom Test & Assessments
8 pages
PDF Academic Performance Questionnaire
No ratings yet
PDF Academic Performance Questionnaire
2 pages