Browse
We’re here to help

Find guidance on Author Services

Search
Browse
We’re here to help

Find guidance on Author Services

Home
All Journals
Journal of Research on Educational Effectiveness
List of Issues
Volume 17, Issue 2
Item Response Theory Models for Differen ....

Search in:

Advanced search

Journal of Research on Educational Effectiveness Volume 17, 2024 - Issue 2

Submit an article Journal homepage

195

Views

CrossRef citations to date

Altmetric

Methodological Studies

Item Response Theory Models for Difference-in-Difference Estimates (and Whether They Are Worth the Trouble)

James Solanda School of Education and Human Development, University of Virginia, Charlottesville, Virginia, USA;b NWEA, Portland, Oregon, USACorrespondence[email protected]

Pages 391-421 | Received 25 Aug 2022, Accepted 09 Mar 2023, Published online: 24 Apr 2023

Cite this article
https://doi.org/10.1080/19345747.2023.2195413
CrossMark

Sample our Education journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/19345747.2023.2195413?needAccess=true

Abstract

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to make sure the assumptions of the DiD are met. However, less thought is often put into the approach to score item responses from the outcome measure used. For example, surveys are often scored by adding up the item responses to produce sum scores, and achievement tests often rely on scores produced by test vendors, which frequently employ a unidimensional item response theory (IRT) scoring model that implicitly assumes control and treatment participants are exchangeable (i.e., that they come from the same distribution). In this study, several IRT models that parallel the DiD design in terms of groups and timepoints are presented, and their performance is examined. Results indicate that using a scoring approach that parallels the DiD study design reduces bias and improves power, though these approaches can also lead to increased Type I error rates.

Keywords:

Quasi-experimental studies
program evaluation
difference-in-difference designs
item response theory (IRT)
validity

Notes

1 Another reason is that more complex, multidimensional models often require use of a Bayesian scoring approach because MLE is less feasible (Vector Psychometric Group, Citation2021).

2 Readers may wonder why we did not also fit the Rasch model, which assumes all discrimination parameters are equal and, given how the data were generated, could also potentially represent measurement model misspecification. The main reason is that sum scores also violate the same assumption, namely that all items should be given the same weight. This similarity is one reason that the sum score is a sufficient statistic for the Rasch model. While a Rasch model used in conjunction with EAP scoring would introduce a shrinkage issue (discussed later in the study) that could affect results in a way different from sum scores, similar shrinkage occurs when using a unidimensional 2PL model. Thus, the Rasch model’s contribution to the present study is not sufficiently different from that of sum scores, nor the 2PL model, to merit inclusion.

3 This equation was modified slightly dependent on how different timepoints and groups were treated in the IRT model.

4 For parsimony, the figure shows only the consistent discrimination parameter and variable difficulty conditions at a sample size of 1,000; in general, treatment effect recovery was not sensitive to those conditions.

5 In the figure, one may observe that the Type I error rate with a true gain of 0 SDs and using true scores is only 1% compared to the anticipated 5%. To investigate this issue, more simulations were conducted. Specifically, 10,000 true score replications with a sample size of 5,000 were added. When the post-test means using true scores, as well as sum scores, are compared using a t-test (and true difference of zero), the Type I error rate is almost exactly .05.

6 While the present study used 50 students per cluster, schools in ECLS-K would often have as few as 20 students per cluster.

7 Technically, Whitney and Candelaria (Citation2017) used mean scores, not sum scores.

Vector Psychometric Group. (2021). flexMIRT frequently asked questions. https://vpgcentral.com/software/flexmirt/support/frequently-asked-questions/

Google Scholar

Whitney, C. R., & Candelaria, C. A. (2017). The effects of no child left behind on children’s socioemotional outcomes. AERA Open, 3(3), 233285841772632. https://doi.org/10.1177/2332858417726324

Web of Science ®Google Scholar

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 53.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 302.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Share icon
Back to Top

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Information for

Authors
R&D professionals
Editors
Librarians
Societies

Open access

Overview
Open journals
Open Select
Dove Medical Press
F1000Research

Opportunities

Reprints and e-prints
Advertising solutions
Accelerated publication
Corporate access solutions

Help and information

Help and contact
Newsroom
All journals
Books

Keep up to date

Sign me up

Taylor and Francis Group Facebook page

Taylor and Francis Group X Twitter page

Taylor and Francis Group Linkedin page

Taylor and Francis Group Youtube page

Taylor and Francis Group Weibo page

Registered in England & Wales No. 3099067
5 Howick Place | London | SW1P 1WG

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research