References
- Abedi, J. (2006). Language issues in item development. In S. M. Downing & T. M. Haladyna, Eds., Handbook of test development, pp. 377–398. Lawrence Erlbaum doi:10.4324/9780203874776.ch17.
- Abedi, J. (2007). Language factors in the assessment of English language learners: Theory and principles underlying the linguistic modification approach. Paper developed for the U.S. Department of Education LEP Partnership. http://www.ncela.gwu.edu/files/uploads/11/abedi_sato.pdf
- Abedi, J. (2008). Utilizing accommodations in assessment. In N. H. Hornberger (Ed.), Encyclopedia of language and education (pp. 341–347). Springer. doi:10.1007/978-0-387-30424-3_185
- Abedi, J., & Ewers, N. (2013, February). Smarter balanced assessment consortium: accommodations for English language learners and students with disabilities: A research-based decision algorithm. https://portal.smarterbalanced.org/library/en/accommodations-for-english-language-learners-and-students-with-disabilities-a-research-based-decision-algorithm.pdf
- American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for Educational and Psychological Testing. https://www.aera.net/Publications/Books/Standards-for-Educational-Psychological-Testing-2014-Edition.
- Bejar, I. I. (2017). A historical survey of research regarding constructed-response formats. In R. E. Bennett & M. von Davier (Eds.), Advancing human assessment, methodology of educational measurement and assessment (pp. 565–633). Springer International Publishing.
- Butler, F. A., & Stevens, R. (1997). Accommodation strategies for English language learners on large-scale assessments: Student characteristics and other considerations. (CSE Technical Report 448). https://cresst.org/publications/cresst-publication-2820/?_sf_s=Butler&_sft_publicationscategories=assessment
- Buxton, C. A., M, A.-S., Suriel, R., Kayumova, S., Choi, Y., Bouton, B., & Baker, B. (2013). Using educative assessments to support science teaching for middle school English-language learners. Journal of Science Teacher Education, 24(2), 347–399. doi:https://doi.org/10.1007/s10972-012-9329-5
- Chalmers, P. R. (2012). Mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29. doi:10.18637/jss.v048.i06
- Cole, M. (1999). Culture-free versus culture-based measures of cognition. In R. J. Sternberg (Ed.), The nature of cognition (pp. 645–664). Cambridge, MA: MIT Press.
- Council of Chief State School Officers. (CCSSO). (2016). Major provisions of Every Student Succeeds Act (ESSA) related to the education of English learners. Washington, DC: CCSSO. https://ccsso.org/resource-library/major-provisions-every-student-succeeds-act-related-education-english-learners
- DaSilva Iddings, A. C., & Moll, L. C. (2010). Special issue on second and foreign language learning and teaching: An introduction. Mind, Culture, and Activity, 17(4), 308–311. doi:10.1080/10749030903434308
- Dixon, L. Q., Zhao, J., Shin, J.-Y., Wu, S., Su, J.-H. … Snow, C. (2012). What we know about second language acquisition a synthesis from four perspectives. Review of Educational Research, 82(1), 5–60. doi:10.3102/0034654311433587
- Eberle, F. (2014). Next generation science assessments. Policy Update, 21(4), National Association of State Boards of Education. https://csaa.wested.org/resource/next-generation-science-assessments/
- Fang, Z. (2006). The language demands of science reading in middle school. International Journal of Science Education, 28(5), 491–520. doi:10.1080/09500690500339092
- García, E. E. (2005). Teaching and learning in two languages. New York: Teachers College.
- Haladyna, T. M., & Downing, S. M. (2004). Construct-irrelevant variance in high-stakes testing. Educational Measurement: Issues & Practice, 23(1), 17–27. doi:10.1111/j.1745-3992.2004.tb00149.x
- Hill, C., & Larsen, E. (2000). Children and reading tests. United States: Bloomsbury Academic: Ablex.
- Ilich, M. O. (2013). Differential Item Functioning (DIF) among Spanish-Speaking English Language Learners (ELLs) in state science tests [ Unpublished doctoral dissertation]. University of Washington.
- Kachchaf, R., Noble, T., Rosebery, A., Warren, B., O’Connor, C., & Wang, Y. (2016). A closer look at linguistic complexity: Pinpointing individual linguistic features of science multiple-choice items associated with English language learner performance. Bilingual Research Journal, 39(2), 152–166. doi:10.1080/15235882.2016.1169455
- Kieffer, M. J., Lesaux, N. K., Rivera, M., & Francis, D. J. (2009). Accommodations for English language learners taking large-scale assessments: A meta-analysis on effectiveness and validity. Review of Educational Research, 79(3), 1168–1201. doi:10.3102/0034654309332490
- Kieffer, M. J., Rivera, M., & Francis, D. J. (2006). Practical guidelines for the education of English language learners: Research-based recommendations for the use of accommodations in large-scale assessments. RMC Research Corporation, Center on Instruction. https://www.carnegie.org/publications/practical-guidelines-for-the-education-of-english-language-learners-research-based-recommendations-for-instruction-and-academic-interventions/
- Kieffer, M. J., & Thompson, K. (2018). Hidden progress of multilingual students on NAEP. Educational Researcher, 47(6), 391–398. doi:10.3102/0013189X18777740
- Kopriva, R. J., & Sexton, U. (1999). Guide to scoring LEP student responses to open-ended science items. Washington, DC: Council of Chief State School Officers.
- Linquanti, R., & Cook, H. G. (2013). Toward a “common definition of English learner”: Guidance for states and state assessment consortia in defining and addressing policy and technical issues and options. Council of Chief State School Officers. http://www.ccsso.org/sites/default/files/2017-10/MoreCommonDefinition-Final_0.pdf
- Li, H., & Suen, H. K. (2012). The effects of test accommodations for English language learners: A meta-analysis. Applied Measurement in Education, 25(4), 327–346. doi:10.1080/08957347.2012.714690
- Liu, O. L., Lee, H. S., & Linn, M. C. (2011). Measuring knowledge integration: Validation of four-year assessments. Journal of Research in Science Teaching, 48(9), 1079–1107. doi:10.1002/tea.20441
- Livingston, S. (2009). Constructed-response test questions: Why we use them; how we score them. R&D Connections, 11, September, (pp 1–8). Educational Testing Service. https://www.ets.org/research/policy_research_reports/publications/periodical/2009/hkap
- LLosa, L., Lee, O., Jiang, F., Haas, A., O’Connor, C., Van Booven, C. D., & Kieffer, M. J. (2016). Impact of a large-scale science intervention focused on English language learners. Alerican Educational Research Journal, 53(2), 395–424.
- Loewus, L. (2017). Next-generation science tests slowly take shape. Education Week, 36(32), 15–18. https://www.edweek.org/ew/articles/2017/05/24/next-generation-science-tests-slowly-take-shape.html
- Luykx, A., Lee, O., Mahotiere, M., Lester, B., Hart, J., & Deaktor, R. (2007). Cultural and home language influences on children’s responses to science assessments. Teachers College Record, 109(4), 897–926. doi:10.1177/016146810710900403
- MA DESE. (2012a). 2012 grade 5 science and technology/engineering scoring guide and sample student work. http://www.doe.mass.edu/mcas/student/2012/question.aspx?GradeID=5&SubjectCode=sci&QuestionID=23620
- MA DESE. (2012b). Massachusetts comprehensive assessment system: Test questions. http://www.doe.mass.edu/mcas/testitems.html
- MA DESE. (2016). Massachusetts curriculum framework for science and technology/engineering. https://www.doe.mass.edu/stem/ste/standards.html
- MA DESE. (2019). Guidance on English learner education program development and evaluation. http://www.doe.mass.edu/ele/resources/program-dev-eval.html
- MA DOE. (2006). Massachusetts science and technology/engineering curriculum framework. http://www.doe.mass.edu/frameworks/archive.html
- Martiniello, M. (2008). Language and the performamce of English language learners in math word problems. Harvard Educational Review, 78(2), 333–368.
- Martiniello, M. (2009). Linguistic complexity, schematic representations, and differential item functioning for English language learners in math tests. Educational Assessment, 14(3–4), 160–179. doi:10.1080/10627190903422906
- Millsap, R. E. (2011). Statistical approaches to measurement invariance. Routledge, Inc. doi:10.4324/9780203821961
- Myers, B., & Kopriva, R. (2015). Decision trees linking individual student need to large-scale accommodations for English learners: Wisconsin Center for Education Research, University of Wisconsin. http://iiassessment.wceruw.org/projects/STELLA%20white%20paper%2025Aug2015.pdf
- Nasir, N. S., Rosebery, A., Warren, B., & Lee, C. (2006/2014). Learning as a cultural process: Achieving equity through diversity. In K. Sawyer (Ed.), The Cambridge handbook of the learning sciences (pp. 489–504). Cambridge, UK: Cambridge University.
- National Center for Education Statistics. (2021). English language learners in public schools. Condition of education. U.S. Department of Education, Institute of Education Sciences. Retrieved [date], from https://nces.ed.gov/programs/coe/indicator/cgf
- National Research Council. (2012). A framework for K-12 science education: practices, crosscutting concepts, and core ideas. The National Academies Press. 10.17226/13165
- National Research Council. (2014). Developing assessments for the next generation science standards. The National Academies Press. 10.17226/18409
- National Science Teaching Association. (2022, October 18). About the Next Generation Science Standards. https://ngss.nsta.org/About.aspx
- Nelson-Barber, S., Huang, C., Trumbull, E., Johnson, Z., & Sexton, U. (2008, March). Elicitory test design: A novel approach to understanding the relationship between test item features and student performance on large-scale assessments. Paper presented at the annual meeting of the American Educational Research Association, New York, NY
- NGSS Lead States. (2013a). Next generation science standards: For states, by states (Vol. 1). The National Academies Press. https://www.nextgenscience.org/search-standards
- NGSS Lead States. (2013b). Next generation science standards: For states, by states (Vol. 2). The National Academies Press.
- NGSS Lead States. (2013c). Next generation science standards: all standards, all students. case study 4: English language learners and the next generation science standards. https://www.nextgenscience.org/sites/default/files/%284%29%20Case%20Study%20ELL%206-14-13.pdf
- Noble, T., Kachchaf, R. R., & Rosebery, A. S. (2018). Perspectives from research on the linguistic features of mathematics and science test items and the performance of english learners. In D. L. Baker, D. L. Basaraba, & C. Richards-Tutor (Eds.), Second language acquisition: Methods, perspectives and challenges (pp. 209–236). New York: Nova Science.
- Noble, T., Rosebery, A. S., Suarez, C., Warren, B., & O’Connor, M. C. (2014). Science assessments and English language learners: Validity evidence based on response processes. Applied Measurement in Education, 27(4), 248–260. doi:10.1080/08957347.2014.944309
- Noble, T., Sireci, S. G., Wells, C. S., Kachchaf, R. R., Rosebery, A. S., & Wang, Y. C. (2020). Targeted linguistic simplification of science test items for English learners. American Educational Research Journal, 57(5), 2175–2209. doi:10.3102/0002831220905562
- Noble, T., Suarez, C., Rosebery, A., O’Connor, M. C., Warren, B., & Hudicourt-Barnes, J. (2012). “I never thought of it as freezing”: How students answer questions on large-scale science tests and what they know about science. Journal of Research in Science Teaching, 49(6), 778–803. doi:10.1002/tea.21026
- NORC. (2018). Grade 4 or 5 science assessment item types, 2017-18: Percentage of points on assessment, 2020, from http://stem-assessment.org/table/pages/table23.aspx
- Pennock-Roman, M., & Rivera, C. (2011). Mean effects of test accommodations for ELLs and non-ELLs: A meta-analysis of experimental studies. Educational Measurement: Issues & Practice, 30(3), 10–28. doi:10.1111/j.1745-3992.2011.00207.x
- Pennock-Roman, M., & Rivera, C. (2012). Summary of literature on empirical studies of the validity and effectiveness of test accommodations for ELLs: 2005-2012. Smarter Balanced Assessment Consortium. https://portal.smarterbalanced.org/library/en/summary-of-literature-on-empirical-studies-of-the-validity-and-effectiveness-of-test-accommodations-for-ells-2005-2012.pdf
- Rios, J. A., Ihlenfeldt, S. D., & Chavez, C. (2020). Are accommodations for English learners on state accountability assessments evidence-based? A multi-study systematic review and meta-analysis. Educational Measurement: Issues & Practice, 39(4), 65–75. doi:10.1111/emip.12337
- Saunders, W. M., & Marcelletti, D. J. (2013). The gap that can’t go away: The catch-22 of reclassification in monitoring the progress of English learners. Educational Evaluation and Policy Analysis, 35(2), 139–156. doi:10.3102/0162373712461849
- Shealy, R., & Stout, W. (1993). An item response theory model for test bias and differential item functioning. In P. Holland & H. Wainer (Eds.), Differential item functioning (pp. 197–240). Hillsdale, NJ: Erlbaum.
- Simpson, C. (2021). How COVID taught American about inequity in education. Harvard Gazette, 9 July. https://news.harvard.edu/gazette/story/2021/07/how-covid-taught-america-about-inequity-in-education/
- Sireci, S. G., & Faulkner-Bond, M. (2015). Promoting validity in the assessment of English learners. Review of Research in Education, 39(1), 215–252. doi:10.3102/0091732x14557003
- Sireci, S. G., Wells, C., & Hu, H. (2014). Using internal structure validity evidence to evaluate test accommodations. Paper presented at the annual meeting of the National Council on Measurement in Education, Philadelphia, PA.
- Solano-Flores, G. (2006). Language, dialect, and register: Sociolinguistics and the estimation of measurement error in the testing of English language learners. Teachers College Record, 108(11), 2354–2379. doi:https://doi.org/10.1111/j.1467-9620.2006.00785.x
- Solano-Flores, G. (2008). Who is given tests in what language by whom, when, and where? The need for probabilistic views of language in the testing of English language learners. Educational Researcher, 37(4), 189–199. doi:10.3102/0013189x08319569
- Solano-Flores, G. (2011a). Assessing the cultural validity of assessment practices: An introduction. In M. Basterra, E. Trumbull, & G. Solano-Flores (Eds.), Cultural validity in assessment (pp. 3–21). Routledge. doi:10.4324/9780203850954
- Solano-Flores, G. (2011b). Development of illustrations as image supports for English language learners in large-scale testing: A report on the procedure for designing vignette illustrations. Paper presented at the annual meeting of the American Educational Research Association, New Orleans, LA.
- Solano-Flores, G., Chia, M., & Kachchaf, R. (2019). Design and use of pop-up illustration glossaries as accessibility resources for second language learners in computer-administered tests in a largescale assessment system. International Multilingual Research Journal, 13(4), 277–293. doi:10.1080/19313152.2019.1611338
- Solano-Flores, G., & Nelson-Barber, S. (2001). On the cultural validity of science assessments. Journal of Research in Science Teaching, 38(5), 553–573. doi:10.1002/tea.1018
- Tankersley, K. (2007). Tests that teach. Alexandria, VA: Association for Supervision and Curriculum Development.
- Turkan, S., & Liu, O. L. (2012). Differential performance by English language learners on an inquiry-based science assessment. International Journal of Science Education, 34(15), 2343–2369. doi:10.1080/09500693.2012.705046
- Turkan, S., & Lopez, A. A. (2017). Helping english language learners access the language and content of science through the integration of culturally and linguistically valid assessment practices. In L. C. de Oliveira & K. C. Wilcox (Eds.), Teaching science to English language learners (pp. 163–190). Springer International Publishing.
- Wells, C. S. (2018). Analyses of Differential Item Functioning on MCAS Science Open-Response Items: TERC. https://external-wiki.terc.edu/display/CKC/Publications?preview=%2F36896820%2F102498312%2FMCAS+STE+OR+DIF+Report+Final.pdf
- WIDA Consortium. (2014). ACCESS for English Language Learners test. Wisconsin Center for Education Research. https://wida.wisc.edu/assess/access/tests
- WIDA Consortium. (2016). The WIDA can do descriptors, key uses edition, grade 4–5. Wisconsin Center for Education Research. https://wida.wisc.edu/sites/default/files/resource/2012-ELD-Standards.pdf
- WIDA Consortium. (2019). ACCESS for ELLs paper: Sample items user guide, 2020, from https://wida.wisc.edu/sites/default/files/resource/ACCESS-Paper-Sample-Items-User-Guide.pdf
- Wolf, M. K., & Leon, S. (2009). An investigation of the language demands in content assessments for English language learners educational assessment. Educational Assessment, 14(3–4), 139–159. doi:10.1080/10627190903425883
- Zenisky, A. L., Hambleton, R. K., & Robin, F. (2003). Detection of differential item functioning in large-scale state assessments: A study evaluating a two-stage approach. Educational and Psychological Measurement, 53(1), 51–64. doi:10.1177/0013164402239316