1,201
Views
0
CrossRef citations to date
0
Altmetric
Article

What Do the Measures of Utterance Fluency Employed in Automatic Speech Evaluation (ASE) Tell Us About Oral Proficiency?

ORCID Icon & ORCID Icon

References

  • Alexander, B., Ashford-Rowe, K., Barajas-Murph, N., Dobbin, G., Knott, J., McCormack, M., & Weber, N. (2019). Educause Horizon report 2019 higher education edition (pp. 3–41). EDU19.
  • Bates, E., D’Amico, S., Jacobsen, T., Székely, A., Andonova, E., Devescovi, A., Ching Lu, C., Pechmann, T., Pléh, C., Wicha, N., Federmeier, K., Gerdjikova, I., Gutierrez, G., Hung, D., Hsu, J., Iyer, G., Kohnert, K., Mehotcheva, T., Orozco-Figueroa, A., & Tzeng, A.… Herron, D. (2003). Timed picture naming in seven languages. Psychonomic Bulletin & Review, 10(2), 344–380. https://doi.org/10.3758/BF03196494
  • Bernstein, J., & Cheng, J. (2008). Logic and validation of a fully automatic spoken English test. In M. Holland & F. P. Fisher (Eds.), The path of speech technologies in computer assisted language learning (pp. 188–208). Routledge.
  • Bernstein, J., Cheng, J., & Suzuki, M. (2011). Fluency changes with general progress in l2 proficiency. In Twelfth Annual Conference of the International Speech Communication Association (ISCA). Florence, Italy.
  • Bernstein, J., De Jong, J. H. A. L., Pisoni, D., & Townshend, B. (2000, August). Two experiments on automatic scoring of spoken language proficiency. In Proceedings of InSTIL2000: Integrating Speech Technology in Learning (pp. 57–61). Dundee, UK: International Speech Communication Association.
  • Bernstein, J., Van Moere, A., & Cheng, J. (2010). Validating automated speaking tests. Language Testing, 27(3), 355–377. https://doi.org/10.1177/02655322103644
  • Bona, J. (2014). Temporal characteristics of speech: The effect of age and speech style. The Journal of the Acoustical Society of America, 136(2), EL116–EL121. https://doi.org/10.1121/1.4885482
  • Bradlow, A. R., Kim, M., & Blasingame, M. (2017). Language-independent talker-specificity in first-language and second-language speech production by bilingual talkers: L1 speaking rate predicts L2 speaking rate. The Journal of the Acoustical Society of America, 141(2), 886–899. https://doi.org/10.1121/1.4976044
  • Bridgeman, B., Powers, D., Stone, E., & Mollaun, P. (2012). TOEFL iBT speaking test scores as indicators of oral communicative language proficiency. Language Testing, 29(1), 91–108. https://doi.org/10.1177/0265532211411078
  • Brodeur, M. B., Dionne-Dostie, E., Montreuil, T., Lepage, M., & Op de Beeck, H. P. (2010). The Bank of Standardized Stimuli (BOSS), a new set of 480 normative photos of objects to be used as visual stimuli in cognitive research. PloS One, 5(5), e10773. https://doi.org/10.1371/journal.pone.0010773
  • Brodeur, M. B., Guérard, K., Bouras, M., & Paterson, K. (2014). Bank of standardized stimuli (BOSS) phase II: 930 new normative photos. PLoS One, 9(9), e106953. https://doi.org/10.1371/journal.pone.0106953
  • Brooks, L., & Swain, M. (2015). Students’ voices: The challenge of measuring speaking for academic contexts. In B. Spolsky, O. Inbar-Lourie, & M. Tannenbaum (Eds.), Challenges for language education and policy: Making space for people (pp. 65–80). Routledge.
  • Burchfield, A. L., & Bradlow, A. R. (2014). Syllabic reduction in mandarin and English speech. The Journal of the Acoustical Society of America, 135(6), EL270–EL276. https://doi.org/10.1121/1.4874357
  • Case. (2008). 101 IELTS speaking part two tasks about people, places, actions, things and times. Using English.Com. https://www.usingenglish.com/files/pdf/101-ielts-speaking-part-two-tasks-about-people-places-actions-things-and-times.pdf
  • Chen, L., Zechner, K., Yoon, S., Evanini, K., Wang, X., Loukina, A., Tao, J., Davis, L., Lee, C. M., Ma, M., Mundkowsky, R., Lu, C., Leong, C. W., & Gyawali, B. (2018). Automated scoring of nonnative speech using the SpeechRater SM v. 5.0 engine. ETS Research Report Series, 2018(1), 1–31. https://doi.org/10.1002/ets2.12198
  • Chukharev‐Hudilainen, E., & Ockey, G. J. (2021). The development and evaluation of interactional competence elicitor for oral language assessments. ETS Research Report Series, 2021(1), 1–20. https://doi.org/10.1002/ets2.12319
  • Coupé, C., Oh, Y. M., Dediu, D., & Pellegrino, F. (2019). Different languages, similar encoding efficiency: Comparable information rates across the human communicative niche. Science Advances, 5(9), eaaw2594. https://doi.org/10.1126/sciadv.aaw2594
  • De Jong, N. H. (2016). Predicting pauses in L1 and L2 speech: The effects of utterance boundaries and word frequency. International Review of Applied Linguistics in Language Teaching, 54(2), 113–132. https://doi.org/10.1515/iral-2016-9993
  • De Jong, N. H. (2018). Fluency in second language testing: Insights from different disciplines. Language Assessment Quarterly, 15(3), 237–254. https://doi.org/10.1080/15434303.2018.1477780
  • De Jong, N. H., Groenhout, R., Schoonen, R., & Hulstijn, J. H. (2015). Second language fluency: Speaking style or proficiency? Correcting measures of second language fluency for first language behavior. Applied Psycholinguistics, 36(2), 223–243. https://doi.org/10.1017/S0142716413000210
  • De Jong, N. H., Steinel, M. P., Florijn, A. F., Schoonen, R., & Hulstijn, J. H. (2012). Facets of speaking proficiency. Studies in Second Language Acquisition, 34(1), 5–34. https://doi.org/10.1017/S0272263111000489
  • De Jong, N. H., Steinel, M. P., Florijn, A., Schoonen, R., & Hulstijn, J. H. (2013). Linguistic skills and speaking fluency in a second language. Applied Psycholinguistics, 34(5), 893–916. https://doi.org/10.1017/S0142716412000069
  • De Jong, N. H., & Wempe, T. (2009). Praat script to detect syllable nuclei and measure speech rate automatically. Behavior Research Methods, 41(2), 385–390. https://doi.org/10.3758/BRM.41.2.385
  • Derwing, T. M., Munro, M. J., Thomson, R. I., & Rossiter, M. J. (2009). The relationship between L1 fluency and L2 fluency development. Studies in Second Language Acquisition, 31(4), 533–557. https://doi.org/10.1017/S0272263109990015
  • Dewaele, J. M., & Furnham, A. (1999). Extraversion: The unloved variable in applied linguistic research. Language Learning, 49(3), 509–544. https://doi.org/10.1111/0023-8333.00098
  • Dewaele, J. M., & Furnham, A. (2000). Personality and speech production: A pilot study of second language learners. Personality and Individual Differences, 28(2), 355–365. https://doi.org/10.1016/S0191-8869(99)00106-3
  • Ebrahimi, S. (2021). The impact of semantic mapping technique on the organization of bilingual mental lexicon and L2 utterance fluency of Iranian EFL learners [ PhD thesis], University of York.
  • Ellis, R., & Barkhuizen, G. P. (2005). Analysing learner language. Oxford University Press.
  • Engelhardt, P. E., Veld, S. N., Nigg, J. T., & Ferreira, F. (2012). Are language production problems apparent in adults who no longer meet diagnostic criteria for attention-deficit/hyperactivity disorder? Cognitive Neuropsychology, 29(3), 275–299. https://doi.org/10.1080/02643294.2012.712957
  • Farnsworth, T. L. (2013). An investigation into the validity of the TOEFL iBT speaking test for international teaching assistant certification. Language Assessment Quarterly, 10(3), 274–291. https://doi.org/10.1080/15434303.2013.769548
  • Farrell, M. T., & Abrams, L. (2014). Picture–word interference reveals inhibitory effects of syllable frequency on lexical selection. The Quarterly Journal of Experimental Psychology, 67(3), 525–541. https://doi.org/10.1080/17470218.2013.820763
  • Foster, P., & Skehan, P. (1996). The influence of planning and task type on second language performance. Studies in Second Language Acquisition, 18(3), 299–323. https://doi.org/10.1017/S0272263100015047
  • Foster, P., Tonkyn, A., & Wigglesworth, G. (2000). Measuring spoken language: A unit for all reasons. Applied Linguistics, 21(3), 354–375.
  • Ginther, A., Dimova, S., & Yang, R. (2010). Conceptual and empirical relationships between temporal measures of fluency and oral English proficiency with implications for automated scoring. Language Testing, 27(3), 379–399. https://doi.org/10.1177/0265532210364407
  • Handley, Z. L., & Wang, H. (2018). What is the Impact of Study Abroad on Oral Fluency Development? A Comparison of Study Abroad and Study at Home. The British Council. https://englishagenda.britishcouncil.org/sites/default/files/attachments/h136_elt_a4_what_is_the_impact_of_study_abroad_on_oral_fluency_development_final_web.pdf
  • Harsch, C., Ushioda, E., & Ladroue, C. (2017). Investigating the predictive validity of TOEFL iBT® test scores and their use in informing policy in a United Kingdom University setting. ETS Research Report Series, 2017(1), 1–80. https://doi.org/10.1002/ets2.12167
  • Higgins, D., Xi, X., Zechner, K., & Williamson, D. (2011). A three-stage approach to the automated scoring of spontaneous spoken responses. Computer Speech & Language, 25(2), 282–306. https://doi.org/10.1016/j.csl.2010.06.001
  • Hsieh, C. N., Zechner, K., & Xi, X. (2019). Features measuring fluency and pronunciation. In K. Zechner & K. Evanini(Eds.), Automated speaking assessment: Using language technologies to score spontaneous speech (pp. 101–122). Routledge.
  • Huang, C. J. (1988). Wǒpǎo de kuài and Chinese phrase structure. Language, 64(2), 274–311. https://doi.org/10.2307/415435
  • Huang, C. J. (1989). Pro-drop in Chinese: A generalized control theory. In O. A. Jaeggli & K. J. Safir (Eds.), The null subject parameter (pp. 185–214). Springer.
  • Huensch, A., & Tracy-Ventura, N. (2017). Understanding second language fluency behavior: The effects of individual differences in first language fluency, cross-linguistic differences, and proficiency over time. Applied Psycholinguistics, 38(4), 755–785. https://doi.org/10.1017/S0142716416000424
  • Hulstijn, J. H., & Schoonen, R. (2004). What is speaking proficiency (WiSP). Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO). https://www.narcis.nl/research/RecordID/OND1301810
  • Hulstijn, J. H., Schoonen, R., De Jong, N. H., Steinel, M. P., & Florijn, A. (2012). Linguistic competences of learners of Dutch as a second language at the B1 and B2 levels of speaking proficiency of the common European framework of reference for languages (CEFR). Language Testing, 29(2), 203–221. https://doi.org/10.1177/0265532211419826
  • Iwashita, N., Brown, A., McNamara, T., & O’Hagan, S. (2008). Assessed levels of second language speaking proficiency: How distinct? Applied Linguistics, 29(1), 24–49. https://doi.org/10.1093/applin/amm017
  • Jacewicz, E., Fox, R. A., & Wei, L. (2010). Between-speaker and within-speaker variation in speech tempo of American English. The Journal of the Acoustical Society of America, 128(2), 839–850. https://doi.org/10.1121/1.3459842
  • Jaeggli, O., & Safir, K. J. (1989). The null subject parameter and parametric theory. In O. A. Jaeggli & K. J. Safir (Eds.), The null subject parameter (pp. 1–44). Springer.
  • Jared, D., Poh, R. P. Y., & Paivio, A. (2013). L1 and L2 picture naming in mandarin–English bilinguals: A test of bilingual dual coding theory. Bilingualism: Language and Cognition, 16(2), 383–396. https://doi.org/10.1017/S1366728912000685
  • Jiang, N. (2002). Form–meaning mapping in vocabulary acquisition in a second language. Studies in Second Language Acquisition, 24(4), 617–637. https://doi.org/10.1017/S0272263102004047
  • Jiang, N. (2004). Morphological insensitivity in second language processing. Applied Psycholinguistics, 25(4), 603–634. https://doi.org/10.1017/S0142716404001298
  • Kahng, J. (2014). Exploring utterance and cognitive fluency of L1 and L2 English speakers: Temporal measures and stimulated recall. Language Learning, 64(4), 809–854. https://doi.org/10.1111/lang.12084
  • Kahng, J. (2020). Explaining second language utterance fluency: Contribution of cognitive fluency and first language utterance fluency. Applied Psycholinguistics, 41(2), 457–480. https://doi.org/10.1017/S0142716420000065
  • Kane, M. (2006). Validation. In R. L. Brennan (Ed.), Educational measurement (4th ed., pp. 18–64). American Council on Education/Praeger.
  • Kim, M., Ackerman, L., Burchfield, L. A., Dawdy-Hesterberg, L., Luque, J., Mok, K., & Bradlow, A. (2013). Rate variation as a talker-specific/language-general property in bilingual speakers. The Journal of the Acoustical Society of America, 133(5), 3574–3574. https://doi.org/10.1121/1.4806558
  • Koizumi, R., In’nami, Y., & Jeon, E. H. (2022). L2 speaking and its external correlates: A meta-analysis. In E. H. Jeon & Y. In’nami (Eds.), Understanding L2 proficiency (pp. 339–367). John Benjamins.
  • Kormos, J. (2006). Speech production and second language acquisition. Lawrence Erlbaum.
  • Kuiken, F., & Vedder, I. (2018). Assessing functional adequacy of L2 performance in a task-based approach. In N. Taguchi & Y. Kim (Eds.), Task-based approaches to teaching and assessing pragmatics (pp. 266–285). John Benjamins.
  • Laufer, B., & Nation, P. (1999). A vocabulary-size test of controlled productive ability. Language Testing, 16(1), 33–51. https://doi.org/10.1177/026553229901600103
  • Lennon, P. (1990). Investigating fluency in EFL: A quantitative approach. Language Learning, 40(3), 387–417. https://doi.org/10.1111/j.1467-1770.1990.tb00669.x
  • Levelt, W. J. M. (1989). Speaking: From intention to articulation. MIT Press.
  • Levelt, W. J. M. (1999). Producing spoken language: A blueprint of the speaker. In C. M. Brown & P. Hagoort (Eds.), The neurocognition of language (pp. 83–122). Oxford University Press.
  • Magnan, S. (1988). Grammar and the ACTFL oral proficiency interview: Discussion and data. The Modern Language Journal, 72(3), 266–276. https://doi.org/10.2307/327504
  • McNamara, T. F. (1990). Item response theory and the validation of an ESP test for health professionals. Language Testing, 7(1), 52–76. https://doi.org/10.1177/026553229000700105
  • Milton, J. (2013). Measuring the contribution of vocabulary knowledge to proficiency in the four skills. In C. Bardel, C. Lindqvist, & B. Laufer (Eds.), L2 vocabulary acquisition, knowledge and use: New perspectives on assessment and corpus analysis (pp. 57–78). EUROSLA.
  • Nation, P. (2007). The four strands. International Journal of Innovation in Language Learning and Teaching, 1(1), 2–13. https://doi.org/10.2167/illt039.0
  • Norris, J. M. (2005). Using developmental sequences to estimate ability with English grammar: Preliminary design and investigation of a web-based test. University of Hawaii Second Language Studies Paper, 24(1). http://hdl.handle.net/10125/40678
  • Ockey, G. J., & Chukharev-Hudilainen, E. (2021). Human versus computer partner in the paired oral discussion test. Applied Linguistics, 42(5), 924–944. https://doi.org/10.1093/applin/amaa067
  • Ockey, G. J., & Li, Z. (2015). New and not so new methods for assessing oral communication. Language Value, 7(7), 1–21. https://doi.org/10.6035/LanguageV.2015.7.2
  • Peirce, J. W. (2007). PsychoPy - psychophysics software in Python. Journal of Neurosciemce Methods, 162(1–2), 8–13. https://doi.org/10.1016/j.jneumeth.2006.11.017
  • Peltonen, P. (2018). Exploring connections between first and second language fluency: A mixed methods approach. The Modern Language Journal, 102(4), 676–692. https://doi.org/10.1111/modl.12516
  • Peltonen, P. (2020). Individual and Interactional Speech Fluency in L2 English from a Problem-solving Perspective: A Mixed-methods Approach. Unpublished PhD Thesis. University of Turku.
  • Porte, G. (Ed.). (2012). Replication research in applied linguistics. Cambridge University Press.
  • R Core Team. (2015). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
  • Read, J. (1993). The development of a new measure of L2 vocabulary knowledge. Language Testing, 10(3), 355–371. https://doi.org/10.1177/026553229301000308
  • Read, J. (2004). Plumbing the depths: How should the construct of vocabulary knowledge be defined. In B. Laufer & P. Bogaards (Eds.), Vocabulary in a second language: Selection, acquisition and testing (pp. 209–227). John Benjamins.
  • Révész, A., Ekiert, M., & Torgersen, E. N. (2016). The effects of complexity, accuracy, and fluency on communicative adequacy in oral task performance. Applied Linguistics, 37(6), 828–848. https://doi.org/10.1093/applin/amu069
  • Rossiter, M. J. (2009). Perceptions of L2 fluency by native and non-native speakers of English. The Canadian Modern Language Review, 65(3), 395–412. https://doi.org/10.3138/cmlr.65.3.395
  • Segalowitz, N. (2010). Cognitive bases of second language fluency. Routledge.
  • Segalowitz, N., French, L., & Guay, J. D. (2017). What features best characterize adult second language utterance fluency and what do they reveal about fluency gains in short-term immersion? Canadian Journal of Applied Linguistics, 20(2), 90–116. https://doi.org/10.7202/1050813ar
  • Skehan, P., Foster, P., & Shum, S. (2016). Ladders and snakes in second language fluency. International Review of Applied Linguistics in Language Teaching, 54(2), 97–111. https://doi.org/10.1515/iral-2016-9992
  • Tabachnick, B. G., & Fidell, L. S. (2012). Using multivariate statistics. Pearson.
  • Tardif, T., Shatz, M., & Naigles, L. (1997). Caregiver speech and children’s use of nouns versus verbs: A comparison of English, Italian, and mandarin. Journal of Child Language, 24(3), 535–565. https://doi.org/10.1017/S030500099700319X
  • Tavakoli, P. (2011). Pausing patterns: Differences between L2 learners and native speakers. ELT Journal, 65(1), 71–79. https://doi.org/10.1093/elt/ccq020
  • Tavakoli, P., Nakatsuhara, F., & Hunter, A. M. (2017). Scoring validity of the aptis speaking test: Investigating fluency across tasks and levels of proficiency. ARAGs Research Reports Online. AR-G/2017/7. ISSN 2057-5203.
  • Towell, R., & Dewaele, J. M. (2005). The role of psycholinguistic factors in the development of fluency amongst advanced learners of French. In J. M. Dewaele (Ed.), Focus on French as a foreign language: Multidisciplinary approaches (pp. 210–239). Multilingual Matters.
  • van Dalen, R. C., Knill, K. M., & Gales, M. J. (2015). Automatically grading learners’ English using a Gaussian process. Proceedings Sixth Workshop on Speech and Language Technology in Education (SLaTE), 7–12. Leipzig, Germany: ISCA. https://www.repository.cam.ac.uk/handle/1810/249186
  • Van Moere, A. (2012). A psycholinguistic approach to oral language assessment. Language Testing, 29(3), 325–344. https://doi.org/10.1177/0265532211424478
  • Wang, X., & Evanini, K. (2019). Features measuring content and discourse coherence. In K. Zechner, & K. Evanini (Eds.), Automated speaking assessment: Using language technologies to score spontaneous speech (pp. 138–156). Routledge.
  • Wang, Y., Gales, M. J. F., Knill, K. M., Kyriakopoulos, K., Malinin, A., van Dalen, R. C., & Rashid, M. (2018). Towards automatic assessment of spontaneous spoken English. Speech Communication, 104, 47–56. https://doi.org/10.1016/j.specom.2018.09.002
  • Weekes, B. S., Shu, H., Hao, M., Liu, Y., & Tan, L. H. (2007). Predictors of timed picture naming in Chinese. Behavior Research Methods, 39(2), 335–342. https://doi.org/10.3758/BF03193165
  • West, M. (1953). A general service list of English words. Longman.
  • Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D. A., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T., Miller, E., Bache, S., Müller, K., Ooms, J., Robinson, D., Seidel, D., Spinu, V., & Woo, K.… Yutani, H. (2019). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686. https://doi.org/10.21105/joss.01686
  • Williamson, D. M., Xi, X., & Breyer, F. J. (2012). A framework for evaluation and use of automated scoring. Educational Measurement Issues & Practice, 31(1), 2–13. https://doi.org/10.1111/j.1745-3992.2011.00223.x
  • Xi, X. (2007). Evaluating analytic scoring for the TOEFL academic speaking test (TAST) for operational use. Language Testing, 24(2), 251–286. https://doi.org/10.1177/0265532207076365
  • Xi, X., Schmidgall, J., & Wang, Y. (2016). Chinese users’ perceptions of the use of automated scoring for a speaking practice test. In G. Yu & Y. Jin (Eds.), Assessing Chinese learners of English: Language constructs, consequences and conundrums (pp. 150–175). Palgrave Macmillan.
  • Xue, G., & Nation, I. S. P. (1984). A university word list. Language Learning and Communication, 3(2), 215–229.
  • Yan, X. (2020). Unpacking the relationship between formulaic sequences and speech fluency on elicited imitation tasks: Proficiency level, sentence length, and fluency dimensions. TESOL Quarterly, 54(2), 460–487. https://doi.org/10.1002/tesq.556
  • Yoon, S. Y., Lu, X., & Zechner, K. (2019). Features measuring vocabulary and grammar. In K. Zechner, and K. Evanini (Eds.), Automated speaking assessment: Using language technologies to score spontaneous speech (pp. 123–137). Routledge.
  • Zechner, K., Higgins, D., Xi, X., & Williamson, D. M. (2009). Automatic scoring of non-native spontaneous speech in tests of spoken English. Speech Communication, 51(10), 883–895. https://doi.org/10.1016/j.specom.2009.04.009