811
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Affect in Infant-Directed Speech of Swedish-Speaking Mothers and Fathers to 3-, 6-, 9-, and 12-Month-Old Infants

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon

ABSTRACT

Infant-directed speech (IDS) is characterized by a range of register-typical characteristics. Many of those can be objectively measured, such as acoustic-prosodic and structural-linguistic modifications. Perceived vocal affect, however, is a socio-emotional IDS characteristic and is subjectively assessed. Vocal affect goes beyond acoustic-prosodic and structural-linguistic IDS features and includes a perceptive-subjective component in the listener. This study describes vocal affect valence in Swedish IDS during the first year of life, and compares vocal affect between mothers’ and fathers’ IDS and their adult-directed speech. Adult native speakers of Swedish (N = 16) rated affect valence in low-pass filtered IDS samples from free play interactions of mothers with infants at 3, 6, 9, and 12 months (N = 10) and fathers with their 12-month-olds (N = 6). Across the first year, the trajectory of mothers’ positive affect in Swedish IDS shows the highest affect scores toward the youngest infants and the lowest at 9 months of age. This follows a pattern comparable to that in Australian English IDS, showing that mothers express vocal affect similarly across different languages and cultures. Both mothers’ and fathers’ IDS to 12-month-olds have higher positive vocal affect valence than their adult-directed speech. There was no difference in positive affect valence between mothers’ and fathers’ IDS, that is, mothers and fathers express vocal affect to the same extent when talking to their infants. In conclusion, the findings of this study indicate that high positive vocal affect characterizes IDS across different languages and speakers.

Introduction

Infant-directed speech (IDS) is the speech register typically used when communicating with infants, regardless of language (e.g., Fernald & Simon, Citation1984; Kitamura et al., Citation2002; Papoušek et al., Citation1991; Werker et al., Citation1994). It differs from adult-directed speech (ADS) both in acoustic-prosodic and structural-linguistic regards. Acoustic-prosodic modifications in IDS are – in comparison to ADS – higher fundamental frequency (fo),Footnote1 wider fo excursions, more distinctive fo contours (e.g., Fernald & Mazzie, Citation1991; Fernald & Simon, Citation1984; Kitamura & Lam, Citation2009), steeper spectral tilt (Shinya et al., Citation2009; Wang et al., Citation2016), slower speech rate (Cristià, Citation2010; Fernald & Simon, Citation1984), shorter utterance and pause duration (U. Marklund et al., Citation2015), more pronounced syllable-lengthening (Albin & Echols, Citation1996; Martin et al., Citation2016), and more hyperarticulated vowels (Burnham et al., Citation2002; Kalashnikova & Burnham, Citation2018; Kuhl et al., Citation1997; E. Marklund & Gustavsson, Citation2020; Odijk & Gillis, Citation2022). Structural-linguistic modifications include shorter utterance length, utterance-final positioning of target words (Aslin et al., Citation1996; Fernald et al., Citation1989) and more repetition than in ADS (Bortfeld & Morgan, Citation1999; Fernald & Morikawa, Citation1993). Some IDS modifications remain stable across development, for example, fo variability and degree of hyperarticulation, while others vary with infant age, for example, mean fo, articulation rate, and vowel duration (Cox et al., Citation2022). All these modifications can be measured objectively when analyzing speech recordings and transcriptions. However, IDS also exaggerates positive vocal affect, a modification that cannot be directly accessed from the speech signal as it manifests across several prosodic features and is quantified by subjective listener ratings.

Relationship between prosody and vocal affect

Vocal affect manifests in through prosodic means, although the two components are distinct. Prosody can be directly accessed in the speech signal and objectively measured, whereas vocal affect requires subjective ratings for quantification, even though it can be approximated by a number of acoustic-prosodic characteristics. Positive vocal affect is predominantly expressed by higher fo, wider fo range and higher intensity (Johnstone & Scherer, Citation2000), and associated with higher frequencies across all formants, likely due to the articulatory impact of smiling on the acoustic signal (Benders, Citation2013; Dodane & Al-Tamimi, Citation2007; Englund & Behne, Citation2005; Green et al., Citation2010). The predominant perceptual cue to positive vocal affect is exaggerated prosody (Banse & Scherer, Citation1996).

It has been suggested that some of the acoustic-prosodic modifications in IDS relative to ADS are in fact a result of the high amount of positive affect in IDS, for example, that observed hyperarticulation could be a consequence of a shortened vocal tract when smiling (Benders, Citation2013). In line with this, only minor differences in mean fo and rhythmic tempo remained between IDS and ADS when highly affective ADS was elicited with the help of imaginary scenarios in fixed carrier sentences (Trainor et al., Citation2000).

Positive vocal affect in IDS

IDS conveys more positive vocal affect than ADS (Kalashnikova & Burnham, Citation2018; Katz et al., Citation1996; Kitamura & Burnham, Citation2003). A study on Australian English IDS reports an inverted S-shaped pattern of positive vocal affect across the first year of life: In IDS to newborns, affect valence is relatively low (although higher than for ADS), increases to peak in IDS at around 6 months, and then declines at 9 months with a slight recovery at 12 months (Kitamura & Burnham, Citation2003). Mothers show different trajectories of vocal affect depending on the sex of the infant. Specifically, positive vocal affect increases with age more in speech to girls than to boys between birth and 12 months (Kitamura & Burnham, Citation2003).

There are differences between languages in how and to what extent specific IDS characteristics are realized (e.g., Fernald et al., Citation1989; Takada, Citation2005). This is likely the case for affect, as the expression of vocal affect via prosodic means is restricted in tone languages, where fo modulations are used to convey lexical distinctions (Kitamura et al., Citation2002; Ross et al., Citation1986). However, to the best of our knowledge, vocal affect in tone language IDS has as of yet not been studied. In Japanese, a pitch-accent language, where lexical information is conveyed by fo modulations in a limited number of cases (Kubozono, Citation2012), affect in IDS has only been studied in relation to linguistic content, not prosody (Fernald & Morikawa, Citation1993; Toda et al., Citation1990).

Functions of IDS prosody and vocal affect

The functions of IDS are threefold: i) it arouses infant attention (Werker & McLeod, Citation1989), ii) facilitates language learning (Kalashnikova & Burnham, Citation2018; Kalashnikova et al., Citation2017) and iii) creates socio-emotional bonds (Golinkoff et al., Citation2015; Takada, Citation2005). For the present study, the first two of these functions are of interest.

IDS captures and maintains infant attention. From birth, infants display a behavioral preference for IDS over ADS (Frank et al., Citation2020; Dunst et al., Citation2012; Newman & Hussain, Citation2006; Panneton Cooper & Aslin, Citation1990; Panneton Cooper et al., Citation1997; Pegg et al., Citation1992). This well-established preference may be driven by the high positive affect typical for IDS rather than the dynamic modulation of fo. Infants prefer speech with high positive affect over speech with low positive affect when fo modifications were controlled for (Kitamura & Burnham, Citation1998), and IDS preference is not present when it is contrasted with high affect ADS (Singh, Morgan, et al., Citation2002). Infants like happy speech so much that they will prefer happy ADS to neutral IDS, even though IDS contains the higher fo (Singh, Morgan, et al., Citation2002). A preference for positive vocal affect has been shown in infants throughout the first year of life (Panneton et al., Citation2006; Singh et al., Citation2004; Singh, Morgan, et al., Citation2002).

IDS has a facilitative effect on a range of language learning tasks, for example, word-learning (Foursha-Stevenson et al., Citation2017; Graf Estes & Hurley, Citation2013; Ma et al., Citation2011), word-recognition (Singh et al., Citation2009; Singh, Bortfeld, et al., Citation2002; Song et al., Citation2010) and statistical learning (Bosseler et al., Citation2016; Thiessen et al., Citation2005). Prosodic IDS modifications in particular play an essential role in infant language development, as they have a positive impact on speech segmentation (Thiessen et al., Citation2005), finding stressed words in sentences (Fernald & Mazzie, Citation1991), discriminating multi-syllabic words (Karzon, Citation1985), and parsing clauses (Kemler Nelson et al., Citation1989).Footnote2 No association between positive vocal affect and language outcomes has been shown in the few studies on the subject. Specifically, no relationship was found between vocal affect in IDS at 7, 9, 11, 15, 19 months and vocabulary at 15 and 19 months (Kalashnikova & Burnham, Citation2018), and no impact of positive vocal affect was found on word-learning in 14-month-olds (Schwarz et al., Citation2019; Schwarz et al., in preparation).

The present study

While no study so far has linked vocal affect in IDS to language outcomes, there are reasons to believe that it nevertheless contributes to language learning. Specifically, positive vocal affect in IDS ensures that infants find speech input interesting (Panneton et al., Citation2006; Singh et al., Citation2004; Singh, Morgan, et al., Citation2002). It is likely that this attention-catching trait of IDS underlies its positive impact on language learning (Nencheva & Lew-Williams, Citation2022; Saint-Georges et al., Citation2013). In line with this, infants of depressed mothers – who exhibit less positive vocal affect in their IDS than non-depressed mothers (Lam-Cassettari & Kohlhoff., Citation2020) – do not learn associations as readily from IDS produced by females, but they do learn from male speakers (Kaplan et al., Citation2004, Citation2010). More positive vocal affect on the other hand is produced by mothers when talking to their infants in a simulated hearing loss condition compared to a normal hearing situation, possibly trying to keep infants attentive during interaction (Lopez & Lam-Cassettari, Citation2018). However, despite its potential relevance for language learning, vocal affect in IDS has been only sparsely studied (Kalashnikova & Burnham, Citation2018; Katz et al., Citation1996; Kitamura & Burnham, Citation2003). Therefore, studying vocal affect in IDS in languages other than English is highly warranted, both descriptively and in relation to language development.

Vocal affect in IDS has so far only been studied in mothers. When it comes to prosodic modifications in IDS, fathers display the same patterns of increased fo and greater fo variability in IDS compared to ADS as do mothers (Fernald et al., Citation1989; Gergely et al., Citation2017; Niwano & Sugai, Citation2003; Papoušek et al., Citation1987). However, the results are not consistent across all languages (Benders et al., Citation2021; Broesch & Bryant, Citation2017; Shute & Wheldall, Citation1999). To access the full picture on vocal affect in IDS, it needs to be studied in both mothers and fathers.

In the present study, perceived vocal affect is mapped out in Swedish IDS, spoken by mothers to 3-, 6-, 9- and 12-month-old and by fathers to 12-month-old boys and girls. We use the established practice of asking naïve listeners to rate vocal affect in low-pass filtered recordings of speech (Burnham et al., Citation2002; Kalashnikova & Burnham, Citation2018; Kitamura & Burnham, Citation2003). Vocal affect in Swedish IDS is of particular interest for two reasons: i) in Sweden, parental leave is relatively equally divided between mothers and fathers (Almqvist & Duvander, Citation2014), and it is therefore both possible and relevant to study Swedish fathers’ IDS as well as mothers’; and ii) Swedish is a pitch-accent language, which means that fo occasionally carries lexical information, and vocal affect has not yet been studied in a language where lexical information is conveyed by prosodic modification.

Method

Participants

Participants were Swedish-speaking mothers and fathers who took part with their children in a longitudinal study at Stockholm Babylab, in which play-sessions were recorded (MINT-Project; MAW 2011.0070; Gerholm et al., Citation2015). Included in the current study were 10 mother-infant dyads (7 boys, 3 girls) recorded when the infants were 3, 6, 9, and 12 months, and 6 father-infant dyads (2 boys, 4 girls) recorded at 12 months.Footnote3 Criteria for inclusion was that recordings existed for the desired ages and for an ADS session consisting of a conversation with the experimenter. All families spoke Swedish at home. Parents had a minimum of 12 years of education, and the majority had a university degree. All infants were born full-time and healthy. The families, living in Stockholm County, had originally responded to an invitation to participate in the study, sent to addresses obtained from the Swedish Tax Agency. The study was approved by the Regional Ethics Review Board in Stockholm (2015/63–31).

Speech material

Recording of the parent-infant interaction sessions took place at Stockholm Babylab in a sound-attenuated room with three wall-mounted cameras (Canon HDMI XA10). In addition, the parent wore a Go Pro Hero 3 camera strapped across the chest. Parent and infant were each fitted with a Sennheiser eW 100 G2 microphone. A condenser microphone (AKG SE 300 B) in the room served as backup and was used to synchronize the recordings. The parent was instructed to play freely with the infant on the floor. Besides the free play part of the session from which the IDS recordings used in this study originated, the parent was also asked to perform interactions such as clapping and picture book reading, but these parts were excluded as stimulus material. The play session lasted about 10–15 min.

We followed Kitamura and Burnham’s method to condense the speech material to be rated for vocal affect (Kitamura & Burnham, Citation2003). Three-minute long audio clips were cut out at approximately 1 minute into each recording, respecting utterance boundaries. Recordings were edited to 25 s clips for ratings and acoustic analyses, by reducing pauses longer than 600 ms to 300 ms, extending shorter pauses to 300 ms, and leaving pauses between 300 ms and 600 ms as they were. Utterances with overlaps of parent speech and infant vocalization were excluded, as were instances of laughter, singing, coughing, and non-speech sounds such as toy noise. To remove semantic content, all IDS and ADS clips were low-pass filtered at 400 Hz. Sound files were then normalized for intensity. Acoustic manipulations were performed in Praat 6.0.21 and 6.0.28 (Boersma & Weenink, Citation2010). This resulted in a total of 62 clips (40 IDS and 10 ADS samples from mothers; 6 IDS and 6 ADS samples from fathers) to be used as stimuli in the rating experiment.

Prosodic analyses of the stimuli clips were performed on utterance level within each clip after intensity normalization and low-pass filtering. Utterances were defined as speech segments in the clips, separated by a pause of at least 300 ms. The total number of utterances in the clips was 1046 (845 from mothers and 201 from fathers). We included fo mean, fo median, fo minimum, fo maximum and fo range (difference between max and min) in the prosodic analyses (see ), all measured in Hz using Praat 6.0.40 (Boersma & Weenink, Citation2010). We chose fo median in addition to fo mean as this measure is less prone to be affected by pitch tracking errors which are more likely to occur when analyzing IDS compared to ADS (see Gergely et al., Citation2017). To further minimize potential pitch tracking errors, a two-step pitch extraction was carried out (De Looze & Hirst, Citation2010). In a first step, pitch extraction (autocorrelation) was run with the Praat default settings (floor set to 60 Hz, ceiling to 600 Hz) to create a pitch object. In the second step, the pitch object was used to define the pitch floor and the pitch ceiling in another pitch extraction. Pitch floor and pitch ceiling were set to q15 × 0.83 and q65 × 1.92 with q15 and q65 being the 15th and 65th percentile of the original pitch object.

Table 1. The table lists the prosodic measures fo mean, fo median, fo min, fo max, and fo range (in Hz) for mothers’ ADS and IDS across the first year of life and in fathers’ ADS and IDS to 12-month-olds. Calculations were performed on utterance level.

Rating experiment

Sixteen raters (8 female, mean age = 32 years, SD = 8.9, age range 19–55 years) assessed the vocal affect in the recordings. Raters were recruited via an online recruitment site for research participation. All raters were Swedish native speakers, reported normal hearing, and had at least 9 years of formal education (mandatory basic education in Sweden). Nine participants had university or college level education. For their participation, raters received vouchers for an online book store. An additional 12 raters had to be excluded for either not having Swedish as a native language, not completing the rating study and/or not attending to the stimuli as instructed.

The rating experiment was conducted remotely. Raters were asked to sit in a quiet place with a stable internet connection and to use headphones. Prior to the experiment, participants gave informed consent and provided background information. They were informed about the low-pass filtered nature of the stimuli and instructed to listen to the entire stimulus clip before rating it.

The stimuli clips were presented in random order using online questionnaire software (LimeSurvey 2.05; Schmitz, Citation2015). Each trial consisted of one stimuli clip being presented, and raters were asked to estimate the affect valence and communicative intent of the speech in that clip. Only affect valence is reported in the present study. Affect valence was rated on a nine-point Likert-scale ranging from −4 to + 4 (Kitamura & Burnham, Citation2003), with + 4 meaning perceived high positive affect and −4 perceived high negative affect. Raters controlled the pace of the experiment and could listen to each clip as many times as they wanted before moving on to the next. The time spent on each stimuli clip was checked after completion of the experiment and raters who moved on to the next trial faster than the 25 s stimulus duration were excluded from the results.

Results

Statistical analyses were performed in R 4.0.2 (R Core Team, Citation2021) and the package effectsize (Lakens, Citation2013).

Correlations between affect valence and prosodic measures

To show the relationship between subjective affect ratings and objectively measured prosodic parameters, correlational analyses were conducted. Pearson product-moment correlation coefficients between affect ratings and the prosodic measures fo mean, fo median, fo min, fo max, and fo range were all significant and moderate to strong (α =.05, , ).

Figure 1. Significant correlations of moderate size between affect valence ratings and the prosodic measures fo median, fo min, fo max and fo range. The fo median is chosen to represent overall prosody in this illustration, rather than fo mean, as it is less prone to pitch-tracking errors.

Figure 1. Significant correlations of moderate size between affect valence ratings and the prosodic measures fo median, fo min, fo max and fo range. The fo median is chosen to represent overall prosody in this illustration, rather than fo mean, as it is less prone to pitch-tracking errors.

Table 2. Significance test results for the correlations between affect valence ratings and prosodic measures.

Affect valence in mothers’ IDS

Preliminary analyses of mothers’ affect valence in IDS towards their infants at 3, 6, 9 and 12 months showed only a main effect of infant age, no effect of infant sex and no interaction between infant age and infant sex.

To investigate the effect of infant age on affect valence, we looked at mothers’ speech to infants at 3, 6, 9 and 12 months of age in a within-subjects design. Alpha was set to .05. A one-way repeated-measures ANOVA with infant age as independent variable and affect score as dependent variable revealed a significant difference for infant age with a medium-sized effect size (F(3, 13) = 6.09; p = .001; η2 = .07; see , left panel). Affect valence rating means for mothers’ IDS were then tested for linear, quadratic, and cubic contrasts over infant age. Vocal affect is rated the most positive for 3-month-olds, decreasing to 6-month-olds and further decreasing to 9-month-olds, while rising again for 12-month-olds as the significant quadratic trend shows (F(1,10) = 5.301, p = .023). The linear and cubic trends across all four ages were not significant.

Figure 2. Left: Mean affect valence ratings for mothers’ IDS addressing infants at 3, 6, 9 and 12 months of age (box plots depict the mid quartiles of the distribution). Right: Mean affect valence ratings for mothers’ (blue) and fathers’ (red) IDS towards 12-month-olds and their ADS (box plots depict the mid quartiles of the distribution).

Figure 2. Left: Mean affect valence ratings for mothers’ IDS addressing infants at 3, 6, 9 and 12 months of age (box plots depict the mid quartiles of the distribution). Right: Mean affect valence ratings for mothers’ (blue) and fathers’ (red) IDS towards 12-month-olds and their ADS (box plots depict the mid quartiles of the distribution).

Affect valence in mothers’ and fathers’ IDS

To investigate the difference in affect between mothers’ and father’s IDS and ADS, a 2 × 2 ANOVA was conducted with speaker (mother vs. father) and speech register (IDS to 12-month-olds vs. ADS) as within-subject variables. Due to the small sample size, infant sex was not included as a variable. The test showed a significant effect of speech register with a large effect size (F(1, 12) = 54.4; p < .001; η2 = .34). No main effect was found for speaker (F(1, 12) = .319; p = .58) nor was the interaction between speech register and speaker significant (F(1, 12) = 3.736; p = .072; see , right panel). Both mothers’ and fathers’ IDS is clearly distinctive from their ADS, and mothers and fathers do not differ in degree of vocal affect valence in speech to their 12-month-old infants.

Discussion

The present study shows that perceived vocal affect correlates positively with the prosodic measures fo mean, fo median, fo min, fo max, and fo range. This is in line with previous findings (Johnstone & Scherer, Citation2000), and illustrates the close relationship between subjectively perceived affect and objectively measured acoustic parameters.

The trajectory of vocal affect valence in Swedish mothers’ IDS to 3- to 12-month-old infants follows a quadratic trend, showing a decrease from 3 to 9 months while remaining on the positive part of the scale, increasing again at 12 months. This is similar but not identical to the developmental pattern of vocal affect in IDS in Australian English (Kitamura & Burnham, Citation2003). The score average of 2 on the Swedish affect valence scale for IDS toward 3-month-olds compares well to the average score of 1.8 for Australian English IDS toward 3-month-olds in the graph of the original Australian study (Kitamura & Burnham, Citation2003). However, in that study, Australian English IDS has the highest positive affect valence towards 6-month-olds, whereas the Swedish scores decrease linearly from 3 months onwards. IDS in both languages show a decrease in affect valence towards a dip at 9 months (average affect scores: Australian English 1.5; Swedish 1.1) and a recovery at 12 months (average affect scores: Australian English 2.2; Swedish 1.7). Kalashnikova and Burnham (Citation2018) plotted the developmental trajectory of vocal affect IDS to infants from 7 to 19 months, finding the usual high positive affect in IDS compared to ADS, but no effect of infant age. However, their vocal affect scores were based on an average across several different affect types rather than on affect valence only (Kalashnikova & Burnham, Citation2018) and are thus not directly comparable to the findings of the present study.

The present study is the first to report that vocal affect in fathers’ IDS is more positive than in their ADS, following the same pattern as for mothers. Despite the small sample size, and the fact that only one age group was included, this study gives a first indication of that vocal affect in fathers’ IDS is equal to affect in mothers’ IDS. Given the increased presence of fathers in infants’ early life and the importance of speech input for infant language development (Ferjan Ramírez, Citation2022), both vocal affect and prosodic modifications in IDS need to be studied further in fathers.

To conclude, high vocal affect catches infant attention, thereby potentially contributing to the positive impact of IDS on language learning (Nencheva & Lew-Williams, Citation2022). Despite this, vocal affect in IDS remains understudied. The present study contributes with two points of information on vocal affect in Swedish IDS. It shows that vocal affect valence in mothers’ IDS follows a U-shaped pattern over the first year of infancy, and that mothers’ IDS and fathers’ IDS convey similar degrees of vocal affect valence.

Contributions

Author contributions are specified according to the CRediT system (ANSI/NISO Z39.104–2022). Conceptualization: ICS (equal), EM (equal), CLC (equal); Data curation: EM (lead), LG (supporting); Formal analysis: EM (lead), ICS (supporting), LG (supporting); Funding acquisition: ICS (lead), EM (supporting), LG (supporting), CLC (supporting); Methodology: ICS (equal), EM (equal), CLC (equal); Project administration: EM (lead), ICS (supporting); Resources: ICS (equal), EM (equal), LG (equal), UM (equal); Software: EM (lead), LG (supporting); Supervision: ICS (lead), EM (support); Validation: ICS (equal), EM (equal), LG (equal); Visualization: EM (lead), ICS (supporting), LG (supporting); Writing original draft: ICS (lead), EM (supporting), LG (supporting); Writing review and editing: ICS (lead), EM (lead), LG (lead), UM (supporting), CLC (supporting).

Acknowledgments

The authors thank the MINT project (MAW 2011.0070; PI Tove Gerholm) for providing the parent-infant recordings and the MINT families for their participation. We thank the MARCS Institute for Brain, Behaviour and Development and specifically Christine Kitamura for her input. Thanks also to Klara Hjerpe for stimuli creation and data collection.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was funded by the Marcus and Amalia Wallenberg Foundation (MAW 2013.0056, PI Iris-Corinna Schwarz, co-applicant Christa Lam-Cassettari; MAW 2019.0030, PI Ellen Marklund) and Riksbankens Jubileumsfond (RJ P17-0175, PI Lisa Gustavsson).

Notes

1 Fundamental frequency is abbreviated as fo, which stands for frequency oscillation, in accordance with abbreviation standards in Phonetics (Titze et al., Citation2015).

2 But see also Spinelli et al. (Citation2017).

3 As many fathers took over parental leave toward the end of the first year and therefore started coming to the play sessions at the lab with their infants, father-infant recordings could only be included in this study at infant age 12 months.

References

  • Albin, D. D., & Echols, C. H. (1996). Stressed and word-final syllables in infant-directed speech. Infant Behavior and Development, 19(4), 401–418. https://doi.org/10.1016/S0163-6383(96)90002-8
  • Almqvist, A.-L., & Duvander, A.-Z. E. (2014). Changes in gender equality? Swedish fathers’ parental leave, division of childcare and housework 1. Journal of Family Studies, 20(1), 19–27. https://doi.org/10.5172/jfs.2014.20.1.19
  • Aslin, R. N., Woodward, J. Z., LaMendola, N. P., & Bever, T. G. (1996). Models of word segmentation in fluent maternal speech to infants. In J. L. Morgan & K. Demuth (Eds.), Signal to syntax: Bootstrapping from speech to grammar in early acquisition (pp. 117–134). Lawrence Earlbaum Associates.
  • Banse, R., & Scherer, K. R. (1996). Acoustic profiles in vocal emotion expression. Journal of Personality and Social Psychology, 70(3), 614–636. https://doi.org/10.1037/0022-3514.70.3.614
  • Benders, T. (2013). Mommy is only happy! Dutch mothers’ realisation of speech sounds in infant-directed speech expresses emotion, not didactic intent. Infant Behavior and Development, 36(4), 847–862. https://doi.org/10.1016/j.infbeh.2013.09.001
  • Benders, T., StGeorge, J., & Fletcher, R. (2021). Infant-directed speech by Dutch fathers: Increased pitch variability within and across utterances. Language Learning and Development, 17(3), 292–325. https://doi.org/10.1080/15475441.2021.1876698
  • Boersma, P., & Weenink, D. (2010). Praat: Doing Phonetics by Computer [Computer program],Version 6.0. 40. Retrieved May 28, 2018 from http://www.praat.org/
  • Bortfeld, H., & Morgan, J. (1999). Disentangling multiple sources of stress in infant-directed speech. In A. Greenhill, H. Littlefield, & C. Tano (Eds.), Proceedings of the 23rd Annual Boston University Conference. Somerville, MA: Cascadilla.
  • Bosseler, A. N., Teinonen, T., Tervaniemi, M., Huotilainen, M., & Vicario, D. S. (2016). Infant directed speech enhances statistical learning in newborn infants: An ERP study. PLOS One, 11(9), e0162177. https://doi.org/10.1371/journal.pone.0162177
  • Broesch, T. L., & Bryant, G. A. (2017). Fathers’ infant-directed speech in a small-scale society. Child Development, 89(2), e29–e41. https://doi.org/10.1111/cdev.12768
  • Burnham, D., Kitamura, C., & Vollmer-Conna, U. (2002). What’s new, pussycat? On talking to babies and animals. Science: Advanced Materials and Devices, 296(5572), 1435. https://doi.org/10.1126/science.1069587
  • Cox, C., Bergmann, C., Fowler, E., Keren-Portnoy, T., Roepstorff, A., Bryant, G., & Fusaroli, R. (2022). A systematic review and Bayesian meta-analysis of the acoustic features of infant-directed speech. Nature Human Behaviour, 1–20. https://doi.org/10.1038/s41562-022-01452-1
  • Cristià, A. (2010). Phonetic enhancement of sibilants in infant-directed speech. The Journal of the Acoustical Society of America, 128(1), 424–434. https://doi.org/10.1121/1.3436529
  • De Looze, C., & Hirst, D. (2010). Integrating changes of register into automatic intonation analysis. Paper presented at the Speech Prosody 2010 Conference, paper 209, Chicago, IL.
  • Dodane, C., & Al-Tamimi, J. (2007). An acoustic comparison of vowel systems in adult-directed speech and child-directed speech: Evidence from French, English & Japanese. Paper presented at the 16th International Congress of Phonetic Sciences, Saarbruecken, Germany.
  • Dunst, C. J., Gorman, E., & Hamby, D. W. (2012). Preference for infant-directed speech in preverbal young children. Centre for Early Literacy Learning: Reviews, 5(1), 1–13.
  • Englund, K. T., & Behne, D. (2005). Infant-directed speech in natural interaction - Norwegian vowel quantity and quality. Journal of Psycholinguistic Research, 34(3), 259–280. https://doi.org/10.1007/s10936-005-3640-7
  • Ferjan Ramírez, N. (2022). Fathers’ infant‐directed speech and its effects on child language development. Language and Linguistics Compass, 16(1), e12448. https://doi.org/10.1111/lnc3.12448
  • Fernald, A., & Mazzie, C. (1991). Prosody and focus in speech to infants and adults. Developmental Psychology, 27(2), 209–221. https://doi.org/10.1037/0012-1649.27.2.209
  • Fernald, A., & Morikawa, H. (1993). Common themes and cultural variations in Japanese and American mothers’ speech to infants. Child Development, 64(3), 637–656. https://doi.org/10.2307/1131208
  • Fernald, A., & Simon, T. (1984). Expanded intonation contours in mothers’ speech to newborns. Developmental Psychology, 20(1), 104–113. https://doi.org/10.1037/0012-1649.20.1.104
  • Fernald, A., Taeschner, T., Dunn, J., Papoušek, M., de Boysson-Bardies, B., & Fukui, I. (1989). A cross-language study of prosodic modifications in mothers’ and fathers’ speech to preverbal infants. Journal of Child Language, 16(3), 477–501. https://doi.org/10.1017/S0305000900010679
  • Foursha-Stevenson, C., Schembri, T., Nikoladis, E., & Eriksen, C. (2017). The influence of child-directed speech on word learning and comprehension. Journal of Psycholinguistic Research, 46(2), 329–343. https://doi.org/10.1007/s10936-016-9441-3
  • Frank, M. C., Alcock, K. J., Arias-Trejo, N., Aschersleben, G., Baldwin, D., Barbu, S., Bergelson, E., Bergmann, C., Black, A. K., Blything, R., Böhland, M. P., Bolitho, P., Borovsky, A., Brady, S. M., Braun, B., Brown, A., Byers-Heinlein, K., Campbell, L. E. … The ManyBabies Consortium. (2020). Quantifying sources of variability in infancy research using the infant-directed-speech preference. Advances in Methods and Practices in Psychological Science, 3(1), 24–52. https://doi.org/10.1177/2515245919900809
  • Gergely, A., Faragó, T., Galambos, A., & Topál, J. (2017). Differential effects of speech situations on mothers’ and fathers’ infant-directed and dog-directed speech: An acoustic analysis. Scientific Reports, 7(1), 13739. https://doi.org/10.1038/s41598-017-13883-2
  • Gerholm, T., Gustavsson, L., Salomão, G. L., & Schwarz, I.-C. (2015). The Swedish MINT project: Modelling infant language acquisition from parent-child interaction. Paper presented at the Workshop on Extensive and Intensive Recordings of Children’s Language Environment WEIRCLE, Paris, France.
  • Golinkoff, R. M., Can, D. D., Soderstrom, M., & Hirsh-Pasek, K. (2015). (Baby)talk to me: The social context of infant-directed speech and its effects on early language acquisition. Current Directions in Psychological Science, 24(5), 339–344. https://doi.org/10.1177/0963721415595345
  • Graf Estes, K., & Hurley, K. (2013). Infant-directed prosody helps infants map sounds to meanings. Infancy, 18(5), 797–824. https://doi.org/10.1111/infa.12006
  • Green, J., Nip, I., Wilson, E., Mefferd, A., & Yunusova, Y. (2010). Lip movement exaggerations during infant-directed speech. Journal of Speech, Language, & Hearing Research, 53(6), 1529–1542. https://doi.org/10.1044/1092-4388(2010/09-0005)
  • Johnstone, T., & Scherer, K. R. (2000). Vocal communication of emotion. In M. Lewis & J. Haviland (Eds.), The Handbook of Emotion (pp. 220–235). Guilford.
  • Kalashnikova, M., & Burnham, D. (2018). Infant-directed speech from seven to nineteen months has similar acoustic properties but different functions. Journal of Child Language, 45(5), 1035–1053. https://doi.org/10.1017/S0305000917000629
  • Kalashnikova, M., Carignan, C., & Burnham, D. (2017). The origins of babytalk: Smiling, teaching or social convergence? Royal Society Open Science, 4(8), 170306. https://doi.org/10.1098/rsos.170306
  • Kaplan, P. S., Danko, C. M., & Diaz, A. (2010). A privileged status for male infant-directed speech in infants of depressed mothers? Role of father involvement. Infancy, 15(2), 151–175. https://doi.org/10.1111/j.1532-7078.2009.00010.x
  • Kaplan, P. S., Dungan, J. K., & Zinser, M. C. (2004). Infants of chronically depressed mothers learn in response to male, but not female, infant-directed speech. Developmental Psychology, 40(2), 140–148. https://doi.org/10.1037/0012-1649.40.2.140
  • Karzon, R. G. (1985). Discrimination of polysyllabic sequences by one- to four-month-old infants. Journal of Experimental Child Psychology, 39(2), 326–342. https://doi.org/10.1016/0022-0965(85)90044-X
  • Katz, G. S., Cohn, J. F., & Moore, C. A. (1996). A combination of vocal F0 dynamic and summary features discriminates between three pragmatic categories of infant-directed speech. Child Development, 67(1), 205–217. https://doi.org/10.1111/j.1467-8624.1996.tb01729.x
  • Kemler Nelson, D. G., Hirsh-Pasek, K., Jusczyk, P. W., & Wright Cassidy, K. (1989). How the prosodic cues in motherese may assist language learning. Journal of Child Language, 16(1), 55–68. https://doi.org/10.1017/S030500090001343X
  • Kitamura, C., & Burnham, D. (1998). The infant’s response to maternal vocal affect. In C. Rovee-Collier, L. P. Lipsitt, & H. Hayne (Eds.), Advances in Infancy Research (pp. 221–236). Ablex Publishing Corporation.
  • Kitamura, C., & Burnham, D. (2003). Pitch and communicative intent in mother’s speech: Adjustments for age and sex in the first year. Infancy, 4(1), 85–110. https://doi.org/10.1207/S15327078IN0401_5
  • Kitamura, C., & Lam, C. (2009). Age-specific preferences for infant-directed affective intent. Infancy, 14(1), 77–100. https://doi.org/10.1080/15250000802569777
  • Kitamura, C., Thanavishuth, C., Burnham, D., & Luksaneeyanawin, S. (2002). Universality and specificity in infant-directed speech: Pitch modification as a function of infant age and sex in a tonal and non-tonal language. Infant Behavior and Development, 24(4), 372–392. https://doi.org/10.1016/S0163-6383(02)00086-3
  • Kubozono, H. (2012). Varieties of pitch accent systems in Japanese. Lingua, 122(13), 1395–1414. https://doi.org/10.1016/j.lingua.2012.08.001
  • Kuhl, P. K., Andruski, J. E., Chistovich, I. A., Chistovich, E. I., Kozhevnikova, E. V., Ryskina, V. L., Stolyarova, E. I., Sundberg, U., & Lacerda, F. (1997). Cross-language analysis of phonetic units in language addressed to infants. Science: Advanced Materials and Devices, 277(5326), 684–686. https://doi.org/10.1126/science.277.5326.684
  • Lakens, D. (2013). Calculating and reporting effect sizes to facilitate cumulative science: A practical primer for t-tests and ANOVAs. Frontiers in Psychology, 4(863). https://doi.org/10.3389/fpsyg.2013.00863
  • Lam-Cassettari, C., Kohlhoff, J. (2020). Effect of maternal depression on infant-directed speech to prelinguistic infants: Implications for language development. PloS One, 15(7), e0236787. https://doi.org/10.1371/journal.pone.0236787
  • Lopez, I., Lam-Cassettari, C. (2018). Expression of affect in infant-directed speech to hearing and hearing impaired infants. Proceedings of the 17th Australasian International Conference on Speech Science and Technology (pp. 69–72), Sydney, Australia
  • Ma, W., Golinkoff, R. M., Houston, D. M., & Hirsh-Pasek, K. (2011). Word learning in infant-and adult-directed speech. Language Learning and Development, 7(3), 185–201. https://doi.org/10.1080/15475441.2011.579839
  • Marklund, E., & Gustavsson, L. (2020). The dynamics of vowel hypo- and hyperarticulation in Swedish infant-directed speech to 12-month-olds. Frontiers in Communication, 5, 523768. https://doi.org/10.3389/fcomm.2020.523768
  • Marklund, U., Marklund, E., Lacerda, F., & Schwarz, I. C. (2015). Pause and utterance duration in child-directed speech in relation to child vocabulary size. Journal of Child Language, 42(5), 1158–1171. https://doi.org/10.1017/S0305000914000609
  • Martin, A., Igarashi, Y., Jincho, N., & Mazuka, R. (2016). Utterances in infant-directed speech are shorter, not slower. Cognition, 156, 52–59. https://doi.org/10.1016/j.cognition.2016.07.015
  • Nencheva, M. L., & Lew-Williams, C. (2022). Understanding why infant-directed speech supports learning: A dynamic attention perspective. Developmental Review, 66, 101047. https://doi.org/10.1016/j.dr.2022.101047
  • Newman, R. S., & Hussain, I. (2006). Changes in preference for infant-directed speech in low and moderate noise by 5- to 13-month-olds. Infancy, 10(1), 61–76. https://doi.org/10.1207/s15327078in1001_4
  • Niwano, K., & Sugai, K. (2003). Pitch characteristics of speech during mother-infant and father-infant vocal interactions. Japanese Journal of Special Education, 40(6), 663–674. https://doi.org/10.6033/tokkyou.40.663
  • Odijk, L., & Gillis, S. (2022). Parents tune their vowels to the emergence of children’s words. Journal of Child Language, First View, 1–20. https://doi.org/10.1017/S0305000922000289
  • Panneton Cooper, R., Abraham, J., Berman, S., & Staska, M. (1997). The development of infants’ preference for motherese. Infant Behavior and Development, 20(4), 477–488. https://doi.org/10.1016/S0163-6383(97)90037-0
  • Panneton Cooper, R., & Aslin, R. N. (1990). Preference for infant-directed speech in the first month after birth. Child Development, 61(5), 1584–1595. https://doi.org/10.2307/1130766
  • Panneton, R., Kitamura, C., Mattock, K., & Burnham, D. (2006). Slow speech enhances younger but not older infants’ perception of vocal emotion. Research in Human Development: The Ecology of Emotion in Parenting Relationships: Perspectives Across the Lifespan, 3, 7–19. https://doi.org/10.1207/s15427617rhd0301_2
  • Papoušek, M., Papoušek, H., & Haekel, M. (1987). Didactic adjustments in fathers’ and mothers’ speech to their 3-month-old infants. Journal of Psycholinguistic Research, 16(5), 491–516. https://doi.org/10.1007/BF01073274
  • Papoušek, M., Papoušek, H., & Symmes, D. (1991). The meanings of melodies in motherese in tone and stress languages. Infant Behavior and Development, 14(4), 415–440. https://doi.org/10.1016/0163-6383(91)90031-M
  • Pegg, J. E., Werker, J. F., & McLeod, P. J. (1992). Preference for infant-directed over adult-directed speech: Evidence from 7-week-old infants. Infant Behavior and Development, 15(3), 325–345. https://doi.org/10.1016/0163-6383(92)80003-D
  • R Core Team. (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing. URL. https://www.R-project.org/
  • Ross, E. D., Edmondson, J. A., & Seibert, G. B. (1986). The effect of affect on various acoustic measures of prosody in tone and non-tone languages: A comparison based on computer analysis of voice. Journal of Phonetics, 14(2), 283–302. https://doi.org/10.1016/S0095-4470(19)30669-2
  • Saint-Georges, C., Chetouani, M., Cassel, R., Apicella, F., Madhaoui, A., Muratori, F., Laznik, M.-C., Cohen, D., & Senju, A. (2013). Motherese in interaction: At the cross-road of emotion and cognition? A systematic review. PLOS One, 8(10), e78103. https://doi.org/10.1371/journal.pone.0078103
  • Schmitz, C. (2015). LimeSurvey: An Open Source Survey Tool [Version 2.05]. LimeSurvey GmbH, Survey Services and Consulting, Retrieved from https://www.limesurvey.org
  • Schwarz, I.-C., Lam-Cassettari, C., Marklund, E., & Gustavsson, L. (2019). Does positive affect promote word learning in Australian English learning and Swedish learning 16-month-olds?. Poster presented at the Workshop on Infant Language Development (WILD 2019), Potsdam, Germany.
  • Shinya, T., Nishikawa, K., Igarashi, Y., Kitahara, M., Tanaka, K., & Mazuka, R. (2009). Vowel quality and voice quality in infant-directed speech: Hyperarticulated but soft voices of IDS vowels. In 4th International Workshop on the Interface between Prosody and Informational Structure (pp. 10–11), Komorebi (Shiga), Japan.
  • Shute, B., & Wheldall, K. (1999). Fundamental frequency and temporal modifications in the speech of British fathers to their children. Educational Psychology, 19(2), 221–233. https://doi.org/10.1080/0144341990190208
  • Singh, L., Bortfeld, H., & Morgan, J. (2002). Effects of variability on infant word recognition. In A. H. J. Do, L. Dominguez, & A. Johansen (Eds.), Proceedings of the 26th Annual Boston University Conference on Language Development BUCLD (pp. 608–619). Somerville, MA: Cascadilla.
  • Singh, L., Morgan, J. L., & Best, C. T. (2002). Infants’ listening preferences: Baby talk or happy talk? Infancy, 3(3), 365–394. https://doi.org/10.1207/S15327078IN0303_5
  • Singh, L., Morgan, J. L., & White, K. S. (2004). Preference and processing: The role of speech affect in early spoken word recognition. Journal of Memory and Language, 51(2), 173–189. https://doi.org/10.1016/j.jml.2004.04.004
  • Singh, L., Nestor, S., Parikh, C., & Yull, A. (2009). Influences of infant-directed speech on early word recognition. Infancy, 14(6), 654–666. https://doi.org/10.1080/15250000903263973
  • Song, J. Y., Demuth, K., & Morgan, J. (2010). Effects of the acoustic properties of infant-directed speech on infant word recognition. The Journal of the Acoustical Society of America, 128(1), 389–400. https://doi.org/10.1121/1.3419786
  • Spinelli, M., Fasolo, M., & Mesman, J. (2017). Does prosody make the difference? A meta-analysis on relations between prosodic aspects of infant-directed speech and infant outcomes. Developmental Review, 44, 1–18. https://doi.org/10.1016/j.dr.2016.12.001
  • Takada, A. (2005). Early vocal communication and social institution: Appellation and infant verse addressing among the Central Kalahari San. Crossroads of Language, Interaction and Culture, 6, 80–108.
  • Thiessen, E. D., Hill, D. S., & Saffran, J. R. (2005). Infant-directed speech facilitates word segmentation. Infancy, 7(1), 53–71. https://doi.org/10.1207/s15327078in0701_5
  • Titze, I. R., Baken, R. J., Bozeman, K. W., Granqvist, S., Henrich, N., Herbst, C. T., Howard, D. M., Hunter, E. J., Kaelin, D., Kent, R. D., Kreiman, J., Kob, M., Löfqvist, A., McCoy, S., Miller, D. G., Noé, H., Scherer, R. C., Smith, J. R., Wolfe, J. (2015). Toward a consensus on symbolic notation of harmonics, resonances, and formants in vocalization. The Journal of the Acoustical Society of America, 137(5), 3005–3007. https://doi.org/10.1121/1.4919349
  • Toda, S., Fogel, A., & Kawai, M. (1990). Maternal speech to three-month-old infants in the United States and Japan. Journal of Child Language, 17(2), 279–294. https://doi.org/10.1017/S0305000900013775
  • Trainor, L. J., Austin, C. M., & Desjardins, R. N. (2000). Is infant-directed speech prosody a result of the vocal expression of emotion? Psychological Science, 11(3), 188–195. https://doi.org/10.1111/1467-9280.00240
  • Wang, Y., Lee, C. S., & Houston, D. M. (2016). Infant-directed speech reduces English-learning infants’ preference for trochaic words. The Journal of the Acoustical Society of America, 140(6), 4101–4110. https://doi.org/10.1121/1.4968793
  • Werker, J. F., & McLeod, P. J. (1989). Infant preference for both male and female infant-directed talk: A developmental study of attentional and affective responsiveness. Canadian Journal of Psychology / Revue Canadienne de Psychologie, 43(2), 230–246. https://doi.org/10.1037/h0084224
  • Werker, J. F., Pegg, J. E., & McLeod, P. J. (1994). A cross-language investigation of infant-preference for infant-directed communication. Infant Behavior and Development, 17(3), 323–333. https://doi.org/10.1016/0163-6383(94)90012-4