Skip Navigation

 A Longitudinal Study of Language Adaptation 

at Multiple Timescales in Native- and Non-Native Speakers



An essential resource for STEM (science, technology, engineering, and mathematics) education in the U.S. is the large workforce of talented graduate students who come to U.S. universities to study, conduct research, and teach American undergraduates. It is essential that native-English-speaking undergraduates and non-native English-speaking international graduate students who serve as teaching assistants (ITAs) learn to adapt to and understand one another.  Researchers at Stony Brook University in the Psychology, Linguistics, and Asian & Asian-American Studies Departments are conducting an NSF-funded longitudinal study of language development and communication among STEM students. 

The data presented here were recorded using multiple methods and tasks, from over 100 speakers. ITAs whose first language is not English were followed for two years, during which they completed assessments and provided recordings at regular intervals, resulting in a variety of types of data measured repeatedly for individuals. Here we present audio data for a small sample to exemplify the larger data set.  The complete data set consists of 3 waves of longitudinal data, each beginning when the ITA first comes to campus from their home country, and spanning 4 semesters. The complete data set, including behavioral measures from undergraduates, will be shared with the research community as the different waves of data collection are completed.

This portal includes data from the following tasks:



Data Format


Reading from a word list

sound files (.wav)


Focus dialogues

sound files (.wav)


Spontaneous speech

sound files (.wav)

Data obtained from these tasks can be downloaded from this portal. Click on the tabs above for more information about individual tasks and data available for download.

Login to Edit