These files are part of our longitudinal study, Study of Environmental Effects on Developing Linguistic Skills (SEEDLingS). This volume only includes recordings taken at 6 months of age.
The recordings in this volume were analyzed for the Bergelson & Aslin citation above, alongside eyetracking data.
(the code and eyetracking data for the paper will be shared via github link below once PNAS embargo is lifted).
The broader project is described below:
SEEDLingS is a project exploring how infants' early linguistic and environmental input plays a role in their learning. We focus on understanding how babies learn words between 6 and 18 months of age from the visual, social, and linguistic world around them. By looking at the complex environment that babies are exposed to, from their perspective, we can attempt to decode how the developing mind interprets and organizes the objects and words it faces. SEEDLingS is unique in that it combines well-controlled studies in the lab that assess what words infants know, with in-the-home audio and video recordings of what words infants hear, and what they see when they hear these words.
Video and audio recordings were generated in the home every month, from 6 to 17 months of age, for a set of 44 infants. The goal of this study is to assess infants' language growth over this time period, particularly in the word learning domain. Every two months, infants came into the lab for an eye-tracking study to test their word comprehension (and for older infants, their word production).
This volume includes the audio and video recordings from 6 month home visits. Corresponding test dates for each audio and video recording are included as a supplementary spreadsheet, which can be accessed in the materials folder of this volume.
The day-long audio recordings were generated using child-perspective LENA recorders (LENA Research Foundation, Boulder, Colorado, United States) worn by the infant. The audio recordings are generated from one single LENA audio recording, converted from LENA's propriety algorithmic output (.its) for annotation in CHA format.
The hour-long video recordings show a composite view of infants' typical lives with 1-4 camera feeds. In the standard setup, infants are equipped with 2 headcams, and a centralized camcorder that captures the entire room. The precise arrangement and number of cameras varies per video, as a function of whether the child would wear the hat with the cameras, and whether the cameras' files became corrupt during the recordings.
Shared files have been scrubbed for certain personal information (e.g. full names, addresses, etc.); this leads to some silent periods on the audio track and some black-out periods on the video track. Only sections of the files that have been verified to contain no extremely personal content by human listeners (or from which such info has been scrubbed) are shared here. If you notice anything that you believe we may have missed in terms of personal information, please contact us as soon as possible so we can rectify the issue.
Infants in this sample are from the upstate New York area. The sample is generally middle class, with a range of income and an above-average maternal education level. The sample is predominantly white. All infants heard majority English at home (>75%) and had no known vision or hearing issues at birth.
Please contact Elika Bergelson directly to discuss further aspects of the sample design, annotation, and analysis at firstname.lastname@example.org
These data were collected at the University of Rochester, and continue to be analyzed presently at Duke University.
Further details of the project are available on our website, wiki, and GitHub repo, linked below.