HuMS | Download

Please visit the GitHub Page to access the corpus. Alternatively, you can download different versions of the corpus directly from this page. These include the full set of recordings (HuMS), as well as smaller subsets of recordings (HuMS-A1, HuMS-A2) that have been validated by a larger number of raters.

Full Corpus (HuMS)

This release is for the full corpus (i.e., all 4,834 recordings). In addition to the recordings, the release contains spectrograms of each recording and a spreadsheet (HuMS.csv) with the acoustic, linguistic, and perceptual analyses of each recording. The perceptual ratings come from a set (n = 5) of internal raters, who rated all stimuli in the corpus.

Abridged Version 01 (HuMS-A1)

Download (45.5 MB)

The HuMS-A1 contains 160 recordings - the top 20 and bottom 20 recordings in terms of perceived musicality for the categories of talker sex (male, female) and intended audience (child-directed, adult-directed). The download includes the 160 recordings, as well as a data file (HuMS-A1.csv) containing the acoustic, linguistic, and perceptual ratings from both the smaller set of trusted raters (n = 5) as well as a larger group of raters (total n = 100; n = 50 per talker sex).

Abridged Version 02 (HuMS-A2)

Download (115 MB)

The HuMS-A2 contains 420 recordings - the top 15 and bottom 15 recordings in terms of perceived musicality for each unqiue talker in the corpus (n = 14). The download includes the 420 recordings, as well as a data file (HuMS-A2.csv) containing the acoustic, linguistic, and perceptual ratings from both the smaller set of trusted raters (n = 5) as well as a larger group of raters (total n = 320; n = 18 to 27 per talker).

Download

Huron Musical Speech (HuMS) Corpus

Full Corpus (HuMS)

Abridged Version 01 (HuMS-A1)

Abridged Version 02 (HuMS-A2)

The development of this corpus was supported by a SSHRC Insight Development Grant (2024).

© 2026 Stephen Van Hedger

Full Corpus (HuMS)

Abridged Version 01 (HuMS-A1)

Abridged Version 02 (HuMS-A2)

The development of this corpus was supported by a SSHRC Insight Development Grant (2024). © 2026 Stephen Van Hedger

The development of this corpus was supported by a SSHRC Insight Development Grant (2024).

© 2026 Stephen Van Hedger