Scalable Spontaneous Speech Dataset (SSSD)

Crowdsourcing Data Collection to Promote Dialogue Research

Authors:

Zaid Sheikh (Carnegie Mellon University, USA)
Shuichiro Shimizu (Kyoto University, Japan)
Siddhant Arora (Carnegie Mellon University, USA)
Jiatong Shi (Carnegie Mellon University, USA)
Samuele Cornell (Carnegie Mellon University, USA)
Xinjian Li (Carnegie Mellon University, USA)
Shinji Watanabe (Carnegie Mellon University, USA)

Abstract

This paper introduces the Scalable Spontaneous Speech Dataset (SSSD) project, comprising 727 hours of spontaneous English conversations between two randomly-matched, anonymous participants on Amazon Mechanical Turk (MTurk) crowd-sourcing platform. The dataset features conversations averaging 25-30 minutes, covering a wide range of everyday topics. A key innovation of this work is our approach to maximizing the number of MTurk workers concurrently participating in our task, enabling more effective randomized matching and live two-person conversations. Data quality is ensured through a two-tiered task structure: a qualification round to select reliable workers, followed by the main recording sessions. We detail our methodology for collecting and recording spontaneous voice conversations, present analyses of the conversational content and speech quality of the dataset in comparison to other datasets, and discuss potential usage.¹

This website template was adapted from eliahuhorwitz/Academic-project-page-templat ↩

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
static		static
.nojekyll		.nojekyll
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scalable Spontaneous Speech Dataset (SSSD)

Crowdsourcing Data Collection to Promote Dialogue Research

Abstract

About

Uh oh!

Releases

Packages

Languages

wavlab-speech/SSSD

Folders and files

Latest commit

History

Repository files navigation

Scalable Spontaneous Speech Dataset (SSSD)

Crowdsourcing Data Collection to Promote Dialogue Research

Abstract

Footnotes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages