Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Part of Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1 (NeurIPS Datasets and Benchmarks 2021) round1

Bibtex Paper Reviews And Public Comment » Supplemental

Authors

Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab HEBA, Titouan Parcollet, Titouan Parcollet

Abstract

This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.