MLCommons and Hugging Face team up to release massive speech dataset for AI research

MLCommons, a nonprofit AI safety working group, has teamed up with AI dev platform Hugging Face to release one of the world’s largest collections of public domain voice recordings for AI research. The dataset, called Unsupervised People’s Speech, contains more than a million hours of audio spanning at least 89 different languages. MLCommons says it […] © 2024 TechCrunch. All rights reserved. For personal use only.

MLCommons and Hugging Face team up to release massive speech dataset for AI research
MLCommons, a nonprofit AI safety working group, has teamed up with AI dev platform Hugging Face to release one of the world’s largest collections of public domain voice recordings for AI research. The dataset, called Unsupervised People’s Speech, contains more than a million hours of audio spanning at least 89 different languages. MLCommons says it […] © 2024 TechCrunch. All rights reserved. For personal use only.