Allison Koenecke, Information Science
Dialectal Fairness in Korean Speech-to-Text Technology
Using a Korean corpus of five regional dialects plus “standard” Korean speech, we address the problem of speech-to-text fairness in commercial technology. Will non-standard dialects have worse error rates, and what are the drivers and remedies for disparities? Comprehensive linguistic analysis of Korean dialects follows.