Validation of In-Vehicle Speech Recognition Using Synthetic Mixing 2017-01-1693
This paper describes a method to validate in-vehicle speech recognition by combining synthetically mixed speech and noise samples with batch speech recognition. Vehicle cabin noises are prerecorded along with the impulse response from the driver's mouth location to the cabin microphone location. These signals are combined with a catalog of speech utterances to generate a noisy speech corpus. Several factors were examined to measure their relative importance on speech recognition robustness. These include road surface and vehicle speed, climate control blower noise, and driver's seat position. A summary of the main effects from these experiments are provided with the most significant factors coming from climate control noise. Additionally, a Signal to Noise Ratio (SNR) experiment was conducted highlighting the inverse relationship with speech recognition performance.