Refine Your Search

Search Results

Viewing 1 to 2 of 2
Journal Article

The Impact of Microphone Location and Beamforming on In-Vehicle Speech Recognition

2017-03-28
2017-01-1692
This paper describes two case studies in which multiple microphone processing (beamforming) and microphone location were evaluated to determine their impact on improving embedded automatic speech recognition (ASR) in a vehicle hands-free environment. While each of these case studies was performed using slightly different evaluation set-ups, some specific and general conclusions can be drawn to help guide engineers in selecting the proper microphone location and configuration in a vehicle for the improvement of ASR. There were some outcomes that were common to both dual microphone solutions. When considering both solutions, neither was equally effective across all background noise sources. Both systems appear to be far more effective for noise conditions in which higher frequency energy is present, such as that due to high levels of wind noise and/or HVAC (heating, ventilation and air conditioning) blower noise.
Journal Article

Validation of In-Vehicle Speech Recognition Using Synthetic Mixing

2017-03-28
2017-01-1693
This paper describes a method to validate in-vehicle speech recognition by combining synthetically mixed speech and noise samples with batch speech recognition. Vehicle cabin noises are prerecorded along with the impulse response from the driver's mouth location to the cabin microphone location. These signals are combined with a catalog of speech utterances to generate a noisy speech corpus. Several factors were examined to measure their relative importance on speech recognition robustness. These include road surface and vehicle speed, climate control blower noise, and driver's seat position. A summary of the main effects from these experiments are provided with the most significant factors coming from climate control noise. Additionally, a Signal to Noise Ratio (SNR) experiment was conducted highlighting the inverse relationship with speech recognition performance.
X