Improving Statistical Model-Based Speech Enhancement with Deep Neural Networks

Systems and methods improving the performance of statistical model-based single-channel speech enhancement systems using a deep neural network (DNN) are disclosed. Embodiments include a DNN-trained system to predict speech presence in the input signal, and this information can be used to create frameworks for tracking noise and conducting a priori signal to-noise ratio estimation. Example frameworks provide increased flexibility for various aspects of system design, such as gain estimation. Examples include training a DNN to detect speech in the presence of both noise and reverberation, enabling joint suppression of additive noise and reverberation. Example frameworks provide significant improvements in objective speech quality metrics relative to baseline systems.

Departments: Lincoln Laboratory
Technology Areas: Artificial Intelligence (AI) and Machine Learning (ML) / Biotechnology: Biomedical Devices & Systems / Communication Systems: Wireless / Computer Science: Networking & Signals / Sensing & Imaging: Acoustics

  • systems and methods for improving model-based speech enhancement with neural networks
    United States of America | Granted | 11,227,586

License this technology

Interested in this technology? Connect with our experienced licensing team to initiate the process.

Sign up for technology updates

Sign up now to receive the latest updates on cutting-edge technologies and innovations.

More Technologies