A Collection of Scripts for Training Deep Neural Network Models to Associate Visual Images with Speech Audio

Researchers

David Harwath / James Glass

Departments: Computer Science & Artificial Intelligence Lab
Technology Areas: Artificial Intelligence (AI) and Machine Learning (ML) / Sensing & Imaging: Acoustics, Optical Sensing

License this technology

Interested in this technology? Connect with our experienced licensing team to initiate the process.

Sign up for technology updates

Sign up now to receive the latest updates on cutting-edge technologies and innovations.