




I am Quan Wang, a Senior Staff Software Engineer at Google DeepMind, New York City, NY.
I lead the Hotword Modeling team and the Speaker, Voice & Language team at Google DeepMind. The teams deliver a diverse set of server-side and on-device speech models to Google's product ecosystem, including “Hey Google” spoken keyword spotting, voice match, language recognition, spoofed speech detection, speech enhancement, speaker diarization, and multilingual speech recognition. Our server-side models power numerous speech features in Google Search, YouTube, Google Cloud, and Google Assistant, used by billions. Our on-device models are deployed on billions of Android phones, tablets, Chromebooks, cars, and wearables across the globe.
I am an IEEE Senior Member, and was a Machine Learning Scientist at Amazon, Boston.
I completed my Ph.D. in Computer & Systems Engineering at Rensselaer Polytechnic Institute, advised by Prof. Kim L. Boyer.
I completed my B.Eng. in Automation from Tsinghua University in 2010, advised by Prof. Qionghai Dai.
Awards I received include:
Contact me:
We are hosting the SANE 2025 workshop in Google NYC this year. Please submit your posters!
Learn about Speaker Recognition and Speaker Diarization with me on Udemy.
欢迎报名学习【声纹识别】中文课。报名地址:
欢迎购买我编写的教材: 《声纹技术:从核心算法到工程实践》 (荣获奖项)