Senior Researcher – Voice AI

Huawei Canada has an immediate 12-month contract opening for a Senior Researcher.

About the team:

The Huawei Human-Machine Interaction Lab unites global researchers, engineers, and designers to redefine human technology relationships through user centered, hands on research. We focus on agentic AI and multimodal interaction (voice, touch, vision, gesture) across smartphones, wearables, and emerging devices—advancing agentic workflows, multi-agent orchestration, and intuitive human-AI interfaces. By tightly integrating sensing, algorithms, and systems, our prototype driven work ships directly to products, enabling seamless task delegation and human AI collaboration at scale.

About the job:

  • Conduct advanced research and rapid prototyping in speech and audio AI, including speech enhancement, separation, recognition, speaker modeling, and audio-language/vision models.

  • Design, implement, and evaluate state-of-the-art deep learning architectures for speech and audio understanding.

  • Contribute to Huawei’s next-generation intelligent products, including smartphones, earbuds, wearables, and smart glasses, by developing innovative audio AI capabilities.

  • Collaborate closely with research scientists, software engineers, and product teams to translate research outcomes into deployable systems.

  • Stay current with emerging technologies in audio, multimodal, and large foundation models, and contribute to publications, patents, or product features.

  • Present research progress and findings to internal and external audiences.

The total target annual compensation (based on 2,080 hours per year) ranges from $127,000 to $225,000 depending on education, experience, and demonstrated expertise.

About the ideal candidate:

  • PhD or Master degree in Electrical Engineering, Computer Science, Speech and Audio Processing, Machine Learning, or a related field.

  • Strong background in speech/audio signal processing, including time–frequency analysis, speech enhancement, and feature extraction.

  • Hands-on experience developing and training deep learning models for speech, audio, or multimodal applications using PyTorch, TensorFlow, or JAX.

  • Experience with speech foundation models, self-supervised audio pretraining, or multimodal learning (audio-language, audio-vision).

  • Proficiency in Python and solid experience in implementing, debugging, and optimizing research code for experiments and deployment.

  • Strong ability to prototype quickly, conduct comprehensive evaluations, and iterate based on experimental results.

  • Experience deploying AI models into real-time or embedded systems for mobile or wearable devices. Familiarity with datasets, benchmarks, and evaluation metrics commonly used in speech processing and audio-language tasks.

  • Proven research record demonstrated by first-authored papers, patents, or released systems in top-tier venues (e.g., ICASSP, INTERSPEECH, NeurIPS, ICLR, ICML, ACL).

Additional Information:

Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.

All applications for this position are reviewed directly by our hiring team, we do not use artificial intelligence tools to screen or select candidates.

Similar jobs