Conformer is All You Need for Visual Speech Recognition

Published in ICASSP 2024 IEEE International Conference on Acoustics, Speech and Signal Processing, 2024

A study on applying Conformer architecture to visual speech recognition tasks.

Citation: O Chang, H Liao, D Serdyuk, A Shah, O Siohan, “Conformer is All You Need for Visual Speech Recognition,” in ICASSP 2024 IEEE International Conference on Acoustics, Speech and Signal Processing, 2024.

Recommended citation: O Chang, H Liao, D Serdyuk, A Shah, O Siohan, "Conformer is All You Need for Visual Speech Recognition," in ICASSP 2024 IEEE International Conference on Acoustics, Speech and Signal Processing, 2024.