Streaming non-autoregressive model for accent conversion and pronunciation improvement

Audio examples

Input with Arabic accent Output audio for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model Synthetic Ground-Truth
Input with Chinese accent Output audio for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model Synthetic Ground-Truth
Input with Vietnamse accent Output audio for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model Synthetic Ground-Truth
Input with Indian accent Output audio for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model Synthetic Ground-Truth
Input with Korean accent Output audio for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model Synthetic Ground-Truth

Video examples

Input Video with Indian accent Output Video for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model


Input Video with Chinese accent Output Video for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model




Input Video with Vietnamse accent Output Video for Accent conversion and Improving Pronunciation
Non-Streaming model Our streaming model