The high cost of data acquisition makes Automatic Speech Recognition (ASR) model training problematic for most existing languages, including languages that do not even have a written script, or which the phone inventories remain unknown. Past works explored multilingual training, transfer learning, as well zero-shot learning in order to build ASR systems these low-resource languages. While it h...