Turns out it is great!
GitHub - nari-labs/dia: A TTS model capable of generating ultra-realistic dialogue in one pass.