VALL-E
VALL-E is a zero-shot TTS language model based on RVQ. See Neural Codec Language Models are Zero-Shot Text-to-Speech Synthesizers
VALL-E is a zero-shot TTS language model based on RVQ. See Neural Codec Language Models are Zero-Shot Text-to-Speech Synthesizers