In 2025, I spent another year working like a scientist studying tour pros like lab rats making their way around a maze.
Abstract: We present SegINR, a novel approach to neural Text-to-Speech (TTS) that eliminates the need for either an auxiliary duration predictor or autoregressive (AR) sequence modeling for alignment.