Publications

(April 2021) DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
(April 2022) SNRi Target Training for Joint Speech Enhancement and Recognition
(Oct 2022) WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
(Mar 2023) Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
(May 2023) LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
(Mar 2025) Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
(Mar 2025) ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability
(Mar 2025) Source Separation by Flow Matching