Publications
(April 2021)
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
(April 2022)
SNRi Target Training for Joint Speech Enhancement and Recognition
(Oct 2022)
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
(Mar 2023)
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
(May 2023)
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
(Mar 2025)
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
(Mar 2025)
ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability
(Mar 2025)
Source Separation by Flow Matching