Publications

. Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech. NAACL, 2025.

Preprint PDF Project Slides Ref Audio Examples

. Learning the joint distribution of two sequences using little or no paired data. ICML SpiGM Workshop, 2023.

Preprint PDF

. Speaker Generation. ICASSP, 2022.

Preprint PDF Project Slides Ref Audio Examples

. Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis. ICASSP, 2021.

Preprint PDF Project Slides Ref Audio Examples

. Non-Saturating GAN Training as Divergence Minimization. arXiv, 2020.

Preprint PDF

. Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis. ICASSP, 2019.

Preprint PDF Project Ref Audio Examples

. Semi-Supervised Generative Modeling for Controllable Speech Synthesis. ICLR, 2019.

Preprint PDF Project Ref Audio Examples

. Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis. arXiv, 2019.

Preprint PDF Project Audio Examples

. Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron. ICML, 2018.

Preprint PDF Project Poster Slides Video Ref Audio Examples Blog Post

. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. ICML, 2018.

Preprint PDF Project Source Document Ref Audio Examples Blog Post

. Uncovering Latent Style Factors for Expressive Speech Synthesis. NIPS ML4Audio Workshop, 2017.

Preprint PDF Project Poster Audio Examples Workshop

. Exploring Neural Transducers for End-to-End Speech Recognition. ASRU, 2017.

Preprint PDF Project Ref

. Reducing Bias in Production Speech Models. arXiv, 2017.

Preprint PDF Project

. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. ICML, 2016.

Preprint PDF Project Slides Ref

. Lasagne: First Release. GitHub, 2015.

Code Project 0.1 Ref

. LibROSA: Audio and Music Signal Analysis in Python. SciPy, 2015.

PDF Code Project 0.5.0 Ref

. Scalable Multimedia Content Analysis on Parallel Platforms Using Python. TOMCCAP, 2014.

PDF Project Ref

. Real-Time Musical Applications on an Experimental Operating System for Multi-Core Processors. ICMC, 2011.

PDF Project

. Advances in the Parallelization of Music and Audio Applications. ICMC, 2010.

PDF Project

. Optimizing Hearing Aids for Music Listening. ICA, 2007.

PDF