📝 Publications
🎙 Speech Translation and Synthesis
TASLP
RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference LeveragingICLR 2024
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. | ProjectICLR 2023
Bag of Tricks for Unsupervised Text-to-Speech | ProjectICASSP 2021
Denoising Text to Speech with Frame-Level Noise Modeling | ProjectAAAI 2021
UWSpeech: Speech to Speech Translation for Unwritten Languages | ProjectINTERSPEECH 2023
EE-TTS: Emphatic Expressive TTS with Linguistic Information | ProjectACL 2020
SimulSpeech: End-to-End Simultaneous Speech to Text TranslationIJCAI 2020
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
🎼 Music Generation and Retrieval
TMM
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation | ProjectTAFFC
REMAST: Real-time Emotion-based Music Arrangement with Soft TransitionACM-MM 2022
ReLyMe: Improving Lyric-to-Melody Generation by Incorporating Lyric-Melody Relationships | ProjectACM-MM 2022
SongDriver: Real-time Music Accompaniment Generation without Logical Latency nor Exposure BiasISMIR 2022
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics TranscriptionACL 2022
Automatic Song Translation for Tonal Languages | ProjectICASSP 2022
S3T: Self-Supervised Pre-training with Swin Transformer for Music ClassificationEMNLP 2022
TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method | Project
🧑🎨 Multi-modal Learning
NeurIPS 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in few minutes | Code.
ICLR 2024
Real3d-portrait: One-shot realistic 3d talking portrait synthesis | Project | Code.
ICML 2023 Workshop
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis | VideoNeurIPS 2022
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object LocalizationACM-MM 2020
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-FireIJCAI 2019
Discriminative and Correlative Partial Multi-Label Learning