Chen Zhang (章晨)

FAIR, Meta

📝 Publications

🎙 Speech Translation and Synthesis

TASLP RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging
ICLR 2024 Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. | Project
ICLR 2023 Bag of Tricks for Unsupervised Text-to-Speech | Project
ICASSP 2021 Denoising Text to Speech with Frame-Level Noise Modeling | Project
AAAI 2021 UWSpeech: Speech to Speech Translation for Unwritten Languages | Project
INTERSPEECH 2023 EE-TTS: Emphatic Expressive TTS with Linguistic Information | Project
ACL 2020 SimulSpeech: End-to-End Simultaneous Speech to Text Translation
IJCAI 2020 Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation

🎼 Music Generation and Retrieval

NeurIPS 2024 MimicTalk: Mimicking a personalized and expressive 3D talking face in few minutes | Code .
ICLR 2024 Real3d-portrait: One-shot realistic 3d talking portrait synthesis | Project | Code .
ICML 2023 Workshop Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis | Video
NeurIPS 2022 Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization
ACM-MM 2020 FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire
IJCAI 2019 Discriminative and Correlative Partial Multi-Label Learning