Michele Mancusi

Senior Research Scientist, PhD

Sony

Hello world! 👋

I’m Michele Mancusi, a Senior Research Scientist at Sony. My work focuses on deep learning for generative models in speech, audio, and music utilizing Large Language Models (LLMs) and diffusion models to push the boundaries of what’s possible in audio technology.

Before joining Sony, I gained valuable experience as an intern at Microsoft and Musixmatch. At Microsoft, I worked on deep learning for unsupervised speech separation, and at Musixmatch, I focused on deep learning for singing voice detection.

I earned my Ph.D. from Sapienza University of Rome under the supervision of Prof. Emanuele Rodolà as a member of the Gladia research group. My doctoral research centered on music generation, source separation, and Natural Language Processing (NLP), contributing to advancements in the field of generative AI.

Interests

Deep Learning
Signal Processing
Generative AI
Music Generation
Source Separation
NLP
Speech Synthesis

Education

PhD in Computer Science, 2024

Sapienza University of Rome
M.S. in Physics, 2019

Sapienza University of Rome
B.S. in Physics, 2016

Sapienza University of Rome

Work Experience

Senior Research Scientist

Sony

Apr 2024 – Present Stuttgart, Germany

Research on deep learning for generative models for speech and audio with LLM and diffusion models.

Visiting Research Scientist

Sony

Nov 2023 – Jan 2024 Stuttgart, Germany

Worked with Dr. Stefan Uhlich in the AI, Speech and Sound Group. Conducted research on deep learning for effects removal and timbre transfer with diffusion models.

Research Scientist Intern

Microsoft

Jun 2023 – Sep 2023 Redmond, Washington, USA

Worked with Dr. Sebastian Braun in the Audio and Acoustics Research Group. Conducted research on deep learning for unsupervised speech separation.

Data Scientist Intern

Musixmatch

Sep 2022 – Mar 2023 Bologna, Italy

Worked with Dr. Loreto Parisi in the AI Team. Conducted research on deep learning for singing voice detection.

Publications

Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer

ICASSP 2025

Michele Mancusi, Yurii Halychanskyi, Kin Wai Cheuk, Eloi Moliner, Chieh-Hsin Lai, Stefan Uhlich, Junghyun Koo, Marco A. Martinez-Ramirez, Wei-Hsiang Liao, Giorgio Fabbro, Yuki Mitsufuji