Profile avatar
stefanlattner.bsky.social
Research Leader @ Sony CSL Paris
13 posts 55 followers 41 following
Getting Started

🀩 From our series "@ieeeICASSP paper released", we announce that "Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures" is online! πŸ“œ Paper: arxiv.org/pdf/2411.19806 Thx to my colleagues Alain Riou, Geoffroy Peeters, Gaetan Hadjeres and Antonin GagnerΓ©! 🎢 SonyCSLMusic 🎢

Our #ICASSP paper "Hybrid Losses for Hierarchical Embedding Learning" by Haokun Tian et al. is now online! πŸ’« We assess the organization of a hierarchical embedding space using different (combinations of) losses and improve on the SOTA. πŸ“œ Paper: arxiv.org/pdf/2501.12796 #SonyCSLParis

Recently, I had the honour of giving a keynote speech on Audio Representation Learning and Generation at the DMRN+ workshop at @c4dm at Queen Mary University. πŸ’« πŸŽ¬πŸŽ™οΈ Recording: echo360.org.uk/media/f037dc... 🎢 More Info: www.qmul.ac.uk/dmrn/dmrn19/

Our #ICASSP paper "Estimating Musical Surprisal in Audio" is now online. 😯 <- surprised 😁 Great work by Mathias Bjare and Giorgia Cantisani! πŸ‘ We use an autoregressive transformer and Gaussian mixture models to estimate the information content in music2latent representations. πŸ§΅πŸ‘‡

🎢✨ New Paper Announcement! ✨🎢 We present "Improving Musical Accompaniment Co-creation via Diffusion Transformers" πŸŽΉπŸŽΈβ€”a study advancing our Diff-A-Riff stem generator through improved quality, efficiency, and control. πŸ“œRead the full paper here: arxiv.org/pdf/2410.23005 πŸ§΅πŸ‘‡

πŸ§‘β€πŸŽ“ Our #ISMIR Conference Tutorial "Deep Learning 101 for Audio-based MIR" provides a broad introduction to music audio processing, analysis, and generation. πŸ“˜ The book and jupyter notebooks: geoffroypeeters.github.io/deeplearning... πŸŽ₯ The recording of the tutorial: us02web.zoom.us/rec/share/Qz...

πŸ˜ƒ Accepted #ICASSP papers of Sony CSL Music Team: Accompaniment Prompt Adherence: A Measure for Evaluating Music Accompaniment Systems M. Grachten, J. Nistal Estimating Musical Surprisal in Audio M. Bjare, G. Cantisani, S. Lattner and G. Widmer