Profile avatar
umpedronosapato.bsky.social
AI & Music postdoctoral researcher @c4dm
28 posts 220 followers 295 following
Prolific Poster
Conversation Starter

I love how DiffRhythm keeps changing time signatures à la Dream Theater (ie, seemingly random). The vocals are in a quite deep uncanny valley, but the music sounds super good. And the audio prompting works really well! And all open source! Great job, titans <3 huggingface.co/spaces/ASLP-...

They're out 🤘

Video of @stefanlattner.bsky.social 's talk at DMRN+19 is finally online: "Models of Musical Signals: Representation, Learning & Generation" @c4dm.bsky.social www.youtube.com/watch?v=ixHf...

Great interview with @jascha.sohldickstein.com about diffusion models! This is the first in a series: similar interviews with Yang Song and yours truly will follow soon. (One of these is not like the others -- both of them basically invented the field, and I occasionally write a blog post 🥲)

exitpoints.bandcamp.com/album/you-ar... Grab some albums on bandcamp today, support independent artists and Musicares!

Very excited to share our latest work, the GigaMIDI dataset with > 1.4M files, published at #TISMIR 🤘 It was a huge pleasure to collaborate with such a team of titans transactions.ismir.net/articles/10....

From the 25th February to 4th March 2025, two C4DM researchers will participate at the 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025). More info at: www.c4dm.eecs.qmul.ac.uk/news/2025-02...

this is pricelessly sad and great at the same time 🤘 Courtney LaPlante is such a titan

Today, we are publishing the first-ever International AI Safety Report, backed by 30 countries and the OECD, UN, and EU. It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵 Full Report: assets.publishing.service.gov.uk/media/679a0c... 1/21

Another banger 🤘

Following up on the release of open source models that are shaking the AI status quo: YuE (乐) 🎵 - full music generation - demo: map-yue.github.io - conditioned on lyrics (even does vocal fry and growls 🤘) - Non-commercial license Super impressive and disruptive work! github.com/multimodal-a...

We are proudly engaged in improving transparency both for artists and users relative to the spread of AI generated music on our platform. Based on months of research we're deploying a large scale detector and aim to remove such content from our recommendations: newsroom-deezer.com/2025/01/deez...

📢 Call for contributions: First AES International Conference on Artificial Intelligence and Machine Learning for Audio (AIMLA 2025), London, Sept. 8-10, 2025. More info: aes2.org/contribution... @c4dm.bsky.social

🎉 Follow-up: Thrilled to share that this tutorial has been accepted to #ICLR2025 in the blog posts track!

AI-generated music detection achieved 99.8% accuracy using classifiers trained on real and artificial music. No details on methods or dataset size are provided.

🎶✨ New Paper Announcement! ✨🎶 We present "Improving Musical Accompaniment Co-creation via Diffusion Transformers" 🎹🎸—a study advancing our Diff-A-Riff stem generator through improved quality, efficiency, and control. 📜Read the full paper here: arxiv.org/pdf/2410.23005 🧵👇

First Bsky post, first lab paper of 2025! "On mapping as a technoscientific practice in digital musical instruments" -- a dive on the history and critical implications of mapping theory, with speculation on possible futures. Forthcoming in JNMR: instrumentslab.org/data/andrew/...

Let's go 🎸

Russolo’s intonarumori are in the Guardian. We are so back. www.theguardian.com/music/2025/j...

Help us with our research to: ☝️ - Develop a perceptual similarity metric for audio effects ✌️ - Advance the state of the art in audio effects modelling Take our listening test (<15min): mcomunita.github.io/mushra-front... Use: 💻 + 🎧 🙏

🧑‍🎓 Our #ISMIR Conference Tutorial "Deep Learning 101 for Audio-based MIR" provides a broad introduction to music audio processing, analysis, and generation. 📘 The book and jupyter notebooks: geoffroypeeters.github.io/deeplearning... 🎥 The recording of the tutorial: us02web.zoom.us/rec/share/Qz...

👀

😃 Accepted #ICASSP papers of Sony CSL Music Team: Accompaniment Prompt Adherence: A Measure for Evaluating Music Accompaniment Systems M. Grachten, J. Nistal Estimating Musical Surprisal in Audio M. Bjare, G. Cantisani, S. Lattner and G. Widmer

Weights are out! 🤗 Tokenizing 16kHz speech at very low bitrates. Inference code: github.com/Stability-AI... Model code: github.com/Stability-AI... Model weights: huggingface.co/stabilityai/... arXiv: arxiv.org/abs/2411.19842 Audio demos: stability-ai.github.io/stable-codec...

By far, one of the best things of 2024: www.youtube.com/watch?v=5hTM...

This is such an inspirational insight from Jorge Luis Borges 🙇‍♂️ super relevant today in our quest for more and more and more data

We just released the Helium-1 model , a 2B multi-lingual LLM which @exgrv.bsky.social and @lmazare.bsky.social have been crafting for us! Best model so far under 2.17B params on multi-lingual benchmarks 🇬🇧🇮🇹🇪🇸🇵🇹🇫🇷🇩🇪 On HF, under CC-BY licence: huggingface.co/kyutai/heliu...

What an astonishingly tone deaf take. He misses the entire point of creative practice. Accessibility and mastery are not opposed, they're two ends of the same personal creative journey.

I am organizing a session on "Advancements in Bird Communication Studies" at Forum Acusticum / Euronoise, to be held in Málaga on June 23–26, 2025. Please submit your abstract (max. 200 words) onto the conference portal before January 19th www.fa-euronoise2025.org/abstract-sub...

fixed - ismir2024 papers are accessible now! along with the reviews too, sometimes. ismir2024program.ismir.net

Interesting attempt to build an AI agent-based research assistant to automate machine learning paper writing by acting as PhDs, post-docs, & professors working in a typical lab. It doesn't autonomously produce high-level work but it looks promising as a copilot for researchers cutting cost & effort.

🤖 If you missed Prof. Shalom Lappin's insightful lectures series on the core ideas of his forthcoming book, 'Understanding #AI: Neither Catastrophe nor Redemption', you can catch up and watch the recordings on our Youtube channel: www.youtube.com/@QMEECS/videos

Awesome DMRN @c4dm.bsky.social workshop today, with a great keynote by titan @stefanlattner.bsky.social and looooads of research around guitar 🤘🎸 @jackjamesloth.bsky.social

another banger work by titan @hugofloresgarcia.bsky.social 🤘

The latest Science-in-Parallel episode dropped, in which I talk of this epochal moment in human history (the coming of LLMs), the 2024 NobelPrize for Hinton and Hopfield, and the history of neural networks, besides the writing of WHY MACHINES LEARN. scienceinparallel.org/2024/12/anil...

A deep learning pipeline uses spectrogram masking and the MuseScore API to separate instrument stems from music audio, convert them to MIDI, and transcribe them into sheet music.

The tasks for DCASE challenge 2025 have been announced. dcase.community/articles/cha... Stay tuned for more details.

What? Linear algebra and calculus and machine learning for the holidays! Might a math-y book be a good gift for the holidays? I hope so :-) “A masterpiece.”-Geoff Hinton “A masterful work.”-Melanie Mitchell US www.penguinrandomhouse.com/books/677608... UK www.penguin.co.uk/books/446849...