ADASP at ICASSP 2025: 12 papers & a keynote!

We are excited to share that the ADASP group will be presenting 12 papers and 1 keynote at the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025) in Hyderabad on April 6-11th!

Please check out the papers either in ICASSP proccedings or on HAL/Arxiv:

  • Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures
    Alain Riou, Antonin Gagneré, Gaëtan Hadjeres, Stefan Lattner, Geoffroy Peeters
  • Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning
    Aurian Quelennec, Pierre Chouteau, Geoffroy Peeters, Slim Essid
  • Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping
    Clémentine Berger, Roland Badeau, Slim Essid
  • Multiple Choice Learning for Efficient Speech Separation with Many Speakers
    David Perera, François Derrida, Théo Mariotte, Gaël Richard, Slim Essid
  • O-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker Diarization
    Elio Gruttadauria, Mathieu Fontaine, Jonathan Le Roux, Slim Essid
  • Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges
    Geoffroy Peeters, Zafar Rafii, Magdalena Fuentes, Zhiyao Duan, Emmanouil Benetos, Juhan Nam, Yuki Mitsufuji
  • A Hybrid Model for Weakly-Supervised Speech Dereverberation
    Louis Bahrman, Mathieu Fontaine, Gaël Richard
  • F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation
    Manvi Agarwal, Changhong Wang, Gaël Richard
  • AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
    Samir Sadok, Simon Leglaive, Laurent Girin, Gaël Richard, Xavier Alameda-Pineda
  • Contrastive Knowledge Distillation for Embedding Refinement in Personalized Speech Enhancement
    Thomas Serre, Mathieu Fontaine, Éric Benhaim, Slim Essid
  • Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects
    Victor Deng, Changhong Wang, Gaël Richard, Brian McFee
  • Learning Source Disentanglement in Neural Audio Codec
    Xiaoyu Bie, Xubo Liu, Gaël Richard

In addition to the main conference, Prof. Geoffroy Peeters has been invited as a keynote speaker at the Workshop on Indian Music Analysis and Generative Applications (WIMAGA):

  • Self-Supervised Learning for Invariant and Equivariant representations: How can it be applied to Indian music?
    Geoffroy Peeters