Starting 2020 [129 publications]

2024

Conference Articles

  1. Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis
    Morgan Buisson, Brian Mcfee, Slim Essid
    25th International Society for Music Information Retrieval (ISMIR) (2024), San Francisco (CA), United States, November 2024.
  2. Speech dereverberation constrained on room impulse response characteristics
    Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard
    INTERSPEECH, Kos Island, Greece, September 2024.
  3. RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation
    Liam Kelley, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
    INTERSPEECH, Kos International Convention Center, Kos Island, Greece, September 2024.
  4. Explainable by-design Audio Segmentation through Non-Negative Matrix Factorization and Probing
    Martin Lebourdais, Théo Mariotte, Antonio Almudévar, Marie Tahon, Alfonso Ortega
    Interspeech 2024, Kos / Greece, France, September 2024.
  5. Multifrequency Highly Oscillating Aperiodic Amplitude Estimation for Nonlinear Chirp Signal
    Anton Emelchenkov, Mathieu Fontaine, Yves Grenier, Hervé Mahé, François Roueff
    European Signal Processing Conference (EUSIPCO), Lyon, France, August 2024.
  6. Invariance-based layer regularization for sound event detection
    David Perera, Slim Essid, Richard Gaël
    European Signal Processing Conference, Lyon, France, August 2024.
  7. Winner-takes-all learners are geometry-aware conditional density estimators
    Victor Letzelter, David Perera, Cédric Rommel, Mathieu Fontaine, Slim Essid, Gael Richard, Patrick Pérez
    International Conference on Machine Learning, Vienne (Autriche), Austria, July 2024.
  8. Embodied exploration of deep latent spaces in interactive dance-music performance
    Sarah Nabi, Philippe Esling, Geoffroy Peeters, Frédéric Bevilacqua
    9th International Conference on Movement and Computing (MOCO ’24), Utrecht, Netherlands, May 2024.
  9. Structure-informed Positional Encoding for Music Generation
    Manvi Agarwal, Changhong Wang, Gaël Richard
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, South Korea, April 2024.
  10. SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
    Teysir Baoueb, Haocheng Liu, Mathieu Fontaine, Jonathan Le Roux, Gael Richard
    IEEE International Conference on Acoustics, Speech and Signal Processing, Seoul (Korea), South Korea, April 2024. Accepted at ....
  11. NEURAL STEERER: NOVEL STEERING VECTOR SYNTHESIS WITH A CAUSAL NEURAL FIELD OVER FREQUENCY AND DIRECTION
    Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
    ICASSP, Seoul (Korea), South Korea, April 2024.
  12. Adapting Pitch-Based Self Supervised Learning Models for Tempo Estimation
    Antonin Gagneré, Slim Essid, Geoffroy Peeters
    ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, South Korea, April 2024.
  13. ONLINE SPEAKER DIARIZATION OF MEETINGS GUIDED BY SPEECH SEPARATION
    Elio Gruttadauria, Mathieu Fontaine, Slim Essid
    IEEE International Conference on Acoustics, Speech, and Signal Processing, Seoul (Korea), South Korea, April 2024. Accepted at ....
  14. GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model
    Haocheng Liu, Teysir Baoueb, Mathieu Fontaine, Jonathan Le Roux, Gael Richard
    IEEE International Conference on Acoustics, Speech and Signal Processing, Seoul (Korea), South Korea, April 2024. Accepted at ....
  15. Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing
    Côme Peladeau, Geoffroy Peeters
    ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, South Korea, April 2024.
  16. ON THE CHOICE OF THE OPTIMAL TEMPORAL SUPPORT FOR AUDIO CLASSIFICATION WITH PRE-TRAINED EMBEDDINGS
    Aurian Quelennec, Michel Olvera, Geoffroy Peeters, Slim Essid
    ICASSP, Séoul, South Korea, April 2024.
  17. A fully differentiable model for unsupervised singing voice separation
    Gael Richard, Pierre Chouteau, Bernardo Torres
    IEEE International Conference on Acoustics, Speech, and Signal Processing, Seoul, South Korea, April 2024.
  18. A LIGHTWEIGHT DUAL-STAGE FRAMEWORK FOR PERSONALIZED SPEECH ENHANCEMENT BASED ON DEEPFILTERNET2
    Thomas Serre, Mathieu Fontaine, Éric Benhaim, Geoffroy Dutour, Slim Essid
    ICASSP, Seoul (Korea), South Korea, April 2024. Accepted at ....
  19. Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport
    Bernardo Torres, Geoffroy Peeters, Gaël Richard
    IEEE International Conference on Acoustics, Speech and Signal Processing, Seoul, South Korea, April 2024. Accepted in ....

Journal Articles

  1. Statistical wave field theory
    Roland Badeau
    Journal of the Acoustical Society of America, July 2024.
  2. Absorptive nature of scattering coefficients in stress-energy tensor formalism for room acoustics
    Jean-Dominique Polack, Hugo Dujourdy, Roland Badeau
    Journal of the Acoustical Society of America, April 2024.
  3. Tackling Interpretability in Audio Classification Networks with Non-negative Matrix Factorization
    Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Gael Richard, Florence d’Alché-Buc
    IEEE/ACM Transactions on Audio, Speech and Language Processing, January 2024.
  4. Self-Supervised Learning of Multi-level Audio Representations for Music Segmentation
    Morgan Buisson, Brian Mcfee, Slim Essid, Hélène Crayencour
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2024.
  5. Model-Based Deep Learning for Music Information Research
    Gael Richard, Vincent Lostanlen, Yi-Hsuan Yang, Meinard Müller
    IEEE Signal Processing Magazine, 2024.

Technical Reports

  1. Degradation-Invariant Music Indexing
    Rémi Mignot, Geoffroy Peeters
    March 2024.

2023

Conference Articles

  1. Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis
    Victor Letzelter, Mathieu Fontaine, Mickaël Chen, Patrick Pérez, Slim Essid, Gael Richard
    Advances in neural information processing systems, New Orleans, United States, December 2023.
  2. A Repetition-based Triplet Mining Approach for Music Segmentation
    Morgan Buisson, Brian Mcfee, Slim Essid, Helene-Camille Crayencour
    International Society for Music Information Retrieval (ISMIR), Milan, Italy, November 2023.
  3. THE HI-AUDIO ONLINE PLATFORM FOR DISTRIBUTED MUSIC CROWDSOURCING DATABASE COLLECTION
    Jose Manuel Gil Panal, Aurélien David, Gael Richard
    Late Breaking Demo - International Society for Music Information Retrieval Conference (ISMIR), Milan (Italie), Italy, November 2023.
  4. Self-Similarity-Based and Novelty-based loss for music structure analysis
    Geoffroy Peeters
    Conference of the International Society for Music Information Retrieval, Milano, Italy, November 2023.
  5. PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective
    Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters
    International Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy, November 2023.
  6. Singer Identity Representation Learning using Self-Supervised Techniques
    Bernardo Torres, Stefan Lattner, Gael Richard
    International Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy, November 2023.
  7. Transfer Learning and Bias Correction with Pre-trained Audio Embeddings
    Changhong Wang, Gaël Richard, Brian Mcfee
    The 24th conference of the International Society for Music Information Retrieval (ISMIR), Milan, Italy, November 2023.
  8. Signal Inpainting from Fourier Magnitudes
    Louis Bahrman, Marina Krémé, Paul Magron, Antoine Deleforge
    EUSIPCO 2023, Helsinki, Finland, September 2023.
  9. Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
    Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli
    INTERSPEECH 2023, Dublin, Ireland, August 2023.
  10. Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
    Salah Zaiem, Titouan Parcollet, Slim Essid
    INTERSPEECH 2023, Dublin (Ireland), Ireland, August 2023.
  11. Cosmopolite Sound Monitoring (CoSMo): A Study of Urban Sound Event Detection Systems Generalizing to Multiple Cities
    Florian Angulo, Slim Essid, Geoffroy Peeters, Christophe Mietlicki
    ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, June 2023. Copyright 20....
  12. LEARNING INTERPRETABLE FILTERS IN WAV-UNET FOR SPEECH ENHANCEMENT
    Félix Mathieu, Thomas Courtat, Gael Richard, Geoffroy Peeters
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, June 2023.
  13. Explainable Audio Classification of Playing Techniques with Layer-wise Relevance Propagation
    Changhong Wang, Vincent Lostanlen, Mathieu Lagrange
    2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, June 2023.
  14. Fine-tuning strategies for faster inference using speech self-supervised models: a comparative study
    Salah Zaiem, Robin Algayres, Titouan Parcollet, Slim Essid, Mirco Ravanelli
    ICASSP 2023 - International Conference on Acoustics, Speech, and Signal Processing, Rhodes, Greece, June 2023.
  15. One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
    Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière
    IEEE/CVF Conference on Computer Vision and Pattern Recognition- Workshop on Generative Models for Computer Vision, vancouver, Canada, 2023. Proceedings ....

Journal Articles

  1. Audio Signal Processing in the 21st Century
    Gaël Richard, Paris Smaragdis, Sharon Gannot, Patrick A Naylor, Shoji Makino, Walter Kellermann, Akihiko Sugiyama
    IEEE Signal Processing Magazine, July 2023.
  2. Hi! PARIS: IA et Sciences des données pour la société
    Gael Richard, Vieille Nicolas, Moulines Eric
    Télécom : revue de l’Association Amicale des ingénieurs de l’Ecole Nationale Supérieure des télécommunications, June 2023.
  3. Unsupervised Music Source Separation Using Differentiable Parametric Source Models
    Kilian Schulze-Forster, Gaël Richard, Liam Kelley, Clement Doire, Roland Badeau
    IEEE/ACM Transactions on Audio, Speech and Language Processing, March 2023.

2022

Conference Articles

  1. Learning Multi-Level Representations for Hierarchical Music Structure Analysis
    Morgan Buisson, Brian Mcfee, Slim Essid, Helene-Camille Crayencour
    International Society for Music Information Retrieval (ISMIR), Bengaluru, India, December 2022.
  2. Exploiting device and audio data to tag music with User-Aware listening contexts
    Karim M Ibrahim, Elena V. Epure, Geoffroy Peeters, Gael Richard
    International Society for Music Information Retrieval Conference (ISMIR 2022), Bengalore, India, December 2022.
  3. SSM-NET: FEATURE LEARNING FOR MUSIC STRUCTURE ANALYSIS USING A SELF-SIMILARITY-MATRIX BASED LOSS
    Geoffroy Peeters, Florian Angulo
    Late-Breaking/Demo Session of ISMIR (International Society for Music Infor- mation Retrieval), Bengalore, India, December 2022.
  4. Latent and Adversarial Data Augmentation for Sound Event Detection and Classification
    David Perera, Slim Essid, Gaël Richard
    International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE), Nancy, France, November 2022.
  5. The absorptive nature of the scattering coefficient in the stress-energy tensor formalism for room acoustics
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau
    24th international congress on acoustics (ICA 2022), Gyeongju, South Korea, October 2022.
  6. Scattering at the angles of polyhedral rooms: application of stress-energy tensor conservation in Riemannian spaces
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
    24th international congress on acoustics, Gyeongju, South Korea, October 2022.
  7. Apprentissage de bancs de filtres pour la séparation aveugle de sources sonores
    Félix Mathieu, Thomas Courtat, Gael Richard, Geoffroy Peeters
    Colloque Francophone de Traitement du Signal et des Images (GRETSI), Nancy, France, September 2022.
  8. Impact de perturbations internes sur l’entraînement de réseaux profonds pour la détection d’évènements sonores
    David Perera, Slim Essid, Gael Richard
    Colloque Francophone de Traitement du Signal et des Images (GRETSI), Nancy, France, September 2022.
  9. Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning
    Salah Zaiem, Titouan Parcollet, Slim Essid
    Interspeech 2022, Incheon, South Korea, September 2022.
  10. FVTD simulation of the acoustics of the Phonocamptic Cave in Noyon
    Hugo Duval, Antoine Thomas, Aidan Meacham, Roland Badeau, Jean-Christophe Valière, Jean-Dominique Polack
    The Acoustics of Ancient Theatres, Verona, Italy, July 2022.
  11. Adapting the EST method to ancient theatres: a proposal
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
    The Acoustics of Ancient Theatres, Verona, Italy, July 2022.
  12. Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms
    Milad Sefidgaran, Amin Gohari, Gael Richard, Umut Şimşekli
    COLT 2022 - 35th Annual Conference on Learning Theory, London, United Kingdom, July 2022.
  13. Opinions in Interactions : New Annotations of the SEMAINE Database
    Valentin Barrière, Chloé Clavel, Slim Essid
    LREC, Marseille, France, June 2022.
  14. END-TO-END SPEECH RECOGNITION FROM FEDERATED ACOUSTIC MODELS
    Yan Gao, Titouan Parcollet, Salah Zaiem, Javier Fernandez-Marques, Pedro Gusmao, Daniel Beutel, Nicholas Lane
    The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), Singapour, Singapore, May 2022.
  15. PHASE SHIFTED BEDROSIAN FILTERBANK: AN INTERPRETABLE AUDIO FRONT-END FOR TIME-DOMAIN AUDIO SOURCE SEPARATION
    Félix Mathieu, Thomas Courtat, Gael Richard, Geoffroy Peeters
    ICASSP, Singapour, Singapore, May 2022.
  16. Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
    2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, Singapore, May 2022.
  17. Algorithmes rapides pour la modélisation d’une réponse de salle dont l’atténuation dépend de la fréquence
    Achille Aknin, Roland Badeau
    16e Congrès Français d’Acoustique (CFA 2022), Marseille, France, April 2022.
  18. Confirming dimensional reduction assumptions for the energy-stress tensor through comparison with high-frequency wave-based pressure simulations
    Aidan Meacham, Roland Badeau, Jean-Dominique Polack
    16ème Congrès Français d’Acoustique, CFA2022, Marseille, France, April 2022.
  19. Confirming dimensional reduction assumptions for the energy-stress tensor through comparison with high-frequency wave-based pressure simulations
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau
    16e Congrès Français d’Acoustique (CFA 2022), Marseille, France, April 2022.
  20. Riemannian space tessellation with polyhedral room images
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
    16e Congrès Français d’Acoustique (CFA 2022), Marseille, France, April 2022.
  21. Riemannian space tessellation with polyhedral room images
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
    16ème Congrès Français d’Acoustique, CFA2022, Marseille, France, April 2022.
  22. Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
    Yicheng Du, Aditya Arie Nugraha, Kouhei Sekiguchi, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii
    INTERSPEECH, Incheon, South Korea, 2022.
  23. Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
    Parekh Jayneel, Parekh Sanjeel, Mozharovskyi Pavlo, d’Alché-Buc Florence, Gael Richard
    Advances in Neural Information Processing Systems, New Orleans, United States, 2022.
  24. DNN-FREE LOW-LATENCY ADAPTIVE SPEECH ENHANCEMENT BASED ON FRAME-ONLINE BEAMFORMING POWERED BY BLOCK-ONLINE FASTMNMF
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
    17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany, 2022.
  25. Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
    Kouhei Sekiguchi, Aditya Arie Nugraha, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii
    2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022), Kyoto, France, 2022.

Journal Articles

  1. The Jazz Ontology: A semantic model and large-scale RDF repositories for jazz
    Polina Proutskova, Daniel Wolff, György Fazekas, Klaus Frieler, Frank Höger, Olga Velichkina, Gabriel Solis, Tillman Weyde, Martin Pfleiderer, Hèlène Camille Crayencour, Geoffroy Peeters, Simon Dixon
    Journal of Web Semantics, October 2022.
  2. Pretext Tasks selection for multitask self-supervised speech representation learning
    Salah Zaiem, Titouan Parcollet, Slim Essid, Abdelwahab Heba
    IEEE Journal of Selected Topics in Signal Processing, October 2022.
  3. The Jazz Ontology: A semantic model and large-scale RDF repositories for jazz
    Polina Proutskova, Daniel Wolff, György Fazekas, Klaus Frieler, Frank Höger, Olga Velichkina, Gabriel Solis, Tillman Weyde, Martin Pfleiderer, Hèlène Camille Crayencour, Geoffroy Peeters, Simon Dixon
    Journal of Web Semantics, June 2022.
  4. Lyrics segmentation via bimodal text–audio representation
    Michael Fell, Yaroslav Nechaev, Gabriel Meseguer-Brocal, Elena Cabrio, Fabien Gandon, Geoffroy Peeters
    Natural Language Engineering, 2022.
  5. Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
    Mathieu Fontaine, Kouhei Sekiguchi, Aditya Nugraha, Yoshiaki Bando, Kazuyoshi Yoshii
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022.
  6. Video-to-Music Recommendation using Temporal Alignment of Segments
    Laure Prétet, Gael Richard, Clément Souchier, Geoffroy Peeters
    IEEE Transactions on Multimedia, 2022.
  7. Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
    Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii, Tatsuya Kawahara
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022.
  8. Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings
    Christof Weis, Geoffroy Peeters
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022.

2021

Conference Articles

  1. Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks
    Melih Barsbey, Milad Sefidgaran, Murat A Erdogdu, Gael Richard, Umut Şimşekli
    35th Conference on Neural Information Processing Systems (NeurIPS), Online, United States, December 2021.
  2. Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections
    Kimia Nadjahi, Alain Durmus, Pierre E. Jacob, Roland Badeau, Umut Şimşekli
    35th Conference on Neural Information Processing Systems (NeurIPS 2021), En ligne, France, December 2021.
  3. DARKGAN: EXPLOITING KNOWLEDGE DISTILLATION FOR COMPREHENSIBLE AUDIO SYNTHESIS WITH GANS
    Javier Nistal Hurlé, Stefan Lattner, Gael Richard
    International Society for Music Information Retrieval, Virtual, France, November 2021.
  4. Is There a ”Language of Music-Video Clips” ? A Qualitative and Quantitative Study
    Laure Prétet, Gaël Richard, Geoffroy Peeters
    ISMIR, Virtual Event, France, November 2021.
  5. THE WORDS REMAIN THE SAME: COVER DETECTION WITH LYRICS TRANSCRIPTION
    Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard
    22nd International Society for Music Information Retrieval Conference ISMIR 2021, Online, India, November 2021.
  6. Training Deep Pitch-Class Representations With a Multi-Label CTC Loss
    Christof Weiss, Geoffroy Peeters
    International Society for Music Information Retrieval Conference (ISMIR), Virtual Event, France, November 2021.
  7. On the topic of frequency dependent exponential decay matrices and Lie groups
    Achille Aknin, Roland Badeau
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, United States, October 2021.
  8. User-guided one-shot deep model adaptation for music source separation
    Giorgia Cantisani, Alexey Ozerov, Slim Essid, Gael Richard
    2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, United States, October 2021.
  9. VQCPC-GAN: VARIABLE-LENGTH ADVERSARIAL AUDIO SYNTHESIS USING VECTOR-QUANTIZED CONTRASTIVE PREDICTIVE CODING
    Javier Nistal Hurlé, Cyran Aouameur, Stefan Lattner, Gael Richard
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, United States, October 2021.
  10. Learning Multi-Pitch Estimation From Weakly Aligned Score-Audio Pairs Using a Multi-Label CTC Loss
    Christof Weiss, Geoffroy Peeters
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk Mountain House, New Paltz, NY, United States, October 2021.
  11. Damped Chirp Mixture Estimation via Nonlinear Bayesian Regression
    Julian Neri, Philippe Depalle, Roland Badeau
    23rd International Conference on Digital Audio Effects (DAFx2020), Vienne, Austria, September 2021.
  12. Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
    Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
    EUSIPCO 2021 - 29th European Signal Processing Conference, Dublin / Virtual, Ireland, August 2021.
  13. Unsupervised Blind Source Separation with Variational Auto-Encoders
    Julian Neri, Roland Badeau, Philippe Depalle
    29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland, August 2021.
  14. Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning
    Salah Zaiem, Titouan Parcollet, Slim Essid
    Interspeech 2021, Brno, Czech Republic, August 2021.
  15. Relative Positional Encoding for Transformers with Linear Complexity
    Antoine Liutkus, Ondřej Cífka, Shih-Lun Wu, Umut Şimşekli, Yi-Hsuan Yang, Gael Richard
    ICML 2021 - 38th International Conference on Machine Learning, Virtual Only, United States, July 2021.
  16. Cross-Modal Music-Video Recommendation: A Study of Design Choices
    Laure Prétet, Gael Richard, Geoffroy Peeters
    Special Session of the International Joint Conference on Neural Networks (IJCNN 2021), Shenzhen, China, July 2021.
  17. NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF
    Giorgia Cantisani, Slim Essid, Gael Richard
    2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto (virtual conference), Canada, June 2021.
  18. Self-Supervised VQ-VAE for One-Shot Music Style Transfer
    Ondřej Cífka, Alexey Ozerov, Umut Şimşekli, Gael Richard
    ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto / Virtual, Canada, June 2021.
  19. Distributed speech separation in spatially unconstrained microphone arrays
    Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
    ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Toronto / Virtual, Canada, June 2021.
  20. Comparing Representations for Audio Synthesis Using Generative Adversarial Networks
    Javier Nistal Hurlé, Stefan Lattner, Gael Richard
    2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam (virtual), France, January 2021.
  21. Comparing Representations for Audio Synthesis Using Generative Adversarial Networks
    Gaël Richard, Javier Nistal, Stefan Plattner
    2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam (Virtual), Netherlands, January 2021.

Theses

  1. Personalized audio auto-tagging as proxy for contextual music recommendation
    Karim Magdi Abdelfattah Ibrahim
    December 2021.

patent

  1. Conversion de la parole par apprentissage statistique avec modélisation complexe des modifications temporelles
    Enguerrand Gentet, Sebastien Denjean, Vincent Roussarie, David Bertrand, Gael Richard
    France, July 2021.

Journal Articles

  1. DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays
    Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021.
  2. Approximate Inference and Learning of State Space Models with Laplace Noise
    Julian Neri, Philippe Depalle, Roland Badeau
    IEEE Transactions on Signal Processing, 2021.
  3. Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
    Kilian Schulze-Forster, Clement S J Doire, Gael Richard, Roland Badeau
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021.

2020

Conference Articles

  1. Auralization of a Hybrid Sound Field using a Wave-Stress Tensor Based Model
    Aidan Meacham, Roland Badeau, Jean-Dominique Polack
    Forum Acusticum, Lyon, France, December 2020.
  2. Extending Deep Rhythm for Tempo and Genre Estimation Using Complex Convolutions, Multitask Learning and Multi-input Network
    Hadrien Foroughmand, Geoffroy Peeters
    The 2020 Joint Conference on AI Music Creativity, Stockholm, Sweden, October 2020.
  3. SHOULD WE CONSIDER THE USERS IN CONTEXTUAL MUSIC AUTO-TAGGING MODELS?
    Karim M Ibrahim, Elena V Epure, Geoffroy Peeters, Gael Richard
    21st International Society for Music Information Retrieval Conference, Montreal, Canada, October 2020.
  4. CONTENT BASED SINGING VOICE SOURCE SEPARATION VIA STRONG CONDITIONING USING ALIGNED PHONEMES
    Gabriel Meseguer-Brocal, Geoffroy Peeters
    21st International Society for Music Information Retrieval Conference, Montréal (virtual), Canada, October 2020.
  5. MULTILINGUAL LYRICS-TO-AUDIO ALIGNMENT
    Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard, Florence d’Alché-Buc
    International Society for Music Information Retrieval Conference (ISMIR), Montreal, Canada, October 2020.
  6. EVALUATION OF A STOCHASTIC REVERBERATION MODEL BASED ON THE IMAGE SOURCE PRINCIPLE
    Achille Aknin, Théophile Dupré, Roland Badeau
    International Conference on Digital Audio Effects, Vienne, Austria, September 2020.
  7. DrumGAN: Synthesis of drum sounds with timbral feature conditioning using Generative Adversarial Networks
    Javier Nistal Hurlé, Stefan Lattner, Gael Richard
    21 st International Society for Music Information Retrieval Conference (ISMIR), Toronto, Canada, August 2020.
  8. Confidence-based Weighted Loss for Multi-label Classification with Missing Labels
    Karim M Ibrahim, Elena Epure, Geoffroy Peeters, Gael Richard
    The 2020 International Conference on Multimedia Retrieval (ICMR ’20), Dublin, Ireland, June 2020.
  9. A Prototypical Triplet Loss for Cover Detection
    Guillaume Doras, Geoffroy Peeters
    ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
  10. DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays
    Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
    ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020. Submitted to....
  11. Speech Intelligibility Enhancement by Equalization for in-Car Applications
    Enguerrand Gentet, Bertrand David, Sebastien Denjean, Gael Richard, Vincent Roussarie
    ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
  12. Neutral to Lombard Speech Conversion with Deep Learning
    Enguerrand Gentet, Bertrand David, Sebastien Denjean, Gael Richard, Vincent Roussarie
    ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
  13. AUDIO-BASED AUTO-TAGGING WITH CONTEXTUAL TAGS FOR MUSIC
    Karim M Ibrahim, Jimena Royo-Letelier, Elena V. Epure, Geoffroy Peeters, Gael Richard
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, May 2020.
  14. Approximate Bayesian computation with the sliced-Wasserstein distance
    Kimia Nadjahi, Valentin Bortoli, Alain Durmus, Roland Badeau, Umut Şimşekli
    45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020.
  15. Laplace state space filter with exact inference and moment matching
    Julian Neri, Philippe Depalle, Roland Badeau
    45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020.
  16. Probabilistic filter and smoother for variational inference of Bayesian linear dynamical systems
    Julian Neri, Roland Badeau, Philippe Depalle
    45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020.
  17. LEARNING TO RANK MUSIC TRACKS USING TRIPLET LOSS
    Laure Prétet, Gael Richard, Geoffroy Peeters
    ICASSP, Barcelona, Spain, May 2020.
  18. Joint phoneme alignment and text-informed speech separation on highly corrupted speech
    Kilian Schulze-Forster, Clément Doire, Gael Richard, Roland Badeau
    45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), Barcelona, Spain, May 2020.
  19. Audio-Based Detection of Explicit Content in Music
    Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard, Florence d’Alché-Buc
    ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
  20. Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
    Mathieu Fontaine, Kouhei Sekiguchi, Aditya Arie Nugraha, Kazuyoshi Yoshii
    Proc. Interspeech 2020, 2020.
  21. Matrix Factorization for High Frequency Non Intrusive Load Monitoring
    Simon Henriet, Benoît Fuentes, Umut Şimşekli, Gael Richard
    BuildSys ’20: The 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Virtual Event, Japan, 2020.
  22. The POTUS Corpus, a database of weekly addresses for the study of stance in politics and virtual agents
    Thomas Janssoone, Kevin Bailly, Gael Richard, Chloé Clavel
    Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
  23. Statistical and Topological Properties of Sliced Probability Divergences
    Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli
    Advances in Neural Processing Systems, Online, France, 2020.

patent

  1. Method and System for Broadcasting a Multichannel Audio Stream to Terminals of Spectators Attending a Sports Event
    Raphael Blouet, Slim Essid
    September 2020.

Journal Articles

  1. Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes
    Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters
    Transactions of the International Society for Music Information Retrieval (TISMIR), June 2020.
  2. Separation of Alpha-Stable Random Vectors
    Mathieu Fontaine, Roland Badeau, Antoine Liutkus
    Signal Processing, January 2020.
  3. Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data
    Ondřej Cífka, Umut Şimşekli, Gael Richard
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020.

2015 - 2019 [102 publications]

2019

Conference Articles

  1. Generalized Sliced Wasserstein Distances
    Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Roland Badeau, Gustavo K.
    NeurIPS 2019, Vancouver, Canada, December 2019.
  2. Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance
    Kimia Nadjahi, Alain Durmus, Umut Simsekli, Roland Badeau
    NeurIPS 2019, Vancouver, Canada, December 2019.
  3. First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise
    Thanh Huy Nguyen, Umut Simsekli, Mert Gürbüzbalaban, Gael Richard
    33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada, December 2019.
  4. Supervised Symbolic Music Style Translation Using Synthetic Data
    Ondřej Cífka, Umut Şimşekli, Gael Richard
    20th International Society for Music Information Retrieval Conference (ISMIR), Delft, Netherlands, November 2019.
  5. TRACKING BEATS AND MICROTIMING IN AFRO-LATIN AMERICAN MUSIC USING CONDITIONAL RANDOM FIELDS AND DEEP LEARNING
    Magdalena Fuentes, Lucas S Maia, Martín Rocamora, Luiz W P Biscainho, Hélène C Crayencour, Slim Essid, Juan P. Bello
    ISMIR, Delft, Netherlands, November 2019.
  6. From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining
    Alexandre Garcia, Pierre Colombo, Slim Essid, Florence d’Alché-Buc, Chloe Clavel
    2019 Conference on Empirical Methods in Natural Language Processing, Hong-Kong, China, November 2019.
  7. SAMBASET: A DATASET OF HISTORICAL SAMBA DE ENREDO RECORDINGS FOR COMPUTATIONAL MUSIC ANALYSIS
    Lucas S Maia, Magdalena Fuentes, Luiz W P Biscainho, Martín Rocamora, Slim Essid
    The 20th International Society for Music Information Retrieval Conference, Delft, Netherlands, November 2019.
  8. CONDITIONED-U-NET: INTRODUCING A CONTROL MECHANISM IN THE U-NET FOR MULTIPLE SOURCE SEPARATIONS
    Gabriel Meseguer-Brocal, Geoffroy Peeters
    Proceedings of the 20th International Society for Music Information Retrieval Conference, Delft, Netherlands, November 2019.
  9. EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC
    giorgia cantisani, Slim Essid, Gael Richard
    2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, United States, October 2019. Accepted for....
  10. IDENTIFY, LOCATE AND SEPARATE: AUDIO-VISUAL OBJECT EXTRACTION IN LARGEVIDEO COLLECTIONS USING WEAK SUPERVISION
    Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Duong, Patrick Pérez, Gael Richard
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, United States, October 2019.
  11. Weakly informed audio source separation
    Kilian Schulze-Forster, Clément Doire, Gael Richard, Roland Badeau
    WASPAA, New Paltz, New York, United States, October 2019.
  12. MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music
    giorgia cantisani, Gabriel Trégoat, Slim Essid, Gael Richard
    Speech, Music and Mind (SMM), Satellite Workshop of Interspeech 2019, Vienna, Austria, September 2019.
  13. Cauchy Multichannel Speech Enhancement with a Deep Speech Prior
    Mathieu Fontaine, Aditya Arie Nugraha, Roland Badeau, Kazuyoshi Yoshii, Antoine Liutkus
    EUSIPCO 2019 - 27th European Signal Processing Conference, Coruña, Spain, September 2019.
  14. Lower Bound on Frequency Validity of Energy-Stress Tensor Based Diffuse Sound Field Model
    Aidan Meacham, Roland Badeau, Jean-Dominique Polack
    ICA 2019, Aachen, Germany, September 2019.
  15. Factorisation Matricielle Semi Non-Négative: Applicationà la Décomposition de Consommations Electriques
    Simon Henriet, Umut Simsekli, Sérgio F. Santos, Benoît Fuentes, Gael Richard
    Colloque francophonede traitement du signal et des images (GRETSI), Lille, France, August 2019.
  16. Generalized formulation of acoustics
    Jean-Dominique Polack, Aidan Meacham, Roland Badeau
    Congrès Français de Mécanique, Brest, France, August 2019.
  17. Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization
    Thanh Huy Nguyen, Umut Şimşekli, Gael Richard
    International Conference on Machine Learning (ICML), Long Beach, United States, June 2019.
  18. A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning
    Magdalena Fuentes, Brian Mcfee, Helene-Camille Crayencour, Slim Essid, Juan P. Bello
    ICASSP, Brighton, United Kingdom, May 2019.
  19. Singing Voice Separation: A Study on Training Data
    Laure Prétet, Romain Hennequin, Jimena Royo-Letelier, Andrea Vaglio
    ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, May 2019.
  20. mirdata: Software for Reproducible Usage of Datasets
    R. M. Bittner, M. Fuentes, D. Rubinstein, A. Jansson, K. Choi, T. Kell
    20th International Society for Music Information Retrieval Conference, 2019.

Journal Articles

  1. Weakly Supervised Representation Learning for Audio-Visual Scene Analysis
    Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gael Richard
    IEEE/ACM Transactions on Audio, Speech and Language Processing, December 2019.
  2. Independent-Variation Matrix Factorization With Application to Energy Disaggregation
    Simon Henriet, Umut Simsekli, Sérgio F. Santos, Benoît Fuentes, Gael Richard
    IEEE Signal Processing Letters, November 2019.
  3. Common mathematical framework for stochastic reverberation models
    Roland Badeau
    Journal of the Acoustical Society of America, April 2019.
  4. De Fourier à la reconnaissance musicale
    Gael Richard, Sebastien Fenet, Yves Grenier
    Interstices, February 2019.
  5. Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction
    Atef Ben Youssef, Chloé Clavel, Slim Essid
    IEEE Transactions on Affective Computing , January 2019.
  6. On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction
    Atef Ben Youssef, Giovanna Varni, Slim Essid, Chloé Clavel
    International Journal of Social Robotics, January 2019.
  7. Audiovisual Analysis of Music Performances: Overview of an Emerging Field
    Zhiyao Duan, Slim Essid, Cynthia Liem, Gael Richard, Gaurav Sharma
    IEEE Signal Processing magazine, January 2019.

Technical Reports

  1. Stochastic reverberation model for uniform and non-diffuse acoustic fields
    Roland Badeau
    April 2019.
  2. General stochastic reverberation model
    Roland Badeau
    February 2019.

Theses

  1. Processus alpha-stables pour le traitement du signal
    Mathieu Fontaine
    2019.

2018

patent

  1. Procédé de traitement d’un signal audio et dispositif électronique correspondant, produit-programme lisible par ordinateur non transitoire et support d’informations lisible par ordinateur
    Sanjeel Parekh, Alexey Ozerov, Quang-Khanh-Ngoc Duong, Gael Richard, Slim Essid, Patrick Pérez
    France, October 2018.
  2. Procédé de classification et de localisation d’événements audiovisuels et appareil correspondant, produit-programme lisible par ordinateur et support d’informations lisible par ordinateur
    Quang-Khanh-Ngoc Duong, Alexey Ozerov, Sanjeel Parekh, Slim Essid, Gael Richard, Patrick Pérez
    France, March 2018.
  3. Procede et Systeme de Diffusion d un Flux Audio Multicanal a des terminaux de spectateurs assistant a un evenement sportif
    Raphael Blouet, Slim Essid
    March 2018.

Conference Articles

  1. Unified Stochastic Reverberation Modeling
    Roland Badeau
    26th European Signal Processing Conference (EUSIPCO), Rome, Italy, September 2018.
  2. MAIN MELODY EXTRACTION WITH SOURCE-FILTER NMF AND CRNN
    Dogac Basaran, Slim Essid, Geoffroy Peeters
    19th International Society for Music Information Retreival, Paris, France, September 2018.
  3. ANALYSIS OF COMMON DESIGN CHOICES IN DEEP LEARNING SYSTEMS FOR DOWNBEAT TRACKING
    Magdalena Fuentes, Brian Mcfee, Hélène C Crayencour, Slim Essid, Juan P Bello
    The 19th International Society for Music Information Retrieval Conference, Paris, France, September 2018.
  4. Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization
    Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid
    2018 26th European Signal Processing Conference (EUSIPCO), Rome, France, September 2018.
  5. Non-linear auto-regressive models for cross-frequency coupling in neural time series
    Tom Dupré La Tour, Lucile Tallot, Laeticia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
    BIOMAG, Philadelphia, USA, August 2018.
  6. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
    Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau
    LVA/ICA: Latent Variable Analysis and Signal Separation, Surrey, United Kingdom, July 2018.
  7. Attitude Classification in Adjacency Pairs of a Human-Agent Interaction with Hidden Conditional Random Fields
    Valentin Barriere, Chloe Clavel, Slim Essid
    ICASSP 2018 - 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, April 2018.
  8. Driver estimation in non-linear autoregressive models
    Tom Tour, Yves Grenier, Alexandre Gramfort
    43nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, April 2018.
  9. Optimisation d’un critère d’Intelligibilité de la Parole dans un Contexte Bruité Automobile
    Enguerrand Gentet, Bertrand David, Sébastien Denjean, Gael Richard, Vincent Roussarie
    CFA 2018, Le Havre, France, April 2018.
  10. Alpha-stable low-rank plus residual decomposition for speech enhancement
    Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gael Richard
    ICASSP: International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, April 2018.
  11. Energy Disaggregation for Commercial Buildings: A Statistical Analysis
    Simon Henriet, Umut Simsekli, Gael Richard, Benoît Fuentes
    ”, International Workshop on Non-Intrusive Load Monitoring (NILM2018), Austin, Tx, United States, March 2018.
  12. Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events
    Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q K Duong, Patrick Pérez, Gael Richard
    CVPR Workshop, Salt Lake city, United States, 2018.
  13. A Novel Database of Brazilian Rhythmic Instruments and Some Experiments in Computational Rhythm Analysis
    L.S. Maia, P. D. Tomaz Jr., M. Fuentes, M. Rocamora, L. W. P. Biscainho, M. V. M. Costa, S. Cohen
    Audio Engineering Society Latin American Conference, 2018.
  14. An ENF-Based Audio Authenticity Method Robust to MP3 Compression
    P. Zinemanas, M. Fuentes, P. Cancela, J. A. Apolinário Jr.
    Circuits, Systems and Signal Processing Springer, 2018.

Journal Articles

  1. Student’s t Source and Mixing Models for Multichannel Audio Source Separation
    Simon Leglaive, Roland Badeau, Gael Richard
    IEEE/ACM Transactions on Audio, Speech and Language Processing, June 2018.
  2. Model-based STFT phase recovery for audio source separation
    Paul Magron, Roland Badeau, Bertrand David
    IEEE Transactions on Audio, Speech and Language Processing, June 2018.
  3. Training and Compensation of Class-conditioned NMF Bases for Speech Enhancement
    Hanwook Chung, Roland Badeau, Eric Plourde, Benoît Champagne
    Neurocomputing, 2018.
  4. A Generative Model for Non-Intrusive Load Monitoring in Commercial Buildings
    Simon Henriet, Umut Şimşekli, Benoît Fuentes, Gael Richard
    Energy and Buildings, 2018.

Technical Reports

  1. Research report on unified stochastic reverberation modeling
    Roland Badeau
    February 2018.

2017

Journal Articles

  1. Non-linear auto-regressive models for cross-frequency coupling in neural time series
    Tom Dupré La Tour, Lucille Tallot, Laetitia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
    PLoS Computational Biology, December 2017.
  2. SMART : Règles d’associations temporelles de signaux sociaux pour la synthèse d’un Agent Conversationnel Animé avec une attitude spécifique
    Kévin Bailly, Chloé Clavel, thomas janssoone, Gael Richard
    Revue des Sciences et Technologies de l’Information - Série RIA : Revue d’Intelligence Artificielle, July 2017.
  3. Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification
    Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
    IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2017.
  4. Règles d’Associations Temporelles de signaux sociaux pour la synthèse d’Agents Conversationnels Animés : Application aux attitudes sociales
    thomas janssoone, Chloé Clavel, Kevin Bailly, Gael Richard
    Revue des Sciences et Technologies de l’Information - Série RIA : Revue d’Intelligence Artificielle, 2017.

Conference Articles

  1. UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions
    Atef Ben-Youssef, Chloé Clavel, Slim Essid, Miriam Bilac, Marine Chamoux, Angelica Lim
    the 19th ACM International Conference, Glasgow, France, November 2017.
  2. Amplitude and Phase Dereverberation of Harmonic Signals
    Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
  3. Explaining the Parameterized Wiener Filter with Alpha-Stable Processes
    Mathieu Fontaine, Antoine Liutkus, Laurent Girin, Roland Badeau
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
  4. Separating Time-Frequency Sources from Time-Domain Convolutive Mixtures Using Non-negative Matrix Factorization
    Simon Leglaive, Roland Badeau, Gael Richard
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
  5. Lévy NMF for Robust Nonnegative Source Separation
    Paul Magron, Roland Badeau, Antoine Liutkus
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017), New Paltz, NY, United States, October 2017.
  6. Guiding Audio Source Separation by Video Object Information
    Sanjeel Parekh, Slim Essid, Alexey Ozerov, Quang-Khanh-Ngoc Duong, Patrick Perez, Gael Richard
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
  7. Amplitude and Phase Dereverberation of Harmonic Signals
    Arthur Belhomme, Roland Badeau, Yves Grenier, Éric Humbert
    WASPAA, New Paltz, New York, USA, October 2017.
  8. Séparation de sources audio en milieu réverbérant : Factorisation en matrices non-négatives et représentation temporelle du mélange convolutif
    Simon Leglaive, Roland Badeau, Gael Richard
    Colloque GRETSI, Juan-Les-Pins, France, September 2017.
  9. Lévy NMF : un modèle robuste de séparation de sources non-négatives
    Paul Magron, Roland Badeau, Antoine Liutkus
    Colloque GRETSI, Juan-Les-Pins, France, September 2017.
  10. Histoire de la transformée de Mellin
    Jean-Marie Nicolas, Roland Badeau
    Colloque GRETSI, Juan-Les-Pins, France, September 2017.
  11. Non-linear auto-regressive models for cross-frequency coupling in neural time series
    Tom Dupré La Tour, Lucile Tallot, Laeticia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
    C3S, Cologne, Allemagne, September 2017.
  12. Amplitude and Phase Dereverberation of Monocomponent Signals
    Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
    25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
  13. EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation
    Anne-Claire Conneau, Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid
    2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
  14. Scalable Source Localization with Multichannel Alpha-Stable Distributions
    Mathieu Fontaine, Charles Vanwynsberghe, Antoine Liutkus, Roland Badeau
    25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
  15. Semi-Blind Student’s t Source Separation for Multichannel Audio Convolutive Mixtures
    Simon Leglaive, Roland Badeau, Gael Richard
    25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
  16. Amplitude and Phase Dereverberation of Monocomponent Signals
    Arthur Belhomme, Roland Badeau, Yves Grenier, Éric Humbert
    EUSIPCO, Kos, Greece, August 2017.
  17. Non-linear auto-regressive models for cross-frequency coupling in neural time series
    Tom Dupré La Tour, Lucile Tallot, Laeticia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
    OHBM, Vancouver, Canada, June 2017.
  18. Overlapping sound event detection with supervised Nonnegative Matrix Factorization
    Victor Bisot, Slim Essid, Gael Richard
    2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, France, March 2017.
  19. Parametric estimation of spectrum driven by an exogenous signal
    Tom Dupré La Tour, Yves Grenier, Alexandre Gramfort
    42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) , La Nouvelle Orléans, LA, United States, March 2017.
  20. Parametric estimation of spectrum driven by an exogenous signal
    Tom Dupré La Tour, Yves Grenier, Alexandre Gramfort
    ICASSP, New Orleans, March 2017.
  21. Nonnegative Matrix Factorisation for multimodal data analysis
    Slim Essid
    Dipartimento di Elettronica, Informazione e Bioingegeria (DEIB), Politecnico di Milano, Milan, Italy, February 2017.
  22. Parametric models of phase-amplitude coupling in neural time series
    Tom Dupré La Tour, Yves Grenier, Alexandre Gramfort
    BASP, Villars-sur-Ollon, Switzerland, January 2017.
  23. EMOEEG: a New Multimodal Dataset for Dynamic EEG-based Emotion Recognition with Audiovisual Elicitation
    Anne-Claire Conneau, Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid
    The European Signal Processing Conference (EUSIPCO), Kos island, Greece, 2017.
  24. Sketching for nearfield acoustic imaging of heavy-tailed sources
    Mathieu Fontaine, Charles Vanwynsberghe, Antoine Liutkus, Roland Badeau
    International Conference on Latent Variable Analysis and Signal Separation, 2017.

patent

  1. Procédé et dispositif pour estimer un signal déréverbéré
    Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
    France, May 2017.

2016

patent

  1. Procédé et dispositif pour estimer la réverbération acoustique
    Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
    France, December 2016.
  2. Dispositif a Casque Audio Perfectionne
    Slim Essid, Raphael Blouet
    November 2016.

Conference Articles

  1. Anechoic phase estimation from reverberant signals
    Arthur Belhomme, Yves Grenier, Roland Badeau, Eric Humbert
    15th International Workshop on Acoustic Signal Enhancement (IWAENC), Xi’an, China, September 2016.
  2. SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATION
    Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
    IEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016), Budapest, Hungary, September 2016.
  3. Feature Adapted Convolutional Neural Networks for Downbeat Tracking
    Simon Durand, Juan P. Bello, Bertrand David, Gael Richard
    ICASSP 2016, Shanghai, China, September 2016.
  4. Using Temporal Association Rules For the synthesis of Embodied Conversational Agent With a specific stance.
    thomas janssoone, Chloé Clavel, Kévin Bailly, Gael Richard
    International Conference on Intelligent Virtual Agents, Los Angeles, United States, September 2016.
  5. Downbeat Detection with Conditional Random Fields and Deep Learned Features
    Simon Durand, Slim Essid
    International Society for Music Information Retrieval (ISMIR), New York City, United States, August 2016.
  6. Research on Nonnegative Matrix Factorisation at Telecom ParisTech
    Slim Essid
    Spotify Research Seminar, New York, United States, August 2016.
  7. Analyse et reconnaissance multimodale de signaux sociaux : application à la synthèse d’attitudes sociales d’un agent conversationnel animé
    thomas janssoone, Chloé Clavel, Kévin Bailly, Gael Richard
    WACAI, Brest, France, June 2016.
  8. Acoustic scene classification with matrix factorization for unsupervised feature learning
    Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
    ICASSP, Shangai, China, March 2016.
  9. Formant shifting for speech Intelligibility improvement in car noise environment
    Karan Nathwani, Morgane Daniel, Gael Richard, Bertrand David, Vincent Roussarie
    ICASSP, Shanghai, China, March 2016.
  10. Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification
    Romain Serizel, Slim Essid, Gael Richard
    ICASSP, Shangai, China, March 2016.
  11. Blind estimation of room acoustic parameters using kernel regression
    Arthur Belhomme, Yves Grenier, Roland Badeau, Eric Humbert
    AES 60th Conference, Leuven, Belgium, February 2016.

Technical Reports

  1. An iterative algorithm for recovering the phase of complex components from their mixture
    Paul Magron, Roland Badeau, Bertrand David
    June 2016.

2015

Conference Articles

  1. MELODY EXTRACTION BY CONTOUR CLASSIFICATION
    Rachel M Bittner, Justin Salamon, Slim Essid, Juan P Bello
    International Conference on Music Information Retrieval (ISMIR), Malaga, Spain, September 2015.
  2. Multipitch estimation using a PLCA-based model: Impact of partial user annotation
    Camila Andrade Scatolini, Gael Richard, Benoît Fuentes
    ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, France, April 2015.
  3. A conditional random field system for beat tracking
    Thomas Fillon, C. Joder, Simon Durand, Slim Essid
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, April 2015.
  4. Nonnegative matrix Factorisation for Audiovisual Document Analysis
    Slim Essid
    Seminaire Traitement du Langage Parle, LIMSI, Orsay, France, 2015.

Technical Reports

  1. Phase reconstruction of spectrograms with linear unwrapping : application to audio signal restoration
    Paul Magron, Roland Badeau, Bertrand David
    April 2015.

Journal Articles

  1. TPT-Dance&Actions : un corpus multimodal d’activités humaines
    Aymeric Masurelle, Ahmed Rida Sekkat, Slim Essid, Gael Richard
    Revue Traitement du Signal (Presse universitaire de Grenoble), April 2015.

patent

  1. Procédé de suppression de la réverbération tardive d’un signal sonore
    Nicolás López, Yves Grenier, Gael Richard
    France, January 2015.

2010 - 2014 [104 publications]

2014

Journal Articles

  1. Multichannel high resolution NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
    Roland Badeau, Mark D. Plumbley
    IEEE Transactions on Audio, Speech and Language Processing, November 2014.

Conference Articles

  1. Romeo2 Project: Humanoid Robot Assistant and Companion for Everyday Life: I. Situation Assessment for Social Intelligence
    Amit Kumar Pandey, Rodolphe Gelin, Rachid Alami, Renaud Viry, Axel Buendia, Roland Meertens, Mohamed Chetouani, Laurence Devillers, Marie Tahon, David Filliat, Yves Grenier, Mounira Maazaoui, Abderrahmane Kheddar, Frédéric Lerasle, Laurent Fitte-Duval
    AIC: Artificial Intelligence and Cognition, Torino, Italy, November 2014.
  2. Template adaptation for improving automatic music transcription
    Emmanouil Benetos, Roland Badeau, Tillman Weyde, Gael Richard
    ISMIR 2014 The 15th International Society for Music Information Retrieval Conference, Taipei, Taiwan, October 2014.
  3. Controlling the Convergence Rate to Help Parameter Estimation in a PLCA-based Model
    Benoît Fuentes, Roland Badeau, Gael Richard
    EUSIPCO, Lisbon, Portugal, September 2014.
  4. A tutorial on Nonnegative Matrix Factorisation with applications to audiovisual content analysis
    Slim Essid, Alexey Ozerov
    Tutorial at ICME 2014, Chengdu, China, July 2014.
  5. Assessment of new spectral features for eeg-based emotion recognition.
    Anne-Claire Conneau, Slim Essid
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.
  6. Enhancing downbeat detection when facing different music styles
    Simon Durand, Bertrand David, Gael Richard
    ICASSP, Florence, Italy, May 2014.
  7. Towards complex matrix decomposition of spectrograms based on the relative phase offsets of harmonic sounds
    Holger Kirchhoff, Roland Badeau, Simon Dixon
    Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, May 2014.
  8. Single Channel Reverberation Suppression Based on Sparse Linear Prediction
    Nicolás López, Yves Grenier, Gael Richard, Ivan Bourmeyster
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.
  9. Piecewise constant nonnegative matrix factorization
    N. Seichepine, Slim Essid, C. Fevotte, O. Cappe
    ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, France, May 2014.
  10. Single Channel Reverberation Suppression Based on Sparse Linear Prediction
    Nicolas López, Yves Grenier, Gaël Richard, Ivan Bourmeyster
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.
  11. Informed Audio source Separation
    Gael Richard
    AES International Conference on Semantic Audio, Londres, United Kingdom, 2014.
  12. Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes
    A. Masurelle, S. Essid, G. Richard
    Proceedings of conference on Acoustics, Speech, and Signal Processing, 2014.

Technical Reports

  1. Proof of Wiener-like linear regression of isotropic complex symmetric alpha-stable random variables
    Roland Badeau, Antoine Liutkus
    September 2014.
  2. Scale-invariant probabilistic latent component analysis
    Romain Hennequin, Bertrand David, Roland Badeau
    March 2014. Rapport inte....

2013

Conference Articles

  1. Multimodal Signal Analysis at Telecom ParisTech
    Slim Essid
    Seminaire scienti\unmatchedfb01que de Technicolor R&D, Rennes, France, December 2013.
  2. An Extended Audio-Fingerprint Method with Capabilities for Similar Music Detection
    Sébastien Fenet, Yves Grenier, Gael Richard
    ISMIR, Curitiba, Brazil, November 2013.
  3. Nonnegative Tensor Factorization for Single-Channel EEG Artifact Rejection
    Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid
    IEEE International Workshop on Machine Learning for Signal Processing, Southampton, United Kingdom, September 2013.
  4. Does dereverberation help multichannel blind source separation? A study case
    Nicolás López, Mounira Maazaoui, Yves Grenier, Gael Richard, Ivan Bourmeyster
    European Signal Processing Conference (EUSIPCO), Marrakech, Morocco, September 2013.
  5. Co-factorisation douce en matrices non-négatives. Application au regroupement multimodal de locuteurs
    Nicolas Seichepine, Slim Essid, Cédric Févotte, Olivier Cappé
    GRETSI, Brest, France, September 2013.
  6. Probabilistic dance performance alignment by fusion of multimodal features
    Angelique Dremeau, Slim Essid
    IEEE Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, May 2013.
  7. Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization
    N. Seichepine, Slim Essid, C. Fevotte, O. Cappe
    ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, France, May 2013.
  8. Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF)
    Roland Badeau, Angélique Dremeau
    ICASSP, Vancouver, Canada, 2013.
  9. Probabilistic Time-Frequency Source-Filter Decomposition of Non-Stationary Signals
    Roland Badeau, Mark. D. Plumbley
    EUSIPCO, Marrakech, Morocco, 2013.
  10. Multichannel HR-NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
    Roland Badeau, Mark. D. Plumbley
    WASPAA, New Paltz, New York, United States, 2013.
  11. Fast multilinear SVD for structured tensors and applications to Harmonic analysis and Volterra serie
    Remy Boyer, Roland Badeau, Gérard Favier
    Assemblée Générale du GdR ISIS 2013, France, 2013.
  12. Outil d’analyse temps-fréquence multi-résolution appliqué aux signaux audio
    Thomas Fillon, Jacques Prado, Roland Badeau
    Colloque GRETSI 2013, Brest, France, 2013.
  13. Low bitrate informed source separation of realistic mixtures
    Antoine Liutkus, Roland Badeau, Gael Richard
    ICASSP, Vancouver, Canada, 2013.
  14. Débruitage Aveugle par Décompositions Parcimonieuses et Aléatoires,
    Manuel Moussallam, Alexandre Gramfort, Gael Richard, Laurent Daudet
    GRETSI, Brest, France, 2013.
  15. Multimodal Classification of Dance Movements using Body Joint Trajectories and Step Sounds
    A. Masurelle, S. Essid, G. Richard
    Proceedings of workshop on Image and Audio Analysis for Multimedia Interactive Services , 2013.

Technical Reports

  1. Estimating an AR Model with Exogenous Driver
    Yves Grenier
    October 2013.
  2. Multichannel high resolution NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
    Roland Badeau, Mark. D. Plumbley
    2013.

Journal Articles

  1. Learning Optimal Features for Polyphonic Audio-to-Score Alignment
    Cyril Joder, Slim Essid, Gael Richard
    IEEE Transactions on Audio, Speech and Language Processing, October 2013.
  2. A Multimodal Approach to Speaker Diarization on TV Talk-Shows
    Félicien Vallet, Slim Essid, Jean Carrive
    IEEE Transactions on Multimedia, April 2013.
  3. Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring
    Slim Essid, Cédric Févotte
    IEEE Transactions on Multimedia, February 2013.
  4. Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription
    Benoît Fuentes, Roland Badeau, Gael Richard
    IEEE_J_ASLP, 2013.

patent

  1. Génération d’une Signature d’un Signal Audio Musical
    Sébastien Fenet, Yves Grenier, Gael Richard
    France, February 2013.

2012

Conference Articles

  1. Analysis of dance movements using gaussian processes
    Antoine Liutkus, Angélique Drémeau, Dimitrios Alexiadis, Slim Essid, Petros Daras
    the 20th ACM international conference, Nara, France, October 2012.
  2. Decomposing the video editing structure of a talk-show using nonnegative matrix factorization
    Slim Essid, C. Fevotte
    2012 19th IEEE International Conference on Image Processing (ICIP 2012), Orlando, France, September 2012.
  3. Low variance blind estimation of the reverberation time
    Nicolás López, Yves Grenier, Gael Richard, Ivan Bourmeyster
    13th International Workshop on Acoustic Signal Enhancement (IWAENC 2012), Aachen, Germany, September 2012.
  4. Low variance blind estimation of the reverberation time
    Nicolas López, Yves Grenier, Gaël Richard, Ivan Bourmeyster
    13th International Workshop on Acoustic Signal Enhancement (IWAENC 2012), Aachen, Germany, September 2012.
  5. A Framework for Fingerprint-Based Detection of Repeating Objects in Multimedia Streams
    Sébastien Fenet, Manuel Moussallam, Yves Grenier, Gael Richard, Laurent Daudet
    EUSIPCO, Bucharest, Romania, August 2012.
  6. A Framework for Fingerprint-Based Detection of Repeating Objects in Multimedia Streams
    Sébastien Fenet, Manuel Moussallam, Yves Grenier, Gaël Richard, Laurent Daudet
    EUSIPCO, Bucharest, Romania, August 2012.
  7. Adaptive blind source separation with HRTFs beamforming preprocessing
    Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
    The seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, United States, June 2012.
  8. Adaptive blind source separation with HRTFs beamforming preprocessing and varying number of sources
    Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
    The seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, New Jersey, United States, June 2012.
  9. Adaptive blind source separation with HRTFs beamforming preprocessing and varying number of sources
    Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
    The seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, New Jersey, USA, June 2012.
  10. From Binaural to Multichannel Blind Source Separation using Fixed Beamforming with HRTFs
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    The 19th International Conference on Systems, Signals and Image Processing, IWSSIP 2012, Austria, April 2012.
  11. From Binaural to Multichannel Blind Source Separation using Fixed Beamforming with HRTFs
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    The 19th International Conference on Systems, Signals and Image Processing, IWSSIP 2012, Vienne, Autriche, April 2012.
  12. AN ADVANCED VIRTUAL DANCE PERFORMANCE EVALUATOR
    Slim Essid, Dimitrios Alexiadis, Robin Tournemenne, Marc Gowing, Philip Kelly, David Monhagan, Petros Daras, Angelique Dremeau, N. E. O’Connor
    IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, March 2012.
  13. A probabilistic approach to simultaneous extraction of beats and downbeats
    Maksim Khadkevich, Thomas Fillon, Gael Richard, Maurizio Omologo
    ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, France, March 2012.
  14. Blind Harmonic Adaptive Decomposition Applied to Supervised Source Separation
    Benoît Fuentes, Roland Badeau, Gael Richard
    20th European Signal Processing Conference (EUSIPCO), Bucharest, Romania, 2012.
  15. Probabilistic model for main melody extraction using constant-Q transform
    Benoît Fuentes, Antoine Liutkus, Roland Badeau, Gael Richard
    37th International Conference on Acoustics, Speech, and Signal Processing ICASSP’12, Kyoto, Japan, 2012.
  16. Adaptive filtering for music/voice separation exploiting the repeating musical structure
    Antoine Liutkus, Zafar Rafii, Roland Badeau, Bryan Pardo, Gael Richard
    37th International Conference on Acoustics, Speech, and Signal Processing ICASSP’12, Kyoto, Japan, 2012.

Journal Articles

  1. A multi-modal dance corpus for research into interaction between humans in virtual environments
    Slim Essid, Marc Gowing, Georgios Kordelas, Anil Aksay, P. Kelly, Thomas Fillon, Qianqian Zhang, Alfred Dielmann, Gael Richard
    Journal on Multimodal User Interfaces, August 2012.
  2. Blind Source Separation for Robot Audition using fixed HRTF beamforming
    Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
    EURASIP Journal on Advances in Signal Processing, March 2012.
  3. Blind Source Separation for Robot Audition using fixed HRTF beamforming
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    EURASIP Journal on Advances in Signal Processing , March 2012.

2011

Conference Articles

  1. An audio-driven virtual dance-teaching assistant
    Slim Essid, Yves Grenier, Mounira Maazaoui, Gael Richard, Robin Tournemenne
    the 19th ACM international conference, Scottsdale, France, November 2011.
  2. Enhanced visualisation of dance performance from automatically synchronised multimodal recordings
    Marc Gowing, Xinyu Lin, Qianni Zhang, Philip Kell, Noel O’Connor, Cyril Concolato, Slim Essid, Jean Lefeuvre, Robin Tournemenne, Ebroul Izquierdo, Vlado Kitanovski
    The 19th ACM international conference, Scottsdale, France, November 2011.
  3. An audio-driven virtual dance-teaching assistant
    Slim Essid, Yves Grenier, Mounira Maazaoui, Gaël Richard, Robin Tournemenne
    ACM Multimedia, Scottsdale, Arizona, USA, November 2011.
  4. A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting
    Sébastien Fenet, Gael Richard, Yves Grenier
    ISMIR, Miami, United States, October 2011.
  5. Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment
    Cyril Joder, Slim Essid, Gael Richard
    2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, France, October 2011.
  6. A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting
    Sébastien Fenet, Gaël Richard, Yves Grenier
    ISMIR, Miami, USA, October 2011.
  7. Une empreinte audio à base de CQT appliquée à la surveillance de flux radiophoniques
    Sébastien Fenet, Yves Grenier, Gael Richard
    GRETSI, Bordeaux, France, September 2011.
  8. Blind Source Separation for Robot Audition using Fixed Beamforming with HRTFs
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    12th Annual Conference of the International Speech Communication Association (Interspeech-2011), Florence, Italy, September 2011.
  9. Une empreinte audio à base de CQT appliquée à la surveillance de flux radiophoniques
    Sébastien Fenet, Yves Grenier, Gaël Richard
    GRETSI, Bordeaux, France, September 2011.
  10. Frequency Domain Blind Source Separation for Robot Audition Using a Parameterized Sparsity Criterion
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    The European Signal Processing Conference (EUSIPCO-2011), Barcelone, Espagne, September 2011.
  11. Blind Source Separation for Robot Audition using Fixed Beamforming with HRTFs
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    12th Annual Conference of the International Speech Communication Association (Interspeech-2011), Florence, Italie, September 2011.
  12. Frequency Domain Blind Source Separation for Robot Audition Using a Parameterized Sparsity Criterion
    Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
    The European Signal Processing Conference (EUSIPCO-2011), Spain, August 2011.
  13. Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment
    Cyril Joder, Slim Essid, Gael Richard
    ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, France, May 2011.
  14. Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition
    Felix Weninger, Jean-Louis Durrieu, Florian Eyben, Gael Richard, Björn Schuller
    ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, France, May 2011.
  15. Gaussian modeling of mixtures of non-stationary signals in the time-frequency domain (HR-NMF)
    Roland Badeau
    Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, 2011.
  16. Analyse des structures harmoniques dans les signaux audio : modéliser les variations de hauteur et d’enveloppe spectrale
    Benoit Fuentes, Roland Badeau, Gael Richard
    Actes du XXIIIème Colloque GRETSI, Bordeaux, France, 2011.
  17. Adaptive harmonic decomposition using shift-invariant PLCA
    Benoit Fuentes, Roland Badeau, Gael Richard
    Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011.
  18. AN INTERACTIVE SYSTEM FOR ELECTRO-ACOUSTIC MUSIC ANALYSIS
    Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard
    ISMIR, Miami, United States, 2011.
  19. Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation
    Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard
    AES Conference, Ilmenau, Germany, 2011.
  20. Scale-invariant probabilistic latent component analysis
    Romain Hennequin, Roland Badeau, Bertrand David
    Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, 2011.
  21. Score informed audio source separation using a parametric model of non-negative spectrogram
    Romain Hennequin, Bertrand David, Roland Badeau
    Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011.

Journal Articles

  1. A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching
    Cyril Joder, Slim Essid, Gael Richard
    IEEE Transactions on Audio, Speech and Language Processing, November 2011.
  2. Probabilistic template-based chord recognition
    Laurent Oudre, Cédric Févotte, Yves Grenier
    IEEE Transactions on Audio, Speech and Language Processing, November 2011.
  3. A musically motivated mid-level representation for pitch estimation and musical audio source separation
    Jean-Louis Durrieu, Bertrand David, Gael Richard
    IEEE Journal on Selected Topics in Signal Processing, October 2011.
  4. Décompositions en éléments sonores et applications musicales
    Mathieu Lagrange, Roland Badeau, Bertrand David, Nancy Bertin, Olivier Derrien, Sylvain Marchand, Laurent Daudet
    Traitement du Signal, October 2011.
  5. Signal Processing for Music Analysis
    Meinard Müller, Daniel P.W. Ellis, Anssi Klapuri, Gael Richard
    IEEE Journal of Selected Topics in Signal Processing, October 2011.
  6. Chord recognition by fitting rescaled chroma vectors to chord templates
    Laurent Oudre, Yves Grenier, Cédric Févotte
    IEEE Transactions on Audio, Speech and Language Processing, September 2011.
  7. Greedy sparse decompositions: a comparative study
    Przemyslaw Dymarski, Nicolas Moreau, Gael Richard
    EURASIP Journal on Advances in Signal Processing, 2011.
  8. NMF with time-frequency activations to model non-stationary audio events
    Romain Hennequin, Roland Badeau, Bertrand David
    IEEE_J_ASLP, 2011.
  9. Beta-divergence as a subclass of Bregman divergence
    Romain Hennequin, Bertrand David, Roland Badeau
    IEEE Signal Processing Letters, 2011.

2010

Conference Articles

  1. Descripteurs visuels robustes pour l’identification de locuteurs dans des émissions televisées de talk-shows
    Vallet Félicien, Slim Essid, Jean Carrive, Gaël Richard
    Compression et Représentation des Signaux Audiovisuels (CORESA), Lyon, France, October 2010.
  2. A conditional random field viewpoint of symbolic audio-to-score matching
    Cyril Joder, Slim Essid, Gael Richard
    the international conference, Firenze, France, October 2010.
  3. Approche hiérarchique pour un alignement musique-sur-partition efficace
    Cyril Joder, Slim Essid, Gael Richard
    Compression et Représentation des Signaux Audiovisuels (CORESA), Lyon, France, October 2010. Prix du meil....
  4. How sparsely can a signal be approximated while keeping its class identity?
    Manuel Moussallam, Thomas Fillon, Gael Richard, Laurent Daudet
    3rd international workshop, Firenze, France, October 2010.
  5. Probabilistic framework for template-based chord recognition
    Laurent Oudre, Cédric Févotte, Yves Grenier
    IEEE International Workshop on Multimedia Signal Processing (MMSP), St Malo, France, October 2010.
  6. Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows
    Félicien Vallet, Slim Essid, Jean Carrive, Gael Richard
    2010 17th IEEE International Conference on Image Processing (ICIP 2010), Hong Kong, France, September 2010.
  7. Robust frequency-based Audio Fingerprinting
    Elsa Dupraz, Gael Richard
    2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Dallas, France, March 2010.
  8. A comparative study of tonal acoustic features for a symbolic level music-to-score alignment
    Cyril Joder, Slim Essid, Gael Richard
    2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Dallas, France, March 2010.
  9. A MULTIMODAL APPROACH TO INITIALISATION FOR TOP-DOWN SPEAKER DIARIZATION OF TELEVISION SHOWS
    Simon Bozonnet, Félicien Vallet, Nicholas Evans, Slim Essid, Gael Richard, Jean Carrive
    Eusipco, aalborg, Denmark, 2010.
  10. Time-dependent parametric and harmonic templates in non-negative matrix factorization
    Romain Hennequin, Roland Badeau, Bertrand David
    Proc. of the 13th International Conference on Digital Audio Effects (DAFx), Graz, Austria, 2010.
  11. NMF with time-frequency activations to model non-stationary audio events
    Romain Hennequin, Roland Badeau, Bertrand David
    Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Dallas, Texas, United States, 2010.
  12. AN IMPROVED HIERARCHICAL APPROACH FOR MUSIC-TO-SYMBOLIC SCORE ALIGNMENT
    Cyril Joder, Slim Essid, Gael Richard
    ISMIR, Utrecht, Netherlands, 2010.
  13. Robust similarity metrics between audio signals based on asymmetrical spectral envelope matching
    Mathieu Lagrange, Roland Badeau, Gael Richard
    Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Dallas, Texas, United States, 2010.
  14. YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE EXTRACTION SOFTWARE
    Benoît Mathieu, Slim Essid, Thomas Fillon, Jacques Prado, Gael Richard
    ISMIR, Utrecht, Netherlands, 2010.

patent

  1. Method and device for forming a digital audio mixed signal, method and device for separating signals, and corresponding signal
    Laurent Girin, Antoine Liutkus, Gael Richard, Roland Badeau
    France, October 2010.

Journal Articles

  1. Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals
    Jean-Louis Durrieu, Gael Richard, Bertrand David, Cédric Févotte
    IEEE Transactions on Audio, Speech and Language Processing, March 2010.
  2. Audio signal representations for indexing in the transform domain
    Emmanuel Ravelli, Gael Richard, Laurent Daudet
    IEEE Transactions on Audio, Speech and Language Processing, March 2010.
  3. Explicit Modeling of Temporal Dynamics within Musical Signals for Acoustical Unit Formation and Similarity
    Mathieu Lagrange, Martin Raspaud, Roland Badeau, Gael Richard
    Pattern Recognition Letters, 2010.

2005 - 2009 [87 publications]

2009

Conference Articles

  1. Fast Bayesian constrained NMF for polyphonic pitch transcription
    Nancy Bertin, Emmanuel Vincent, Roland Badeau
    Music Information Retrieval Evaluation eXchange (MIREX). International Society for Music Information Retrieval., Kobe, Japan, October 2009. Article acco....
  2. Fast Bayesian NMF algorithms enforcing harmonicity and temporal continuity in polyphonic music transcription
    Nancy Bertin, Emmanuel Vincent, Roland Badeau
    WASPAA, New Paltz, United States, October 2009.
  3. Template-based chord recognition : influence of the chord types
    Laurent Oudre, Yves Grenier, Cédric Févotte
    International Symposium on Music Information Retrieval (ISMIR), Kobe, Japan, October 2009.
  4. Chord recognition using measures of fit, chord templates and filtering methods
    Laurent Oudre, Yves Grenier, Cédric Févotte
    IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, USA, October 2009.
  5. Interactive Segmentation of Electro-Acoustic Music
    Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard
    2nd International Workshop on Machine Learning and Music (MML - ECML - PKDD), Bled, Slovenia, September 2009.
  6. Étude des descripteurs acoustiques pour l’alignement temporel audio-sur-partition musicale
    Cyril Joder, Slim Essid, Gaël Richard
    GRETSI, Dijon, France, September 2009.
  7. Incorporating prior knowledge on the digital media creation process into audio classifiers
    M. Lardeur, Slim Essid, G. Richard, M. Haller, T. Sikora
    ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, France, April 2009.
  8. A tempering approach for Itakura-Saito non-negative matrix factorization. With application to music transcription
    Nancy Bertin, Cédric Févotte, Roland Badeau
    Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Taipei, Taiwan, 2009.

Journal Articles

  1. Temporal Integration for Audio Classification With Application to Musical Instrument Classification
    Cyril Joder, Slim Essid, Gael Richard
    IEEE Transactions on Audio, Speech and Language Processing, January 2009.
  2. Sympathetic string modes in the concert harp
    Jean-Loic Le Carrou, François Gautier, Roland Badeau
    Acta Acustica united with Acustica, 2009.

Technical Reports

  1. Supporting document for the paper ”Stability analysis of multiplicative update algorithms and application to non-negative matrix factorization”
    Roland Badeau, Nancy Bertin, Emmanuel Vincent
    2009.
  2. Adaptive harmonic spectral decomposition for multiple pitch estimation
    Emmanuel Vincent, Nancy Bertin, Roland Badeau
    2009. This technic....

2008

Journal Articles

  1. Union of MDCT Bases for Audio Coding
    Emmanuel Ravelli, Gael Richard, Laurent Daudet
    IEEE Transactions on Audio, Speech and Language Processing, November 2008.
  2. A general framework for second order blind separation of stationary colored sources
    Abdeldjalil Aissa El Bey, Karim Abed-Meraim, Yves Grenier, Yingbo Hua
    Signal Processing, September 2008.
  3. Fear-type emotion recognition for future audio-based surveillance systems
    C. Clavel, I. Vasilescu, L. Devillers, Gael Richard, T. Ehrette
    Speech Communication, May 2008.
  4. Transcription and Separation of Drum Signals From Polyphonic Music
    Olivier Gillet, Gael Richard
    IEEE Transactions on Audio, Speech and Language Processing, March 2008.
  5. Estimation of Frequency for AM/FM Models Using the Phase Vocoder Framework
    Michaël Betser, Patrice Collen, Gael Richard, Bertrand T. David
    IEEE Transactions on Signal Processing, February 2008.
  6. MULTILINEAR SINGULAR VALUE DECOMPOSITION FOR STRUCTURED TENSORS
    Roland Badeau, Remy Boyer
    SIAM Journal on Matrix Analysis and Applications, 2008.
  7. Cramér-Rao bounds for multiple poles and coefficients of quasipolynomials in colored noise
    Roland Badeau, Bertrand David, Gael Richard
    IEEE_J_SP, 2008.
  8. Fast and stable YAST algorithm for principal and minor subspace tracking
    Roland Badeau, Gael Richard, Bertrand David
    IEEE_J_SP, 2008.
  9. Performance of ESPRIT for estimating mixtures of complex exponentials modulated by polynomials
    Roland Badeau, Gael Richard, Bertrand David
    IEEE_J_SP, 2008.
  10. Instrument-specific harmonic atoms for mid-level music representation
    Pierre Leveau, Emmanuel Vincent, Gael Richard, Laurent Daudet
    IEEE Transactions on Audio, Speech and Language Processing, 2008.
  11. Audio Indexing
    Gael Richard
    Encyclopedia of Data Warehousing and Mining, 2008.

Conference Articles

  1. Automatic transcription of piano music based on HMM tracking of jointly-estimated pitches
    Valentin Emiya, Roland Badeau, Bertrand David
    2008 Music Information Retrieval Evaluation eXchange (MIREX), Philadelphia, PA, United States, September 2008.
  2. ALIGNMENT KERNELS FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSIC INSTRUMENT RECOGNITION
    Cyril Joder, Slim Essid, Gaël Richard
    16th European Signal Processing Conference, Lausanne, Switzerland, August 2008.
  3. ON THE ROBUSTNESS OF AUDIO FEATURES FOR MUSICAL INSTRUMENT CLASSIFICATION
    S Wegener, M Haller, J J Burred, T Sikora, Slim Essid, Gael Richard
    16th European Signal Processing Conference, Lausanne, Switzerland, August 2008.
  4. Harmonic and inharmonic nonnegative matrix factorization for polyphonic pitch transcription
    Emmanuel Vincent, Nancy Bertin, Roland Badeau
    2008 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, United States, March 2008.
  5. Weighted maximum likelihood autoregressive and moving average spectrum modeling
    Roland Badeau, Bertrand David
    Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, United States, 2008.
  6. A Collaborative Approach to Video Summarization
    Emilie Dumont, Bernard Mérialdo, Slim Essid, Werner Bailer, Daragh Byrne, Hervé Bredin, Noel O’Connor, Gareth JF Jones, Martin Haller, Andreas Krutz, Thomas Sikora, Tomas Piatrik
    SAMT 2008, 3rd International Conference on Semantic and Digital Media Technologies, Koblenz, Germany, 2008.
  7. Rushes Video Summarization using a Collaborative Approach
    Emilie Dumont, Bernard Mérialdo, Slim Essid, Werner Bailer, Herwig Rehatschek, Daragh Byrne, Hervé Bredin, Noel O’Connor, Gareth JF Jones, Alan F Smeaton, Martin Haller, Andreas Krutz, Thomas Sikora, Tomas Piatrik
    TRECVID 2008, ACM International Conference on Multimedia Information Retrieval, Vancouver, Canada, 2008.

2007

Conference Articles

  1. Multipitch detection for piano music: Benchmarking a few approaches
    Bertrand David, Roland Badeau, Nancy Bertin, Valentin Emiya, Gaël Richard
    154th Meeting of the Acoustical Society of America, New Orleans, United States, November 2007.
  2. Listening tests of the localization performance of Stereodipole and Ambisonic systems
    Andrea Capra, Simone Fontana, Fons Adriaensen, Angelo Farina, Yves Grenier
    123rd Convention of the Audio Engineering Society, New York, USA, October 2007.
  3. Multipitch estimation of quasi-harmonic sounds in colored noise
    Valentin Emiya, Roland Badeau, Bertrand David
    10th Int. Conf. on Digital Audio Effects (DAFx-07), Bordeaux, France, September 2007.
  4. TOWARDS POLYPHONIC MUSICAL INSTRUMENTS RECOGNITION
    Gael Richard, Pierre Leveau, Laurent Daudet, Slim Essid, Bertrand David
    19th INTERNATIONAL CONGRESS ON ACOUSTICS, Madrid, Spain, September 2007.
  5. Séparation aveugle sous-déterminée de sources en utilisant la décomposition en paquet d’ondelettes
    Abdeldjalil Aissa El Bey, Karim Abed-Meraim, Yves Grenier
    21e Colloque {GRETSI} sur le traitement du signal et des images, Troyes, France, September 2007.