Starting 2020 [129 publications]
2024
Conference Articles
-
Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis
Morgan Buisson, Brian Mcfee, Slim Essid
25th International Society for Music Information Retrieval (ISMIR) (2024), San Francisco (CA), United States, November 2024.
-
Speech dereverberation constrained on room impulse response characteristics
Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard
INTERSPEECH, Kos Island, Greece, September 2024.
-
RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation
Liam Kelley, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
INTERSPEECH, Kos International Convention Center, Kos Island, Greece, September 2024.
-
Explainable by-design Audio Segmentation through Non-Negative Matrix Factorization and Probing
Martin Lebourdais, Théo Mariotte, Antonio Almudévar, Marie Tahon, Alfonso Ortega
Interspeech 2024, Kos / Greece, France, September 2024.
-
Multifrequency Highly Oscillating Aperiodic Amplitude Estimation for Nonlinear Chirp Signal
Anton Emelchenkov, Mathieu Fontaine, Yves Grenier, Hervé Mahé, François Roueff
European Signal Processing Conference (EUSIPCO), Lyon, France, August 2024.
-
Invariance-based layer regularization for sound event detection
David Perera, Slim Essid, Richard Gaël
European Signal Processing Conference, Lyon, France, August 2024.
-
Winner-takes-all learners are geometry-aware conditional density estimators
Victor Letzelter, David Perera, Cédric Rommel, Mathieu Fontaine, Slim Essid, Gael Richard, Patrick Pérez
International Conference on Machine Learning, Vienne (Autriche), Austria, July 2024.
-
Embodied exploration of deep latent spaces in interactive dance-music performance
Sarah Nabi, Philippe Esling, Geoffroy Peeters, Frédéric Bevilacqua
9th International Conference on Movement and Computing (MOCO ’24), Utrecht, Netherlands, May 2024.
-
Structure-informed Positional Encoding for Music Generation
Manvi Agarwal, Changhong Wang, Gaël Richard
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, South Korea, April 2024.
-
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Teysir Baoueb, Haocheng Liu, Mathieu Fontaine, Jonathan Le Roux, Gael Richard
IEEE International Conference on Acoustics, Speech and Signal Processing, Seoul (Korea), South Korea, April 2024. Accepted at ....
-
NEURAL STEERER: NOVEL STEERING VECTOR SYNTHESIS WITH A CAUSAL NEURAL FIELD OVER FREQUENCY AND DIRECTION
Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
ICASSP, Seoul (Korea), South Korea, April 2024.
-
Adapting Pitch-Based Self Supervised Learning Models for Tempo Estimation
Antonin Gagneré, Slim Essid, Geoffroy Peeters
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, South Korea, April 2024.
-
ONLINE SPEAKER DIARIZATION OF MEETINGS GUIDED BY SPEECH SEPARATION
Elio Gruttadauria, Mathieu Fontaine, Slim Essid
IEEE International Conference on Acoustics, Speech, and Signal Processing, Seoul (Korea), South Korea, April 2024. Accepted at ....
-
GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model
Haocheng Liu, Teysir Baoueb, Mathieu Fontaine, Jonathan Le Roux, Gael Richard
IEEE International Conference on Acoustics, Speech and Signal Processing, Seoul (Korea), South Korea, April 2024. Accepted at ....
-
Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing
Côme Peladeau, Geoffroy Peeters
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, South Korea, April 2024.
-
ON THE CHOICE OF THE OPTIMAL TEMPORAL SUPPORT FOR AUDIO CLASSIFICATION WITH PRE-TRAINED EMBEDDINGS
Aurian Quelennec, Michel Olvera, Geoffroy Peeters, Slim Essid
ICASSP, Séoul, South Korea, April 2024.
-
A fully differentiable model for unsupervised singing voice separation
Gael Richard, Pierre Chouteau, Bernardo Torres
IEEE International Conference on Acoustics, Speech, and Signal Processing, Seoul, South Korea, April 2024.
-
A LIGHTWEIGHT DUAL-STAGE FRAMEWORK FOR PERSONALIZED SPEECH ENHANCEMENT BASED ON DEEPFILTERNET2
Thomas Serre, Mathieu Fontaine, Éric Benhaim, Geoffroy Dutour, Slim Essid
ICASSP, Seoul (Korea), South Korea, April 2024. Accepted at ....
-
Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport
Bernardo Torres, Geoffroy Peeters, Gaël Richard
IEEE International Conference on Acoustics, Speech and Signal Processing, Seoul, South Korea, April 2024. Accepted in ....
Journal Articles
-
Statistical wave field theory
Roland Badeau
Journal of the Acoustical Society of America, July 2024.
-
Absorptive nature of scattering coefficients in stress-energy tensor formalism for room acoustics
Jean-Dominique Polack, Hugo Dujourdy, Roland Badeau
Journal of the Acoustical Society of America, April 2024.
-
Tackling Interpretability in Audio Classification Networks with Non-negative Matrix Factorization
Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Gael Richard, Florence d’Alché-Buc
IEEE/ACM Transactions on Audio, Speech and Language Processing, January 2024.
-
Self-Supervised Learning of Multi-level Audio Representations for Music Segmentation
Morgan Buisson, Brian Mcfee, Slim Essid, Hélène Crayencour
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2024.
-
Model-Based Deep Learning for Music Information Research
Gael Richard, Vincent Lostanlen, Yi-Hsuan Yang, Meinard Müller
IEEE Signal Processing Magazine, 2024.
Technical Reports
2023
Conference Articles
-
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis
Victor Letzelter, Mathieu Fontaine, Mickaël Chen, Patrick Pérez, Slim Essid, Gael Richard
Advances in neural information processing systems, New Orleans, United States, December 2023.
-
A Repetition-based Triplet Mining Approach for Music Segmentation
Morgan Buisson, Brian Mcfee, Slim Essid, Helene-Camille Crayencour
International Society for Music Information Retrieval (ISMIR), Milan, Italy, November 2023.
-
THE HI-AUDIO ONLINE PLATFORM FOR DISTRIBUTED MUSIC CROWDSOURCING DATABASE COLLECTION
Jose Manuel Gil Panal, Aurélien David, Gael Richard
Late Breaking Demo - International Society for Music Information Retrieval Conference (ISMIR), Milan (Italie), Italy, November 2023.
-
Self-Similarity-Based and Novelty-based loss for music structure analysis
Geoffroy Peeters
Conference of the International Society for Music Information Retrieval, Milano, Italy, November 2023.
-
PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective
Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters
International Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy, November 2023.
-
Singer Identity Representation Learning using Self-Supervised Techniques
Bernardo Torres, Stefan Lattner, Gael Richard
International Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy, November 2023.
-
Transfer Learning and Bias Correction with Pre-trained Audio Embeddings
Changhong Wang, Gaël Richard, Brian Mcfee
The 24th conference of the International Society for Music Information Retrieval (ISMIR), Milan, Italy, November 2023.
-
Signal Inpainting from Fourier Magnitudes
Louis Bahrman, Marina Krémé, Paul Magron, Antoine Deleforge
EUSIPCO 2023, Helsinki, Finland, September 2023.
-
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli
INTERSPEECH 2023, Dublin, Ireland, August 2023.
-
Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Salah Zaiem, Titouan Parcollet, Slim Essid
INTERSPEECH 2023, Dublin (Ireland), Ireland, August 2023.
-
Cosmopolite Sound Monitoring (CoSMo): A Study of Urban Sound Event Detection Systems Generalizing to Multiple Cities
Florian Angulo, Slim Essid, Geoffroy Peeters, Christophe Mietlicki
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, June 2023. Copyright 20....
-
LEARNING INTERPRETABLE FILTERS IN WAV-UNET FOR SPEECH ENHANCEMENT
Félix Mathieu, Thomas Courtat, Gael Richard, Geoffroy Peeters
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, June 2023.
-
Explainable Audio Classification of Playing Techniques with Layer-wise Relevance Propagation
Changhong Wang, Vincent Lostanlen, Mathieu Lagrange
2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, Greece, June 2023.
-
Fine-tuning strategies for faster inference using speech self-supervised models: a comparative study
Salah Zaiem, Robin Algayres, Titouan Parcollet, Slim Essid, Mirco Ravanelli
ICASSP 2023 - International Conference on Acoustics, Speech, and Signal Processing, Rhodes, Greece, June 2023.
-
One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière
IEEE/CVF Conference on Computer Vision and Pattern Recognition- Workshop on Generative Models for Computer Vision, vancouver, Canada, 2023. Proceedings ....
Journal Articles
-
Audio Signal Processing in the 21st Century
Gaël Richard, Paris Smaragdis, Sharon Gannot, Patrick A Naylor, Shoji Makino, Walter Kellermann, Akihiko Sugiyama
IEEE Signal Processing Magazine, July 2023.
-
Hi! PARIS: IA et Sciences des données pour la société
Gael Richard, Vieille Nicolas, Moulines Eric
Télécom : revue de l’Association Amicale des ingénieurs de l’Ecole Nationale Supérieure des télécommunications, June 2023.
-
Unsupervised Music Source Separation Using Differentiable Parametric Source Models
Kilian Schulze-Forster, Gaël Richard, Liam Kelley, Clement Doire, Roland Badeau
IEEE/ACM Transactions on Audio, Speech and Language Processing, March 2023.
2022
Conference Articles
-
Learning Multi-Level Representations for Hierarchical Music Structure Analysis
Morgan Buisson, Brian Mcfee, Slim Essid, Helene-Camille Crayencour
International Society for Music Information Retrieval (ISMIR), Bengaluru, India, December 2022.
-
Exploiting device and audio data to tag music with User-Aware listening contexts
Karim M Ibrahim, Elena V. Epure, Geoffroy Peeters, Gael Richard
International Society for Music Information Retrieval Conference (ISMIR 2022), Bengalore, India, December 2022.
-
SSM-NET: FEATURE LEARNING FOR MUSIC STRUCTURE ANALYSIS USING A SELF-SIMILARITY-MATRIX BASED LOSS
Geoffroy Peeters, Florian Angulo
Late-Breaking/Demo Session of ISMIR (International Society for Music Infor- mation Retrieval), Bengalore, India, December 2022.
-
Latent and Adversarial Data Augmentation for Sound Event Detection and Classification
David Perera, Slim Essid, Gaël Richard
International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE), Nancy, France, November 2022.
-
The absorptive nature of the scattering coefficient in the stress-energy tensor formalism for room acoustics
Jean-Dominique Polack, Aidan Meacham, Roland Badeau
24th international congress on acoustics (ICA 2022), Gyeongju, South Korea, October 2022.
-
Scattering at the angles of polyhedral rooms: application of stress-energy tensor conservation in Riemannian spaces
Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
24th international congress on acoustics, Gyeongju, South Korea, October 2022.
-
Apprentissage de bancs de filtres pour la séparation aveugle de sources sonores
Félix Mathieu, Thomas Courtat, Gael Richard, Geoffroy Peeters
Colloque Francophone de Traitement du Signal et des Images (GRETSI), Nancy, France, September 2022.
-
Impact de perturbations internes sur l’entraînement de réseaux profonds pour la détection d’évènements sonores
David Perera, Slim Essid, Gael Richard
Colloque Francophone de Traitement du Signal et des Images (GRETSI), Nancy, France, September 2022.
-
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning
Salah Zaiem, Titouan Parcollet, Slim Essid
Interspeech 2022, Incheon, South Korea, September 2022.
-
FVTD simulation of the acoustics of the Phonocamptic Cave in Noyon
Hugo Duval, Antoine Thomas, Aidan Meacham, Roland Badeau, Jean-Christophe Valière, Jean-Dominique Polack
The Acoustics of Ancient Theatres, Verona, Italy, July 2022.
-
Adapting the EST method to ancient theatres: a proposal
Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
The Acoustics of Ancient Theatres, Verona, Italy, July 2022.
-
Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms
Milad Sefidgaran, Amin Gohari, Gael Richard, Umut Şimşekli
COLT 2022 - 35th Annual Conference on Learning Theory, London, United Kingdom, July 2022.
-
Opinions in Interactions : New Annotations of the SEMAINE Database
Valentin Barrière, Chloé Clavel, Slim Essid
LREC, Marseille, France, June 2022.
-
END-TO-END SPEECH RECOGNITION FROM FEDERATED ACOUSTIC MODELS
Yan Gao, Titouan Parcollet, Salah Zaiem, Javier Fernandez-Marques, Pedro Gusmao, Daniel Beutel, Nicholas Lane
The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), Singapour, Singapore, May 2022.
-
PHASE SHIFTED BEDROSIAN FILTERBANK: AN INTERPRETABLE AUDIO FRONT-END FOR TIME-DOMAIN AUDIO SOURCE SEPARATION
Félix Mathieu, Thomas Courtat, Gael Richard, Geoffroy Peeters
ICASSP, Singapour, Singapore, May 2022.
-
Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation
Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, Singapore, May 2022.
-
Algorithmes rapides pour la modélisation d’une réponse de salle dont l’atténuation dépend de la fréquence
Achille Aknin, Roland Badeau
16e Congrès Français d’Acoustique (CFA 2022), Marseille, France, April 2022.
-
Confirming dimensional reduction assumptions for the energy-stress tensor through comparison with high-frequency wave-based pressure simulations
Aidan Meacham, Roland Badeau, Jean-Dominique Polack
16ème Congrès Français d’Acoustique, CFA2022, Marseille, France, April 2022.
-
Confirming dimensional reduction assumptions for the energy-stress tensor through comparison with high-frequency wave-based pressure simulations
Jean-Dominique Polack, Aidan Meacham, Roland Badeau
16e Congrès Français d’Acoustique (CFA 2022), Marseille, France, April 2022.
-
Riemannian space tessellation with polyhedral room images
Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
16e Congrès Français d’Acoustique (CFA 2022), Marseille, France, April 2022.
-
Riemannian space tessellation with polyhedral room images
Jean-Dominique Polack, Aidan Meacham, Roland Badeau, Jean-Christophe Valière
16ème Congrès Français d’Acoustique, CFA2022, Marseille, France, April 2022.
-
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Yicheng Du, Aditya Arie Nugraha, Kouhei Sekiguchi, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii
INTERSPEECH, Incheon, South Korea, 2022.
-
Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
Parekh Jayneel, Parekh Sanjeel, Mozharovskyi Pavlo, d’Alché-Buc Florence, Gael Richard
Advances in Neural Information Processing Systems, New Orleans, United States, 2022.
-
DNN-FREE LOW-LATENCY ADAPTIVE SPEECH ENHANCEMENT BASED ON FRAME-ONLINE BEAMFORMING POWERED BY BLOCK-ONLINE FASTMNMF
Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany, 2022.
-
Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
Kouhei Sekiguchi, Aditya Arie Nugraha, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022), Kyoto, France, 2022.
Journal Articles
-
The Jazz Ontology: A semantic model and large-scale RDF repositories for jazz
Polina Proutskova, Daniel Wolff, György Fazekas, Klaus Frieler, Frank Höger, Olga Velichkina, Gabriel Solis, Tillman Weyde, Martin Pfleiderer, Hèlène Camille Crayencour, Geoffroy Peeters, Simon Dixon
Journal of Web Semantics, October 2022.
-
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem, Titouan Parcollet, Slim Essid, Abdelwahab Heba
IEEE Journal of Selected Topics in Signal Processing, October 2022.
-
The Jazz Ontology: A semantic model and large-scale RDF repositories for jazz
Polina Proutskova, Daniel Wolff, György Fazekas, Klaus Frieler, Frank Höger, Olga Velichkina, Gabriel Solis, Tillman Weyde, Martin Pfleiderer, Hèlène Camille Crayencour, Geoffroy Peeters, Simon Dixon
Journal of Web Semantics, June 2022.
-
Lyrics segmentation via bimodal text–audio representation
Michael Fell, Yaroslav Nechaev, Gabriel Meseguer-Brocal, Elena Cabrio, Fabien Gandon, Geoffroy Peeters
Natural Language Engineering, 2022.
-
Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
Mathieu Fontaine, Kouhei Sekiguchi, Aditya Nugraha, Yoshiaki Bando, Kazuyoshi Yoshii
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022.
-
Video-to-Music Recommendation using Temporal Alignment of Segments
Laure Prétet, Gael Richard, Clément Souchier, Geoffroy Peeters
IEEE Transactions on Multimedia, 2022.
-
Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii, Tatsuya Kawahara
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022.
-
Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings
Christof Weis, Geoffroy Peeters
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022.
2021
Conference Articles
-
Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks
Melih Barsbey, Milad Sefidgaran, Murat A Erdogdu, Gael Richard, Umut Şimşekli
35th Conference on Neural Information Processing Systems (NeurIPS), Online, United States, December 2021.
-
Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections
Kimia Nadjahi, Alain Durmus, Pierre E. Jacob, Roland Badeau, Umut Şimşekli
35th Conference on Neural Information Processing Systems (NeurIPS 2021), En ligne, France, December 2021.
-
DARKGAN: EXPLOITING KNOWLEDGE DISTILLATION FOR COMPREHENSIBLE AUDIO SYNTHESIS WITH GANS
Javier Nistal Hurlé, Stefan Lattner, Gael Richard
International Society for Music Information Retrieval, Virtual, France, November 2021.
-
Is There a ”Language of Music-Video Clips” ? A Qualitative and Quantitative Study
Laure Prétet, Gaël Richard, Geoffroy Peeters
ISMIR, Virtual Event, France, November 2021.
-
THE WORDS REMAIN THE SAME: COVER DETECTION WITH LYRICS TRANSCRIPTION
Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard
22nd International Society for Music Information Retrieval Conference ISMIR 2021, Online, India, November 2021.
-
Training Deep Pitch-Class Representations With a Multi-Label CTC Loss
Christof Weiss, Geoffroy Peeters
International Society for Music Information Retrieval Conference (ISMIR), Virtual Event, France, November 2021.
-
On the topic of frequency dependent exponential decay matrices and Lie groups
Achille Aknin, Roland Badeau
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, United States, October 2021.
-
User-guided one-shot deep model adaptation for music source separation
Giorgia Cantisani, Alexey Ozerov, Slim Essid, Gael Richard
2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, United States, October 2021.
-
VQCPC-GAN: VARIABLE-LENGTH ADVERSARIAL AUDIO SYNTHESIS USING VECTOR-QUANTIZED CONTRASTIVE PREDICTIVE CODING
Javier Nistal Hurlé, Cyran Aouameur, Stefan Lattner, Gael Richard
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, United States, October 2021.
-
Learning Multi-Pitch Estimation From Weakly Aligned Score-Audio Pairs Using a Multi-Label CTC Loss
Christof Weiss, Geoffroy Peeters
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk Mountain House, New Paltz, NY, United States, October 2021.
-
Damped Chirp Mixture Estimation via Nonlinear Bayesian Regression
Julian Neri, Philippe Depalle, Roland Badeau
23rd International Conference on Digital Audio Effects (DAFx2020), Vienne, Austria, September 2021.
-
Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
EUSIPCO 2021 - 29th European Signal Processing Conference, Dublin / Virtual, Ireland, August 2021.
-
Unsupervised Blind Source Separation with Variational Auto-Encoders
Julian Neri, Roland Badeau, Philippe Depalle
29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland, August 2021.
-
Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning
Salah Zaiem, Titouan Parcollet, Slim Essid
Interspeech 2021, Brno, Czech Republic, August 2021.
-
Relative Positional Encoding for Transformers with Linear Complexity
Antoine Liutkus, Ondřej Cífka, Shih-Lun Wu, Umut Şimşekli, Yi-Hsuan Yang, Gael Richard
ICML 2021 - 38th International Conference on Machine Learning, Virtual Only, United States, July 2021.
-
Cross-Modal Music-Video Recommendation: A Study of Design Choices
Laure Prétet, Gael Richard, Geoffroy Peeters
Special Session of the International Joint Conference on Neural Networks (IJCNN 2021), Shenzhen, China, July 2021.
-
NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF
Giorgia Cantisani, Slim Essid, Gael Richard
2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto (virtual conference), Canada, June 2021.
-
Self-Supervised VQ-VAE for One-Shot Music Style Transfer
Ondřej Cífka, Alexey Ozerov, Umut Şimşekli, Gael Richard
ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto / Virtual, Canada, June 2021.
-
Distributed speech separation in spatially unconstrained microphone arrays
Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Toronto / Virtual, Canada, June 2021.
-
Comparing Representations for Audio Synthesis Using Generative Adversarial Networks
Javier Nistal Hurlé, Stefan Lattner, Gael Richard
2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam (virtual), France, January 2021.
-
Comparing Representations for Audio Synthesis Using Generative Adversarial Networks
Gaël Richard, Javier Nistal, Stefan Plattner
2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam (Virtual), Netherlands, January 2021.
Theses
-
Personalized audio auto-tagging as proxy for contextual music recommendation
Karim Magdi Abdelfattah Ibrahim
December 2021.
patent
-
Conversion de la parole par apprentissage statistique avec modélisation complexe des modifications temporelles
Enguerrand Gentet, Sebastien Denjean, Vincent Roussarie, David Bertrand, Gael Richard
France, July 2021.
Journal Articles
-
DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays
Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021.
-
Approximate Inference and Learning of State Space Models with Laplace Noise
Julian Neri, Philippe Depalle, Roland Badeau
IEEE Transactions on Signal Processing, 2021.
-
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
Kilian Schulze-Forster, Clement S J Doire, Gael Richard, Roland Badeau
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021.
2020
Conference Articles
-
Auralization of a Hybrid Sound Field using a Wave-Stress Tensor Based Model
Aidan Meacham, Roland Badeau, Jean-Dominique Polack
Forum Acusticum, Lyon, France, December 2020.
-
Extending Deep Rhythm for Tempo and Genre Estimation Using Complex Convolutions, Multitask Learning and Multi-input Network
Hadrien Foroughmand, Geoffroy Peeters
The 2020 Joint Conference on AI Music Creativity, Stockholm, Sweden, October 2020.
-
SHOULD WE CONSIDER THE USERS IN CONTEXTUAL MUSIC AUTO-TAGGING MODELS?
Karim M Ibrahim, Elena V Epure, Geoffroy Peeters, Gael Richard
21st International Society for Music Information Retrieval Conference, Montreal, Canada, October 2020.
-
CONTENT BASED SINGING VOICE SOURCE SEPARATION VIA STRONG CONDITIONING USING ALIGNED PHONEMES
Gabriel Meseguer-Brocal, Geoffroy Peeters
21st International Society for Music Information Retrieval Conference, Montréal (virtual), Canada, October 2020.
-
MULTILINGUAL LYRICS-TO-AUDIO ALIGNMENT
Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard, Florence d’Alché-Buc
International Society for Music Information Retrieval Conference (ISMIR), Montreal, Canada, October 2020.
-
EVALUATION OF A STOCHASTIC REVERBERATION MODEL BASED ON THE IMAGE SOURCE PRINCIPLE
Achille Aknin, Théophile Dupré, Roland Badeau
International Conference on Digital Audio Effects, Vienne, Austria, September 2020.
-
DrumGAN: Synthesis of drum sounds with timbral feature conditioning using Generative Adversarial Networks
Javier Nistal Hurlé, Stefan Lattner, Gael Richard
21 st International Society for Music Information Retrieval Conference (ISMIR), Toronto, Canada, August 2020.
-
Confidence-based Weighted Loss for Multi-label Classification with Missing Labels
Karim M Ibrahim, Elena Epure, Geoffroy Peeters, Gael Richard
The 2020 International Conference on Multimedia Retrieval (ICMR ’20), Dublin, Ireland, June 2020.
-
A Prototypical Triplet Loss for Cover Detection
Guillaume Doras, Geoffroy Peeters
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
-
DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays
Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020. Submitted to....
-
Speech Intelligibility Enhancement by Equalization for in-Car Applications
Enguerrand Gentet, Bertrand David, Sebastien Denjean, Gael Richard, Vincent Roussarie
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
-
Neutral to Lombard Speech Conversion with Deep Learning
Enguerrand Gentet, Bertrand David, Sebastien Denjean, Gael Richard, Vincent Roussarie
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
-
AUDIO-BASED AUTO-TAGGING WITH CONTEXTUAL TAGS FOR MUSIC
Karim M Ibrahim, Jimena Royo-Letelier, Elena V. Epure, Geoffroy Peeters, Gael Richard
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, May 2020.
-
Approximate Bayesian computation with the sliced-Wasserstein distance
Kimia Nadjahi, Valentin Bortoli, Alain Durmus, Roland Badeau, Umut Şimşekli
45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020.
-
Laplace state space filter with exact inference and moment matching
Julian Neri, Philippe Depalle, Roland Badeau
45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020.
-
Probabilistic filter and smoother for variational inference of Bayesian linear dynamical systems
Julian Neri, Roland Badeau, Philippe Depalle
45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 2020.
-
LEARNING TO RANK MUSIC TRACKS USING TRIPLET LOSS
Laure Prétet, Gael Richard, Geoffroy Peeters
ICASSP, Barcelona, Spain, May 2020.
-
Joint phoneme alignment and text-informed speech separation on highly corrupted speech
Kilian Schulze-Forster, Clément Doire, Gael Richard, Roland Badeau
45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), Barcelona, Spain, May 2020.
-
Audio-Based Detection of Explicit Content in Music
Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard, Florence d’Alché-Buc
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, France, May 2020.
-
Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
Mathieu Fontaine, Kouhei Sekiguchi, Aditya Arie Nugraha, Kazuyoshi Yoshii
Proc. Interspeech 2020, 2020.
-
Matrix Factorization for High Frequency Non Intrusive Load Monitoring
Simon Henriet, Benoît Fuentes, Umut Şimşekli, Gael Richard
BuildSys ’20: The 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Virtual Event, Japan, 2020.
-
The POTUS Corpus, a database of weekly addresses for the study of stance in politics and virtual agents
Thomas Janssoone, Kevin Bailly, Gael Richard, Chloé Clavel
Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
-
Statistical and Topological Properties of Sliced Probability Divergences
Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli
Advances in Neural Processing Systems, Online, France, 2020.
patent
-
Method and System for Broadcasting a Multichannel Audio Stream to Terminals of Spectators Attending a Sports Event
Raphael Blouet, Slim Essid
September 2020.
Journal Articles
-
Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes
Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters
Transactions of the International Society for Music Information Retrieval (TISMIR), June 2020.
-
Separation of Alpha-Stable Random Vectors
Mathieu Fontaine, Roland Badeau, Antoine Liutkus
Signal Processing, January 2020.
-
Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data
Ondřej Cífka, Umut Şimşekli, Gael Richard
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020.
2015 - 2019 [102 publications]
2019
Conference Articles
-
Generalized Sliced Wasserstein Distances
Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Roland Badeau, Gustavo K.
NeurIPS 2019, Vancouver, Canada, December 2019.
-
Asymptotic Guarantees for Learning Generative Models with the Sliced-Wasserstein Distance
Kimia Nadjahi, Alain Durmus, Umut Simsekli, Roland Badeau
NeurIPS 2019, Vancouver, Canada, December 2019.
-
First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise
Thanh Huy Nguyen, Umut Simsekli, Mert Gürbüzbalaban, Gael Richard
33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada, December 2019.
-
Supervised Symbolic Music Style Translation Using Synthetic Data
Ondřej Cífka, Umut Şimşekli, Gael Richard
20th International Society for Music Information Retrieval Conference (ISMIR), Delft, Netherlands, November 2019.
-
TRACKING BEATS AND MICROTIMING IN AFRO-LATIN AMERICAN MUSIC USING CONDITIONAL RANDOM FIELDS AND DEEP LEARNING
Magdalena Fuentes, Lucas S Maia, Martín Rocamora, Luiz W P Biscainho, Hélène C Crayencour, Slim Essid, Juan P. Bello
ISMIR, Delft, Netherlands, November 2019.
-
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining
Alexandre Garcia, Pierre Colombo, Slim Essid, Florence d’Alché-Buc, Chloe Clavel
2019 Conference on Empirical Methods in Natural Language Processing, Hong-Kong, China, November 2019.
-
SAMBASET: A DATASET OF HISTORICAL SAMBA DE ENREDO RECORDINGS FOR COMPUTATIONAL MUSIC ANALYSIS
Lucas S Maia, Magdalena Fuentes, Luiz W P Biscainho, Martín Rocamora, Slim Essid
The 20th International Society for Music Information Retrieval Conference, Delft, Netherlands, November 2019.
-
CONDITIONED-U-NET: INTRODUCING A CONTROL MECHANISM IN THE U-NET FOR MULTIPLE SOURCE SEPARATIONS
Gabriel Meseguer-Brocal, Geoffroy Peeters
Proceedings of the 20th International Society for Music Information Retrieval Conference, Delft, Netherlands, November 2019.
-
EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC
giorgia cantisani, Slim Essid, Gael Richard
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, United States, October 2019. Accepted for....
-
IDENTIFY, LOCATE AND SEPARATE: AUDIO-VISUAL OBJECT EXTRACTION IN LARGEVIDEO COLLECTIONS USING WEAK SUPERVISION
Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Duong, Patrick Pérez, Gael Richard
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, United States, October 2019.
-
Weakly informed audio source separation
Kilian Schulze-Forster, Clément Doire, Gael Richard, Roland Badeau
WASPAA, New Paltz, New York, United States, October 2019.
-
MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music
giorgia cantisani, Gabriel Trégoat, Slim Essid, Gael Richard
Speech, Music and Mind (SMM), Satellite Workshop of Interspeech 2019, Vienna, Austria, September 2019.
-
Cauchy Multichannel Speech Enhancement with a Deep Speech Prior
Mathieu Fontaine, Aditya Arie Nugraha, Roland Badeau, Kazuyoshi Yoshii, Antoine Liutkus
EUSIPCO 2019 - 27th European Signal Processing Conference, Coruña, Spain, September 2019.
-
Lower Bound on Frequency Validity of Energy-Stress Tensor Based Diffuse Sound Field Model
Aidan Meacham, Roland Badeau, Jean-Dominique Polack
ICA 2019, Aachen, Germany, September 2019.
-
Factorisation Matricielle Semi Non-Négative: Applicationà la Décomposition de Consommations Electriques
Simon Henriet, Umut Simsekli, Sérgio F. Santos, Benoît Fuentes, Gael Richard
Colloque francophonede traitement du signal et des images (GRETSI), Lille, France, August 2019.
-
Generalized formulation of acoustics
Jean-Dominique Polack, Aidan Meacham, Roland Badeau
Congrès Français de Mécanique, Brest, France, August 2019.
-
Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization
Thanh Huy Nguyen, Umut Şimşekli, Gael Richard
International Conference on Machine Learning (ICML), Long Beach, United States, June 2019.
-
A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning
Magdalena Fuentes, Brian Mcfee, Helene-Camille Crayencour, Slim Essid, Juan P. Bello
ICASSP, Brighton, United Kingdom, May 2019.
-
Singing Voice Separation: A Study on Training Data
Laure Prétet, Romain Hennequin, Jimena Royo-Letelier, Andrea Vaglio
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, May 2019.
-
mirdata: Software for Reproducible Usage of Datasets
R. M. Bittner, M. Fuentes, D. Rubinstein, A. Jansson, K. Choi, T. Kell
20th International Society for Music Information Retrieval Conference, 2019.
Journal Articles
-
Weakly Supervised Representation Learning for Audio-Visual Scene Analysis
Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gael Richard
IEEE/ACM Transactions on Audio, Speech and Language Processing, December 2019.
-
Independent-Variation Matrix Factorization With Application to Energy Disaggregation
Simon Henriet, Umut Simsekli, Sérgio F. Santos, Benoît Fuentes, Gael Richard
IEEE Signal Processing Letters, November 2019.
-
Common mathematical framework for stochastic reverberation models
Roland Badeau
Journal of the Acoustical Society of America, April 2019.
-
De Fourier à la reconnaissance musicale
Gael Richard, Sebastien Fenet, Yves Grenier
Interstices, February 2019.
-
Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction
Atef Ben Youssef, Chloé Clavel, Slim Essid
IEEE Transactions on Affective Computing , January 2019.
-
On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction
Atef Ben Youssef, Giovanna Varni, Slim Essid, Chloé Clavel
International Journal of Social Robotics, January 2019.
-
Audiovisual Analysis of Music Performances: Overview of an Emerging Field
Zhiyao Duan, Slim Essid, Cynthia Liem, Gael Richard, Gaurav Sharma
IEEE Signal Processing magazine, January 2019.
Technical Reports
-
Stochastic reverberation model for uniform and non-diffuse acoustic fields
Roland Badeau
April 2019.
-
General stochastic reverberation model
Roland Badeau
February 2019.
Theses
-
Processus alpha-stables pour le traitement du signal
Mathieu Fontaine
2019.
2018
patent
-
Procédé de traitement d’un signal audio et dispositif électronique correspondant, produit-programme lisible par ordinateur non transitoire et support d’informations lisible par ordinateur
Sanjeel Parekh, Alexey Ozerov, Quang-Khanh-Ngoc Duong, Gael Richard, Slim Essid, Patrick Pérez
France, October 2018.
-
Procédé de classification et de localisation d’événements audiovisuels et appareil correspondant, produit-programme lisible par ordinateur et support d’informations lisible par ordinateur
Quang-Khanh-Ngoc Duong, Alexey Ozerov, Sanjeel Parekh, Slim Essid, Gael Richard, Patrick Pérez
France, March 2018.
-
Procede et Systeme de Diffusion d un Flux Audio Multicanal a des terminaux de spectateurs assistant a un evenement sportif
Raphael Blouet, Slim Essid
March 2018.
Conference Articles
-
Unified Stochastic Reverberation Modeling
Roland Badeau
26th European Signal Processing Conference (EUSIPCO), Rome, Italy, September 2018.
-
MAIN MELODY EXTRACTION WITH SOURCE-FILTER NMF AND CRNN
Dogac Basaran, Slim Essid, Geoffroy Peeters
19th International Society for Music Information Retreival, Paris, France, September 2018.
-
ANALYSIS OF COMMON DESIGN CHOICES IN DEEP LEARNING SYSTEMS FOR DOWNBEAT TRACKING
Magdalena Fuentes, Brian Mcfee, Hélène C Crayencour, Slim Essid, Juan P Bello
The 19th International Society for Music Information Retrieval Conference, Paris, France, September 2018.
-
Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization
Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid
2018 26th European Signal Processing Conference (EUSIPCO), Rome, France, September 2018.
-
Non-linear auto-regressive models for cross-frequency coupling in neural time series
Tom Dupré La Tour, Lucile Tallot, Laeticia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
BIOMAG, Philadelphia, USA, August 2018.
-
Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau
LVA/ICA: Latent Variable Analysis and Signal Separation, Surrey, United Kingdom, July 2018.
-
Attitude Classification in Adjacency Pairs of a Human-Agent Interaction with Hidden Conditional Random Fields
Valentin Barriere, Chloe Clavel, Slim Essid
ICASSP 2018 - 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, April 2018.
-
Driver estimation in non-linear autoregressive models
Tom Tour, Yves Grenier, Alexandre Gramfort
43nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, April 2018.
-
Optimisation d’un critère d’Intelligibilité de la Parole dans un Contexte Bruité Automobile
Enguerrand Gentet, Bertrand David, Sébastien Denjean, Gael Richard, Vincent Roussarie
CFA 2018, Le Havre, France, April 2018.
-
Alpha-stable low-rank plus residual decomposition for speech enhancement
Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gael Richard
ICASSP: International Conference on Acoustics, Speech, and Signal Processing, Calgary, Canada, April 2018.
-
Energy Disaggregation for Commercial Buildings: A Statistical Analysis
Simon Henriet, Umut Simsekli, Gael Richard, Benoît Fuentes
”, International Workshop on Non-Intrusive Load Monitoring (NILM2018), Austin, Tx, United States, March 2018.
-
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events
Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q K Duong, Patrick Pérez, Gael Richard
CVPR Workshop, Salt Lake city, United States, 2018.
-
A Novel Database of Brazilian Rhythmic Instruments and Some Experiments in Computational Rhythm Analysis
L.S. Maia, P. D. Tomaz Jr., M. Fuentes, M. Rocamora, L. W. P. Biscainho, M. V. M. Costa, S. Cohen
Audio Engineering Society Latin American Conference, 2018.
-
An ENF-Based Audio Authenticity Method Robust to MP3 Compression
P. Zinemanas, M. Fuentes, P. Cancela, J. A. Apolinário Jr.
Circuits, Systems and Signal Processing Springer, 2018.
Journal Articles
-
Student’s t Source and Mixing Models for Multichannel Audio Source Separation
Simon Leglaive, Roland Badeau, Gael Richard
IEEE/ACM Transactions on Audio, Speech and Language Processing, June 2018.
-
Model-based STFT phase recovery for audio source separation
Paul Magron, Roland Badeau, Bertrand David
IEEE Transactions on Audio, Speech and Language Processing, June 2018.
-
Training and Compensation of Class-conditioned NMF Bases for Speech Enhancement
Hanwook Chung, Roland Badeau, Eric Plourde, Benoît Champagne
Neurocomputing, 2018.
-
A Generative Model for Non-Intrusive Load Monitoring in Commercial Buildings
Simon Henriet, Umut Şimşekli, Benoît Fuentes, Gael Richard
Energy and Buildings, 2018.
Technical Reports
2017
Journal Articles
-
Non-linear auto-regressive models for cross-frequency coupling in neural time series
Tom Dupré La Tour, Lucille Tallot, Laetitia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
PLoS Computational Biology, December 2017.
-
SMART : Règles d’associations temporelles de signaux sociaux pour la synthèse d’un Agent Conversationnel Animé avec une attitude spécifique
Kévin Bailly, Chloé Clavel, thomas janssoone, Gael Richard
Revue des Sciences et Technologies de l’Information - Série RIA : Revue d’Intelligence Artificielle, July 2017.
-
Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification
Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2017.
-
Règles d’Associations Temporelles de signaux sociaux pour la synthèse d’Agents Conversationnels Animés : Application aux attitudes sociales
thomas janssoone, Chloé Clavel, Kevin Bailly, Gael Richard
Revue des Sciences et Technologies de l’Information - Série RIA : Revue d’Intelligence Artificielle, 2017.
Conference Articles
-
UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions
Atef Ben-Youssef, Chloé Clavel, Slim Essid, Miriam Bilac, Marine Chamoux, Angelica Lim
the 19th ACM International Conference, Glasgow, France, November 2017.
-
Amplitude and Phase Dereverberation of Harmonic Signals
Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
-
Explaining the Parameterized Wiener Filter with Alpha-Stable Processes
Mathieu Fontaine, Antoine Liutkus, Laurent Girin, Roland Badeau
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
-
Separating Time-Frequency Sources from Time-Domain Convolutive Mixtures Using Non-negative Matrix Factorization
Simon Leglaive, Roland Badeau, Gael Richard
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
-
Lévy NMF for Robust Nonnegative Source Separation
Paul Magron, Roland Badeau, Antoine Liutkus
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017), New Paltz, NY, United States, October 2017.
-
Guiding Audio Source Separation by Video Object Information
Sanjeel Parekh, Slim Essid, Alexey Ozerov, Quang-Khanh-Ngoc Duong, Patrick Perez, Gael Richard
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, October 2017.
-
Amplitude and Phase Dereverberation of Harmonic Signals
Arthur Belhomme, Roland Badeau, Yves Grenier, Éric Humbert
WASPAA, New Paltz, New York, USA, October 2017.
-
Séparation de sources audio en milieu réverbérant : Factorisation en matrices non-négatives et représentation temporelle du mélange convolutif
Simon Leglaive, Roland Badeau, Gael Richard
Colloque GRETSI, Juan-Les-Pins, France, September 2017.
-
Lévy NMF : un modèle robuste de séparation de sources non-négatives
Paul Magron, Roland Badeau, Antoine Liutkus
Colloque GRETSI, Juan-Les-Pins, France, September 2017.
-
Histoire de la transformée de Mellin
Jean-Marie Nicolas, Roland Badeau
Colloque GRETSI, Juan-Les-Pins, France, September 2017.
-
Non-linear auto-regressive models for cross-frequency coupling in neural time series
Tom Dupré La Tour, Lucile Tallot, Laeticia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
C3S, Cologne, Allemagne, September 2017.
-
Amplitude and Phase Dereverberation of Monocomponent Signals
Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
-
EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation
Anne-Claire Conneau, Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid
2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
-
Scalable Source Localization with Multichannel Alpha-Stable Distributions
Mathieu Fontaine, Charles Vanwynsberghe, Antoine Liutkus, Roland Badeau
25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
-
Semi-Blind Student’s t Source Separation for Multichannel Audio Convolutive Mixtures
Simon Leglaive, Roland Badeau, Gael Richard
25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 2017.
-
Amplitude and Phase Dereverberation of Monocomponent Signals
Arthur Belhomme, Roland Badeau, Yves Grenier, Éric Humbert
EUSIPCO, Kos, Greece, August 2017.
-
Non-linear auto-regressive models for cross-frequency coupling in neural time series
Tom Dupré La Tour, Lucile Tallot, Laeticia Grabot, Valérie Doyère, Virginie Van Wassenhove, Yves Grenier, Alexandre Gramfort
OHBM, Vancouver, Canada, June 2017.
-
Overlapping sound event detection with supervised Nonnegative Matrix Factorization
Victor Bisot, Slim Essid, Gael Richard
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, France, March 2017.
-
Parametric estimation of spectrum driven by an exogenous signal
Tom Dupré La Tour, Yves Grenier, Alexandre Gramfort
42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) , La Nouvelle Orléans, LA, United States, March 2017.
-
Parametric estimation of spectrum driven by an exogenous signal
Tom Dupré La Tour, Yves Grenier, Alexandre Gramfort
ICASSP, New Orleans, March 2017.
-
Nonnegative Matrix Factorisation for multimodal data analysis
Slim Essid
Dipartimento di Elettronica, Informazione e Bioingegeria (DEIB), Politecnico di Milano, Milan, Italy, February 2017.
-
Parametric models of phase-amplitude coupling in neural time series
Tom Dupré La Tour, Yves Grenier, Alexandre Gramfort
BASP, Villars-sur-Ollon, Switzerland, January 2017.
-
EMOEEG: a New Multimodal Dataset for Dynamic EEG-based Emotion Recognition with Audiovisual Elicitation
Anne-Claire Conneau, Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid
The European Signal Processing Conference (EUSIPCO), Kos island, Greece, 2017.
-
Sketching for nearfield acoustic imaging of heavy-tailed sources
Mathieu Fontaine, Charles Vanwynsberghe, Antoine Liutkus, Roland Badeau
International Conference on Latent Variable Analysis and Signal Separation, 2017.
patent
-
Procédé et dispositif pour estimer un signal déréverbéré
Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
France, May 2017.
2016
patent
-
Procédé et dispositif pour estimer la réverbération acoustique
Arthur Belhomme, Roland Badeau, Yves Grenier, Eric Humbert
France, December 2016.
-
Dispositif a Casque Audio Perfectionne
Slim Essid, Raphael Blouet
November 2016.
Conference Articles
-
Anechoic phase estimation from reverberant signals
Arthur Belhomme, Yves Grenier, Roland Badeau, Eric Humbert
15th International Workshop on Acoustic Signal Enhancement (IWAENC), Xi’an, China, September 2016.
-
SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATION
Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
IEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016), Budapest, Hungary, September 2016.
-
Feature Adapted Convolutional Neural Networks for Downbeat Tracking
Simon Durand, Juan P. Bello, Bertrand David, Gael Richard
ICASSP 2016, Shanghai, China, September 2016.
-
Using Temporal Association Rules For the synthesis of Embodied Conversational Agent With a specific stance.
thomas janssoone, Chloé Clavel, Kévin Bailly, Gael Richard
International Conference on Intelligent Virtual Agents, Los Angeles, United States, September 2016.
-
Downbeat Detection with Conditional Random Fields and Deep Learned Features
Simon Durand, Slim Essid
International Society for Music Information Retrieval (ISMIR), New York City, United States, August 2016.
-
Research on Nonnegative Matrix Factorisation at Telecom ParisTech
Slim Essid
Spotify Research Seminar, New York, United States, August 2016.
-
Analyse et reconnaissance multimodale de signaux sociaux : application à la synthèse d’attitudes sociales d’un agent conversationnel animé
thomas janssoone, Chloé Clavel, Kévin Bailly, Gael Richard
WACAI, Brest, France, June 2016.
-
Acoustic scene classification with matrix factorization for unsupervised feature learning
Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
ICASSP, Shangai, China, March 2016.
-
Formant shifting for speech Intelligibility improvement in car noise environment
Karan Nathwani, Morgane Daniel, Gael Richard, Bertrand David, Vincent Roussarie
ICASSP, Shanghai, China, March 2016.
-
Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification
Romain Serizel, Slim Essid, Gael Richard
ICASSP, Shangai, China, March 2016.
-
Blind estimation of room acoustic parameters using kernel regression
Arthur Belhomme, Yves Grenier, Roland Badeau, Eric Humbert
AES 60th Conference, Leuven, Belgium, February 2016.
Technical Reports
-
An iterative algorithm for recovering the phase of complex components from their mixture
Paul Magron, Roland Badeau, Bertrand David
June 2016.
2015
Conference Articles
-
MELODY EXTRACTION BY CONTOUR CLASSIFICATION
Rachel M Bittner, Justin Salamon, Slim Essid, Juan P Bello
International Conference on Music Information Retrieval (ISMIR), Malaga, Spain, September 2015.
-
Multipitch estimation using a PLCA-based model: Impact of partial user annotation
Camila Andrade Scatolini, Gael Richard, Benoît Fuentes
ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, France, April 2015.
-
A conditional random field system for beat tracking
Thomas Fillon, C. Joder, Simon Durand, Slim Essid
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, April 2015.
-
Nonnegative matrix Factorisation for Audiovisual Document Analysis
Slim Essid
Seminaire Traitement du Langage Parle, LIMSI, Orsay, France, 2015.
Technical Reports
-
Phase reconstruction of spectrograms with linear unwrapping : application to audio signal restoration
Paul Magron, Roland Badeau, Bertrand David
April 2015.
Journal Articles
-
TPT-Dance&Actions : un corpus multimodal d’activités humaines
Aymeric Masurelle, Ahmed Rida Sekkat, Slim Essid, Gael Richard
Revue Traitement du Signal (Presse universitaire de Grenoble), April 2015.
patent
-
Procédé de suppression de la réverbération tardive d’un signal sonore
Nicolás López, Yves Grenier, Gael Richard
France, January 2015.
2010 - 2014 [104 publications]
2014
Journal Articles
-
Multichannel high resolution NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
Roland Badeau, Mark D. Plumbley
IEEE Transactions on Audio, Speech and Language Processing, November 2014.
Conference Articles
-
Romeo2 Project: Humanoid Robot Assistant and Companion for Everyday Life: I. Situation Assessment for Social Intelligence
Amit Kumar Pandey, Rodolphe Gelin, Rachid Alami, Renaud Viry, Axel Buendia, Roland Meertens, Mohamed Chetouani, Laurence Devillers, Marie Tahon, David Filliat, Yves Grenier, Mounira Maazaoui, Abderrahmane Kheddar, Frédéric Lerasle, Laurent Fitte-Duval
AIC: Artificial Intelligence and Cognition, Torino, Italy, November 2014.
-
Template adaptation for improving automatic music transcription
Emmanouil Benetos, Roland Badeau, Tillman Weyde, Gael Richard
ISMIR 2014 The 15th International Society for Music Information Retrieval Conference, Taipei, Taiwan, October 2014.
-
Controlling the Convergence Rate to Help Parameter Estimation in a PLCA-based Model
Benoît Fuentes, Roland Badeau, Gael Richard
EUSIPCO, Lisbon, Portugal, September 2014.
-
A tutorial on Nonnegative Matrix Factorisation with applications to audiovisual content analysis
Slim Essid, Alexey Ozerov
Tutorial at ICME 2014, Chengdu, China, July 2014.
-
Assessment of new spectral features for eeg-based emotion recognition.
Anne-Claire Conneau, Slim Essid
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.
-
Enhancing downbeat detection when facing different music styles
Simon Durand, Bertrand David, Gael Richard
ICASSP, Florence, Italy, May 2014.
-
Towards complex matrix decomposition of spectrograms based on the relative phase offsets of harmonic sounds
Holger Kirchhoff, Roland Badeau, Simon Dixon
Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, May 2014.
-
Single Channel Reverberation Suppression Based on Sparse Linear Prediction
Nicolás López, Yves Grenier, Gael Richard, Ivan Bourmeyster
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.
-
Piecewise constant nonnegative matrix factorization
N. Seichepine, Slim Essid, C. Fevotte, O. Cappe
ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, France, May 2014.
-
Single Channel Reverberation Suppression Based on Sparse Linear Prediction
Nicolas López, Yves Grenier, Gaël Richard, Ivan Bourmeyster
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014.
-
Informed Audio source Separation
Gael Richard
AES International Conference on Semantic Audio, Londres, United Kingdom, 2014.
-
Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes
A. Masurelle, S. Essid, G. Richard
Proceedings of conference on Acoustics, Speech, and Signal Processing, 2014.
Technical Reports
-
Proof of Wiener-like linear regression of isotropic complex symmetric alpha-stable random variables
Roland Badeau, Antoine Liutkus
September 2014.
-
Scale-invariant probabilistic latent component analysis
Romain Hennequin, Bertrand David, Roland Badeau
March 2014. Rapport inte....
2013
Conference Articles
-
Multimodal Signal Analysis at Telecom ParisTech
Slim Essid
Seminaire scienti\unmatchedfb01que de Technicolor R&D, Rennes, France, December 2013.
-
An Extended Audio-Fingerprint Method with Capabilities for Similar Music Detection
Sébastien Fenet, Yves Grenier, Gael Richard
ISMIR, Curitiba, Brazil, November 2013.
-
Nonnegative Tensor Factorization for Single-Channel EEG Artifact Rejection
Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid
IEEE International Workshop on Machine Learning for Signal Processing, Southampton, United Kingdom, September 2013.
-
Does dereverberation help multichannel blind source separation? A study case
Nicolás López, Mounira Maazaoui, Yves Grenier, Gael Richard, Ivan Bourmeyster
European Signal Processing Conference (EUSIPCO), Marrakech, Morocco, September 2013.
-
Co-factorisation douce en matrices non-négatives. Application au regroupement multimodal de locuteurs
Nicolas Seichepine, Slim Essid, Cédric Févotte, Olivier Cappé
GRETSI, Brest, France, September 2013.
-
Probabilistic dance performance alignment by fusion of multimodal features
Angelique Dremeau, Slim Essid
IEEE Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, May 2013.
-
Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization
N. Seichepine, Slim Essid, C. Fevotte, O. Cappe
ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, France, May 2013.
-
Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF)
Roland Badeau, Angélique Dremeau
ICASSP, Vancouver, Canada, 2013.
-
Probabilistic Time-Frequency Source-Filter Decomposition of Non-Stationary Signals
Roland Badeau, Mark. D. Plumbley
EUSIPCO, Marrakech, Morocco, 2013.
-
Multichannel HR-NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
Roland Badeau, Mark. D. Plumbley
WASPAA, New Paltz, New York, United States, 2013.
-
Fast multilinear SVD for structured tensors and applications to Harmonic analysis and Volterra serie
Remy Boyer, Roland Badeau, Gérard Favier
Assemblée Générale du GdR ISIS 2013, France, 2013.
-
Outil d’analyse temps-fréquence multi-résolution appliqué aux signaux audio
Thomas Fillon, Jacques Prado, Roland Badeau
Colloque GRETSI 2013, Brest, France, 2013.
-
Low bitrate informed source separation of realistic mixtures
Antoine Liutkus, Roland Badeau, Gael Richard
ICASSP, Vancouver, Canada, 2013.
-
Débruitage Aveugle par Décompositions Parcimonieuses et Aléatoires,
Manuel Moussallam, Alexandre Gramfort, Gael Richard, Laurent Daudet
GRETSI, Brest, France, 2013.
-
Multimodal Classification of Dance Movements using Body Joint Trajectories and Step Sounds
A. Masurelle, S. Essid, G. Richard
Proceedings of workshop on Image and Audio Analysis for Multimedia Interactive Services , 2013.
Technical Reports
-
Estimating an AR Model with Exogenous Driver
Yves Grenier
October 2013.
-
Multichannel high resolution NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
Roland Badeau, Mark. D. Plumbley
2013.
Journal Articles
-
Learning Optimal Features for Polyphonic Audio-to-Score Alignment
Cyril Joder, Slim Essid, Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, October 2013.
-
A Multimodal Approach to Speaker Diarization on TV Talk-Shows
Félicien Vallet, Slim Essid, Jean Carrive
IEEE Transactions on Multimedia, April 2013.
-
Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring
Slim Essid, Cédric Févotte
IEEE Transactions on Multimedia, February 2013.
-
Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription
Benoît Fuentes, Roland Badeau, Gael Richard
IEEE_J_ASLP, 2013.
patent
-
Génération d’une Signature d’un Signal Audio Musical
Sébastien Fenet, Yves Grenier, Gael Richard
France, February 2013.
2012
Conference Articles
-
Analysis of dance movements using gaussian processes
Antoine Liutkus, Angélique Drémeau, Dimitrios Alexiadis, Slim Essid, Petros Daras
the 20th ACM international conference, Nara, France, October 2012.
-
Decomposing the video editing structure of a talk-show using nonnegative matrix factorization
Slim Essid, C. Fevotte
2012 19th IEEE International Conference on Image Processing (ICIP 2012), Orlando, France, September 2012.
-
Low variance blind estimation of the reverberation time
Nicolás López, Yves Grenier, Gael Richard, Ivan Bourmeyster
13th International Workshop on Acoustic Signal Enhancement (IWAENC 2012), Aachen, Germany, September 2012.
-
Low variance blind estimation of the reverberation time
Nicolas López, Yves Grenier, Gaël Richard, Ivan Bourmeyster
13th International Workshop on Acoustic Signal Enhancement (IWAENC 2012), Aachen, Germany, September 2012.
-
A Framework for Fingerprint-Based Detection of Repeating Objects in Multimedia Streams
Sébastien Fenet, Manuel Moussallam, Yves Grenier, Gael Richard, Laurent Daudet
EUSIPCO, Bucharest, Romania, August 2012.
-
A Framework for Fingerprint-Based Detection of Repeating Objects in Multimedia Streams
Sébastien Fenet, Manuel Moussallam, Yves Grenier, Gaël Richard, Laurent Daudet
EUSIPCO, Bucharest, Romania, August 2012.
-
Adaptive blind source separation with HRTFs beamforming preprocessing
Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
The seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, United States, June 2012.
-
Adaptive blind source separation with HRTFs beamforming preprocessing and varying number of sources
Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
The seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, New Jersey, United States, June 2012.
-
Adaptive blind source separation with HRTFs beamforming preprocessing and varying number of sources
Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
The seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, New Jersey, USA, June 2012.
-
From Binaural to Multichannel Blind Source Separation using Fixed Beamforming with HRTFs
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
The 19th International Conference on Systems, Signals and Image Processing, IWSSIP 2012, Austria, April 2012.
-
From Binaural to Multichannel Blind Source Separation using Fixed Beamforming with HRTFs
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
The 19th International Conference on Systems, Signals and Image Processing, IWSSIP 2012, Vienne, Autriche, April 2012.
-
AN ADVANCED VIRTUAL DANCE PERFORMANCE EVALUATOR
Slim Essid, Dimitrios Alexiadis, Robin Tournemenne, Marc Gowing, Philip Kelly, David Monhagan, Petros Daras, Angelique Dremeau, N. E. O’Connor
IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, March 2012.
-
A probabilistic approach to simultaneous extraction of beats and downbeats
Maksim Khadkevich, Thomas Fillon, Gael Richard, Maurizio Omologo
ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, France, March 2012.
-
Blind Harmonic Adaptive Decomposition Applied to Supervised Source Separation
Benoît Fuentes, Roland Badeau, Gael Richard
20th European Signal Processing Conference (EUSIPCO), Bucharest, Romania, 2012.
-
Probabilistic model for main melody extraction using constant-Q transform
Benoît Fuentes, Antoine Liutkus, Roland Badeau, Gael Richard
37th International Conference on Acoustics, Speech, and Signal Processing ICASSP’12, Kyoto, Japan, 2012.
-
Adaptive filtering for music/voice separation exploiting the repeating musical structure
Antoine Liutkus, Zafar Rafii, Roland Badeau, Bryan Pardo, Gael Richard
37th International Conference on Acoustics, Speech, and Signal Processing ICASSP’12, Kyoto, Japan, 2012.
Journal Articles
-
A multi-modal dance corpus for research into interaction between humans in virtual environments
Slim Essid, Marc Gowing, Georgios Kordelas, Anil Aksay, P. Kelly, Thomas Fillon, Qianqian Zhang, Alfred Dielmann, Gael Richard
Journal on Multimodal User Interfaces, August 2012.
-
Blind Source Separation for Robot Audition using fixed HRTF beamforming
Mounira Maazaoui, Karim Abed-Meraim, Yves Grenier
EURASIP Journal on Advances in Signal Processing, March 2012.
-
Blind Source Separation for Robot Audition using fixed HRTF beamforming
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
EURASIP Journal on Advances in Signal Processing , March 2012.
2011
Conference Articles
-
An audio-driven virtual dance-teaching assistant
Slim Essid, Yves Grenier, Mounira Maazaoui, Gael Richard, Robin Tournemenne
the 19th ACM international conference, Scottsdale, France, November 2011.
-
Enhanced visualisation of dance performance from automatically synchronised multimodal recordings
Marc Gowing, Xinyu Lin, Qianni Zhang, Philip Kell, Noel O’Connor, Cyril Concolato, Slim Essid, Jean Lefeuvre, Robin Tournemenne, Ebroul Izquierdo, Vlado Kitanovski
The 19th ACM international conference, Scottsdale, France, November 2011.
-
An audio-driven virtual dance-teaching assistant
Slim Essid, Yves Grenier, Mounira Maazaoui, Gaël Richard, Robin Tournemenne
ACM Multimedia, Scottsdale, Arizona, USA, November 2011.
-
A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting
Sébastien Fenet, Gael Richard, Yves Grenier
ISMIR, Miami, United States, October 2011.
-
Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment
Cyril Joder, Slim Essid, Gael Richard
2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, France, October 2011.
-
A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting
Sébastien Fenet, Gaël Richard, Yves Grenier
ISMIR, Miami, USA, October 2011.
-
Une empreinte audio à base de CQT appliquée à la surveillance de flux radiophoniques
Sébastien Fenet, Yves Grenier, Gael Richard
GRETSI, Bordeaux, France, September 2011.
-
Blind Source Separation for Robot Audition using Fixed Beamforming with HRTFs
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
12th Annual Conference of the International Speech Communication Association (Interspeech-2011), Florence, Italy, September 2011.
-
Une empreinte audio à base de CQT appliquée à la surveillance de flux radiophoniques
Sébastien Fenet, Yves Grenier, Gaël Richard
GRETSI, Bordeaux, France, September 2011.
-
Frequency Domain Blind Source Separation for Robot Audition Using a Parameterized Sparsity Criterion
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
The European Signal Processing Conference (EUSIPCO-2011), Barcelone, Espagne, September 2011.
-
Blind Source Separation for Robot Audition using Fixed Beamforming with HRTFs
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
12th Annual Conference of the International Speech Communication Association (Interspeech-2011), Florence, Italie, September 2011.
-
Frequency Domain Blind Source Separation for Robot Audition Using a Parameterized Sparsity Criterion
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
The European Signal Processing Conference (EUSIPCO-2011), Spain, August 2011.
-
Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment
Cyril Joder, Slim Essid, Gael Richard
ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, France, May 2011.
-
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition
Felix Weninger, Jean-Louis Durrieu, Florian Eyben, Gael Richard, Björn Schuller
ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, France, May 2011.
-
Gaussian modeling of mixtures of non-stationary signals in the time-frequency domain (HR-NMF)
Roland Badeau
Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, 2011.
-
Analyse des structures harmoniques dans les signaux audio : modéliser les variations de hauteur et d’enveloppe spectrale
Benoit Fuentes, Roland Badeau, Gael Richard
Actes du XXIIIème Colloque GRETSI, Bordeaux, France, 2011.
-
Adaptive harmonic decomposition using shift-invariant PLCA
Benoit Fuentes, Roland Badeau, Gael Richard
Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011.
-
AN INTERACTIVE SYSTEM FOR ELECTRO-ACOUSTIC MUSIC ANALYSIS
Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard
ISMIR, Miami, United States, 2011.
-
Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation
Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard
AES Conference, Ilmenau, Germany, 2011.
-
Scale-invariant probabilistic latent component analysis
Romain Hennequin, Roland Badeau, Bertrand David
Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, United States, 2011.
-
Score informed audio source separation using a parametric model of non-negative spectrogram
Romain Hennequin, Bertrand David, Roland Badeau
Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011.
Journal Articles
-
A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching
Cyril Joder, Slim Essid, Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, November 2011.
-
Probabilistic template-based chord recognition
Laurent Oudre, Cédric Févotte, Yves Grenier
IEEE Transactions on Audio, Speech and Language Processing, November 2011.
-
A musically motivated mid-level representation for pitch estimation and musical audio source separation
Jean-Louis Durrieu, Bertrand David, Gael Richard
IEEE Journal on Selected Topics in Signal Processing, October 2011.
-
Décompositions en éléments sonores et applications musicales
Mathieu Lagrange, Roland Badeau, Bertrand David, Nancy Bertin, Olivier Derrien, Sylvain Marchand, Laurent Daudet
Traitement du Signal, October 2011.
-
Signal Processing for Music Analysis
Meinard Müller, Daniel P.W. Ellis, Anssi Klapuri, Gael Richard
IEEE Journal of Selected Topics in Signal Processing, October 2011.
-
Chord recognition by fitting rescaled chroma vectors to chord templates
Laurent Oudre, Yves Grenier, Cédric Févotte
IEEE Transactions on Audio, Speech and Language Processing, September 2011.
-
Greedy sparse decompositions: a comparative study
Przemyslaw Dymarski, Nicolas Moreau, Gael Richard
EURASIP Journal on Advances in Signal Processing, 2011.
-
NMF with time-frequency activations to model non-stationary audio events
Romain Hennequin, Roland Badeau, Bertrand David
IEEE_J_ASLP, 2011.
-
Beta-divergence as a subclass of Bregman divergence
Romain Hennequin, Bertrand David, Roland Badeau
IEEE Signal Processing Letters, 2011.
2010
Conference Articles
-
Descripteurs visuels robustes pour l’identification de locuteurs dans des émissions televisées de talk-shows
Vallet Félicien, Slim Essid, Jean Carrive, Gaël Richard
Compression et Représentation des Signaux Audiovisuels (CORESA), Lyon, France, October 2010.
-
A conditional random field viewpoint of symbolic audio-to-score matching
Cyril Joder, Slim Essid, Gael Richard
the international conference, Firenze, France, October 2010.
-
Approche hiérarchique pour un alignement musique-sur-partition efficace
Cyril Joder, Slim Essid, Gael Richard
Compression et Représentation des Signaux Audiovisuels (CORESA), Lyon, France, October 2010. Prix du meil....
-
How sparsely can a signal be approximated while keeping its class identity?
Manuel Moussallam, Thomas Fillon, Gael Richard, Laurent Daudet
3rd international workshop, Firenze, France, October 2010.
-
Probabilistic framework for template-based chord recognition
Laurent Oudre, Cédric Févotte, Yves Grenier
IEEE International Workshop on Multimedia Signal Processing (MMSP), St Malo, France, October 2010.
-
Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows
Félicien Vallet, Slim Essid, Jean Carrive, Gael Richard
2010 17th IEEE International Conference on Image Processing (ICIP 2010), Hong Kong, France, September 2010.
-
Robust frequency-based Audio Fingerprinting
Elsa Dupraz, Gael Richard
2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Dallas, France, March 2010.
-
A comparative study of tonal acoustic features for a symbolic level music-to-score alignment
Cyril Joder, Slim Essid, Gael Richard
2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Dallas, France, March 2010.
-
A MULTIMODAL APPROACH TO INITIALISATION FOR TOP-DOWN SPEAKER DIARIZATION OF TELEVISION SHOWS
Simon Bozonnet, Félicien Vallet, Nicholas Evans, Slim Essid, Gael Richard, Jean Carrive
Eusipco, aalborg, Denmark, 2010.
-
Time-dependent parametric and harmonic templates in non-negative matrix factorization
Romain Hennequin, Roland Badeau, Bertrand David
Proc. of the 13th International Conference on Digital Audio Effects (DAFx), Graz, Austria, 2010.
-
NMF with time-frequency activations to model non-stationary audio events
Romain Hennequin, Roland Badeau, Bertrand David
Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Dallas, Texas, United States, 2010.
-
AN IMPROVED HIERARCHICAL APPROACH FOR MUSIC-TO-SYMBOLIC SCORE ALIGNMENT
Cyril Joder, Slim Essid, Gael Richard
ISMIR, Utrecht, Netherlands, 2010.
-
Robust similarity metrics between audio signals based on asymmetrical spectral envelope matching
Mathieu Lagrange, Roland Badeau, Gael Richard
Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Dallas, Texas, United States, 2010.
-
YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE EXTRACTION SOFTWARE
Benoît Mathieu, Slim Essid, Thomas Fillon, Jacques Prado, Gael Richard
ISMIR, Utrecht, Netherlands, 2010.
patent
-
Method and device for forming a digital audio mixed signal, method and device for separating signals, and corresponding signal
Laurent Girin, Antoine Liutkus, Gael Richard, Roland Badeau
France, October 2010.
Journal Articles
-
Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals
Jean-Louis Durrieu, Gael Richard, Bertrand David, Cédric Févotte
IEEE Transactions on Audio, Speech and Language Processing, March 2010.
-
Audio signal representations for indexing in the transform domain
Emmanuel Ravelli, Gael Richard, Laurent Daudet
IEEE Transactions on Audio, Speech and Language Processing, March 2010.
-
Explicit Modeling of Temporal Dynamics within Musical Signals for Acoustical Unit Formation and Similarity
Mathieu Lagrange, Martin Raspaud, Roland Badeau, Gael Richard
Pattern Recognition Letters, 2010.
2005 - 2009 [87 publications]
2009
Conference Articles
-
Fast Bayesian constrained NMF for polyphonic pitch transcription
Nancy Bertin, Emmanuel Vincent, Roland Badeau
Music Information Retrieval Evaluation eXchange (MIREX). International Society for Music Information Retrieval., Kobe, Japan, October 2009. Article acco....
-
Fast Bayesian NMF algorithms enforcing harmonicity and temporal continuity in polyphonic music transcription
Nancy Bertin, Emmanuel Vincent, Roland Badeau
WASPAA, New Paltz, United States, October 2009.
-
Template-based chord recognition : influence of the chord types
Laurent Oudre, Yves Grenier, Cédric Févotte
International Symposium on Music Information Retrieval (ISMIR), Kobe, Japan, October 2009.
-
Chord recognition using measures of fit, chord templates and filtering methods
Laurent Oudre, Yves Grenier, Cédric Févotte
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, USA, October 2009.
-
Interactive Segmentation of Electro-Acoustic Music
Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard
2nd International Workshop on Machine Learning and Music (MML - ECML - PKDD), Bled, Slovenia, September 2009.
-
Étude des descripteurs acoustiques pour l’alignement temporel audio-sur-partition musicale
Cyril Joder, Slim Essid, Gaël Richard
GRETSI, Dijon, France, September 2009.
-
Incorporating prior knowledge on the digital media creation process into audio classifiers
M. Lardeur, Slim Essid, G. Richard, M. Haller, T. Sikora
ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, France, April 2009.
-
A tempering approach for Itakura-Saito non-negative matrix factorization. With application to music transcription
Nancy Bertin, Cédric Févotte, Roland Badeau
Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Taipei, Taiwan, 2009.
Journal Articles
-
Temporal Integration for Audio Classification With Application to Musical Instrument Classification
Cyril Joder, Slim Essid, Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, January 2009.
-
Sympathetic string modes in the concert harp
Jean-Loic Le Carrou, François Gautier, Roland Badeau
Acta Acustica united with Acustica, 2009.
Technical Reports
-
Supporting document for the paper ”Stability analysis of multiplicative update algorithms and application to non-negative matrix factorization”
Roland Badeau, Nancy Bertin, Emmanuel Vincent
2009.
-
Adaptive harmonic spectral decomposition for multiple pitch estimation
Emmanuel Vincent, Nancy Bertin, Roland Badeau
2009. This technic....
2008
Journal Articles
-
Union of MDCT Bases for Audio Coding
Emmanuel Ravelli, Gael Richard, Laurent Daudet
IEEE Transactions on Audio, Speech and Language Processing, November 2008.
-
A general framework for second order blind separation of stationary colored sources
Abdeldjalil Aissa El Bey, Karim Abed-Meraim, Yves Grenier, Yingbo Hua
Signal Processing, September 2008.
-
Fear-type emotion recognition for future audio-based surveillance systems
C. Clavel, I. Vasilescu, L. Devillers, Gael Richard, T. Ehrette
Speech Communication, May 2008.
-
Transcription and Separation of Drum Signals From Polyphonic Music
Olivier Gillet, Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, March 2008.
-
Estimation of Frequency for AM/FM Models Using the Phase Vocoder Framework
Michaël Betser, Patrice Collen, Gael Richard, Bertrand T. David
IEEE Transactions on Signal Processing, February 2008.
-
MULTILINEAR SINGULAR VALUE DECOMPOSITION FOR STRUCTURED TENSORS
Roland Badeau, Remy Boyer
SIAM Journal on Matrix Analysis and Applications, 2008.
-
Cramér-Rao bounds for multiple poles and coefficients of quasipolynomials in colored noise
Roland Badeau, Bertrand David, Gael Richard
IEEE_J_SP, 2008.
-
Fast and stable YAST algorithm for principal and minor subspace tracking
Roland Badeau, Gael Richard, Bertrand David
IEEE_J_SP, 2008.
-
Performance of ESPRIT for estimating mixtures of complex exponentials modulated by polynomials
Roland Badeau, Gael Richard, Bertrand David
IEEE_J_SP, 2008.
-
Instrument-specific harmonic atoms for mid-level music representation
Pierre Leveau, Emmanuel Vincent, Gael Richard, Laurent Daudet
IEEE Transactions on Audio, Speech and Language Processing, 2008.
-
Audio Indexing
Gael Richard
Encyclopedia of Data Warehousing and Mining, 2008.
Conference Articles
-
Automatic transcription of piano music based on HMM tracking of jointly-estimated pitches
Valentin Emiya, Roland Badeau, Bertrand David
2008 Music Information Retrieval Evaluation eXchange (MIREX), Philadelphia, PA, United States, September 2008.
-
ALIGNMENT KERNELS FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSIC INSTRUMENT RECOGNITION
Cyril Joder, Slim Essid, Gaël Richard
16th European Signal Processing Conference, Lausanne, Switzerland, August 2008.
-
ON THE ROBUSTNESS OF AUDIO FEATURES FOR MUSICAL INSTRUMENT CLASSIFICATION
S Wegener, M Haller, J J Burred, T Sikora, Slim Essid, Gael Richard
16th European Signal Processing Conference, Lausanne, Switzerland, August 2008.
-
Harmonic and inharmonic nonnegative matrix factorization for polyphonic pitch transcription
Emmanuel Vincent, Nancy Bertin, Roland Badeau
2008 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, United States, March 2008.
-
Weighted maximum likelihood autoregressive and moving average spectrum modeling
Roland Badeau, Bertrand David
Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, United States, 2008.
-
A Collaborative Approach to Video Summarization
Emilie Dumont, Bernard Mérialdo, Slim Essid, Werner Bailer, Daragh Byrne, Hervé Bredin, Noel O’Connor, Gareth JF Jones, Martin Haller, Andreas Krutz, Thomas Sikora, Tomas Piatrik
SAMT 2008, 3rd International Conference on Semantic and Digital Media Technologies, Koblenz, Germany, 2008.
-
Rushes Video Summarization using a Collaborative Approach
Emilie Dumont, Bernard Mérialdo, Slim Essid, Werner Bailer, Herwig Rehatschek, Daragh Byrne, Hervé Bredin, Noel O’Connor, Gareth JF Jones, Alan F Smeaton, Martin Haller, Andreas Krutz, Thomas Sikora, Tomas Piatrik
TRECVID 2008, ACM International Conference on Multimedia Information Retrieval, Vancouver, Canada, 2008.
2007
Conference Articles
-
Multipitch detection for piano music: Benchmarking a few approaches
Bertrand David, Roland Badeau, Nancy Bertin, Valentin Emiya, Gaël Richard
154th Meeting of the Acoustical Society of America, New Orleans, United States, November 2007.
-
Listening tests of the localization performance of Stereodipole and Ambisonic systems
Andrea Capra, Simone Fontana, Fons Adriaensen, Angelo Farina, Yves Grenier
123rd Convention of the Audio Engineering Society, New York, USA, October 2007.
-
Multipitch estimation of quasi-harmonic sounds in colored noise
Valentin Emiya, Roland Badeau, Bertrand David
10th Int. Conf. on Digital Audio Effects (DAFx-07), Bordeaux, France, September 2007.
-
TOWARDS POLYPHONIC MUSICAL INSTRUMENTS RECOGNITION
Gael Richard, Pierre Leveau, Laurent Daudet, Slim Essid, Bertrand David
19th INTERNATIONAL CONGRESS ON ACOUSTICS, Madrid, Spain, September 2007.
-
Séparation aveugle sous-déterminée de sources en utilisant la décomposition en paquet d’ondelettes
Abdeldjalil Aissa El Bey, Karim Abed-Meraim, Yves Grenier
21e Colloque {GRETSI} sur le traitement du signal et des images, Troyes, France, September 2007.