Publications HAL de Xavier,Alameda-Pineda

2024

titre: A weighted-variance variational autoencoder model for speech enhancement
auteur: Ali Golmakani, Mostafa Sadeghi, Xavier Alameda-Pineda, Romain Serizel
article: ICASSP 2024 - International Conference on Acoustics Speech and Signal Processing, IEEE, Apr 2024, Seoul (Korea), South Korea. pp.1-5
Accès au texte intégral et bibtex

titre: A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning
auteur: Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier
article: Neural Networks, 2024, 172, pp.106120. ⟨10.1016/j.neunet.2024.106120⟩
Accès au bibtex

titre: Unsupervised Performance Analysis of 3D Face Alignment with a Statistically Robust Confidence Test
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda, Radu Horaud
article: Neurocomputing, 2024, 564, pp.1-16. ⟨10.1016/j.neucom.2023.126941⟩
Accès au texte intégral et bibtex

titre: Autoregressive GAN for Semantic Unconditional Head Motion Generation
auteur: Louis Airale, Xavier Alameda-Pineda, Stéphane Lathuilière, Dominique Vaufreydaz
article: ACM Transactions on Multimedia Computing, Communications and Applications, 2024, pp.1-11. ⟨10.1145/3635154⟩
Accès au texte intégral et bibtex

titre: Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
auteur: Xiaoyu Lin, Laurent Girin, Xavier Alameda-Pineda
article: Transactions on Machine Learning Research Journal, 2024, pp.1-19
Accès au texte intégral et bibtex

2023

titre: Univariate Radial Basis Function Layers: Brain-inspired Deep Neural Layers for Low-Dimensional Inputs
auteur: Basavasagar Patil, Xavier Alameda-Pineda, Chris Reinke
article: 2023
Accès au texte intégral et bibtex

titre: Motion-DVAE: Unsupervised learning for fast human motion denoising
auteur: Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Renaud Séguier
article: ACM SIGGRAPH Conference on Motion, Interaction and Games (ACM MIG), Nov 2023, Rennes, France. ⟨10.1145/3623264.3624454⟩
Accès au bibtex

titre: Continual Attentive Fusion for Incremental Learning in Semantic Segmentation
auteur: Guanglei Yang, Enrico Fini, Dan Xu, Paolo Rota, Mingli Ding, Hao Tang, Xavier Alameda-Pineda, Elisa Ricci
article: IEEE Transactions on Multimedia, 2023, 25, pp.3841-3854. ⟨10.1109/TMM.2022.3167555⟩
Accès au bibtex

titre: Variational Meta Reinforcement Learning for Social Robotics
auteur: Anand Ballou, Xavier Alameda-Pineda, Chris Reinke
article: Applied Intelligence, 2023, pp.1-16. ⟨10.1007/s10489-023-04691-5⟩
Accès au bibtex

titre: Unsupervised speech enhancement with deep dynamical generative speech and noise models
auteur: Xiaoyu Lin, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda
article: Interspeech 2023 - 24th Annual Conference of the International Speech Communication Association, ISCA, Aug 2023, Dublin, Ireland. pp.1-5
Accès au bibtex

titre: A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
auteur: Louis Airale, Dominique Vaufreydaz, Xavier Alameda-Pineda
article: 2023
Accès au texte intégral et bibtex

titre: Semi-supervised learning made simple with self-supervised clustering
auteur: Enrico Fini, Pietro Astolfi, Karteek Alahari, Xavier Alameda-Pineda, Julien Mairal, Moin Nabi, Elisa Ricci
article: CVPR 2023 – IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2023, Vancouver, Canada. pp.1-11
Accès au texte intégral et bibtex

titre: Speech Modeling with a Hierarchical Transformer Dynamical VAE
auteur: Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda
article: ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing, Jun 2023, Rhodes, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10096751⟩
Accès au bibtex

titre: Expression-preserving face frontalization improves visually assisted speech processing
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda
article: International Journal of Computer Vision, 2023, 131 (5), pp.1122-1140. ⟨10.1007/s11263-022-01742-1⟩
Accès au texte intégral et bibtex

titre: Successor Feature Representations
auteur: Chris Reinke, Xavier Alameda-Pineda
article: Transactions on Machine Learning Research Journal, 2023, pp.1-35
Accès au bibtex

titre: Learning and controlling the source-filter representation of speech with a variational autoencoder
auteur: Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier
article: Speech Communication, 2023, 148, pp.53-65. ⟨10.1016/j.specom.2023.02.005⟩
Accès au texte intégral et bibtex

titre: Back to MLP: A Simple Baseline for Human Motion Prediction
auteur: Wen Guo, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer
article: WACV 2023 - IEEE Winter Conference on Applications of Computer Vision, Jan 2023, Waikoloa, United States. pp.1-11
Accès au bibtex

2022

titre: TransCenter: Transformers With Dense Representations for Multiple-Object Tracking
auteur: Yihong Xu, Yutong Ban, Guillaume Delorme, Chuang Gan, Daniela Rus, Xavier Alameda-Pineda
article: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, pp.1-16. ⟨10.1109/TPAMI.2022.3225078⟩
Accès au bibtex

titre: Unsupervised Speech Enhancement using Dynamical Variational Autoencoders
auteur: Xiaoyu Bie, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, 30, pp.2993 - 3007. ⟨10.1109/TASLP.2022.3207349⟩
Accès au texte intégral et bibtex

titre: Variational Inference and Learning of Piecewise-linear Dynamical Systems
auteur: Xavier Alameda-Pineda, Vincent Drouard, Radu Horaud
article: IEEE Transactions on Neural Networks and Learning Systems, 2022, 33 (8), pp.3753 - 3764. ⟨10.1109/TNNLS.2021.3054407⟩
Accès au texte intégral et bibtex

titre: Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining
auteur: Hanyu Xuan, Yihong Xu, Shuo Chen, Zhiliang Wu, Jian Yang, Yan Yan, Xavier Alameda-Pineda
article: IJCAI 2022 - 31st International Joint Conference on Artificial Intelligence, Jul 2022, Vienna, Austria. pp.3643-3649, ⟨10.24963/ijcai.2022/506⟩
Accès au bibtex

titre: Self-Supervised Models are Continual Learners
auteur: Enrico Fini, Victor da Costa, Xavier Alameda-Pineda, Elisa Ricci, Karteek Alahari, Julien Mairal
article: CVPR 2022 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States. pp.9611-9620, ⟨10.1109/CVPR52688.2022.00940⟩
Accès au bibtex

titre: Multi-Person Extreme Motion Prediction
auteur: Wen Guo, Xiaoyu Bie, Xavier Alameda-Pineda, Francesc Moreno-Noguer
article: IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States. ⟨10.1109/CVPR52688.2022.01271⟩
Accès au bibtex

titre: A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos
auteur: Hanyu Xuan, Zhiliang Wu, Jian Yang, Yan Yan, Xavier Alameda-Pineda
article: CVPR 2022 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States. pp.1-10, ⟨10.1109/CVPR52688.2022.00110⟩
Accès au texte intégral et bibtex

titre: Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole
auteur: Laurent Girin, Xiaoyu Bie, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda
article: JEP 2022 - 34e Journées d’Études sur la Parole, Université de Nantes, Jun 2022, Noirmoutier, France. pp.655-663, ⟨10.21437/JEP.2022-69⟩
Accès au texte intégral et bibtex

titre: The Impact of Removing Head Movements on Audio-visual Speech Enhancement
auteur: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda, Jacob Donley, Anurag Kumar
article: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5, ⟨10.1109/ICASSP43922.2022.9746401⟩
Accès au texte intégral et bibtex

titre: SocialInteractionGAN: Multi-person Interaction Sequence Generation
auteur: Louis Airale, Dominique Vaufreydaz, Xavier Alameda-Pineda
article: IEEE Transactions on Affective Computing, 2022, ⟨10.1109/TAFFC.2022.3171719⟩
Accès au texte intégral et bibtex

titre: Probabilistic Graph Attention Network with Conditional Kernels for Pixel-Wise Prediction
auteur: Dan Xu, Xavier Alameda-Pineda, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, Nicu Sebe
article: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44 (5), pp.2673-2688. ⟨10.1109/TPAMI.2020.3043781⟩
Accès au bibtex

titre: Learning and controlling the source-filter representation of speech with a variational autoencoder
auteur: Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Seguier
article: CFA 2022 - 16ème Congrès Français d'Acoustique, Société Française d'Acoustique (SFA), Apr 2022, Marseille, France
Accès au bibtex

titre: Uncertainty-aware Contrastive Distillation for Incremental Semantic Segmentation
auteur: Guanglei Yang, Enrico Fini, Dan Xu, Paolo Rota, Mingli Ding, Moin Nabi, Xavier Alameda-Pineda, Elisa Ricci
article: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, pp.1-14. ⟨10.1109/TPAMI.2022.3163806⟩
Accès au bibtex

2021

titre: Successor Feature Neural Episodic Control
auteur: David Emukpere, Xavier Alameda-Pineda, Chris Reinke
article: NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems, Dec 2021, Virtual, Canada. pp.1-12
Accès au bibtex

titre: Dynamical Variational Autoencoders: A Comprehensive Review
auteur: Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda
article: Foundations and Trends in Machine Learning, 2021, 15 (1-2), pp.1-175. ⟨10.1561/2200000089⟩
Accès au texte intégral et bibtex

titre: Deep Variational Generative Models for Audio-visual Speech Separation
auteur: Viet-Nhat Nguyen, Mostafa Sadeghi, Elisa Ricci, Xavier Alameda-Pineda
article: MLSP 2021 - IEEE International Workshop on Machine Learning for Signal Processing, Oct 2021, Gold Coast, Australia. ⟨10.1109/MLSP52302.2021.9596406⟩
Accès au bibtex

titre: A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling
auteur: Xiaoyu Bie, Laurent Girin, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda
article: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.46-50, ⟨10.21437/Interspeech.2021-256⟩
Accès au texte intégral et bibtex

titre: Variational Structured Attention Networks for Deep Visual Representation Learning
auteur: Guanglei Yang, Paolo Rota, Xavier Alameda-Pineda, Dan Xu, Mingli Ding, Elisa Ricci
article: 2021
Accès au bibtex

titre: Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. pp.1-5, ⟨10.1109/ICASSP39728.2021.9414097⟩
Accès au texte intégral et bibtex

titre: Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers
auteur: Yutong Ban, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (5), pp.1761-1776. ⟨10.1109/TPAMI.2019.2953020⟩
Accès au texte intégral et bibtex

titre: Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: IEEE Transactions on Signal Processing, 2021, 69, pp.1899-1909. ⟨10.1109/TSP.2021.3066038⟩
Accès au texte intégral et bibtex

titre: ODANet: Online Deep Appearance Network for Identity-Consistent Multi-Person Tracking
auteur: Guillaume Delorme, Yutong Ban, Guillaume Sarrazin, Xavier Alameda-Pineda
article: ICPR 2021 - 25th International Conference on Pattern Recognition / Workshops, Jan 2021, Milano / Virtual, Italy. pp.803-818, ⟨10.1007/978-3-030-68780-9_60⟩
Accès au texte intégral et bibtex

titre: CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification
auteur: Guillaume Delorme, Yihong Xu, Stéphane Lathuilière, Radu Horaud, Xavier Alameda-Pineda
article: ICPR 2020 - 25th International Conference on Pattern Recognition, Jan 2021, Milano, Italy. pp.4428-4435, ⟨10.1109/ICPR48806.2021.9412431⟩
Accès au texte intégral et bibtex

titre: CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification
auteur: Guillaume Delorme, Yihong Xu, Stéphane Lathuiliére, Radu Horaud, Xavier Alameda-Pineda
article: 25th International Conference on Pattern Recognition (ICPR), Jan 2021, Milan, Italy. ⟨10.1109/ICPR48806.2021.9412431⟩
Accès au bibtex

titre: PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation
auteur: Wen Guo, Enric Corona, Francesc Moreno-Noguer, Xavier Alameda-Pineda
article: WACV 2021 - IEEE Winter Conference on Applications of Computer vision, Jan 2021, Waikoloa, United States. pp.1-11, ⟨10.1109/WACV48630.2021.00284⟩
Accès au bibtex

2020

titre: Towards Probabilistic Generative Models for Socially Intelligent Robots
auteur: Xavier Alameda-Pineda
article: Computer Vision and Pattern Recognition [cs.CV]. Université Grenoble - Alpes, 2020
Accès au texte intégral et bibtex

titre: Unsupervised Performance Analysis of 3D Face Alignment
auteur: Mostafa Sadeghi, Sylvain Guy, Adrien Raison, Xavier Alameda-Pineda, Radu Horaud
article: 2020
Accès au texte intégral et bibtex

titre: Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach
auteur: Yahui Liu, Marco de Nadai, Deng Cai, Huayang Li, Xavier Alameda-Pineda, Nicu Sebe, Bruno Lepri
article: 28th ACM International Conference on Multimedia, MM'20, Oct 2020, Seatle, United States. pp.1357-1365, ⟨10.1145/3394171.3413505⟩
Accès au bibtex

titre: A Comprehensive Analysis of Deep Regression
auteur: Stéphane Lathuilière, Pablo Mesejo, Xavier Alameda-Pineda, Radu Horaud
article: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42 (9), pp.2065-2081. ⟨10.1109/TPAMI.2019.2910523⟩
Accès au texte intégral et bibtex

titre: How To Train Your Deep Multi-Object Tracker
auteur: Yihong Xu, Aljosa Osep, Yutong Ban, Radu Horaud, Laura Leal-Taixé, Xavier Alameda-Pineda
article: IEEE Conference on Computer Vision and Pattern Recognition, Jun 2020, Seattle WA, United States. pp.6786-6795, ⟨10.1109/CVPR42600.2020.00682⟩
Accès au texte intégral et bibtex

titre: Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders
auteur: Mostafa Sadeghi, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, 28, pp.1788-1800. ⟨10.1109/TASLP.2020.3000593⟩
Accès au texte intégral et bibtex

titre: Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
auteur: Mostafa Sadeghi, Xavier Alameda-Pineda
article: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2020, Barcelona, Spain. pp.7534-7538, ⟨10.1109/ICASSP40776.2020.9053730⟩
Accès au texte intégral et bibtex

titre: A Recurrent Variational Autoencoder for Speech Enhancement
auteur: Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: ICASSP 2020 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, May 2020, Barcelone (virtual), Spain. pp.371-375, ⟨10.1109/ICASSP40776.2020.9053164⟩
Accès au texte intégral et bibtex

2019

titre: Audio-Visual Variational Fusion for Multi-Person Tracking with Robots
auteur: Xavier Alameda-Pineda, Soraya Arias, Yutong Ban, Guillaume Delorme, Laurent Girin, Radu Horaud, Xiaofei Li, Bastien Mourgue, Guillaume Sarrazin
article: ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨10.1145/3343031.3350590⟩
Accès au texte intégral et bibtex

titre: Tracking Multiple Audio Sources with the Von Mises Distribution and Variational EM
auteur: Yutong Ban, Xavier Alameda-Pineda, Christine Evers, Radu Horaud
article: IEEE Signal Processing Letters, 2019, 26 (6), pp.798 - 802. ⟨10.1109/LSP.2019.2908376⟩
Accès au texte intégral et bibtex

titre: Increasing Image Memorability with Neural Style Transfer
auteur: Aliaksandr Siarohin, Gloria Zen, Cveta Majtanovic, Xavier Alameda-Pineda, Elisa Ricci, Nicu Sebe
article: ACM Transactions on Multimedia Computing, Communications and Applications, 2019, 15 (2), ⟨10.1145/3311781⟩
Accès au texte intégral et bibtex

titre: Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments
auteur: Xiaofei Li, Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud
article: IEEE Journal of Selected Topics in Signal Processing, 2019, 13 (1), pp.88-103. ⟨10.1109/JSTSP.2019.2903472⟩
Accès au texte intégral et bibtex

2018

titre: Multimodal behavior analysis in the wild
auteur: Xavier Alameda-Pineda, Elisa Ricci, Nicu Sebe
article: Academic Press (Elsevier), 2018
Accès au bibtex

titre: A Cascaded Multiple-Speaker Localization and Tracking System
auteur: Xiaofei Li, Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud
article: IWAENC - LOCATA Challenge Workshop - a satellite event of IWAENC 2018, Sep 2018, Tokyo, Japan. pp.1-5
Accès au texte intégral et bibtex

titre: DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model
auteur: Stéphane Lathuilière, Pablo Mesejo, Xavier Alameda-Pineda, Radu Horaud
article: ECCV 2018 - European Conference on Computer Vision, Sep 2018, Munich, Germany. pp.205-221, ⟨10.1007/978-3-030-01228-1_13⟩
Accès au texte intégral et bibtex

titre: Cross-Paced Representation Learning with Partial Curricula for Sketch-based Image Retrieval
auteur: Dan Xu, Xavier Alameda-Pineda, Jingkuan Song, Elisa Ricci, Nicu Sebe
article: IEEE Transactions on Image Processing, 2018, 27 (9), pp. 4410-4421. ⟨10.1109/TIP.2018.2837381⟩
Accès au texte intégral et bibtex

titre: Every Smile is Unique: Landmark-Guided Diverse Smile Generation
auteur: Wei Wang, Xavier Alameda-Pineda, Dan Xu, Pascal Fua, Elisa Ricci, Nicu Sebe
article: IEEE Conference on Computer Vision and Pattern Recognition, Jun 2018, Salk Lake City, United States. pp.7083-7092, ⟨10.1109/CVPR.2018.00740⟩
Accès au bibtex

titre: Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking
auteur: Yutong Ban, Xiaofei Li, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.6553-6557, ⟨10.1109/ICASSP.2018.8462100⟩
Accès au texte intégral et bibtex

2017

titre: Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction
auteur: Dan Xu, Wanli Ouyang, Xavier Alameda-Pineda, Elisa Ricci, Xiaogang Wang, Nicu Sebe
article: Advances in Neural Information Processing Systems, Dec 2017, Long Beach, United States. pp.3961-3970
Accès au texte intégral et bibtex

titre: MUSA2: First ACM Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes
auteur: Xavier Alameda-Pineda, Miriam Redi, Mohammad Soleymani, Nicu Sebe, Shih-Fu Chang, Samuel Gosling
article: MM 2017 - ACM on Multimedia Conference, Oct 2017, Mountain View CA, United States. pp.1974-1975, ⟨10.1145/3123266.3132057⟩
Accès au texte intégral et bibtex

titre: Exploiting the Complementarity of Audio and Visual Data in Multi-Speaker Tracking
auteur: Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud
article: ICCVW 2017 - IEEE International Conference on Computer Vision Workshops, Oct 2017, Venise, Italy. pp.446-454, ⟨10.1109/ICCVW.2017.60⟩
Accès au texte intégral et bibtex

titre: Exploiting the Intermittency of Speech for Joint Separation and Diarization
auteur: Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud, Sharon Gannot
article: WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.41-45, ⟨10.1109/WASPAA.2017.8169991⟩
Accès au texte intégral et bibtex

titre: Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract
auteur: Diandra Fabre, Thomas Hueber, Laurent Girin, Xavier Alameda-Pineda, Pierre Badin
article: Speech Communication, 2017, 93, pp.63 - 75. ⟨10.1016/j.specom.2017.08.002⟩
Accès au bibtex

titre: Tracking a Varying Number of People with a Visually-Controlled Robotic Head
auteur: Yutong Ban, Xavier Alameda-Pineda, Fabien Badeig, Sileye Ba, Radu Horaud
article: IEEE/RSJ International Conference on Intelligent Robots and Systems, Sep 2017, Vancouver, Canada. pp.4144-4151, ⟨10.1109/IROS.2017.8206274⟩
Accès au texte intégral et bibtex

titre: Viraliency: Pooling Local Virality
auteur: Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci
article: IEEE Conference on Computer Vision and Pattern Recognition, Jul 2017, Honolulu, Hawaii, United States. pp.484-492, ⟨10.1109/CVPR.2017.59⟩
Accès au texte intégral et bibtex

titre: How to Make an Image More Memorable? A Deep Style Transfer Approach
auteur: Aliaksandr Siarohin, Gloria Zen, Cveta Majtanovic, Xavier Alameda-Pineda, Elisa Ricci, Nicu Sebe
article: ICMR 2017 - ACM International Conference on Multimedia Retrieval, Jun 2017, Bucharest, Romania. pp.322-329, ⟨10.1145/3078971.3078986⟩
Accès au bibtex

titre: An EM Algorithm for Joint Source Separation and Diarisation of Multichannel Convolutive Speech Mixtures
auteur: Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud
article: ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.16-20, ⟨10.1109/ICASSP.2017.7951789⟩
Accès au texte intégral et bibtex

titre: Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping
auteur: Laurent Girin, Thomas Hueber, Xavier Alameda-Pineda
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩
Accès au texte intégral et bibtex

titre: Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework
auteur: Laurent Girin, Thomas Hueber, Xavier Alameda-Pineda
article: LVA/ICA 2017 - 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.459-468, ⟨10.1007/978-3-319-53547-0_43⟩
Accès au texte intégral et bibtex

2016

titre: Multi-Paced Dictionary Learning for Cross-Domain Retrieval and Recognition
auteur: Dan Xu, Jingkuan Song, Xavier Alameda-Pineda, Elisa Ricci, Nicu Sebe
article: IEEE International Conference on Pattern Recognition, Dec 2016, Cancun, Mexico. pp.3228-3233, ⟨10.1109/ICPR.2016.7900132⟩
Accès au texte intégral et bibtex

titre: An On-line Variational Bayesian Model for Multi-Person Tracking from Cluttered Scenes
auteur: Sileye Ba, Xavier Alameda-Pineda, Alessio Xompero, Radu Horaud
article: Computer Vision and Image Understanding, 2016, 153, pp.64-76. ⟨10.1016/j.cviu.2016.07.006⟩
Accès au texte intégral et bibtex

titre: EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis
auteur: Israel Dejene Gebru, Xavier Alameda-Pineda, Florence Forbes, Radu Horaud
article: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38 (12), pp.2402 - 2415. ⟨10.1109/TPAMI.2016.2522425⟩
Accès au texte intégral et bibtex

titre: Tracking Multiple Persons Based on a Variational Bayesian Model
auteur: Yutong Ban, Sileye Ba, Xavier Alameda-Pineda, Radu Horaud
article: Computer Vision – ECCV 2016 Workshops, Oct 2016, Amsterdam, Netherlands. pp.52-67, ⟨10.1007/978-3-319-48881-3_5⟩
Accès au texte intégral et bibtex

titre: A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures
auteur: Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016, 24 (8), pp.1408-1423. ⟨10.1109/TASLP.2016.2554286⟩
Accès au texte intégral et bibtex

titre: An Inverse-Gamma Source Variance Prior with Factorized Parameterization for Audio Source Separation
auteur: Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud
article: ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Mar 2016, Shanghai, China. pp.136-140, ⟨10.1109/ICASSP.2016.7471652⟩
Accès au texte intégral et bibtex

2015

titre: Speaker-Adaptive Acoustic-Articulatory Inversion using Cascaded Gaussian Mixture Regression
auteur: Thomas Hueber, Laurent Girin, Xavier Alameda-Pineda, Gérard Bailly
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23 (12), pp.2246-2259. ⟨10.1109/TASLP.2015.2464702⟩
Accès au bibtex

titre: A Variational EM Algorithm for the Separation of Moving Sound Sources
auteur: Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud
article: WASPAA 2015 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE Signal Processing Society, Oct 2015, New Paltz, NY, United States. pp.1-5, ⟨10.1109/WASPAA.2015.7336936⟩
Accès au texte intégral et bibtex

titre: Vision-Guided Robot Hearing
auteur: Xavier Alameda-Pineda, Radu Horaud
article: The International Journal of Robotics Research, 2015, 34 (4-5), pp.437-456. ⟨10.1177/0278364914548050⟩
Accès au texte intégral et bibtex

2014

titre: Audio-Visual Speaker Localization via Weighted Clustering
auteur: Israel-Dejene Gebru, Xavier Alameda-Pineda, Radu Horaud, Florence Forbes
article: IEEE Workshop on Machine Learning for Signal Processing, Sep 2014, Reims, France. pp.1-6, ⟨10.1109/MLSP.2014.6958874⟩
Accès au texte intégral et bibtex

titre: A Geometric Approach to Sound Source Localization from Time-Delay Estimates
auteur: Xavier Alameda-Pineda, Radu Horaud
article: IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (6), pp.1082-1095. ⟨10.1109/TASLP.2014.2317989⟩
Accès au texte intégral et bibtex

titre: Sound Representation and Classification Benchmark for Domestic Robots
auteur: Maxime Janvier, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: ICRA 2014 - IEEE International Conference on Robotics and Automation, May 2014, Hong Kong, China. pp.6285-6292, ⟨10.1109/ICRA.2014.6907786⟩
Accès au texte intégral et bibtex

2013

titre: The Geometry of Sound-Source Localization using Non-Coplanar microphone Arrays
auteur: Xavier Alameda-Pineda, Radu Horaud, Bernard Mourrain
article: WASPAA 2013 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2013, New Paltz, United States. pp.1-4, ⟨10.1109/WASPAA.2013.6701896⟩
Accès au texte intégral et bibtex

titre: Egocentric Audio-Visual Scene Analysis : a machine learning and signal processing approach
auteur: Xavier Alameda-Pineda
article: General Mathematics [math.GM]. Université de Grenoble, 2013. English. ⟨NNT : 2013GRENM024⟩
Accès au texte intégral et bibtex

titre: Active-Speaker Detection and Localization with Microphones and Cameras Embedded into a Robotic Head
auteur: Jan Cech, Ravi Mittal, Antoine Deleforge, Jordi Sanchez-Riera, Xavier Alameda-Pineda, Radu Horaud
article: Humanoids 2013 - IEEE-RAS International Conference on Humanoid Robots, IEEE Robotics Society, Oct 2013, Atlanta, United States. pp.203-210, ⟨10.1109/HUMANOIDS.2013.7029977⟩
Accès au texte intégral et bibtex

titre: Benchmarking Methods for Audio-Visual Recognition Using Tiny Training Sets
auteur: Xavier Alameda-Pineda, Jordi Sanchez-Riera, Radu Horaud
article: ICASSP 2013 - IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2013, Vancouver, Canada. pp.3662-3666, ⟨10.1109/ICASSP.2013.6638341⟩
Accès au texte intégral et bibtex

titre: RAVEL: An Annotated Corpus for Training Robots with Audiovisual Abilities
auteur: Xavier Alameda-Pineda, Jordi Sanchez-Riera, Johannes Wienke, Vojtech Franc, Jan Cech, Kaustubh Kulkarni, Antoine Deleforge, Radu Horaud
article: Journal on Multimodal User Interfaces, 2013, 7 (1-2), pp.79-91. ⟨10.1007/s12193-012-0111-y⟩
Accès au texte intégral et bibtex

2012

titre: Sound-Event Recognition with a Companion Humanoid
auteur: Maxime Janvier, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
article: Humanoids 2012 - IEEE International Conference on Humanoid Robotics, Nov 2012, Osaka, Japan. pp.104-111, ⟨10.1109/HUMANOIDS.2012.6651506⟩
Accès au texte intégral et bibtex

titre: Online Multimodal Speaker Detection for Humanoid Robots
auteur: Jordi Sanchez-Riera, Xavier Alameda-Pineda, Johannes Wienke, Antoine Deleforge, Soraya Arias, Jan Cech, Sebastian Wrede, Radu Horaud
article: Humanoids 2012 - IEEE International Conference on Humanoid Robotics, Nov 2012, Osaka, Japan. pp.126-133, ⟨10.1109/HUMANOIDS.2012.6651509⟩
Accès au texte intégral et bibtex

titre: Audio-Visual Robot Command Recognition
auteur: Jordi Sanchez-Riera, Xavier Alameda-Pineda, Radu Horaud
article: ICMI 2012 - 14th ACM International Conference on Multimodal Interaction, Oct 2012, Santa-Monica, CA, United States. pp.371-378, ⟨10.1145/2388676.2388760⟩
Accès au texte intégral et bibtex

titre: Geometrically-constrained Robust Time Delay Estimation Using Non-coplanar Microphone Arrays
auteur: Xavier Alameda-Pineda, Radu Horaud
article: EUSIPCO 2012 - 20th European Signal Processing Conference, Aug 2012, Bucharest, Romania. pp.1309-1313
Accès au texte intégral et bibtex

titre: Geometrically-constrained time delay estimation-based sound source localisation (gTDESSL)
auteur: Xavier Alameda-Pineda, Radu Horaud
article: [Research Report] RR-7988, INRIA. 2012, pp.28
Accès au texte intégral et bibtex

2011

titre: Finding Audio-Visual Events in Informal Social Gatherings
auteur: Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud, Florence Forbes
article: ACM/IEEE International Conference on Multimodal Interaction, Nov 2011, Alicante, Spain. pp.247-254, ⟨10.1145/2070481.2070527⟩
Accès au texte intégral et bibtex

titre: The Ravel data set
auteur: Xavier Alameda-Pineda, Jordi Sanchez-Riera, Vojtech Franch, Johannes Wienke, Jan Cech, Kaustubh Kulkarni, Antoine Deleforge, Radu Horaud
article: [Research Report] RR-7709, INRIA. 2011
Accès au texte intégral et bibtex