Patrick Cardinal

Cadre en ressourcement
Professeur

Département

Département de génie logiciel et TI

Formation

B.Ing (ÉTS), M.Sc. (McGill), Ph.D. (ÉTS)

Courriel

patrick.cardinal@etsmtl.ca

Professeur souriant devant des gradins rouges, représentant l'esprit dynamique de l'université.

Unité de recherche

Visiter le site web de : LIVIA – Laboratoire d'imagerie, de vision et d'intelligence artificielle

Axe de recherche

Systèmes logiciels, multimédia et cybersécurité

Expertises

Reconnaissance de la parole
Identification de la langue
Détection des émotions
Traitement parallèle

Cours

3 crédits Automne 2025

4 crédits Été 2025

4 crédits Automne 2025

4 crédits Hiver 2026

Encadrements

En codirection avec : Pierrich Plusquellec
Introduction aux réseaux conceptuels appliqués à l'apprentissage automatique des machines, par Patrice Boucher
Automne 2023
En codirection avec : Alessandro Lameiras Koerich
End-to-End Deep Learning for Audio Classification: From Waveforms to a Security Perspective, par Sajjad Abdoli
Automne 2021
En codirection avec : Alessandro Lameiras Koerich
Towards Reliable Data-Driven Sound Recognition Models: Developing Attack and Defense Algorithms, par Mohammad Esmaeilpour
Hiver 2022
En codirection avec : Éric Granger
Deep Regression Models for Spatio-temporal Expression Recognition in Videos, par Gnanapraveen Rajasekhar
Été 2023

En codirection avec : Pierre Dumouchel
Utilisation des caractéristiques prosodiques pour optimiser un système de compréhension du langage naturel, par Simon Boutin
Hiver 2016
En codirection avec : Jean-Marc Robert
SIPBIO - Biometrics SIP Extension, par Wilmar Perez
Été 2018
En codirection avec : Éric Granger
An Investigation of Ensemble Methods in Emotion Recognition Using LSTM and SVR, par Seyedeh Rafooneh Jafarian Bahri
Été 2019
En codirection avec : Alessandro Lameiras Koerich
Emotion Recognition Using Fusion of Audio And Video Features, par Juan David Silva Ortega
Hiver 2019
En codirection avec : Marco Pedersoli
Deep audio and video emotion detection, par Masih Aminbeidokhti
Hiver 2020
En codirection avec : Nawwaf Kharma
Improved Measures of Robustness and Evolvability for Evolutionary Systems, par Rémi Bédard-Couture
Automne 2023
En codirection avec : Jérémie Voix
Classification of Nonverbal Human-Produced Audio Events, par Philippe Chabot
Hiver 2020
En codirection avec : Marco Pedersoli
Variational Autoencoders with Gaussian Mixture Prior for Recommender Systems, par Kristof Boucher Charbonneau
Hiver 2020
En codirection avec : Elsa Vasseur
Applications of Deep Learning in Visual Recognition, par Mirmohammad Saadati
Hiver 2023
En codirection avec : Maxime Dumas
Unsupervised Abstractive Summaries of Controllable Length, par Stéphane Gazaille
Été 2020
En codirection avec : Laureano Moro-Velazquez
Predicting the Severity of Parkinson’s Disease Symptoms with Smartwatches during Daily Living, par Marie-Philippe Gill
Hiver 2021
En codirection avec : Rafael Menelau Oliveira Cruz
Identification des dialectes arabes avec la sélection dynamique de classifieurs, par Pierre-Marc Thibault
Automne 2021
En codirection avec : Alessandro Lameiras Koerich
Automatic Audio Anonymization, par Guillaume Baril
Automne 2021
Détection et suivi de joueurs dans le jeu de Blackjack, par Branavan Inthiranathan
Automne 2022
En codirection avec : Laureano Moro-Velazquez
Automatic Proficiency Assessment in Second-Language English Learners, par Armita Mohammadi
Été 2025
En codirection avec : Julien Gascon-Samson
Évaluation exhaustive des performances de courtiers MQTT dans divers environnements en périphérie, par Guillaume Simard
Hiver 2024
En codirection avec : Olivier Landon-Cardinal
Apprentissage machine quantique, par Ana Catarina Castro da Silva
Hiver 2026
En codirection avec : Olivier Landon-Cardinal
Robustesse des classificateurs quantiques face aux attaques adverses, par Félix Wilhelmy
Été 2026

Conception d'une application de conseil psychologique basée sur les LLM, par Joseph Corbin
Automne 2025

Développement de composantes d'extraction de contenu sémantique à partir d'enregistrements audio, en vue de leur application à la lutte contre la désinformation, par Amira Morsli
Hiver 2024

Valorisation des données pour améliorer la reconnaissance de la parole en français québécois, par Amira Morsli
Hiver 2024

En codirection avec : Pierre Dumouchel
Implémentation de l'algorithme SOLA - modification de l'échelle de temps en codage de moyenne à bas debit, par Freud Abner Romero
Automne 2015

Publications

R. Gnana Praveen, Patrick Cardinal, Eric Granger. 2023 « Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention ». IEEE Transactions on Biometrics, Behavior, and Identity Science vol. 5 , nº 3. p. 360-373
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2022 « From environmental sound representation to robustness of 2D CNN models against adversarial attacks ». Applied Acoustics vol. 195
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2022 « Multidiscriminator sobolev defense-GAN against adversarial attacks for end-to-end speech systems ». IEEE Transactions on Information Forensics and Security vol. 17. p. 2044-2058
Mohammad Esmaeilpour, Nourhene Chaalia, Adel Abusitta, Franois-Xavier Devailly, Wissem Maazoun, Patrick Cardinal. 2022 « Bi-discriminator GAN for tabular data synthesis ». Pattern Recognition Letters vol. 159. p. 204-210
Mohammad Esmaeilpour, Nourhene Chaalia, Patrick Cardinal. 2022 « RSD-GAN: Regularized sobolev defense GAN against speech-to-text adversarial attacks ». IEEE Signal Processing Letters vol. 29. p. 1998-2002
Philippe Chabot, Rachel E. Bouserhal, Patrick Cardinal, Jérémie Voix. 2021 « Detection and classification of human-produced nonverbal audio events ». Applied Acoustics vol. 171
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2021 « Cyclic defense GAN against speech adversarial attacks ». IEEE Signal Processing Letters vol. 28. p. 1769-1773
Gnana Praveen Rajasekhar, Eric Granger, Patrick Cardinal. 2021 « Deep domain adaptation with ordinal regression for pain assessment using weakly-labeled videos ». Image and Vision Computing vol. 110
Marie-Michèle Dufour, Marc J. Lanovaz, Patrick Cardinal. 2020 « Artificial intelligence for the measurement of vocal stereotypy ». Journal of the Experimental Analysis of Behavior vol. 114 , nº 3. p. 368-380
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2020 « A robust approach for securing audio classification against adversarial attacks ». IEEE Transactions on Information Forensics and Security vol. 15. p. 2147-2159
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2020 « Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network ». Applied Soft Computing vol. 86
Sajjad Abdoli, Patrick Cardinal, Alessandro Lameiras Koerich. 2019 « End-to-end environmental sound classification using a 1D convolutional neural network ». Expert Systems with Applications vol. 136. p. 252-263
Marc J. Lanovaz, Patrick Cardinal, Mary Francis. 2019 « Using a visual structured criterion for the analysis of alternating-treatment designs ». Behavior Modification vol. 43 , nº 1. p. 115-131
Marc J. Lanovaz, Stéphanie Turgeon, Patrick Cardinal, Tara L. Wheatley. 2019 « Using single-case designs in practical settings: Is within-subject replication always necessary? ». Perspectives on Behavior Science vol. 42 , nº 1. p. 153-162
Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne. 2013 « Large vocabulary speech recognition on parallel architectures ». IEEE Transactions on Audio, Speech, and Language Processing vol. 21 , nº 11. p. 2290-2300
Vishwa Nath Gupta, Gilles Boulianne, Patrick Cardinal. 2012 « CRIM’s content-based audio copy detection system for TRECVID 2009 ». Multimedia Tools and Applications vol. 60 , nº 2. p. 371-387

Hami Monsarrat-Chanon, Jérémie Voix, Rachel Bouserhal, Patrick Cardinal, Philippe Chabot. 2025-07-29 « In-ear nonverbal audio events classification system and method ». Brevet canadien CA 3079917.
Vishwa Gupta, Gilles Boulianne, Patrick Cardinal. 2014-09-09 « Content based audio copy detection ». Brevet américain US 8,831,760.

Mohammed Senoussaoui, Milton O. Saria-Paja, Patrick Cardinal, Tiago H. Falk, François Michaud. 2020 « State-of-the-art speaker recognition methods applied to speakers with dysarthria ». In Voice Technologies for Speech Reconstruction and Enhancement. p. 7-34. De Gruyter
Gilles Boulianne, Jean-François Beaumont, Maryse Boisvert, Julie Brousseau, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Pierre Ouellet, Frédéric Osterrath, Pierre Dumouchel. 2010 « Shadow speaking for real-time closed-captioning of TV broadcasts in french ». In Listening to subtitles : subtitles for the deaf and hard of hearing . Peter Lang International Academic Professional Publishers

Rachel Bouserhal, Milton Sarria-Paja, Patrick Cardinal, Jérémie Voix. 2019 « Classification of nonverbal human produced audio events: a pilot study ». Communication lors de la conférence : National Hearing Conservation Association Annual Conference 2019 (Grapevine, TX, USA, Feb. 07-09, 2019)
Rachel Bouserhal, Milton Sarria-Paja, Patrick Cardinal, Jérémie Voix. 2018 « Classification of nonverbal human produced audio events: a pilot study ». Communication lors de la conférence : Workshop on machine hearing and learning (Montreal, QC, Canada, Sept. 21, 2018)

R. Gnana Praveen, Eric Granger, Patrick Cardinal. 2023 « Recursive joint attention for audio-visual fusion in regression based emotion recognition ». IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Rhodes Island, Greece, June 04-10, 2023) Institute of Electrical and Electronics Engineers Inc.
Guillaume Simard, Cédric Melançon, Patrick Cardinal, Julien Gascon-Samson. 2023 « Performance characterization of MQTT brokers in a device-local edge deployment ». MiddleWEdge 2023 - Proceedings of the 2nd International Workshop on Middleware for the Edge (Bologna, Italia, Dec. 11, 2023) Association for Computing Machinery, Inc
Guillaume Baril, Patrick Cardinal, Alessandro Lameiras Koerich. 2022 « Named entity recognition for audio de-identification ». International Joint Conference on Neural Networks (IJCNN) (Padua, Italy, July 18-23, 2022) Institute of Electrical and Electronics Engineers Inc.
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2022 « Towards robust speech-to-text adversarial attack ». 47th IEEE International Conference on Acoustics, Speech, and Signal Processing (Singapore, Singapore, May 23-27, 2022) Institute of Electrical and Electronics Engineers Inc.
R. G. Praveen, W. C. de Melo, N. Ullah, H. Aslam, O. Zeeshan, T. Denorme, M. Pedersoli, A. L. Koerich, S. Bacon, P. Cardinal, E. Granger. 2022 « A joint cross-attention model for audio-visual fusion in dimensional emotion recognition ». IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (New Orleans, LA, USA, June 19-20, 2022) IEEE
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich. 2021 « Class-conditional defense GaN against end-to-end speech attacks ». IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Toronto, ON, Canada - En ligne, June 06-11,, 2021) Institute of Electrical and Electronics Engineers Inc.
R. Gnana Praveen, Eric Granger, Patrick Cardinal. 2021 « Cross attentional audio-visual fusion for dimensional emotion recognition ». 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021) (Jodhpur, India, Dec. 15-18, 2021) Institute of Electrical and Electronics Engineers Inc.
Mirmohammad Saadati, Marco Pedersoli, Patrick Cardinal, Peter Oliver. 2021 « RADARSAT-2 Synthetic-Aperture radar land cover segmentation using deep convolutional neural networks ». Pattern Recognition. ICPR International Workshops and Challenges, Virtual Event, January 10-15, 2021, Proceedings Part VIII (Milan, Italy, Jan. 10-15, 2021) Springer
Gnana R. Praveen, Eric Granger, Patrick Cardinal. 2020 « Deep weakly supervised domain adaptation for pain localization in videos ». 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG) (Buenos Aires, Argentina, Nov. 16-20, 2020) IEE Computer Society
Raymel Alfonso Sallo, Mohammad Esmaeilpour, Patrick Cardinal. 2020 « Adversarially training for audio classifiers ». 25th International Conference on Pattern Recognition (ICPR) (Milan, Italy, Jan. 10-15, 2021) IEEE
Masih Aminbeidokhti, Marco Pedersoli, Patrick Cardinal, Eric Granger. 2019 « Emotion recognition with spatial attention and temporal softmax pooling ». Image Analysis and Recognition : 16th International Conference, ICIAR : Proceedings (Waterloo, ON, Canada, Aug. 27-29, 2019) Springer International Publishing
Juan D. S. Ortega, Patrick Cardinal, Alessandro L. Koerich. 2019 « Emotion recognition using fusion of audio and video features ». IEEE International Conference on Systems, Man and Cybernetics (SMC) (Bari, Italy, Oct. 06-09, 2019) Institute of Electrical and Electronics Engineers Inc.
Rachel E. Bouserhal, Philippe Chabot, Milton Sarria-Paja, Patrick Cardinal, Jérémie Voix. 2018 « Classification of nonverbal human produced audio events: A pilot study ». 19th Annual Conference of the International Speech Communication (INTERSPEECH 2018) (Hyderabad, India, Sept. 02-06, 2018) International Speech Communication Association
I. Verduyckt, P. Cardinal, A. Loubnani, A. Alpan. 2017 « MyOrtho – A vocal coach application with visual feed-back for monitoring and storing of patient progress in a home environment ». 10th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications (Firenze, Italy, Dec. 13-15, 2017) Firenze University Press
Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James Glass, Peter Bell, Steve Renals. 2016 « Automatic dialect detection in Arabic broadcast speech ». 17th Annual Conference of the International Speech Communication Association, (INTERSPEECH) (San Francisco, CA, USA, Sept. 08-16, 2016) International Speech and Communication Association
Patrice Boucher, Pierre Dufour, Pierrich Plusquellec, Najim Dehak, Pierre Dumouchel, Patrick Cardinal. 2016 « PHYSIOSTRESS: A multimodal corpus of data on acute stress and physiological activation ». Workshop on Multimodal Corpora : Computer vision and language processing (MMC 2016) (Portoroz, Slovenia, May 23-28, 2016) European Language Resources Association (ELRA)
Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro L. Koerich. 2016 « Native language detection using the i-vector framework ». 17th Annual Conference of the International Speech Communication Association (INTERSPEECH) (San Francisco, CA, USA, Sept. 08-16, 2016) International Speech and Communication Association
Simon Boutin, Réal Tremblay, Patrick Cardinal, Doug Peters, Pierre Dumouchel. 2015 « Audio quotation marks for natural language understanding ». INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015) International Speech Communication Association
Patrick Cardinal, Najim Dehak, Alessandro Koerich Lameiras, Jahangir Alam, Patrice Boucher. 2015 « ETS System for AV+EC 2015 Challenge ». Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge (Brisbane, Australia, Oct. 26-30, 2015) ACM
Patrick Cardinal, Najim Dehak, Yu Zhang, James Glass. 2015 « Speaker adaptation using the I-vector technique for bottleneck features ». INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015) International Speech Communication Association
Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak, Stephan Vogel, James Glass. 2014 « A complete KALDI recipe for building Arabic speech recognition systems ». 2014 IEEE Spoken Language Technology Workshop (STL) (South Lake Tahoe, NV, USA, Dec. 7-10, 2014) IEEE
Patrick Cardinal, Ahmed Ali, Najim Dehak, Yu Zhang, Tuka Al Hanai, Yifan Zhang, James R. Glass, Stephan Vogel. 2014 « Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera ». INTERSPEECH 2014. 15th Annual Conference of the International Speech Communication Association (Singapore, Singapore, Sept. 14-18, 2014) International Speech Communication Association
Patrick Cardinal, Gilles Boulianne, Pierre Dumouchel. 2012 « The A* speech recognition system on parallel architectures ». 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) (Montreal, QC, Canada, July 2-5, 2012) IEEE Computer Society
Patrick Cardinal, Gilles Boulianne, Pierre Dumouchel. 2012 « Using A* for the parallelization of speech recognition systems ». 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Kyoto, Japan, Mar. 25-30, 2012) Institute of Electrical and Electronics Engineers Inc.
Patrick Cardinal, Vishwa Gupta, Gilles Boulianne. 2010 « Content-based advertisement detection ». INTERSPEECH 2010. 11th Annual Conference of the International Speech Communication Association (Chiba, Makuhari, Japan, Sept. 26-30, 2010) International Speech Communication Association
Vishwa Gupta, Gilles Boulianne, Patrick Cardinal. 2010 « CRIM's content-based audio copy detection system for TRECVID 2009 ». 2010 International Workshop on Content-Based Multimedia Indexing (CBMI) (Grenoble, France, June 23-25, 2010) IEEE
Vishwa Gupta, Gilles Boulianne, Patrick Cardinal. 2010 « Content-based audio copy detection using nearest-neighbor mapping ». 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) (Dallas, TX, USA, Mar. 14-19, 2010) IEEE
Patrick Cardinal, Gilles Boulianne. 2009 « Real-time correction of closed-captions ». INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009) International Speech and Communication Association
Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne. 2009 « Using parallel architectures in speech recognition ». INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009) International Speech and Communication Association
Maguelonne Héritier, Vishwa Gupta, Langis Gagnon, Gilles Boulianne, Samuel Foucher, Patrick Cardinal. 2009 « CRIM's content-based copy detection system for TRECVID ». 2009 TREC Video Retrieval Evaluation Notebook Papers (Gaithesburg, MD, USA, Nov. 16, 2009) National Institute of Standards and Technology
Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne, Michel Comeau. 2008 « GPU accelerated acoustics likelihood computations ». 9th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008) International Speech Communication Association
P. Cardinal, G. Boulianne, M. Comeau, M. Boisvert. 2007 « Real-time correction of closed captions ». Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (Prague, Czech Republic, June 24-29, 2007) Association for Computational Linguistics (ACL)
G. Boulianne, J.-F. Beaumont, M. Boisvert, J. Brousseau, P. Cardinal, C. Chapdelaine, M. Comeau, P. Ouellet, F. Osterrath. 2006 « Computer-assisted closed-captioning of live TV broadcasts in French ». INTERSPEECH 2006 : ICSLP ; Proceedings of the Ninth International Conference on Spoken Language Processing (Pittsburgh, PA, USA, Sept. 17-21, 2006) International Speech and Communication Association
Patrick Cardinal, Gilles Boulianne, Michel Comeau. 2005 « Segmentation of recordings based on partial transcriptions ». Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech'2005-Eurospeech) (Lisbon, Portugal, Sept. 4-8, 2005) International Speech and Communication Association
Gilles Boulianne, Jean-François Beaumont, Patrick Cardinal, Michel Comeau, Pierre Ouellet, Pierre Dumouchel. 2003 « Automatic segmentation of film dialogues into phonemes and graphemes ». Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003) International Speech and Communication Association
Julie Brousseau, Jean-François Beaumont, Gilles Boulianne, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Frédéric Osterrath, Pierre Ouellet. 2003 « Automated closed-captioning of live TV broadcast news in French ». Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003) International Speech and Communication Association
N. Smaili, P. Cardinal, G. Boulianne, P. Dumouchel. 2002 « Disambiguation of finite-state transducers ». Proceedings of the 19th International Conference on Computational Linguistics (COLING2002) (Taipei, Taiwan, Aug. 26-30, 2002) Association for Computational Linguistics (ACL)

Patrick Cardinal. 2013 « Speech recognition on multi-core processors and GPUs ». 145 p.Thèse de doctorat. Montréal, (Québec), École de technologie supérieure.
Patrick Cardinal. 2003 « Finite-state transducers and speech recognition ». Mémoire de maîtrise. McGill University.

Patrick Cardinal. 2006 « E-Inclusion core speech forward-backward algorithm ». Centre de recherche informatique de Montréal. 6 p.

Portes ouvertes

Patrick Cardinal

Cadre en ressourcement Professeur

Unité de recherche

Axe de recherche

Expertises

Cours

Encadrements

Thèse de doctorat (recherche appliquée)

Mémoire à 30 crédits

Projet d'application à 15 crédits

Projet d'intervention en entreprise à 15 crédits

Stage industriel et rapport technique, 3 cr.

Rapport technique à 6 crédits

Publications

Article publié dans une revue, révisé par les pairs 16

Brevet 2

Chapitre de livre 2

Communication 2

Compte rendu de conférence 37

Mémoire ou thèse 2

Rapport technique 1

Cadre en ressourcement
Professeur