Patrick Cardinal

Patrick Cardinal
Directeur et professeur
Formation B.Ing (ÉTS), M.Sc. (McGill), Ph.D. (ÉTS)
Bureau A-4486
Téléphone 514 396-8573
Présentation

Département de génie logiciel et des TI

Axes de recherche :

  • Technologies de l’information et des communications

Expertises :

  • Reconnaissance de la parole
  • Identification de la langue
  • Détection des émotions
  • Traitement parallèle
Publications: article

Publications: conference_item
17th Annual Conference of the International Speech Communication Association, (INTERSPEECH)
Publications: conference_item
2014 IEEE Spoken Language Technology Workshop (STL)
Publications: conference_item
Image Analysis and Recognition : 16th International Conference, ICIAR : Proceedings
Publications: conference_item
Workshop on Multimodal Corpora : Computer vision and language processing (MMC 2016)
Publications: conference_item
INTERSPEECH 2006 : ICSLP ; Proceedings of the Ninth International Conference on Spoken Language Processing
Publications: book_section
Listening to subtitles : subtitles for the deaf and hard of hearing
Publications: conference_item
Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003)
Publications: conference_item
19th Annual Conference of the International Speech Communication (INTERSPEECH 2018)
Publications: conference_itemau

Publications: conference_itemau

Publications: conference_item
INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association
Publications: conference_item
Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003)
Publications: conference_item
Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
Publications: monograph

Publications: thesis

Publications: thesis

Publications: conference_item
INTERSPEECH 2014. 15th Annual Conference of the International Speech Communication Association
Publications: conference_item
INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association
Publications: conference_item
INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association
Publications: conference_item
Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech'2005-Eurospeech)
Publications: conference_item
2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)
Publications: conference_item
2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Publications: conference_item
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge
Publications: conference_item
INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association
Publications: article

Publications: conference_item
9th Annual Conference of the International Speech Communication Association (INTERSPEECH)
Publications: conference_item
INTERSPEECH 2010. 11th Annual Conference of the International Speech Communication Association
Publications: article

Publications: conference_item
2010 International Workshop on Content-Based Multimedia Indexing (CBMI)
Publications: patent

Publications: conference_item
2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
Publications: conference_item
2009 TREC Video Retrieval Evaluation Notebook Papers
Publications: article

Publications: article

Publications: conference_item
17th Annual Conference of the International Speech Communication Association (INTERSPEECH)
Publications: conference_item
Proceedings of the 19th International Conference on Computational Linguistics (COLING2002)
Publications: conference_item
10th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications
Cours et encadrements

Cours

LOG320 Structures de données et algorithmes (Été 2019)
LOG320 Structures de données et algorithmes (Hiver 2019)

Encadrements

Mémoire à 30 crédits

    Attaques par exemples contradictoires sur les machines à vecteurs de support, par Alfonso Sallo,Raymel.
    Hiver 2020

    Sélection dynamique pour la détection automatique des émotions, par Gill,Marie-Philippe.
    Hiver 2020

    En codirection avec : Granger, Éric
    An Investigation of Ensemble Methods in Emotion Recognition Using LSTM and SVR, par Jafarian Bahri,Seyedeh Rafooneh.
    Été 2019

    En codirection avec : Lameiras Koerich, Alessandro
    Emotion Recognition Using Fusion of Audio And Video Features, par Silva Ortega,Juan David.
    Hiver 2019

    En codirection avec : Pedersoli, Marco
    Ensemble methods for affective computing, par Aminbeidokhti,Masih.
    Automne 2019

    En codirection avec : Kharma, Nawwaf
    Gulded Function Discovery in Genetic Programming, par Bédard-Couture,Rémi.
    Hiver 2020

    En codirection avec : Pedersoli, Marco
    Machine learning for automatic malware detection, par Boucher Charbonneau,Kristof.
    Automne 2019

    En codirection avec : Vasseur, Elsa
    Weakly Supervised Approaches in Audio/Visual Applications, par Saadati,Mirmohammad.
    Automne 2019

    En codirection avec : Dumas, Maxime
    La détection d'anomalie appliquée à la sélection de contenu d'un système automatisé de génération de rapports financiers, par Gazaille,Stéphane.
    Automne 2019

Thèse de doctorat (recherche appliquée)

    En codirection avec : Plusquellec, Pierrich
    Détection multisensorielle ubiquitaire du stress humain, par Boucher,Patrice.
    Automne 2019

    En codirection avec : Lameiras Koerich, Alessandro
    Novel Featurure Representation Strategies for Audio Classification, par Esmaeilpour,Mohammad.
    Hiver 2020

Rapport technique à 6 crédits

    En codirection avec : Dumouchel, Pierre
    Implémentation de l'algorithme SOLA - modification de l'échelle de temps en codage de moyenne à bas debit, par Romero,Freud Abner.
    Automne 2015

Publications
Article publié dans une revue, révisé par les pairs (5)

Sajjad Abdoli, Patrick Cardinal, Alessandro Lameiras Koerich. 2019. « End-to-end environmental sound classification using a 1D convolutional neural network ». Expert Systems with Applications. vol. 136 p. 252-263.

Marc J. Lanovaz, Patrick Cardinal, Mary Francis. 2019. « Using a visual structured criterion for the analysis of alternating-treatment designs ». Behavior Modification. vol. 43 , nº 1. p. 115-131.

Marc J. Lanovaz, Stéphanie Turgeon, Patrick Cardinal, Tara L. Wheatley. 2019. « Using single-case designs in practical settings: Is within-subject replication always necessary? ». Perspectives on Behavior Science. vol. 42 , nº 1. p. 153-162.

Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne. 2013. « Large vocabulary speech recognition on parallel architectures ». IEEE Transactions on Audio, Speech and Language Processing. vol. 21 , nº 11. p. 2290-2300.

Vishwa Nath Gupta, Gilles Boulianne, Patrick Cardinal. 2012. « CRIM’s content-based audio copy detection system for TRECVID 2009 ». Multimedia Tools and Applications. vol. 60 , nº 2. p. 371-387.

Compte rendu de conférence (26)

Masih Aminbeidokhti, Marco Pedersoli, Patrick Cardinal, Eric Granger. 2019. « Emotion recognition with spatial attention and temporal softmax pooling ». Image Analysis and Recognition : 16th International Conference, ICIAR : Proceedings (Waterloo, ON, Canada, Aug. 27-29, 2019) p. 323-331. Cham, Switzerland : Springer International Publishing.

Rachel E. Bouserhal, Philippe Chabot, Milton Sarria-Paja, Patrick Cardinal, Jérémie Voix. 2018. « Classification of nonverbal human produced audio events: A pilot study ». 19th Annual Conference of the International Speech Communication (INTERSPEECH 2018) (Hyderabad, India, Sept. 02-06, 2018) p. 1512-1516. International Speech Communication Association.

I. Verduyckt, P. Cardinal, A. Loubnani, A. Alpan. 2017. « MyOrtho – A vocal coach application with visual feed-back for monitoring and storing of patient progress in a home environment ». 10th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications (Firenze, Italy, Dec. 13-15, 2017) p. 31-34. Firenze University Press.

Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James Glass, Peter Bell, Steve Renals. 2016. « Automatic dialect detection in Arabic broadcast speech ». 17th Annual Conference of the International Speech Communication Association, (INTERSPEECH) (San Francisco, CA, USA, Sept. 08-16, 2016) p. 2934-2938. Baixas, France : International Speech and Communication Association.

Patrice Boucher, Pierre Dufour, Pierrich Plusquellec, Najim Dehak, Pierre Dumouchel, Patrick Cardinal. 2016. « PHYSIOSTRESS: A multimodal corpus of data on acute stress and physiological activation ». Workshop on Multimodal Corpora : Computer vision and language processing (MMC 2016) (Portoroz, Slovenia, May 23-28, 2016) p. 45-48. European Language Resources Association (ELRA).

Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro L. Koerich. 2016. « Native language detection using the i-vector framework ». 17th Annual Conference of the International Speech Communication Association (INTERSPEECH) (San Francisco, CA, USA, Sept. 08-16, 2016) p. 2398-2402. Baixas, France : International Speech and Communication Association.

Simon Boutin, Réal Tremblay, Patrick Cardinal, Doug Peters, Pierre Dumouchel. 2015. « Audio quotation marks for natural language understanding ». INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015) p. 1349-1352. International Speech Communication Association.

Patrick Cardinal, Najim Dehak, Alessandro Koerich Lameiras, Jahangir Alam, Patrice Boucher. 2015. « ETS System for AV+EC 2015 Challenge ». Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge (Brisbane, Australia, Oct. 26-30, 2015) p. 17-23. ACM.

Patrick Cardinal, Najim Dehak, Yu Zhang, James Glass. 2015. « Speaker adaptation using the I-vector technique for bottleneck features ». INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015) p. 2867-2871. International Speech Communication Association.

Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak, Stephan Vogel, James Glass. 2014. « A complete KALDI recipe for building Arabic speech recognition systems ». 2014 IEEE Spoken Language Technology Workshop (STL) (South Lake Tahoe, NV, USA, Dec. 7-10, 2014) p. 525-529. IEEE.

Patrick Cardinal, Ahmed Ali, Najim Dehak, Yu Zhang, Tuka Al Hanai, Yifan Zhang, James R. Glass, Stephan Vogel. 2014. « Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera ». INTERSPEECH 2014. 15th Annual Conference of the International Speech Communication Association (Singapore, Singapore, Sept. 14-18, 2014) p. 2088-2092. International Speech Communication Association.

Patrick Cardinal, Gilles Boulianne, Pierre Dumouchel. 2012. « The A* speech recognition system on parallel architectures ». 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) (Montreal, QC, Canada, July 2-5, 2012) p. 108-113. Washington, DC : IEEE Computer Society.

Patrick Cardinal, Gilles Boulianne, Pierre Dumouchel. 2012. « Using A* for the parallelization of speech recognition systems ». 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Kyoto, Japan, Mar. 25-30, 2012) p. 4433-4436. Piscataway, NJ : Institute of Electrical and Electronics Engineers Inc..

Patrick Cardinal, Vishwa Gupta, Gilles Boulianne. 2010. « Content-based advertisement detection ». INTERSPEECH 2010. 11th Annual Conference of the International Speech Communication Association (Chiba, Makuhari, Japan, Sept. 26-30, 2010) p. 2214-2217. International Speech Communication Association.

Vishwa Gupta, Gilles Boulianne, Patrick Cardinal. 2010. « CRIM's content-based audio copy detection system for TRECVID 2009 ». 2010 International Workshop on Content-Based Multimedia Indexing (CBMI) (Grenoble, France, June 23-25, 2010) IEEE.

Vishwa Gupta, Gilles Boulianne, Patrick Cardinal. 2010. « Content-based audio copy detection using nearest-neighbor mapping ». 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) (Dallas, TX, USA, Mar. 14-19, 2010) p. 261-264. IEEE.

Patrick Cardinal, Gilles Boulianne. 2009. « Real-time correction of closed-captions ». INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009) p. 1447-1450. International Speech and Communication Association.

Patrick Cardinal, Gilles Boulianne. 2009. « Using parallel architectures in speech recognition ». INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009) p. 3039-3042. International Speech and Communication Association.

Maguelonne Héritier, Vishwa Gupta, Langis Gagnon, Gilles Boulianne, Samuel Foucher, Patrick Cardinal. 2009. « CRIM's content-based copy detection system for TRECVID ». 2009 TREC Video Retrieval Evaluation Notebook Papers (Gaithesburg, MD, USA, Nov. 16, 2009) National Institute of Standards and Technology.

Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne, Michel Comeau. 2008. « GPU accelerated acoustics likelihood computations ». 9th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008) p. 964-967. Bonn, Germany : International Speech Communication Association.

P. Cardinal, G. Boulianne, M. Comeau, M. Boisvert. 2007. « Real-time correction of closed captions ». Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (Prague, Czech Republic, June 24-29, 2007) p. 113-116. Association for Computational Linguistics (ACL).

G. Boulianne, J.-F. Beaumont, M. Boisvert, J. Brousseau, P. Cardinal, C. Chapdelaine, M. Comeau, P. Ouellet, F. Osterrath. 2006. « Computer-assisted closed-captioning of live TV broadcasts in French ». INTERSPEECH 2006 : ICSLP ; Proceedings of the Ninth International Conference on Spoken Language Processing (Pittsburgh, PA, USA, Sept. 17-21, 2006) p. 273-276. International Speech and Communication Association.

Patrick Cardinal, Gilles Boulianne, Michel Comeau. 2005. « Segmentation of recordings based on partial transcriptions ». Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech'2005-Eurospeech) (Lisbon, Portugal, Sept. 4-8, 2005) p. 3345-3348. International Speech and Communication Association.

Gilles Boulianne, Jean-François Beaumont, Patrick Cardinal, Michel Comeau, Pierre Ouellet, Pierre Dumouchel. 2003. « Automatic segmentation of film dialogues into phonemes and graphemes ». Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003) p. 1241-1244. International Speech and Communication Association.

Julie Brousseau, Jean-François Beaumont, Gilles Boulianne, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Frédéric Osterrath, Pierre Ouellet. 2003. « Automated closed-captioning of live TV broadcast news in French ». Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003) p. 1245-1248. International Speech and Communication Association.

N. Smaili, P. Cardinal, G. Boulianne, P. Dumouchel. 2002. « Disambiguation of finite-state transducers ». Proceedings of the 19th International Conference on Computational Linguistics (COLING2002) (Taipei, Taiwan, Aug. 26-30, 2002) Association for Computational Linguistics (ACL).

Communication (2)

Rachel Bouserhal, Milton Sarria-Paja, Patrick Cardinal, Jérémie Voix. 2019. « Classification of nonverbal human produced audio events: a pilot study ». Communication lors de la conférence : National Hearing Conservation Association Annual Conference 2019 ( Grapevine, TX, USA, Feb. 07-09, 2019 )

Rachel Bouserhal, Milton Sarria-Paja, Patrick Cardinal, Jérémie Voix. 2018. « Classification of nonverbal human produced audio events: a pilot study ». Communication lors de la conférence : Workshop on machine hearing and learning ( Montreal, QC, Canada, Sept. 21, 2018 )

Brevet (1)

Vishwa Gupta, Gilles Boulianne, Patrick Cardinal. 2014-09-09. « Content based audio copy detection ».

Mémoire ou thèse (2)

Patrick Cardinal. 2013. « Speech recognition on multi-core processors and GPUs ». 145 p. Thèse de doctorat. Montréal , École de technologie supérieure

Patrick Cardinal. 2003. « Finite-state transducers and speech recognition ». Mémoire de maîtrise. , McGill University

Chapitre de livre (1)

Gilles Boulianne, Jean-François Beaumont, Maryse Boisvert, Julie Brousseau, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Pierre Ouellet, Frédéric Osterrath, Pierre Dumouchel. 2010. « Shadow speaking for real-time closed-captioning of TV broadcasts in french ». In Listening to subtitles : subtitles for the deaf and hard of hearing . Peter Lang International Academic Professional Publishers.

Rapport technique (1)

Patrick Cardinal. 2006. « E-Inclusion core speech forward-backward algorithm ». «Collection scientifique et technique» Centre de recherche informatique de Montréal. 6 p.

Prix et distinctions