Patrick Cardinal

Orcid: 0009-0000-9439-9910

According to our database1, Patrick Cardinal authored at least 61 papers between 2002 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention.
IEEE Trans. Biom. Behav. Identity Sci., July, 2023

Performance Characterization of MQTT Brokers in a Device-Local Edge Deployment.
Proceedings of the 2nd International Workshop on Middleware for the Edge, 2023

Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Multidiscriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems.
IEEE Trans. Inf. Forensics Secur., 2022

RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks.
IEEE Signal Process. Lett., 2022

Bi-discriminator GAN for tabular data synthesis.
Pattern Recognit. Lett., 2022

RCC-GAN: Regularized Compound Conditional GAN for Large-Scale Tabular Data Synthesis.
CoRR, 2022

From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks.
CoRR, 2022

A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition.
CoRR, 2022

Named Entity Recognition for Audio De-Identification.
Proceedings of the International Joint Conference on Neural Networks, 2022

Towards Robust Speech-to-Text Adversarial Attack.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Cyclic Defense GAN Against Speech Adversarial Attacks.
IEEE Signal Process. Lett., 2021

Deep domain adaptation with ordinal regression for pain assessment using weakly-labeled videos.
Image Vis. Comput., 2021

Bi-Discriminator Class-Conditional Tabular GAN.
CoRR, 2021

Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems.
CoRR, 2021

Weakly Supervised Learning for Facial Behavior Analysis : A Review.
CoRR, 2021

Class-Conditional Defense GAN Against End-To-End Speech Attacks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

2020
A Robust Approach for Securing Audio Classification Against Adversarial Attacks.
IEEE Trans. Inf. Forensics Secur., 2020

Conditioning Trick for Training Stable GANs.
CoRR, 2020

Deep Domain Adaptation for Ordinal Regression of Pain Intensity Estimation Using Weakly-Labelled Videos.
CoRR, 2020

Improving Stability of LS-GANs for Audio and Speech Signals.
CoRR, 2020

From Sound Representation to Model Robustness.
CoRR, 2020

Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network.
Appl. Soft Comput., 2020

Adversarially Training for Audio Classifiers.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

RADARSAT-2 Synthetic-Aperture Radar Land Cover Segmentation Using Deep Convolutional Neural Networks.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Detection of Adversarial Attacks and Characterization of Adversarial Subspace.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Weakly Supervised Domain Adaptation for Pain Localization in Videos.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

2019
End-to-end environmental sound classification using a 1D convolutional neural network.
Expert Syst. Appl., 2019

Universal Adversarial Audio Perturbations.
CoRR, 2019

Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction.
CoRR, 2019

Multimodal Fusion with Deep Neural Networks for Audio-Video Emotion Recognition.
CoRR, 2019

Speaker Sincerity Detection based on Covariance Feature Vectors and Ensemble Methods.
CoRR, 2019

Unsupervised Feature Learning for Environmental Sound Classification Using Cycle Consistent Generative Adversarial Network.
CoRR, 2019

Emotion Recognition Using Fusion of Audio and Video Features.
Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019

Emotion Recognition with Spatial Attention and Temporal Softmax Pooling.
Proceedings of the Image Analysis and Recognition - 16th International Conference, 2019

2018
Classification of Nonverbal Human Produced Audio Events: A Pilot Study.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2016
Native Language Detection Using the I-Vector Framework.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Dialect Detection in Arabic Broadcast Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
ETS System for AV+EC 2015 Challenge.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015

Speaker adaptation using the i-vector technique for bottleneck features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Audio quotation marks for natural language understanding.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
A complete KALDI recipe for building Arabic speech recognition systems.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Large Vocabulary Speech Recognition on Parallel Architectures.
IEEE Trans. Speech Audio Process., 2013

2012
CRIM's content-based audio copy detection system for TRECVID 2009.
Multim. Tools Appl., 2012

The A* speech recognition system on parallel architectures.
Proceedings of the 11th International Conference on Information Science, 2012

Using A* for the parallelization of speech recognition systems.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2010
Content-based advertisement detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Content-based audio copy detection using nearest-neighbor mapping.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
CRIM´s Content-Based Copy Detection System for TRECVID.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Using parallel architectures in speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Real-time correction of closed-captions.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
GPU accelerated acoustic likelihood computations.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Real-Time Correction of Closed-Captions.
Proceedings of the ACL 2007, 2007

2006
Computer-assisted closed-captioning of live TV broadcasts in French.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Segmentation of recordings based on partial transcriptions.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2003
Automated closed-captioning of live TV broadcast news in French.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic segmentation of film dialogues into phonemes and graphemes.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Disambiguation of Finite-State Transducers.
Proceedings of the 19th International Conference on Computational Linguistics, 2002


  Loading...