Yong Man Ro
Orcid: 0000-0001-5306-6853Affiliations:
- Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Image and Video Systems Lab, Daejeon, South Korea
- Information and Communications University, Yusong, South Korea (former)
- Korea Advanced Institute of Science and Technology, Daejeon, South Korea (PhD 1992)
According to our database1,
Yong Man Ro
authored at least 386 papers
between 1995 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Neural Networks Learn. Syst., September, 2024
Integrating Language-Derived Appearance Elements With Visual Cues in Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., September, 2024
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model.
IEEE Trans. Multim., 2024
Defending Video Recognition Model Against Adversarial Perturbations via Defense Patterns.
IEEE Trans. Dependable Secur. Comput., 2024
Textless Unit-to-Unit Training for Many-to-Many Multilingual Speech-to-Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Pattern Recognit., 2024
Text-guided distillation learning to diversify video embeddings for text-video retrieval.
Pattern Recognit., 2024
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
CoRR, 2024
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models.
CoRR, 2024
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models.
CoRR, 2024
CoRR, 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection.
CoRR, 2024
What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models.
CoRR, 2024
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024
Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units.
CoRR, 2024
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Visual Speech Recognition for Languages with Limited Labeled Data Using Automatic Labels from Whisper.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens.
Proceedings of the IEEE International Conference on Acoustics, 2024
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Image Vis. Comput., August, 2023
IEEE Trans. Image Process., 2023
IEEE Trans. Inf. Forensics Secur., 2023
Incorporating Language-Driven Appearance Knowledge Units with Visual Cues in Pedestrian Detection.
CoRR, 2023
CoRR, 2023
Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model.
CoRR, 2023
Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit Translation.
CoRR, 2023
CoRR, 2023
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition.
CoRR, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Robust Multispectral Pedestrian Detection Via Spectral Position-Free Feature Mapping.
Proceedings of the IEEE International Conference on Image Processing, 2023
Mitigating Dataset Bias in Image Captioning Through Clip Confounder-Free Captioning Network.
Proceedings of the IEEE International Conference on Image Processing, 2023
Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Similarity Relation Preserving Cross-Modal Learning for Multispectral Pedestrian Detection Against Adversarial Attacks.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Multispectral Invisible Coating: Laminated Visible-Thermal Physical Attack against Multispectral Object Detectors Using Transparent Low-E Films.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Multim., 2022
Defending Person Detection Against Adversarial Patch Attack by Using Universal Defensive Frame.
IEEE Trans. Image Process., 2022
Robust Perturbation for Visual Explanation: Cross-Checking Mask Optimization to Avoid Class Distortion.
IEEE Trans. Image Process., 2022
Assessing Individual VR Sickness Through Deep Feature Fusion of VR Video and Physiological Response.
IEEE Trans. Circuits Syst. Video Technol., 2022
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022
Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment.
IEEE Trans. Biom. Behav. Identity Sci., 2022
On-the-Fly Facial Expression Prediction Using LSTM Encoded Appearance-Suppressed Dynamics.
IEEE Trans. Affect. Comput., 2022
Defending Against Person Hiding Adversarial Patch Attack with a Universal White Frame.
CoRR, 2022
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022
Defending Physical Adversarial Attack on Object Detection via Adversarial Patch-Feature Energy.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Robust Thermal Infrared Pedestrian Detection By Associating Visible Pedestrian Knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection.
Proceedings of the Computer Vision - ECCV 2022, 2022
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Towards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE Trans. Circuits Syst. Video Technol., 2021
IEEE Trans. Circuits Syst. Video Technol., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Adversarially Robust Hyperspectral Image Classification via Random Spectral Sampling and Spectral Shape Encoding.
IEEE Access, 2021
Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021
Adversarially Robust Multi-Sensor Fusion Model Training Via Random Feature Fusion For Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Towards Robust Training of Multi-Sensor Data Fusion Network Against Adversarial Examples in Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
M-CAM: Visual Explanation of Challenging Conditioned Dataset with Bias-reducing Memory.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Towards a Better Understanding of VR Sickness: Physical Symptom Prediction for VR Contents.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE Trans. Image Process., 2020
IEEE Trans. Geosci. Remote. Sens., 2020
Lightweight and Effective Facial Landmark Detection using Adversarial Learning with Face Geometric Map Generative Network.
IEEE Trans. Circuits Syst. Video Technol., 2020
Deep Virtual Reality Image Quality Assessment With Human Perception Guider for Omnidirectional Image.
IEEE Trans. Circuits Syst. Video Technol., 2020
IEEE Trans. Circuits Syst. Video Technol., 2020
Encoding features robust to unseen modes of variation with attentive long short-term memory.
Pattern Recognit., 2020
Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers.
Image Vis. Comput., 2020
Investigating Vulnerability to Adversarial Examples on Multimodal Data Fusion in Deep Learning.
CoRR, 2020
Efficient Ensemble Model Generation for Uncertainty Estimation with Bayesian Approximation in Segmentation.
CoRR, 2020
IEEE Access, 2020
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Towards High-Performance Object Detection: Task-Specific Design Considering Classification and Localization Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-Based Symptom Relation Embedding.
Proceedings of the Computer Vision - ECCV 2020, 2020
Structure Boundary Preserving Segmentation for Medical Image With Ambiguous Boundary.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 31st British Machine Vision Conference 2020, 2020
2019
IEEE Trans. Image Process., 2019
Attended Relation Feature Representation of Facial Dynamics for Facial Authentication.
IEEE Trans. Inf. Forensics Secur., 2019
IEEE Trans. Circuits Syst. Video Technol., 2019
Multi-Objective Based Spatio-Temporal Feature Representation Learning Robust to Expression Intensity Variations for Facial Expression Recognition.
IEEE Trans. Affect. Comput., 2019
Implementation of multimodal biometric recognition via multi-feature deep learning networks and feature fusion.
Multim. Tools Appl., 2019
Photo-Realistic Facial Emotion Synthesis Using Multi-level Critic Networks with Multi-level Generative Model.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019
Region-guided adversarial learning for anatomical landmark detection in uterus ultrasound image.
Proceedings of the Medical Imaging 2019: Image Processing, 2019
Generation of Multimodal Justification Using Visual Word Constraint Model for Explainable Computer-Aided Diagnosis.
Proceedings of the Interpretability of Machine Intelligence in Medical Image Computing and Multimodal Learning for Clinical Decision Support, 2019
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019
Visual evidence for interpreting diagnostic decision of deep neural network in computer-aided diagnosis.
Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019
Generative Guiding Block: Synthesizing Realistic Looking Variants Capable of Even Large Change Demands.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Physiological Fusion Net: Quantifying Individual VR Sickness with Content Stimulus and Physiological Response.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Attentive Layer Separation for Object Classification and Object Localization in Object Detection.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Deep Objective Assessment Model Based on Spatio-Temporal Perception of 360-Degree Video for VR Sickness Prediction.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
Mode Variational LSTM Robust to Unseen Modes of Variation: Application to Facial Expression Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
VR IQA NET: Deep Virtual Reality Image Quality Assessment using Adversarial Learning.
CoRR, 2018
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the Medical Imaging 2018: Computer-Aided Diagnosis, 2018
Adversarial Spatial Frequency Domain Critic Learning for Age and Gender Classification.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Object Bounding Box-Critic Networks for Occlusion-Robust Object Detection in Road Scene.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018
Feature2Mass: Visual Feature Processing in Latent Space for Realistic Labeled Mass Generation.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Facial Dynamics Interpreter Network: What Are the Important Relations Between Local Dynamics for Facial Trait Estimation?
Proceedings of the Computer Vision - ECCV 2018, 2018
Learning Spatio-Temporal Features With Partial Expression Sequences for On-the-Fly Prediction.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Multiview Stereoscopic Video Hole Filling Considering Spatiotemporal Consistency and Binocular Symmetry for Synthesized 3D Video.
IEEE Trans. Circuits Syst. Video Technol., 2017
Effective and efficient human action recognition using dynamic frame skipping and trajectory rejection.
Image Vis. Comput., 2017
Dynamics Transfer GAN: Generating Video by Transferring Arbitrary Temporal Dynamics from a Source Video to a Single Target Image.
CoRR, 2017
Differential Generative Adversarial Networks: Synthesizing Non-linear Facial Variations with Limited Number of Training Data.
CoRR, 2017
Measurement of exceptional motion in VR video contents for VR sickness assessment using deep convolutional autoencoder.
Proceedings of the 23rd ACM Symposium on Virtual Reality Software and Technology, 2017
Learning Features Robust to Image Variations with Siamese Networks for Facial Expression Recognition.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Visual comfort assessment of stereoscopic images using deep visual and disparity features based on human attention.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017
Proceedings of the 10th International Congress on Image and Signal Processing, 2017
Multi-Scale Facial Scanning via Spatial Lstm for Latent Facial Feature Representation.
Proceedings of the International Conference of the Biometrics Special Interest Group, 2017
2016
Critical Binocular Asymmetry Measure for the Perceptual Quality Assessment of Synthesized Stereo 3D Images in View Synthesis.
IEEE Trans. Circuits Syst. Video Technol., 2016
Partial Matching of Facial Expression Sequence Using Over-Complete Transition Dictionary for Emotion Recognition.
IEEE Trans. Affect. Comput., 2016
Collaborative expression representation using peak expression and intra class variation face images for practical subject-independent emotion recognition in videos.
Pattern Recognit., 2016
Feature scalability for a low complexity face recognition with unconstrained spatial resolution.
Multim. Tools Appl., 2016
Classifier ensemble generation and selection with multiple feature representations for classification applications in computer-aided detection and diagnosis on mammography.
Expert Syst. Appl., 2016
Micro-Expression Recognition with Expression-State Constrained Spatio-Temporal Feature Representations.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Collaborative facial color feature learning of multiple color spaces for face recognition.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
Measurement of critical temporal inconsistency for quality assessment of synthesized video.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
Spatio-temporal representation for face authentication by using multi-task learning with human attributes.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
A deep facial landmarks detection with facial contour and facial components constraint.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
Latent feature representation with 3-D multi-view deep convolutional neural network for bilateral analysis in digital breast tomosynthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A new hole filling method based on 3D geometric transformation for synthesized image.
Proceedings of the Stereoscopic Displays and Applications XXVII, 2016
Learning based hole filling method using deep convolutional neural network for view synthesis.
Proceedings of the Image Processing: Machine Vision Applications IX, 2016
Two-step Learning of Deep Convolutional Neural Network for Discriminative Face Recognition under Varying Illumination.
Proceedings of the Imaging and Multimedia Analytics in a Web and Mobile World 2016, 2016
Proceedings of the Digital Media Industry & Academic Forum, 2016
Facial dynamic modelling using long short-term memory network: Analysis and application to face authentication.
Proceedings of the 8th IEEE International Conference on Biometrics Theory, 2016
Bilateral hemiface feature representation learning for pose robust facial expression recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Image-based coin recognition using rotation-invariant region binary patterns based on gradient magnitudes.
J. Vis. Commun. Image Represent., 2015
Region based stellate features combined with variable selection using AdaBoost learning in mammographic computer-aided detection.
Comput. Biol. Medicine, 2015
High-Speed Periodic Motion Reconstruction Using an Off-the-shelf Camera with Compensation for Rolling Shutter Effect.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
A Sparse Representation-Based Label Pruning for Image Inpainting Using Global Optimization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015
Subtle Facial Expression Recognition Using Adaptive Magnification of Discriminative Facial Motion.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the Medical Imaging 2015: Computer-Aided Diagnosis, 2015
Combination of conspicuity improved synthetic mammograms and digital breast tomosynthesis: a promising approach for mass detection.
Proceedings of the Medical Imaging 2015: Computer-Aided Diagnosis, 2015
Utilizing digital breast tomosynthesis projection views correlation for microcalcification enhancement for detection purposes.
Proceedings of the Medical Imaging 2015: Computer-Aided Diagnosis, 2015
Pose-Robust and Discriminative Feature Representation by Multi-task Deep Learning for Multi-view Face Recognition.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
Multispectral Texture Features from Visible and Near-Infrared Synthetic Face Images for Face Recognition.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
Human action recognition using time-invariant key-trajectories describing spatio-temporal salient motion.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Temporally consistent hole filling method based on global optimization with label propagation for 3D video.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Face image assessment learned with objective and relative face image qualities for improved face recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Region matching based on local structure information in ipsilateral digital breast tomosynthesis views.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Feature extraction from bilateral dissimilarity in digital breast tomosynthesis reconstructed volume.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Efficient and effective human action recognition in video through motion boundary description with a compact set of trajectories.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015
Multi-view stereo image synthesis using binocular symmetry-based global optimization.
Proceedings of the Stereoscopic Displays and Applications XXVI, 2015
2014
Visual Comfort Amelioration Technique for Stereoscopic Images: Disparity Remapping to Mitigate Global and Local Discomfort Causes.
IEEE Trans. Circuits Syst. Video Technol., 2014
Visual comfort improvement in stereoscopic 3D displays using perceptually plausible assessment metric of visual comfort.
IEEE Trans. Consumer Electron., 2014
Intra-Class Variation Reduction Using Training Expression Images for Sparse Representation Based Facial Expression Recognition.
IEEE Trans. Affect. Comput., 2014
Adaptive weighted fusion with new spatial and temporal fingerprints for improved video copy detection.
Signal Process. Image Commun., 2014
Multim. Tools Appl., 2014
J. Vis. Commun. Image Represent., 2014
IEEE J. Sel. Areas Commun., 2014
Experimental investigation of discomfort combination: toward visual discomfort prediction for stereoscopic videos.
J. Electronic Imaging, 2014
Int. J. Imaging Syst. Technol., 2014
High resolution image formation method based on the realistic spaceborne SAR modeling and simulation.
Proceedings of the SAR Image Analysis, 2014
Level-set based free fluid segmentation with improved initialization using region growing in 3D ultrasound sonography.
Proceedings of the Medical Imaging 2014: Computer-Aided Diagnosis, San Diego, 2014
Investigating Cascaded Face Quality Assessment for Practical Face Recognition System.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Inter-view consistent hole filling in view extrapolation for multi-view image generation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Adaptive feature extraction for blurred face images in facial expression recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the 19th International Conference on Digital Signal Processing, 2014
Generation of conspicuity-improved synthetic image from digital breast tomosynthesis.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014
Proceedings of the 19th International Conference on Digital Signal Processing, 2014
Local disparity remapping to enhance depth quality of stereoscopic 3D images using stereoacuity function.
Proceedings of the Stereoscopic Displays and Applications XXV, 2014
Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics, 2014
Mass detection based on pooled mass probability map of 3D reconstructed slices in digital breast tomosynthesis.
Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics, 2014
Breast tissue removal for enhancing microcalcification cluster detection in mammograms.
Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics, 2014
2013
Visual Importance- and Discomfort Region-Selective Low-Pass Filtering for Reducing Visual Discomfort in Stereoscopic Displays.
IEEE Trans. Circuits Syst. Video Technol., 2013
IEEE Trans. Circuits Syst. Video Technol., 2013
Predicting Visual Discomfort Using Object Size and Disparity Information in Stereoscopic Images.
IEEE Trans. Broadcast., 2013
Effect of Stimulus Width on the Perceived Visual Discomfort in Viewing Stereoscopic 3-D-TV.
IEEE Trans. Broadcast., 2013
J. Commun. Networks, 2013
Expert Syst. Appl., 2013
Improving positive predictive value in computer-aided diagnosis using mammographic mass and microcalcification confidence score fusion based on co-location information.
Proceedings of the Medical Imaging 2013: Computer-Aided Diagnosis, 2013
Boosting framework for mammographic mass classification with combination of CC and MLO view information.
Proceedings of the Medical Imaging 2013: Computer-Aided Diagnosis, 2013
Improved License Plate Recognition for Low-Resolution CCTV Forensics by Integrating Sparse Representation-Based Super-Resolution.
Proceedings of the Digital-Forensics and Watermarking - 12th International Workshop, 2013
Crosstalk reduction in stereoscopic displays: A combined approach of disparity adjustment and crosstalk cancellation.
Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013
Sparse Representation-Based Human Action Recognition Using an Action Region-Aware Dictionary.
Proceedings of the 2013 IEEE International Symposium on Multimedia, 2013
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013
Proceedings of the Stereoscopic Displays and Applications XXIV, 2013
Subjective assessment of visual discomfort induced by binocular disparity and stimulus width in stereoscopic image.
Proceedings of the Stereoscopic Displays and Applications XXIV, 2013
Subjective and objective measurements of visual fatigue induced by excessive disparities in stereoscopic images.
Proceedings of the Stereoscopic Displays and Applications XXIV, 2013
Proceedings of the Computational Color Imaging - 4th International Workshop, 2013
2012
Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition.
IEEE Trans. Syst. Man Cybern. Part B, 2012
Local Color Vector Binary Patterns From Multichannel Face Images for Face Recognition.
IEEE Trans. Image Process., 2012
IEEE Trans. Image Process., 2012
Near-Duplicate Video Clip Detection Using Model-Free Semantic Concept Detection and Adaptive Semantic Distance Measurement.
IEEE Trans. Circuits Syst. Video Technol., 2012
IEEE Trans. Consumer Electron., 2012
Visual comfort assessment metric based on salient object motion information in stereoscopic video.
J. Electronic Imaging, 2012
Towards data-driven estimation of image tag relevance using visually similar and dissimilar folksonomy images.
Proceedings of the 2012 international workshop on Socially-aware multimedia, 2012
Mammographic enhancement with combining local statistical measures and sliding band filter for improved mass segmentation in mammograms.
Proceedings of the Medical Imaging 2012: Computer-Aided Diagnosis, San Diego, 2012
Multiresolution Local Binary Pattern texture analysis for false positive reduction in computerized detection of breast masses on mammograms.
Proceedings of the Medical Imaging 2012: Computer-Aided Diagnosis, San Diego, 2012
Proceedings of the Digital Forensics and Watermaking - 11th International Workshop, 2012
Proceedings of the 2012 IEEE International Symposium on Multimedia, 2012
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012
A novel mammographic mass detection approach to combining suprevised and unsuprevised detection algorithms.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Region based stellate features for classification of mammographic spiculated lesions in computer-aided detection.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Combining multiple feature representations and AdaBoost ensemble learning for reducing false-positive detections in Computer-aided Detection of masses on mammograms.
Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012
Investigation of object thickness for visual discomfort prediction in stereoscopic images.
Proceedings of the Stereoscopic Displays and Applications XXIII, 2012
Combining multiresolution local binary pattern texture analysis and variable selection strategy applied to computer-aided detection of breast masses on mammograms.
Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics, 2012
2011
Collaborative Face Recognition for Improved Face Annotation in Personal Photo Collections Shared on Online Social Networks.
IEEE Trans. Multim., 2011
IEEE Trans. Image Process., 2011
Privacy Protection in Video Surveillance Systems: Analysis of Subband-Adaptive Scrambling in JPEG XR.
IEEE Trans. Circuits Syst. Video Technol., 2011
Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection.
Signal Process. Image Commun., 2011
A comparative study of preprocessing mismatch effects in color image based face recognition.
Pattern Recognit., 2011
IEICE Trans. Inf. Syst., 2011
Contribution of Non-scrambled Chroma Information in Privacy-Protected Face Images to Privacy Leakage.
Proceedings of the Digital Forensics and Watermarking - 10th International Workshop, 2011
Leveraging an image folksonomy and the Signature Quadratic Form Distance for semantic-based detection of near-duplicate video clips.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011
Towards a better understanding of model-free semantic concept detection for annotation and near-duplicate video clip detection.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Enhanced classification of focal hepatic lesions in ultrasound images using novel texture features.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Proceedings of the 17th International Conference on Digital Signal Processing, 2011
Proceedings of the 17th International Conference on Digital Signal Processing, 2011
Proceedings of the Stereoscopic Displays and Applications XXII, 2011
2010
Privacy Enhancing Solutions for Personal Information Based Multimedia Content Sharing.
Proceedings of the Intelligent Multimedia Analysis for Security Applications, 2010
Automatic Face Annotation in Personal Photo Collections Using Context-Based Unsupervised Clustering and Face Information Fusion.
IEEE Trans. Circuits Syst. Video Technol., 2010
Towards an automatic face indexing system for actor-based video services in an IPTV environment.
IEEE Trans. Consumer Electron., 2010
IEEE Trans. Broadcast., 2010
Tag refinement in an image folksonomy using visual similarity and tag co-occurrence statistics.
Signal Process. Image Commun., 2010
Pattern Recognit. Lett., 2010
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010
Training Strategy of Semantic Concept Detectors Using Support Vector Machine in Naked Image Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010
Proceedings of the 4th International Conference on Multimedia and Ubiquitous Engineering, 2010
Semantic Concept Detection for User-Generated Video Content Using a Refined Image Folksonomy.
Proceedings of the Advances in Multimedia Modeling, 2010
Proceedings of the Digital Watermarking - 9th International Workshop, 2010
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Image tag refinement along the 'what' dimension using tag categorization and neighbor voting.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Face annotation for online personal videos using color feature fusion based face recognition.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Exploiting collective knowledge in an image folksonomy for semantic-based near-duplicate video detection.
Proceedings of the International Conference on Image Processing, 2010
Proceedings of the International Conference on Image Processing, 2010
Proceedings of the FUZZ-IEEE 2010, 2010
Color component feature selection in feature-level fusion based color face recognition.
Proceedings of the FUZZ-IEEE 2010, 2010
Proceedings of the Advances in Web Technologies and Applications, 2010
2009
IEEE Trans. Syst. Man Cybern. Part B, 2009
Content sharing between home networks by using personal information and associated fuzzy vault scheme.
IEEE Trans. Consumer Electron., 2009
Signal Process. Image Commun., 2009
IEICE Trans. Inf. Syst., 2009
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Proceedings of the Digital Watermarking, 8th International Workshop, 2009
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009
Proceedings of the Image Processing: Algorithms and Systems VII, 2009
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, 2009
Proceedings of the International Conference on Image Processing, 2009
Proceedings of the International Conference on Image Processing, 2009
Face annotation for personal photos using collaborative face recognition in online social networks.
Proceedings of the 16th International Conference on Digital Signal Processing, 2009
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009
Proceedings of the Handbook of Multimedia for Digital Entertainment and Arts, 2009
2008
Multim. Tools Appl., 2008
IEICE Trans. Commun., 2008
Quantification and Standardized Description of Color Vision Deficiency Caused by Anomalous Trichromats - Part II: Modeling and Color Compensation.
EURASIP J. Image Video Process., 2008
Quantification and Standardized Description of Color Vision Deficiency Caused by Anomalous Trichromats - Part I: Simulation and Measurement.
EURASIP J. Image Video Process., 2008
Adv. Multim., 2008
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008
Proceedings of the Digital Watermarking, 7th International Workshop, 2008
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008
Proceedings of the Image Processing: Algorithms and Systems VI, 2008
Improving visual content accessibility for low-vision users in the MPEG-21 multimedia framework.
Proceedings of the Human Vision and Electronic Imaging XIII, 2008
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008
2007
Inf. Process. Manag., 2007
IEICE Trans. Inf. Syst., 2007
IEICE Trans. Inf. Syst., 2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Scalable Security and Conditional Access Control for Multiple Regions of Interest in Scalable Video Coding.
Proceedings of the Digital Watermarking, 6th International Workshop, 2007
Proceedings of the International Conference on Wireless Communications and Mobile Computing, 2007
Advertisement Insertion based on MPEG-4 File Format and MPEG-21 DID.
Proceedings of the 2007 International Conference on Image Processing, 2007
Perceived Image Contrast Measurement using Multi-scale Adaptation and Spatial Vision Characteristics.
Proceedings of the 2007 International Conference on Image Processing, 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the Human Vision and Electronic Imaging XII, San Jose, CA, USA, January 29, 2007
2006
Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment.
IEICE Trans. Inf. Syst., 2006
A Microcalcification Detection Using Adaptive Contrast Enhancement on Wavelet Transform and Neural Network.
IEICE Trans. Inf. Syst., 2006
Intelligent broadcasting system and services for personalized semantic contents consumption.
Expert Syst. Appl., 2006
Proceedings of the 1st International Workshop on Semantic-Enhanced Multimedia Presentation Systems (SEMPS-2006), 2006
Proceedings of the 1st International Workshop on Semantic-Enhanced Multimedia Presentation Systems (SEMPS-2006), 2006
Proceedings of the Digital Watermarking, 5th International Workshop, 2006
Conditional Access Control in Secured SVC Bitstream.
Proceedings of the 2006 International Conference on Image Processing, 2006
Multimedia Packaging for TVAF-based Broadcasting Contents.
Proceedings of the 2006 International Conference on Image Processing, 2006
2005
IEEE Trans. Multim., 2005
Signal Process. Image Commun., 2005
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2005, 2005
Proceedings of the Advances in Multimedia Information Processing, 2005
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005
MPEG-21 Based Broadcasting Contents Service System in Ubiquitous Environment.
Proceedings of The 2005 International Conference on Imaging Science, 2005
Microcalcification Detection System for Computer Aided Diagnosis.
Proceedings of The 2005 International Conference on Imaging Science, 2005
Proceedings of the Information Retrieval Technology, 2005
2004
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
Proceedings of the Digital Watermarking, Third InternationalWorkshop, 2004
Proceedings of the 2004 International Conference on Image Processing, 2004
Joint control for hybrid transcoding using multidimensional rate distortion modeling.
Proceedings of the 2004 International Conference on Image Processing, 2004
Proceedings of the Computational Science and Its Applications, 2004
Proceedings of the Computational Science, 2004
Proceedings of the Human Vision and Electronic Imaging IX, 2004
Proceedings of the Image and Video Retrieval: Third International Conference, 2004
Semantic Event Detection in Sports Video Using Hidden Markov Model.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2004
Intelligent Agent-Based Systems for Personalized Broadcasting Services.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2004
Watermarking System for Contents Adaptation in Ubiquitous Environment.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2004
2003
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003
Proceedings of the Signal and Image Processing (SIP 2003), 2003
Proceedings of the Digital Watermarking, Second International Workshop, 2003
Proceedings of the Digital Watermarking, Second International Workshop, 2003
Proceedings of the Image Processing: Algorithms and Systems II, 2003
Proceedings of the 2003 International Conference on Image Processing, 2003
Proceedings of the Human Vision and Electronic Imaging VIII, 2003
Modality Conversion in Content Adaptation for Universal Multimedia Access.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2003
Soccer Video Summarization System Based on Hidden Markov Model with Multiple MPEG-7 Descriptors.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2003
Visual Media Adaptation System for Active Media.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2003
2002
Proceedings of the Digital Watermarking, First International Workshop, 2002
Proceedings of the Image Processing: Algorithms and Systems, 2002
2001
A Metadata Repository System for an Efficient Description of Visual Multimedia Documents.
Concurr. Eng. Res. Appl., 2001
Proceedings of the Security and Watermarking of Multimedia Contents III, 2001
Proceedings of the Security and Watermarking of Multimedia Contents III, 2001
Proceedings of the Computer Analysis of Images and Patterns, 9th International Conference, 2001
2000
Proceedings of the 2000 International Conference on Image Processing, 2000
1999
Proceedings of the Medical Imaging 1999: Image Processing, 1999
Proceedings of the 1999 International Conference on Image Processing, 1999
1998
Proceedings of the Medical Imaging 1998: Image Processing, 1998
1995
Susceptibility effect-enhanced functional MR imaging using tailored RF gradient echo (TRFGE) sequence.
Int. J. Imaging Syst. Technol., 1995