Yong Man Ro

Orcid: 0000-0001-5306-6853

Affiliations:
  • Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Image and Video Systems Lab, Daejeon, South Korea
  • Information and Communications University, Yusong, South Korea (former)
  • Korea Advanced Institute of Science and Technology, Daejeon, South Korea (PhD 1992)


According to our database1, Yong Man Ro authored at least 386 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Advancing Adversarial Training by Injecting Booster Signal.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Integrating Language-Derived Appearance Elements With Visual Cues in Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model.
IEEE Trans. Multim., 2024

Defending Video Recognition Model Against Adversarial Perturbations via Defense Patterns.
IEEE Trans. Dependable Secur. Comput., 2024

Textless Unit-to-Unit Training for Many-to-Many Multilingual Speech-to-Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Robust pedestrian detection via constructing versatile pedestrian knowledge bank.
Pattern Recognit., 2024

Text-guided distillation learning to diversify video embeddings for text-video retrieval.
Pattern Recognit., 2024

Phantom of Latent for Large Language and Vision Models.
CoRR, 2024

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
CoRR, 2024

SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models.
CoRR, 2024

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models.
CoRR, 2024

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models.
CoRR, 2024

MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection.
CoRR, 2024

What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models.
CoRR, 2024

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024

Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units.
CoRR, 2024

Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Visual Speech Recognition for Languages with Limited Labeled Data Using Automatic Labels from Whisper.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Phonetic Context-Aware Lip-Sync for Talking Face Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Hyperspectral Skin Vision Challenge: Can Your Camera See Beyond Your Skin?
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens.
Proceedings of the IEEE International Conference on Acoustics, 2024

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TroL: Traversal of Layers for Large Language and Vision Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MoAI: Mixture of All Intelligence for Large Language and Vision Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CoLLaVO: Crayon Large Language and Vision mOdel.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Adversarial anchor-guided feature refinement for adversarial defense.
Image Vis. Comput., August, 2023

Stereoscopic Vision Recalling Memory for Monocular 3D Object Detection.
IEEE Trans. Image Process., 2023

Robust Proxy: Improving Adversarial Robustness by Robust Proxy Learning.
IEEE Trans. Inf. Forensics Secur., 2023

Incorporating Language-Driven Appearance Knowledge Units with Visual Cues in Pedestrian Detection.
CoRR, 2023

Causal Unsupervised Semantic Segmentation.
CoRR, 2023

DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion.
CoRR, 2023

Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model.
CoRR, 2023

Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit Translation.
CoRR, 2023

Reprogramming Audio-driven Talking Face Synthesis into Text-driven.
CoRR, 2023

Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation.
CoRR, 2023

Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition.
CoRR, 2023

Intelligible Lip-to-Speech Synthesis with Speech Units.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Multispectral Pedestrian Detection Via Spectral Position-Free Feature Mapping.
Proceedings of the IEEE International Conference on Image Processing, 2023

Mitigating Dataset Bias in Image Captioning Through Clip Confounder-Free Captioning Network.
Proceedings of the IEEE International Conference on Image Processing, 2023

Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-Temporal Lip-Audio Memory for Visual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Similarity Relation Preserving Cross-Modal Learning for Multispectral Pedestrian Detection Against Adversarial Attacks.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lip-to-Speech Synthesis in the Wild with Multi-Task Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multispectral Invisible Coating: Laminated Visible-Thermal Physical Attack against Multispectral Object Detectors Using Transparent Low-E Films.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition.
IEEE Trans. Multim., 2022

Defending Person Detection Against Adversarial Patch Attack by Using Universal Defensive Frame.
IEEE Trans. Image Process., 2022

Robust Perturbation for Visual Explanation: Cross-Checking Mask Optimization to Avoid Class Distortion.
IEEE Trans. Image Process., 2022

Assessing Individual VR Sickness Through Deep Feature Fusion of VR Video and Physiological Response.
IEEE Trans. Circuits Syst. Video Technol., 2022

Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment.
IEEE Trans. Biom. Behav. Identity Sci., 2022

On-the-Fly Facial Expression Prediction Using LSTM Encoded Appearance-Suppressed Dynamics.
IEEE Trans. Affect. Comput., 2022

Meta Input: How to Leverage Off-the-Shelf Deep Neural Networks.
CoRR, 2022

Defending Against Person Hiding Adversarial Patch Attack with a Universal White Frame.
CoRR, 2022

IVIST: Interactive Video Search Tool in VBS 2022.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Defending Physical Adversarial Attack on Object Detection via Adversarial Patch-Feature Energy.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Robust Thermal Infrared Pedestrian Detection By Associating Visible Pedestrian Knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Map: Multispectral Adversarial Patch to Attack Person Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment.
Proceedings of the Computer Vision - ECCV 2022, 2022

Speaker-Adaptive Lip Reading with User-Dependent Padding.
Proceedings of the Computer Vision - ECCV 2022, 2022

VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Towards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Robust Video Frame Interpolation With Exceptional Motion Map.
IEEE Trans. Circuits Syst. Video Technol., 2021

CUA Loss: Class Uncertainty-Aware Gradient Modulation for Robust Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2021

Speech Reconstruction With Reminiscent Sound Via Visual Voice Memory.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Adversarially Robust Hyperspectral Image Classification via Random Spectral Sampling and Spectral Shape Encoding.
IEEE Access, 2021

Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Lip to Speech Synthesis with Visual Context Attentional GAN.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

IVIST: Interactive Video Search Tool in VBS 2021.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Adversarially Robust Multi-Sensor Fusion Model Training Via Random Feature Fusion For Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Robust Decision-Based Black-Box Adversarial Attack via Coarse-To-Fine Random Search.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Interpretation of Lesional Detection via Counterfactual Generation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Towards Robust Training of Multi-Sensor Data Fusion Network Against Adversarial Examples in Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

M-CAM: Visual Explanation of Challenging Conditioned Dataset with Bias-reducing Memory.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Towards a Better Understanding of VR Sickness: Physical Symptom Prediction for VR Contents.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
BMAN: Bidirectional Multi-Scale Aggregation Networks for Abnormal Event Detection.
IEEE Trans. Image Process., 2020

MCSIP Net: Multichannel Satellite Image Prediction via Deep Neural Network.
IEEE Trans. Geosci. Remote. Sens., 2020

Lightweight and Effective Facial Landmark Detection using Adversarial Learning with Face Geometric Map Generative Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

Deep Virtual Reality Image Quality Assessment With Human Perception Guider for Omnidirectional Image.
IEEE Trans. Circuits Syst. Video Technol., 2020

BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2020

Encoding features robust to unseen modes of variation with attentive long short-term memory.
Pattern Recognit., 2020

Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers.
Image Vis. Comput., 2020

Investigating Vulnerability to Adversarial Examples on Multimodal Data Fusion in Deep Learning.
CoRR, 2020

Efficient Ensemble Model Generation for Uncertainty Estimation with Bayesian Approximation in Segmentation.
CoRR, 2020

Dual-Branch Structured De-Striping Convolution Network Using Parametric Noise Model.
IEEE Access, 2020

IVIST: Interactive VIdeo Search Tool in VBS 2020.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Towards Human-Like Interpretable Object Detection Via Spatial Relation Encoding.
Proceedings of the IEEE International Conference on Image Processing, 2020

Estimating VR Sickness Caused By Camera Shake in VR Videography.
Proceedings of the IEEE International Conference on Image Processing, 2020

Robust Video Facial Authentication With Unsupervised Mode Disentanglement.
Proceedings of the IEEE International Conference on Image Processing, 2020

Revisiting Role of Autoencoders in Adversarial Settings.
Proceedings of the IEEE International Conference on Image Processing, 2020

Class Incremental Learning With Task-Selection.
Proceedings of the IEEE International Conference on Image Processing, 2020

Learning Style Correlation for Elaborate Few-Shot Classification.
Proceedings of the IEEE International Conference on Image Processing, 2020

Comprehensive Facial Expression Synthesis Using Human-Interpretable Language.
Proceedings of the IEEE International Conference on Image Processing, 2020

Fake Video Detection With Certainty-Based Attention Network.
Proceedings of the IEEE International Conference on Image Processing, 2020

Video Frame Interpolation Via Exceptional Motion-Aware Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards High-Performance Object Detection: Task-Specific Design Considering Classification and Localization Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-Based Symptom Relation Embedding.
Proceedings of the Computer Vision - ECCV 2020, 2020

Structure Boundary Preserving Segmentation for Medical Image With Ambiguous Boundary.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Ensemble Model Training via Random Layer Sampling Against Adversarial Attack.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
VRSA Net: VR Sickness Assessment Considering Exceptional Motion for 360° VR Video.
IEEE Trans. Image Process., 2019

Attended Relation Feature Representation of Facial Dynamics for Facial Authentication.
IEEE Trans. Inf. Forensics Secur., 2019

Binocular Fusion Net: Deep Learning Visual Comfort Assessment for Stereoscopic 3D.
IEEE Trans. Circuits Syst. Video Technol., 2019

Multi-Objective Based Spatio-Temporal Feature Representation Learning Robust to Expression Intensity Variations for Facial Expression Recognition.
IEEE Trans. Affect. Comput., 2019

Implementation of multimodal biometric recognition via multi-feature deep learning networks and feature fusion.
Multim. Tools Appl., 2019

Photo-Realistic Facial Emotion Synthesis Using Multi-level Critic Networks with Multi-level Generative Model.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Region-guided adversarial learning for anatomical landmark detection in uterus ultrasound image.
Proceedings of the Medical Imaging 2019: Image Processing, 2019

Generation of Multimodal Justification Using Visual Word Constraint Model for Explainable Computer-Aided Diagnosis.
Proceedings of the Interpretability of Machine Intelligence in Medical Image Computing and Multimodal Learning for Clinical Decision Support, 2019

Realistic Breast Mass Generation Through BIRADS Category.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Visual evidence for interpreting diagnostic decision of deep neural network in computer-aided diagnosis.
Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019

Generative Guiding Block: Synthesizing Realistic Looking Variants Capable of Even Large Change Demands.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Probenet: Probing Deep Networks.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Physiological Fusion Net: Quantifying Individual VR Sickness with Content Stimulus and Physiological Response.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Attentive Layer Separation for Object Classification and Object Localization in Object Detection.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Deep Objective Assessment Model Based on Spatio-Temporal Perception of 360-Degree Video for VR Sickness Prediction.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Building a Breast-Sentence Dataset: Its Usefulness for Computer-Aided Diagnosis.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Mode Variational LSTM Robust to Unseen Modes of Variation: Application to Facial Expression Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
VR IQA NET: Deep Virtual Reality Image Quality Assessment using Adversarial Learning.
CoRR, 2018

Convolution with Logarithmic Filter Groups for Efficient Shallow CNN.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Teacher and Student Joint Learning for Compact Facial Landmark Detection Network.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Session details: Demo + Video + Makers' Program.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ICADx: interpretable computer aided diagnosis of breast masses.
Proceedings of the Medical Imaging 2018: Computer-Aided Diagnosis, 2018

Adversarial Spatial Frequency Domain Critic Learning for Age and Gender Classification.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Object Bounding Box-Critic Networks for Occlusion-Robust Object Detection in Road Scene.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Stan: Spatio- Temporal Adversarial Networks for Abnormal Event Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Fast Recognition of Human Actions Using Autocorrelation Sequence.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

Feature2Mass: Visual Feature Processing in Latent Space for Realistic Labeled Mass Generation.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Facial Dynamics Interpreter Network: What Are the Important Relations Between Local Dynamics for Facial Trait Estimation?
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Spatio-Temporal Features With Partial Expression Sequences for On-the-Fly Prediction.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Multiview Stereoscopic Video Hole Filling Considering Spatiotemporal Consistency and Binocular Symmetry for Synthesized 3D Video.
IEEE Trans. Circuits Syst. Video Technol., 2017

Effective and efficient human action recognition using dynamic frame skipping and trajectory rejection.
Image Vis. Comput., 2017

Dynamics Transfer GAN: Generating Video by Transferring Arbitrary Temporal Dynamics from a Source Video to a Single Target Image.
CoRR, 2017

Interpretable Facial Relational Network Using Relational Importance.
CoRR, 2017

Differential Generative Adversarial Networks: Synthesizing Non-linear Facial Variations with Limited Number of Training Data.
CoRR, 2017

EvaluationNet: Can Human Skill be Evaluated by Deep Networks?
CoRR, 2017

Measurement of exceptional motion in VR video contents for VR sickness assessment using deep convolutional autoencoder.
Proceedings of the 23rd ACM Symposium on Virtual Reality Software and Technology, 2017

Learning Features Robust to Image Variations with Siamese Networks for Facial Expression Recognition.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Robust and Real-Time Visual Tracking with Triplet Convolutional Neural Network.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Adaptive attention fusion network for visual question answering.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Visual comfort assessment of stereoscopic images using deep visual and disparity features based on human attention.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Color channel-wise recurrent learning for facial expression recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Iterative deep convolutional encoder-decoder network for medical image segmentation.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

Modality-bridge transfer learning for medical image classification.
Proceedings of the 10th International Congress on Image and Signal Processing, 2017

Multi-Scale Facial Scanning via Spatial Lstm for Latent Facial Feature Representation.
Proceedings of the International Conference of the Biometrics Special Interest Group, 2017

2016
Critical Binocular Asymmetry Measure for the Perceptual Quality Assessment of Synthesized Stereo 3D Images in View Synthesis.
IEEE Trans. Circuits Syst. Video Technol., 2016

Partial Matching of Facial Expression Sequence Using Over-Complete Transition Dictionary for Emotion Recognition.
IEEE Trans. Affect. Comput., 2016

Collaborative expression representation using peak expression and intra class variation face images for practical subject-independent emotion recognition in videos.
Pattern Recognit., 2016

Feature scalability for a low complexity face recognition with unconstrained spatial resolution.
Multim. Tools Appl., 2016

Classifier ensemble generation and selection with multiple feature representations for classification applications in computer-aided detection and diagnosis on mammography.
Expert Syst. Appl., 2016

Micro-Expression Recognition with Expression-State Constrained Spatio-Temporal Feature Representations.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Collaborative facial color feature learning of multiple color spaces for face recognition.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Measurement of critical temporal inconsistency for quality assessment of synthesized video.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Spatio-temporal representation for face authentication by using multi-task learning with human attributes.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

A deep facial landmarks detection with facial contour and facial components constraint.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Latent feature representation with 3-D multi-view deep convolutional neural network for bilateral analysis in digital breast tomosynthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A new hole filling method based on 3D geometric transformation for synthesized image.
Proceedings of the Stereoscopic Displays and Applications XXVII, 2016

Learning based hole filling method using deep convolutional neural network for view synthesis.
Proceedings of the Image Processing: Machine Vision Applications IX, 2016

Two-step Learning of Deep Convolutional Neural Network for Discriminative Face Recognition under Varying Illumination.
Proceedings of the Imaging and Multimedia Analytics in a Web and Mobile World 2016, 2016

Real-time quality evaluation of adaptation strategies in VoD streaming.
Proceedings of the Digital Media Industry & Academic Forum, 2016

Facial dynamic modelling using long short-term memory network: Analysis and application to face authentication.
Proceedings of the 8th IEEE International Conference on Biometrics Theory, 2016

Bilateral hemiface feature representation learning for pose robust facial expression recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Image-based coin recognition using rotation-invariant region binary patterns based on gradient magnitudes.
J. Vis. Commun. Image Represent., 2015

Region based stellate features combined with variable selection using AdaBoost learning in mammographic computer-aided detection.
Comput. Biol. Medicine, 2015

High-Speed Periodic Motion Reconstruction Using an Off-the-shelf Camera with Compensation for Rolling Shutter Effect.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

A Sparse Representation-Based Label Pruning for Image Inpainting Using Global Optimization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Subtle Facial Expression Recognition Using Adaptive Magnification of Discriminative Facial Motion.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Feature extraction from inter-view similarity of DBT projection views.
Proceedings of the Medical Imaging 2015: Computer-Aided Diagnosis, 2015

Combination of conspicuity improved synthetic mammograms and digital breast tomosynthesis: a promising approach for mass detection.
Proceedings of the Medical Imaging 2015: Computer-Aided Diagnosis, 2015

Utilizing digital breast tomosynthesis projection views correlation for microcalcification enhancement for detection purposes.
Proceedings of the Medical Imaging 2015: Computer-Aided Diagnosis, 2015

Pose-Robust and Discriminative Feature Representation by Multi-task Deep Learning for Multi-view Face Recognition.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Multispectral Texture Features from Visible and Near-Infrared Synthetic Face Images for Face Recognition.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Human action recognition using time-invariant key-trajectories describing spatio-temporal salient motion.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Temporally consistent hole filling method based on global optimization with label propagation for 3D video.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Face image assessment learned with objective and relative face image qualities for improved face recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Region matching based on local structure information in ipsilateral digital breast tomosynthesis views.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Feature extraction from bilateral dissimilarity in digital breast tomosynthesis reconstructed volume.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Efficient and effective human action recognition in video through motion boundary description with a compact set of trajectories.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Multi-view stereo image synthesis using binocular symmetry-based global optimization.
Proceedings of the Stereoscopic Displays and Applications XXVI, 2015

2014
Visual Comfort Amelioration Technique for Stereoscopic Images: Disparity Remapping to Mitigate Global and Local Discomfort Causes.
IEEE Trans. Circuits Syst. Video Technol., 2014

Visual comfort improvement in stereoscopic 3D displays using perceptually plausible assessment metric of visual comfort.
IEEE Trans. Consumer Electron., 2014

Intra-Class Variation Reduction Using Training Expression Images for Sparse Representation Based Facial Expression Recognition.
IEEE Trans. Affect. Comput., 2014

Adaptive weighted fusion with new spatial and temporal fingerprints for improved video copy detection.
Signal Process. Image Commun., 2014

Visually weighted neighbor voting for image tag relevance learning.
Multim. Tools Appl., 2014

Rotation and flipping robust region binary patterns for video copy detection.
J. Vis. Commun. Image Represent., 2014

An Evaluation of Bitrate Adaptation Methods for HTTP Live Streaming.
IEEE J. Sel. Areas Commun., 2014

Experimental investigation of discomfort combination: toward visual discomfort prediction for stereoscopic videos.
J. Electronic Imaging, 2014

fMRI analysis of excessive binocular disparity on the human brain.
Int. J. Imaging Syst. Technol., 2014

High resolution image formation method based on the realistic spaceborne SAR modeling and simulation.
Proceedings of the SAR Image Analysis, 2014

Level-set based free fluid segmentation with improved initialization using region growing in 3D ultrasound sonography.
Proceedings of the Medical Imaging 2014: Computer-Aided Diagnosis, San Diego, 2014

Investigating Cascaded Face Quality Assessment for Practical Face Recognition System.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Inter-view consistent hole filling in view extrapolation for multi-view image generation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Local age group modeling in unconstrained face images for facial age classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Adaptive feature extraction for blurred face images in facial expression recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Face detection for low power event detection in intelligent surveillance system.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Generation of conspicuity-improved synthetic image from digital breast tomosynthesis.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Investigating experienced quality factors in synthesized multi-view stereo images.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Local disparity remapping to enhance depth quality of stereoscopic 3D images using stereoacuity function.
Proceedings of the Stereoscopic Displays and Applications XXV, 2014

Improvement of subtle microcalcifications detection in DBT slices.
Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics, 2014

Mass detection based on pooled mass probability map of 3D reconstructed slices in digital breast tomosynthesis.
Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics, 2014

Breast tissue removal for enhancing microcalcification cluster detection in mammograms.
Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics, 2014

2013
Visual Importance- and Discomfort Region-Selective Low-Pass Filtering for Reducing Visual Discomfort in Stereoscopic Displays.
IEEE Trans. Circuits Syst. Video Technol., 2013

Predicting Visual Discomfort of Stereoscopic Images Using Human Attention Model.
IEEE Trans. Circuits Syst. Video Technol., 2013

Predicting Visual Discomfort Using Object Size and Disparity Information in Stereoscopic Images.
IEEE Trans. Broadcast., 2013

Effect of Stimulus Width on the Perceived Visual Discomfort in Viewing Stereoscopic 3-D-TV.
IEEE Trans. Broadcast., 2013

Adaptive video streaming over HTTP with dynamic resource estimation.
J. Commun. Networks, 2013

Multiple ROI selection based focal liver lesion classification in ultrasound images.
Expert Syst. Appl., 2013

Improving positive predictive value in computer-aided diagnosis using mammographic mass and microcalcification confidence score fusion based on co-location information.
Proceedings of the Medical Imaging 2013: Computer-Aided Diagnosis, 2013

Boosting framework for mammographic mass classification with combination of CC and MLO view information.
Proceedings of the Medical Imaging 2013: Computer-Aided Diagnosis, 2013

Improved License Plate Recognition for Low-Resolution CCTV Forensics by Integrating Sparse Representation-Based Super-Resolution.
Proceedings of the Digital-Forensics and Watermarking - 12th International Workshop, 2013

Crosstalk reduction in stereoscopic displays: A combined approach of disparity adjustment and crosstalk cancellation.
Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013

Sparse Representation-Based Human Action Recognition Using an Action Region-Aware Dictionary.
Proceedings of the 2013 IEEE International Symposium on Multimedia, 2013

Using color texture sparsity for facial expression recognition.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Denoising 3D ultrasound volumes using sparse representation.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Disparity remapping to ameliorate visual comfort of stereoscopic video.
Proceedings of the Stereoscopic Displays and Applications XXIV, 2013

Subjective assessment of visual discomfort induced by binocular disparity and stimulus width in stereoscopic image.
Proceedings of the Stereoscopic Displays and Applications XXIV, 2013

Subjective and objective measurements of visual fatigue induced by excessive disparities in stereoscopic images.
Proceedings of the Stereoscopic Displays and Applications XXIV, 2013

A Comparative Study of Color Texture Features for Face Analysis.
Proceedings of the Computational Color Imaging - 4th International Workshop, 2013

2012
Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Local Color Vector Binary Patterns From Multichannel Face Images for Face Recognition.
IEEE Trans. Image Process., 2012

Color Local Texture Features for Color Face Recognition.
IEEE Trans. Image Process., 2012

Near-Duplicate Video Clip Detection Using Model-Free Semantic Concept Detection and Adaptive Semantic Distance Measurement.
IEEE Trans. Circuits Syst. Video Technol., 2012

Visual discomfort visualizer using stereo vision and time-of-flight depth cameras.
IEEE Trans. Consumer Electron., 2012

Visual comfort assessment metric based on salient object motion information in stereoscopic video.
J. Electronic Imaging, 2012

Towards data-driven estimation of image tag relevance using visually similar and dissimilar folksonomy images.
Proceedings of the 2012 international workshop on Socially-aware multimedia, 2012

Mammographic enhancement with combining local statistical measures and sliding band filter for improved mass segmentation in mammograms.
Proceedings of the Medical Imaging 2012: Computer-Aided Diagnosis, San Diego, 2012

Multiresolution Local Binary Pattern texture analysis for false positive reduction in computerized detection of breast masses on mammograms.
Proceedings of the Medical Imaging 2012: Computer-Aided Diagnosis, San Diego, 2012

Face Verification Using Color Sparse Representation.
Proceedings of the Digital Forensics and Watermaking - 11th International Workshop, 2012

Visualizing the Perceived Discomfort of Stereoscopic Video.
Proceedings of the 2012 IEEE International Symposium on Multimedia, 2012

Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

A novel mammographic mass detection approach to combining suprevised and unsuprevised detection algorithms.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Region based stellate features for classification of mammographic spiculated lesions in computer-aided detection.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Combining multiple feature representations and AdaBoost ensemble learning for reducing false-positive detections in Computer-aided Detection of masses on mammograms.
Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012

Investigation of object thickness for visual discomfort prediction in stereoscopic images.
Proceedings of the Stereoscopic Displays and Applications XXIII, 2012

Combining multiresolution local binary pattern texture analysis and variable selection strategy applied to computer-aided detection of breast masses on mammograms.
Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics, 2012

2011
Collaborative Face Recognition for Improved Face Annotation in Personal Photo Collections Shared on Online Social Networks.
IEEE Trans. Multim., 2011

Boosting Color Feature Selection for Color Face Recognition.
IEEE Trans. Image Process., 2011

Privacy Protection in Video Surveillance Systems: Analysis of Subband-Adaptive Scrambling in JPEG XR.
IEEE Trans. Circuits Syst. Video Technol., 2011

Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection.
Signal Process. Image Commun., 2011

A comparative study of preprocessing mismatch effects in color image based face recognition.
Pattern Recognit., 2011

Enhanced Distal Radius Segmentation in DXA Using Modified ASM.
IEICE Trans. Inf. Syst., 2011

Contribution of Non-scrambled Chroma Information in Privacy-Protected Face Images to Privacy Leakage.
Proceedings of the Digital Forensics and Watermarking - 10th International Workshop, 2011

Leveraging an image folksonomy and the Signature Quadratic Form Distance for semantic-based detection of near-duplicate video clips.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Towards a better understanding of model-free semantic concept detection for annotation and near-duplicate video clip detection.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Enhanced classification of focal hepatic lesions in ultrasound images using novel texture features.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Local color vector binary pattern for face recognition.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Improving image tag recommendation using favorite image context.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Attention model-based visual comfort assessment for stereoscopic depth perception.
Proceedings of the 17th International Conference on Digital Signal Processing, 2011

Human brain response to visual fatigue caused by stereoscopic depth perception.
Proceedings of the 17th International Conference on Digital Signal Processing, 2011

Visual discomfort induced by fast salient object motion in stereoscopic video.
Proceedings of the Stereoscopic Displays and Applications XXII, 2011

2010
Privacy Enhancing Solutions for Personal Information Based Multimedia Content Sharing.
Proceedings of the Intelligent Multimedia Analysis for Security Applications, 2010

Automatic Face Annotation in Personal Photo Collections Using Context-Based Unsupervised Clustering and Face Information Fusion.
IEEE Trans. Circuits Syst. Video Technol., 2010

Towards an automatic face indexing system for actor-based video services in an IPTV environment.
IEEE Trans. Consumer Electron., 2010

Full-Reference Video Quality Metric for Fully Scalable and Mobile SVC Content.
IEEE Trans. Broadcast., 2010

Tag refinement in an image folksonomy using visual similarity and tag co-occurrence statistics.
Signal Process. Image Commun., 2010

MAP-based image tag recommendation using a visual folksonomy.
Pattern Recognit. Lett., 2010

Privacy-Preserving Watch List Screening in Video Surveillance System.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Training Strategy of Semantic Concept Detectors Using Support Vector Machine in Naked Image Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Semantic Detection of Adult Image Using Semantic Features.
Proceedings of the 4th International Conference on Multimedia and Ubiquitous Engineering, 2010

Semantic Concept Detection for User-Generated Video Content Using a Refined Image Folksonomy.
Proceedings of the Advances in Multimedia Modeling, 2010

Privacy Preserving Facial and Fingerprint Multi-biometric Authentication.
Proceedings of the Digital Watermarking - 9th International Workshop, 2010

Towards using semantic features for near-duplicate video detection.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Image tag refinement along the 'what' dimension using tag categorization and neighbor voting.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Face annotation for online personal videos using color feature fusion based face recognition.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Exploiting collective knowledge in an image folksonomy for semantic-based near-duplicate video detection.
Proceedings of the International Conference on Image Processing, 2010

Using colour local binary pattern features for face recognition.
Proceedings of the International Conference on Image Processing, 2010

Enhanced weakly trained frontal face detector for surveillance purposes.
Proceedings of the FUZZ-IEEE 2010, 2010

Color component feature selection in feature-level fusion based color face recognition.
Proceedings of the FUZZ-IEEE 2010, 2010

Multi-Factor Authentication Using Fingerprints and User-Specific Random Projection.
Proceedings of the Advances in Web Technologies and Applications, 2010

2009
Color Face Recognition for Degraded Face Images.
IEEE Trans. Syst. Man Cybern. Part B, 2009

Content sharing between home networks by using personal information and associated fuzzy vault scheme.
IEEE Trans. Consumer Electron., 2009

Improved BSDL-based content adaptation for JPEG 2000 and HD Photo (JPEG XR).
Signal Process. Image Commun., 2009

An Objective Perceptual Quality-Based ADTE for Adapting Mobile SVC Video Content.
IEICE Trans. Inf. Syst., 2009

Region-of-interest scrambling for scalable surveillance video using JPEG XR.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

A Statistical and Iterative Method for Data Hiding in Palette-Based Images.
Proceedings of the Digital Watermarking, 8th International Workshop, 2009

Near-Duplicate Video Detection Using Temporal Patterns of Semantic Concepts.
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

Semantic home video categorization.
Proceedings of the Image Processing: Algorithms and Systems VII, 2009

Malicious content filtering based on semantic features.
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, 2009

Semantic annotation of personal video content using an image folksonomy.
Proceedings of the International Conference on Image Processing, 2009

Image compression mismatch effect on color image based face recognition system.
Proceedings of the International Conference on Image Processing, 2009

Face annotation for personal photos using collaborative face recognition in online social networks.
Proceedings of the 16th International Conference on Digital Signal Processing, 2009

Privacy Protection in Video Surveillance Systems Using Scalable Video Coding.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Real-Time Content Filtering for Live Broadcasts in TV Terminals.
Proceedings of the Handbook of Multimedia for Digital Entertainment and Arts, 2009

2008
Real-time content filtering for live broadcasts in TV terminals.
Multim. Tools Appl., 2008

Measuring Video Quality on Full Scalability of H.264/AVC Scalable Video Coding.
IEICE Trans. Commun., 2008

Quantification and Standardized Description of Color Vision Deficiency Caused by Anomalous Trichromats - Part II: Modeling and Color Compensation.
EURASIP J. Image Video Process., 2008

Quantification and Standardized Description of Color Vision Deficiency Caused by Anomalous Trichromats - Part I: Simulation and Measurement.
EURASIP J. Image Video Process., 2008

Optimal Multilayer Adaptation of SVC Video over Heterogeneous Environments.
Adv. Multim., 2008

Face annotation for personal photos using context-assisted face recognition.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Content Sharing Based on Personal Information in Virtually Secured Space.
Proceedings of the Digital Watermarking, 7th International Workshop, 2008

Color Effect on the Face Recognition with Spatial Resolution Constraints.
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008

Microcalcification detection system in digital mammogram using two-layer SVM.
Proceedings of the Image Processing: Algorithms and Systems VI, 2008

Improving visual content accessibility for low-vision users in the MPEG-21 multimedia framework.
Proceedings of the Human Vision and Electronic Imaging XIII, 2008

Feature subspace determination in video-based mismatched face recognition.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

2007
Semantic Home Photo Categorization.
IEEE Trans. Circuits Syst. Video Technol., 2007

Video Event Filtering in Consumer Domain.
IEEE Trans. Broadcast., 2007

Semantic categorization of digital home photo using photographic region templates.
Inf. Process. Manag., 2007

Media Accessibility for Low-Vision Users in the MPEG-21 Multimedia Framework.
IEICE Trans. Inf. Syst., 2007

Improvement of Inter-Layer Motion Prediction in Scalable Video Coding.
IEICE Trans. Inf. Syst., 2007

Quality Measurement Modeling on Scalable Video Applications.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Scalable Security and Conditional Access Control for Multiple Regions of Interest in Scalable Video Coding.
Proceedings of the Digital Watermarking, 6th International Workshop, 2007

Optimal multi-layer adaptation of SVC video over heterogeneous environments.
Proceedings of the International Conference on Wireless Communications and Mobile Computing, 2007

Advertisement Insertion based on MPEG-4 File Format and MPEG-21 DID.
Proceedings of the 2007 International Conference on Image Processing, 2007

Perceived Image Contrast Measurement using Multi-scale Adaptation and Spatial Vision Characteristics.
Proceedings of the 2007 International Conference on Image Processing, 2007

Graph-Based Perceptual Quality Model for Audiovisual Contents.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Quality metric for H.264/AVC scalable video coding with full scalability.
Proceedings of the Human Vision and Electronic Imaging XII, San Jose, CA, USA, January 29, 2007

2006
Meaningful scene filtering for TV terminals.
IEEE Trans. Consumer Electron., 2006

Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment.
IEICE Trans. Inf. Syst., 2006

A Microcalcification Detection Using Adaptive Contrast Enhancement on Wavelet Transform and Neural Network.
IEICE Trans. Inf. Syst., 2006

Intelligent broadcasting system and services for personalized semantic contents consumption.
Expert Syst. Appl., 2006

Two-layered Photo Classification based on Semantic and Syntactic Features.
Proceedings of the 1st International Workshop on Semantic-Enhanced Multimedia Presentation Systems (SEMPS-2006), 2006

e-Learning Media Format for Enhanced Consumption on Mobile Application.
Proceedings of the 1st International Workshop on Semantic-Enhanced Multimedia Presentation Systems (SEMPS-2006), 2006

Scalable Protection and Access Control in Full Scalable Video Coding.
Proceedings of the Digital Watermarking, 5th International Workshop, 2006

Conditional Access Control in Secured SVC Bitstream.
Proceedings of the 2006 International Conference on Image Processing, 2006

Multimedia Packaging for TVAF-based Broadcasting Contents.
Proceedings of the 2006 International Conference on Image Processing, 2006

2005
Visual content adaptation according to user perception characteristics.
IEEE Trans. Multim., 2005

Effective adaptation of multimedia documents with modality conversion.
Signal Process. Image Commun., 2005

Automated situation clustering of home photos for digital albuming.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2005, 2005

Automatic Photo Indexing Based on Person Identity.
Proceedings of the Advances in Multimedia Information Processing, 2005

Semantic Quality for Content-Aware Video Adaptation.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Distortion Measures in MPEG-Compressed Domain for Multidimensional Transcoding.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Semantic Event Detection in Structured Video Using Hybrid HMM/SVM.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

MPEG-21 Based Broadcasting Contents Service System in Ubiquitous Environment.
Proceedings of The 2005 International Conference on Imaging Science, 2005

Microcalcification Detection System for Computer Aided Diagnosis.
Proceedings of The 2005 International Conference on Imaging Science, 2005

Home Photo Categorization Based on Photographic Region Templates.
Proceedings of the Information Retrieval Technology, 2005

2004
Color adaptation for anomalous trichromats.
Int. J. Imaging Syst. Technol., 2004

Video genre classification using multimodal features.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Dynamic Programming Based Adaptation of Multimedia Contents in UMA.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Automatic Video Genre Detection for Content-Based Authoring.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Watermarking System for QoS Aware Content Adaptation.
Proceedings of the Digital Watermarking, Third InternationalWorkshop, 2004

Visual content adaptation for low vision users in MPEG-21 framework.
Proceedings of the 2004 International Conference on Image Processing, 2004

Joint control for hybrid transcoding using multidimensional rate distortion modeling.
Proceedings of the 2004 International Conference on Image Processing, 2004

Robust Contrast Enhancement for Microcalcification in Mammography.
Proceedings of the Computational Science and Its Applications, 2004

Adaptive Microcalcification Detection in Computer Aided Diagnosis.
Proceedings of the Computational Science, 2004

Content adaptation for visual impairment in MPEG-21.
Proceedings of the Human Vision and Electronic Imaging IX, 2004

Video Segmentation Using Hidden Markov Model with Multimodal Features.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

Semantic Event Detection in Sports Video Using Hidden Markov Model.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2004

Intelligent Agent-Based Systems for Personalized Broadcasting Services.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2004

Watermarking System for Contents Adaptation in Ubiquitous Environment.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2004

2003
Novel Watermark Embedding Technique Based on Human Visual System.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003

Semantic event detection using MPEG-7.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003

Enhancement of Image Quality in Screen Mark Attack of Watermark.
Proceedings of the Signal and Image Processing (SIP 2003), 2003

Enhancement Methods of Image Quality in Screen Mark Attack.
Proceedings of the Digital Watermarking, Second International Workshop, 2003

Metadata Hiding for Content Adaptation.
Proceedings of the Digital Watermarking, Second International Workshop, 2003

Video abstraction generation supervised by user preference.
Proceedings of the Image Processing: Algorithms and Systems II, 2003

Visual contents adaptation for color vision deficiency.
Proceedings of the 2003 International Conference on Image Processing, 2003

Digital item adaptation for color vision variations.
Proceedings of the Human Vision and Electronic Imaging VIII, 2003

Modality Conversion in Content Adaptation for Universal Multimedia Access.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2003

Soccer Video Summarization System Based on Hidden Markov Model with Multiple MPEG-7 Descriptors.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2003

Visual Media Adaptation System for Active Media.
Proceedings of the International Conference on Imaging Science, Systems and Technology, 2003

2002
Spatial Frequency Band Division in Human Visual System Based-Watermarking.
Proceedings of the Digital Watermarking, First International Workshop, 2002

Generation of MPEG-7 descriptor in compressed domain.
Proceedings of the Image Processing: Algorithms and Systems, 2002

2001
MPEG-7 Texture Descriptors.
Int. J. Image Graph., 2001

A Metadata Repository System for an Efficient Description of Visual Multimedia Documents.
Concurr. Eng. Res. Appl., 2001

Content-based watermarking technique using MPEG-7 descriptor.
Proceedings of the Security and Watermarking of Multimedia Contents III, 2001

Novel watermark embedding technique based on human visual system.
Proceedings of the Security and Watermarking of Multimedia Contents III, 2001

Texture Descriptors in MPEG-7.
Proceedings of the Computer Analysis of Images and Patterns, 9th International Conference, 2001

2000
Hierarchical Block Matching Algorithm in MRME.
Proceedings of the 2000 International Conference on Image Processing, 2000

1999
Progressive multiresolution reconstruction in MRI.
Proceedings of the Medical Imaging 1999: Image Processing, 1999

Texture Featuring and Indexing Using Matching Pursuit in Radon Space.
Proceedings of the 1999 International Conference on Image Processing, 1999

1998
Adaptive coding using matching pursuit in MRI.
Proceedings of the Medical Imaging 1998: Image Processing, 1998

1995
Susceptibility effect-enhanced functional MR imaging using tailored RF gradient echo (TRFGE) sequence.
Int. J. Imaging Syst. Technol., 1995


  Loading...