Chang Dong Yoo

Orcid: 0000-0002-0756-7179

According to our database1, Chang Dong Yoo authored at least 211 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Scalable SoftGroup for 3D Instance Segmentation on Point Clouds.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Physics Informed Distillation for Diffusion Models.
Trans. Mach. Learn. Res., 2024

TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation.
CoRR, 2024

Predictive Coding for Decision Transformer.
CoRR, 2024

Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization.
CoRR, 2024

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation.
CoRR, 2024

LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition.
CoRR, 2024

On the Perturbed States for Transformed Input-robust Reinforcement Learning.
CoRR, 2024

Towards Unsupervised Speech Recognition Without Pronunciation Models.
CoRR, 2024

Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses.
CoRR, 2024

Causal Localization Network for Radar Human Localization With Micro-Doppler Signature.
IEEE Access, 2024

Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning.
IEEE Access, 2024

FRAG: Frequency Adapting Group for Diffusion Video Editing.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cross-view Masked Diffusion Transformers for Person Image Synthesis.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Progressive Fourier Neural Representation for Sequential Video Compilation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Querying Easily Flip-flopped Samples for Deep Active Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unsupervised Speech Recognition with N-skipgram and Positional Unigram Matching.
Proceedings of the IEEE International Conference on Acoustics, 2024

Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing.
Proceedings of the IEEE International Conference on Acoustics, 2024

G2PU: Grapheme-To-Phoneme Transducer with Speech Units.
Proceedings of the IEEE International Conference on Acoustics, 2024

AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DNI: Dilutional Noise Initialization for Diffusion Video Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

Implicit Steganography Beyond the Constraints of Modality.
Proceedings of the Computer Vision - ECCV 2024, 2024

FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-rigid Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Skew Class-Balanced Re-Weighting for Unbiased Scene Graph Generation.
Mach. Learn. Knowl. Extr., March, 2023

Efficient Convolutional Neural Networks for Semiconductor Wafer Bin Map Classification.
Sensors, February, 2023

Continual Learning: Forget-free Winning Subnetworks for Video Representations.
CoRR, 2023

Neutral Editing Framework for Diffusion-based Video Editing.
CoRR, 2023

Flexible Cross-Modal Steganography via Implicit Representations.
CoRR, 2023

Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection.
CoRR, 2023

DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution.
CoRR, 2023

Progressive Neural Representation for Sequential Video Compilation.
CoRR, 2023

Forget-free Continual Learning with Soft-Winning SubNetworks.
CoRR, 2023

Self-Supervised Visual Representation Learning via Residual Momentum.
IEEE Access, 2023

DimCL: Dimensional Contrastive Learning for Improving Self-Supervised Learning.
IEEE Access, 2023

Joint Path Alignment Framework for 3D Human Pose and Shape Estimation From Video.
IEEE Access, 2023

One-Shot Exemplification Modeling via Latent Sense Representations.
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

On the Soft-Subnetwork for Few-Shot Class Incremental Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Counterfactual Two-Stage Debiasing For Video Corpus Moment Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Efficient Latent Variable Modeling for Knowledge-Grounded Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Listen, Decipher and Sign: Toward Unsupervised Speech-to-Sign Language Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Theory of Unsupervised Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Seamless equal accuracy ratio for inclusive CTC speech recognition.
Speech Commun., 2022

Dual-Scale Doppler Attention for Human Identification.
Sensors, 2022

Visual Pretraining via Contrastive Predictive Model for Pixel-Based Reinforcement Learning.
Sensors, 2022

CE-BART: Cause-and-Effect BART for Visual Commonsense Generation.
Sensors, 2022

Fair Facial Attribute Classification via Causal Graph-Based Attribute Translation.
Sensors, 2022

Self-Supervised Visual Representation Learning via Residual Momentum.
CoRR, 2022

Selective Query-guided Debiasing Network for Video Corpus Moment Retrieval.
CoRR, 2022

SoftGroup++: Scalable 3D Instance Segmentation with Octree Pyramid Grouping.
CoRR, 2022

On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning.
CoRR, 2022

SoftGroup for 3D Instance Segmentation on Point Clouds.
CoRR, 2022

Noise Augmentation Is All You Need For FGSM Fast Adversarial Training: Catastrophic Overfitting And Robust Overfitting Require Different Augmentation.
CoRR, 2022

Cascaded MPN: Cascaded Moment Proposal Network for Video Corpus Moment Retrieval.
IEEE Access, 2022

LAD: A Hybrid Deep Learning System for Benign Paroxysmal Positional Vertigo Disorders Diagnostic.
IEEE Access, 2022

Utilizing Skipped Frames in Action Repeats for Improving Sample Efficiency in Reinforcement Learning.
IEEE Access, 2022

Survival Analysis of COVID-19 Patients With Symptoms Information by Machine Learning Algorithms.
IEEE Access, 2022

Corrections to "Blending Query Strategy of Active Learning for Imbalanced Data".
IEEE Access, 2022

Blending Query Strategy of Active Learning for Imbalanced Data.
IEEE Access, 2022

Frame-Level Stutter Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Forget-free Continual Learning with Winning Subnetworks.
Proceedings of the International Conference on Machine Learning, 2022

How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Semantic Association Network for Video Corpus Moment Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2022

Information-Theoretic Text Hallucination Reduction for Video-grounded Dialogue.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness.
Proceedings of the Computer Vision - ECCV 2022, 2022

Selective Query-Guided Debiasing for Video Corpus Moment Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022

Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SoftGroup for 3D Instance Segmentation on Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Fast and Efficient MMD-Based Fair PCA via Optimization over Stiefel Manifold.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Counterfactually Fair Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

GDCA: GAN-based single image super resolution with Dual discriminators and Channel Attention.
CoRR, 2021

Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold.
CoRR, 2021

Self-supervised Learning with Local Attention-Aware Feature.
CoRR, 2021

Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment.
IEEE Access, 2021

Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Sphererpn: Learning Spheres For High-Quality Region Proposals On 3d Point Clouds Object Detection.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Learning Imbalanced Datasets With Maximum Margin Loss.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Robust Maml: Prioritization Task Buffer with Adaptive Learning Process for Model-Agnostic Meta-Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Synthesis of New Words for Improved Dysarthric Speech Recognition on an Expanded Vocabulary.
Proceedings of the IEEE International Conference on Acoustics, 2021

SCNet: Training Inference Sample Consistency for Instance Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Semantic Grouping Network for Video Captioning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Structured Co-reference Graph Attention for Video-grounded Dialogue.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
CNN-Based Learnable Gammatone Filterbank and Equal-Loudness Normalization for Environmental Sound Classification.
IEEE Signal Process. Lett., 2020

GAPNet: Generic-Attribute-Pose Network For Fine-Grained Visual Categorization Using Multi-Attribute Attention Module.
Proceedings of the IEEE International Conference on Image Processing, 2020

VLANet: Video-Language Alignment Network for Weakly-Supervised Video Moment Retrieval.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Augmentation Network via Influence Functions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Modality Shifting Attention Network for Multi-Modal Video Question Answering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Object Detection for Similar Appearance Objects Based on Entropy.
Proceedings of the 7th International Conference on Robot Intelligence Technology and Applications, 2019

Cascade RPN: Delving into High-Quality Region Proposal Network with Adaptive Convolution.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Gaining Extra Supervision via Multi-task learning for Multi-Modal Video Question Answering.
Proceedings of the International Joint Conference on Neural Networks, 2019

Few-Shot Associative Domain Adaptation for Surface Normal Estimation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Progressive Attention Memory Network for Movie Story Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Unsupervised Domain Adaptation for Object Detection Using Distribution Matching in Various Feature Level.
Proceedings of the Digital Forensics and Watermarking - 17th International Workshop, 2018

Action Recognition: First-and Second-Order 3D Feature in Bi-Directional Attention Network.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Fast and Efficient Image Quality Enhancement via Desubpixel Convolutional Neural Networks.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Perception-Enhanced Image Super-Resolution via Relativistic Generative Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Pivot Correlational Neural Network for Multimodal Video Categorization.
Proceedings of the Computer Vision - ECCV 2018, 2018

ImaGAN: Unsupervised Training of Conditional Joint CycleGAN for Transferring Style with Core Structures in Content Preserved.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Complex Video Scene Analysis Using Kernelized-Collaborative Behavior Pattern Learning Based on Hierarchical Representative Object Behaviors.
IEEE Trans. Circuits Syst. Video Technol., 2017

A Resizable Mini-batch Gradient Descent based on a Randomized Weighted Majority.
CoRR, 2017

Meta-Learning via Feature-Label Memory Network.
CoRR, 2017

Early Improving Recurrent Elastic Highway Network.
CoRR, 2017

Content adaptive video summarization using spatio-temporal features.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deep partial person re-identification via attention model.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Multi-view visual speech recognition based on multi task learning.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Melody extraction and detection through LSTM-RNN with harmonic sum loss.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Neural network-based autonomous navigation for a homecare mobile robot.
Proceedings of the 2017 IEEE International Conference on Big Data and Smart Computing, 2017

2016
Maximum Margin Learning of t-SPNs for Cell Classification With Filtered Input.
IEEE J. Sel. Top. Signal Process., 2016

Multimodal representation: Kneser-ney smoothing/skip-gram based neural language model.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Driver Drowsiness Detection System Based on Feature Representation Learning Using Various Deep Networks.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Underdetermined High-Resolution DOA Estimation: A 2ρth-Order Source-Signal/Noise Subspace Constrained Optimization.
IEEE Trans. Signal Process., 2015

Underdetermined Convolutive BSS: Bayes Risk Minimization Based on a Mixture of Super-Gaussian Posterior Approximation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Segment-wise online learning based on greedy algorithm for real-time multi-target tracking.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Face attribute classification using attribute-aware correlation map and gated convolutional neural networks.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Dense Image Registration and Deformable Surface Reconstruction in Presence of Occlusions and Minimal Texture.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Face detection using Local Hybrid Patterns.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Face alignment using cascade Gaussian process regression trees.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Image Segmentation UsingHigher-Order Correlation Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

A hierarchical-structured dictionary learning for image classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Salient object detection using bipartite dictionary.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Greedy algorithm for real-time multi-object tracking.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Joint learning of foreground region labeling and depth ordering.
Proceedings of the IEEE International Conference on Acoustics, 2014

Joint Estimation of Pose and Face Landmark.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Task-Specific Image Partitioning.
IEEE Trans. Image Process., 2013

A maximum likelihood approach for underdetermined TDOA estimation.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification.
IEEE Trans. Speech Audio Process., 2012

Phoneme Classification using Constrained Variational Gaussian Process Dynamical System.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Sparsity Sharing Embedding for Face Verification.
Proceedings of the Computer Vision, 2012

Joint Kernel Learning for Supervised Image Segmentation.
Proceedings of the Computer Vision - ACCV 2012, 2012

2011
Large Margin Discriminative Semi-Markov Model for Phonetic Recognition.
IEEE Trans. Speech Audio Process., 2011

Melody Tracking Based on Sequential Bayesian Model.
IEEE J. Sel. Top. Signal Process., 2011

Higher-Order Correlation Clustering for Image Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Underdetermined convolutive blind source separation using a novel mixing matrix estimation and MMSE-based source estimation.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Melody Extraction based on Harmonic Coded Structure.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

A High Resolution Multiple Source Localization Based on Generalized Cumulant Structure (GCS) Matrix.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Learning a discriminative visual codebook using homonym scheme.
Proceedings of the IEEE International Conference on Acoustics, 2011

Variable grouping for energy minimization.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Distance Metric Learning for Content Identification.
IEEE Trans. Inf. Forensics Secur., 2010

Psychoacoustically Constrained and Distortion Minimized Speech Enhancement.
IEEE Trans. Speech Audio Process., 2010

Melody Extraction from Polyphonic Audio Based on Particle Filter.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Melody pitch estimation based on range estimation and candidate extraction using harmonic structure model.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A maximum a posteriori sound source localization in reverberant and noisy conditions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Parametric emotional singing voice synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2010

Largemargin training of semi-Markov model for phonetic recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Underdetermined blind source separation based on subspace representation.
IEEE Trans. Signal Process., 2009

Quantum hashing for multimedia.
IEEE Trans. Inf. Forensics Secur., 2009

Pairwise boosted audio fingerprint.
IEEE Trans. Inf. Forensics Secur., 2009

Robust Video Fingerprinting Based on Symmetric Pairwise Boosting.
IEEE Trans. Circuits Syst. Video Technol., 2009

Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion model.
Proceedings of the IEEE International Conference on Acoustics, 2009

Psychoacoustically constrained and distortion minimized speech enhancement algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2009

Humming-based human verification and identification.
Proceedings of the IEEE International Conference on Acoustics, 2009

Fingerprint matching based on distance metric learning.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Alias-Free Subband Adaptive Filtering With Critical Sampling.
IEEE Trans. Signal Process., 2008

Robust Video Fingerprinting for Content-Based Video Identification.
IEEE Trans. Circuits Syst. Video Technol., 2008

Development of a simple free viewpoint video system.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Music genre classification using novel features and a weighted voting method.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Robust video fingerprinting based on 2D-OPCA of affine covariant regions.
Proceedings of the International Conference on Image Processing, 2008

Robust video fingerprinting based on affine covariant regions.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Reversible Image Watermarking Based on Integer-to-Integer Wavelet Transform.
IEEE Trans. Inf. Forensics Secur., 2007

A Syllable Lattice Approach to Speaker Verification.
IEEE Trans. Speech Audio Process., 2007

Temporal Dynamics for Spectral Sub-Band Centroid Audio Fingerprints.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Boosted Binary Audio Fingerprint Based on Spectral Subband Moments.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Novel Adaptive Crosstalk Cancellation using Psychoacoustic Model for 3D Audio.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Enhancement Based on the Decomposition of Speech Into Deterministic and Stochastic Components and Psychoacoustic Model.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Image watermarking based on invariant regions of scale-space representation.
IEEE Trans. Signal Process., 2006

Audio fingerprinting based on normalized spectral subband moments.
IEEE Signal Process. Lett., 2006

Video Fingerprinting Based on Centroids of Gradient Orientations.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Syllable Lattice Based Re-Scoring For Speaker Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Novel Embedding Method For An Anti-Collusion Fingerprinting By Embedding Both A Code And An Orthogonal Fingerprint.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Audio fingerprinting based on normalized spectral subband centroids.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

An SVD-Based Watermarking Method for Image Content Authentication with Improved Security.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A robust image fingerprinting system using the Radon transform.
Signal Process. Image Commun., 2004

Localized image watermarking based on feature points of scale-space representation.
Pattern Recognit., 2004

Image watermarking based on scale-space representation.
Proceedings of the Security, Steganography, and Watermarking of Multimedia Contents VI, 2004

Blind separation of speech and sub-Gaussian signals in underdetermined case.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Hybrid utterance verification based on n-best models and model derived from kulback-leibler divergence.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Underdetermined Independent Component Analysis by Data Generation.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

2003
A novel transcoding algorithm for SMV and g.723.1 speech coders via direct parameter transformation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Accuracy improved double-talk detector based on state transition diagram.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A robust and sensitive word boundary decision algorithm.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A novel rate selection algorithm for transcoding CELP-type codec and SMV.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speaker adaptation based on confidence-weighted training.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Affine transform resilient image fingerprinting.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

A novel transcoding algorithm for AMR and EVRC speech codecs via direct parameter transformation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

The incorporation of masking threshold to subspace speech enhancement.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Improvements in speaker adaptation using weighted training.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Design of template in the autocorrelation domain.
Proceedings of the Security and Watermarking of Multimedia Contents IV, 2002

Subspace speech enhancement using subband whitening filter.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Acoustic echo cancellation based on m-channel IIR cosine-modulated filter bank.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Correlation Detection of Asymmetric Watermark.
Proceedings of the Advances in Multimedia Information Processing, 2001

Speech/noise-dominant decision for speech enhancement.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

1999
Utilizing interband acoustical information for modeling stationary time-frequency regions of noisy speech.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1996
Speech enhancement: identification and modeling of stationary time-frequency regions.
PhD thesis, 1996

Selective all-pole modeling of degraded speech using M-band decomposition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
An iterative method for designing separable Wiener filters.
IEEE Trans. Signal Process., 1995

Speech enhancement based on the generalized dual excitation model with adaptive analysis window.
Proceedings of the 1995 International Conference on Acoustics, 1995


  Loading...