2025
SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal.
CoRR, May, 2025
Open World Object Detection: A Survey.
IEEE Trans. Circuits Syst. Video Technol., February, 2025
ByteNet: Rethinking Multimedia File Fragment Classification Through Visual Perspectives.
IEEE Trans. Multim., 2025
Incorporating vision-based artificial intelligence and large language model for smart traffic light control.
Appl. Soft Comput., 2025
2024
Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds.
IEEE Trans. Circuits Syst. Video Technol., June, 2024
DEO-Net: Joint Density Estimation and Object Detection for Crowd Counting.
IEEE Trans. Instrum. Meas., 2024
Intra- and inter-sector contextual information fusion with joint self-attention for file fragment classification.
Knowl. Based Syst., 2024
Top-down framework for weakly-supervised grounded image captioning.
Knowl. Based Syst., 2024
Image compressive sensing reconstruction via nonlocal low-rank residual-based ADMM framework.
Comput. Vis. Image Underst., 2024
CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models.
CoRR, 2024
CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition.
CoRR, 2024
Video sentence grounding with temporally global textual knowledge.
CoRR, 2024
MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition.
Proceedings of the 26th IEEE International Workshop on Multimedia Signal Processing, 2024
Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024
ViPose: Keypoint Visibility-Based Human Pose Estimation.
Proceedings of the 15th International Conference on Information and Communication Technology Convergence, 2024
Temporal Sentence Grounding with Temporally Global Textual Knowledge.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
CM<sup>2</sup>-Net: Continual Cross-Modal Mapping Network For Driver Action Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2024
Hdplifter: Hierarchical Dynamics Perception For 2D-to-3D Human Pose Lifting.
Proceedings of the IEEE International Conference on Image Processing, 2024
Multi-Modality Action Recognition Based on Dual Feature Shift in Vehicle Cabin Monitoring.
Proceedings of the IEEE International Conference on Acoustics, 2024
Contextual Human Object Interaction Understanding from Pre-Trained Large Language Model.
Proceedings of the IEEE International Conference on Acoustics, 2024
Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
CoG-DQA: Chain-of-Guiding Learning with Large Language Models for Diagram Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning.
Remote. Sens., December, 2023
Reconciliation of statistical and spatial sparsity for robust visual classification.
Neurocomputing, April, 2023
SSN: Stockwell Scattering Network for SAR Image Change Detection.
IEEE Geosci. Remote. Sens. Lett., 2023
Learning-Based Biharmonic Augmentation for Point Cloud Classification.
CoRR, 2023
OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking.
CoRR, 2023
Top-Down Viewing for Weakly Supervised Grounded Image Captioning.
CoRR, 2023
Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Vision-Based Early Fire and Smoke Detection for Smart Factory Applications Using FFS-YOLO.
Proceedings of the 25th IEEE International Workshop on Multimedia Signal Processing, 2023
METFormer: A Motion Enhanced Transformer for Multiple Object Tracking.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023
Image Representation and Deep Inception-Attention for File-type and Malware Classification.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023
Nonlocal Low-Rank Residual Modeling for Image Compressive Sensing Reconstruction.
Proceedings of the IEEE International Conference on Image Processing, 2023
A Spatial-Focal Error Concealment Scheme for Corrupted Focal Stack Video.
Proceedings of the Data Compression Conference, 2023
TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Bitstream-Corrupted JPEG Images are Restorable: Two-stage Compensation and Alignment Framework for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings.
Proceedings of the 5th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2023
Object-Augmented Skeleton-Based Action Recognition.
Proceedings of the 5th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2023
2022
Collaborative learning mutual network for domain adaptation in person re-identification.
Neural Comput. Appl., 2022
Mixed Membership Generative Adversarial Networks.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Attribute Conditioned Fashion Image Captioning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Learning Transferable Human-Object Interaction Detector with Natural Language Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Attribute saliency network for person re-identification.
Image Vis. Comput., 2021
Reconciliation of Statistical and Spatial Sparsity For Robust Image and Image-Set Classification.
CoRR, 2021
Fusion Learning using Semantics and Graph Convolutional Network for Visual Food Recognition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
APNET: Attribute Parsing Network for Person Re-Identification.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
A Compact Joint Distillation Network for Visual Food Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Remote detection of idling cars using infrared imaging and deep networks.
Neural Comput. Appl., 2020
Semantic granularity metric learning for visual search.
J. Vis. Commun. Image Represent., 2020
JDNet: A Joint-Learning Distilled Network for Mobile Visual Food Recognition.
IEEE J. Sel. Top. Signal Process., 2020
Empirical Analysis Of Overfitting And Mode Drop In Gan Training.
Proceedings of the IEEE International Conference on Image Processing, 2020
Dynamically Modulated Deep Metric Learning for Visual Search.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
What Does Plate Glass Reveal About Camera Calibration?
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Multi-Path Region Mining for Weakly Supervised 3D Semantic Segmentation on Point Clouds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Discovering Human Interactions With Novel Objects via Zero-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions.
CoRR, 2019
Interest Point Detection based on Adaptive Ternary Coding.
CoRR, 2019
DCI: Discriminative and Contrast Invertible Descriptor.
CoRR, 2019
Few-Shot and Many-Shot Fusion Learning in Mobile Visual Food Recognition.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019
Multitask Person Re-Identification using Homoscedastic Uncertainty Learning.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019
Convolutional Three-Stream Network Fusion for Driver Fatigue Detection from Infrared Videos.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019
The Unusual Effectiveness of Averaging in GAN Training.
Proceedings of the 7th International Conference on Learning Representations, 2019
AANet: Attribute Attention Network for Person Re-Identifications.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Brand-Aware Fashion Clothing Search using CNN Feature Encoding and Re-ranking.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018
Hybrid Supervised Deep Learning for Ethnicity Classification using Face Images.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018
Idling Car Detection with ConvNets in Infrared Image Sequences.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018
Autoregressive Generative Adversarial Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018
QL-Net: Quantized-by-LookUp CNN.
Proceedings of the 15th International Conference on Control, 2018
Tiered Deep Similarity Search for Fashion.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
2017
Feature Repetitiveness Similarity Metrics in Visual Search.
IEEE Signal Process. Lett., 2017
Lattice-Support repetitive local feature detection for visual search.
Pattern Recognit. Lett., 2017
Integrated 3D feature augmentation and view selection in commercial product search.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Laplace gradient based Discriminative and Contrast Invertible descriptor.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Contrast Invariant Interest Point Detection by Zero-Norm LoG Filter.
IEEE Trans. Image Process., 2016
2015
Bijective Weighted Kernel with Connected Component Analysis for Visual Object Search.
IEEE Signal Process. Lett., 2015
Feature weighting in visual product recognition.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015
Hybrid feature-based wallpaper visual search.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015
Augmented visual phrase in mobile product recognition.
Proceedings of the 10th International Conference on Information, 2015
2014
Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition.
IEEE Trans. Multim., 2014
Discriminative BoW Framework for Mobile Landmark Recognition.
IEEE Trans. Cybern., 2014
Beyond Bag-of-Words: combining generative and discriminative models for scene categorization.
Multim. Tools Appl., 2014
Context-aware Discriminative Vocabulary Tree Learning for mobile landmark recognition.
Digit. Signal Process., 2014
Mobile product recognition with efficient Bag-of-Phrase visual search.
Proceedings of the 6th International Symposium on Communications, 2014
2013
Joint Image Registration and Super-Resolution From Low-Resolution Images With Zooming Motion.
IEEE Trans. Circuits Syst. Video Technol., 2013
Context-Aware Discriminative Vocabulary Learning for Mobile Landmark Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2013
Context-aware mobile image annotation for media search and sharing.
Signal Process. Image Commun., 2013
An efficient approach for scene categorization based on discriminative codebook learning in bag-of-words framework.
Image Vis. Comput., 2013
2012
Content and Context Boosting for Mobile Landmark Recognition.
IEEE Signal Process. Lett., 2012
Vehicle license plate super-resolution using soft learning prior.
Multim. Tools Appl., 2012
Efficient mobile landmark recognition based on saliency-aware scalable vocabulary tree.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Multi-frame super-resolution from observations with zooming motion.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Discriminative bag-of-visual phrase learning for landmark recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Guest Editorial Special Issue on Advances in Multimedia Computing, Communications and Applications.
J. Signal Process. Syst., 2011
A Fuzzy Clustering Algorithm for Virtual Character Animation Representation.
IEEE Trans. Multim., 2011
Integrated Content and Context Analysis for Mobile Landmark Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2011
New regularization scheme for blind color image deconvolution.
J. Electronic Imaging, 2011
L1-norm multi-frame super-resolution from images with zooming motion.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011
Image based approach with k-mean clustering for the compression of human motion sequences.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011
A new blind robust image watermarking scheme in SVD-DCT composite domain.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
From universal bag-of-words to adaptive bag-of-phrases for mobile scene recognition.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
A discriminative learning technique for mobile landmark recognition.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Beyond bag of words: Combining generative and discriminative models for natural scene categorization.
Proceedings of the IEEE International Conference on Acoustics, 2011
High resolution vehicle license plate reconstruction using soft recognition learning.
Proceedings of the 8th International Conference on Information, 2011
A soft relevance method for content-based scene categorization in the BoW framework.
Proceedings of the 8th International Conference on Information, 2011
Content and context information fusion for mobile landmark recognition.
Proceedings of the 8th International Conference on Information, 2011
2010
Knowledge Propagation in Collaborative Tagging for Image Retrieval.
J. Signal Process. Syst., 2010
Guest Editorial: Special Issue on Recent Advances in Content Analysis for Media Computing.
J. Signal Process. Syst., 2010
Joint Rate Allocation for Multiprogram Video Coding Using FGS.
IEEE Trans. Circuits Syst. Video Technol., 2010
Bit-Rate Allocation for Broadcasting of Scalable Video Over Wireless Networks.
IEEE Trans. Broadcast., 2010
Adaptive resynchronization approach for scalable video over wireless channel.
J. Vis. Commun. Image Represent., 2010
A Comparative Study of Mobile-Based Landmark Recognition Techniques.
IEEE Intell. Syst., 2010
A Bayesian image annotation framework integrating search and context.
Proceedings of the 2010 IEEE International Workshop on Multimedia Signal Processing, 2010
Bit allocation for scalable video coding of multiple video programs.
Proceedings of the International Conference on Image Processing, 2010
2009
A Nonlinear L <sub>1</sub> -Norm Approach for Joint Image Registration and Super-Resolution.
IEEE Signal Process. Lett., 2009
A soft MAP framework for blind super-resolution image reconstruction.
Image Vis. Comput., 2009
A Survey on Mobile Landmark Recognition for Information Retrieval.
Proceedings of the MDM 2009, 2009
Broadcast of Scalable Video over Wireless Networks.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009
A Learning Approach for Single-frame Face Super-resolution.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009
Progressive Transmission of Motion Capture Data for Scalable Virtual Character Animation.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009
2008
A Novel Hybrid Model Framework to Blind Color Image Deconvolution.
IEEE Trans. Syst. Man Cybern. Part A, 2008
Subband Synthesis for Color Filter Array Demosaicking.
IEEE Trans. Syst. Man Cybern. Part A, 2008
An Effective Technique for Subpixel Image Registration Under Noisy Conditions.
IEEE Trans. Syst. Man Cybern. Part A, 2008
A Joint Source-Channel Video Coding Scheme Based on Distributed Source Coding.
IEEE Trans. Multim., 2008
A Collaborative Bayesian Image Annotation Framework.
Proceedings of the Advances in Multimedia Information Processing, 2008
Spatial resolution decision in scalable bitstream extraction for network and receiver aware adaptation.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
A new color image regularization scheme for blind image deconvolution.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Two-Dimensional Channel Coding Scheme for MCTF-Based Scalable Video Coding.
IEEE Trans. Multim., 2007
A Nonlinear Least Square Technique for Simultaneous Image Registration and Super-Resolution.
IEEE Trans. Image Process., 2007
Content-based image retrieval using fuzzy perceptual feedback.
Multim. Tools Appl., 2007
Efficient Recursive Multichannel Blind Image Restoration.
EURASIP J. Adv. Signal Process., 2007
A Motion-Based Selective Error Protection Method for Scalable Video Over Error-Prone Channel.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Joint Image Registration and Super-Resolution using Nonlinear Least Squares Method.
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Efficient discrete spatial techniques for blur support identification in blind image deconvolution.
IEEE Trans. Signal Process., 2006
Fuzzy SVM for content-based image retrieval: a pseudo-label support vector machine framework.
IEEE Comput. Intell. Mag., 2006
Two-dimensional channel rate allocation for SVC over error-prone channel.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006
Region-Based Image Retrieval using Radial Basis Function Network.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
A Novel Resynchronization Method for Scalable Video Over Wireless Channel.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Blind Super-Resolution Image Reconstruction using a Maximum a Posteriori Estimation.
Proceedings of the International Conference on Image Processing, 2006
A Bispectrum Technique to Subpixel Image Registration under Noisy Conditions.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
A soft double regularization approach to parametric blind image deconvolution.
IEEE Trans. Image Process., 2005
A soft relevance framework in content-based image retrieval systems.
IEEE Trans. Circuits Syst. Video Technol., 2005
Fuzzy relevance feedback in content-based image retrieval systems using radial basis function network.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Blind color image deconvolution based on wavelet decomposition.
Proceedings of the 2005 International Conference on Image Processing, 2005
Color filter array demosaicking using wavelet-based subband synthesis.
Proceedings of the 2005 International Conference on Image Processing, 2005
Regularized interpolation using Kronecker product for still images.
Proceedings of the 2005 International Conference on Image Processing, 2005
2004
A noisy chaotic neural network approach to image denoising.
Proceedings of the 2004 International Conference on Image Processing, 2004
An efficient radial basis function network approach for content-based image retrieval.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
A recursive soft-decision approach to blind image deconvolution.
IEEE Trans. Signal Process., 2003
A fuzzy K-nearest-neighbor algorithm to blind image deconvolution.
Proceedings of the IEEE International Conference on Systems, 2003
2002
A computational reinforced learning scheme to blind image deconvolution.
IEEE Trans. Evol. Comput., 2002
A fuzzy blur algorithm to adaptive blind image deconvolution.
Proceedings of the Seventh International Conference on Control, 2002
2001
An attractor space approach to blind image deconvolution.
Proceedings of the IEEE International Conference on Acoustics, 2001
Blind adaptive detection for CDMA systems based on regularized independent component analysis.
Proceedings of the Global Telecommunications Conference, 2001
2000
A Recursive Soft-Decision PSF and Neural Network Approach to Adaptive Blind Image Regularization.
Proceedings of the 2000 International Conference on Image Processing, 2000
1999
Image Restoration Based on Hierarchical Cluster Model with Evolutionary Optimization.
Proceedings of the 1999 International Conference on Image Processing, 1999