Xiaokang Yang

Orcid: 0000-0003-4029-3322

According to our database1, Xiaokang Yang authored at least 702 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
From Discrete Representation to Continuous Modeling: A Novel Audio-Visual Saliency Prediction Model With Implicit Neural Representations.
IEEE Trans. Emerg. Top. Comput. Intell., December, 2024

EasyDGL: Encode, Train and Interpret for Continuous-Time Dynamic Graph Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Ualign: pushing the limit of template-free retrosynthesis prediction with unsupervised SMILES alignment.
J. Cheminformatics, December, 2024

Towards Unified Defense for Face Forgery and Spoofing Attacks via Dual Space Reconstruction Learning.
Int. J. Comput. Vis., December, 2024

Neural Architecture Selection as a Nash Equilibrium With Batch Entanglement.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Action-aware Linguistic Skeleton Optimization Network for Non-autoregressive Video Captioning.
ACM Trans. Multim. Comput. Commun. Appl., October, 2024

HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane.
IEEE Trans. Medical Imaging, September, 2024

ReHarvest: An ADC Resource-Harvesting Crossbar Architecture for ReRAM-Based DNN Accelerators.
ACM Trans. Archit. Code Optim., September, 2024

Directional Texture Editing for 3D Models.
Comput. Graph. Forum, September, 2024

Object Detection and Information Perception by Fusing YOLO-SCG and Point Cloud Clustering.
Sensors, August, 2024

StyleVR: Stylizing Character Animations With Normalizing Flows.
IEEE Trans. Vis. Comput. Graph., July, 2024

Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation.
ACM Trans. Multim. Comput. Commun. Appl., June, 2024

Break the Bias: Delving Semantic Transform Invariance for Few-Shot Segmentation.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Model-Based Reinforcement Learning With Isolated Imaginations.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

MTCAM: A Novel Weakly-Supervised Audio-Visual Saliency Prediction Model With Multi-Modal Transformer.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2024

SiamDMU: Siamese Dual Mask Update Network for Visual Object Tracking.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2024

DialogueNeRF: towards realistic avatar face-to-face conversation video generation.
Vis. Intell., 2024

Efficient Singular Spectrum Mode Ensemble for Extracting Wide-Band Components in Overlapping Spectral Environments.
IEEE Trans. Signal Process., 2024

Unified Audio-Visual Saliency Model for Omnidirectional Videos With Spatial Audio.
IEEE Trans. Multim., 2024

Pixel-Learnable 3DLUT With Saturation-Aware Compensation for Image Enhancement.
IEEE Trans. Multim., 2024

Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement.
IEEE Trans. Multim., 2024

Sparsely-Supervised Object Tracking.
IEEE Trans. Image Process., 2024

Task-Specific Normalization for Continual Learning of Blind Image Quality Models.
IEEE Trans. Image Process., 2024

Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2024

3D Lane Detection With Attention in Attention.
IEEE Signal Process. Lett., 2024

A deep learning system for myopia onset prediction and intervention effectiveness evaluation in children.
npj Digit. Medicine, 2024

High-resolution real-space reconstruction of cryo-EM structures using a neural field network.
Nat. Mac. Intell., 2024

Pygmtools: A Python Graph Matching Toolkit.
J. Mach. Learn. Res., 2024

Review on SLAM algorithms for Augmented Reality.
Displays, 2024

Combinatorial progressive architecture search for crowd counting.
Displays, 2024

Bridging the gap between object detection in close-up and high-resolution wide shots.
Comput. Vis. Image Underst., 2024

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations.
CoRR, 2024

Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics.
CoRR, 2024

DimOL: Dimensional Awareness as A New 'Dimension' in Operator Learning.
CoRR, 2024

PostCast: Generalizable Postprocessing for Precipitation Nowcasting via Unsupervised Blurriness Modeling.
CoRR, 2024

Revealing Directions for Text-guided 3D Face Editing.
CoRR, 2024

PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing.
CoRR, 2024

Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution.
CoRR, 2024

Discovering Message Passing Hierarchies for Mesh-Based Physics Simulation.
CoRR, 2024

Open-World Reinforcement Learning over Long Short-Term Imagination.
CoRR, 2024

ARB-LLM: Alternating Refined Binarizations for Large Language Models.
CoRR, 2024

AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction.
CoRR, 2024

Open-Vocabulary Remote Sensing Image Semantic Segmentation.
CoRR, 2024

Learning Augmentation Policies from A Model Zoo for Time Series Forecasting.
CoRR, 2024

Learning to Solve Combinatorial Optimization under Positive Linear Constraints via Non-Autoregressive Neural Networks.
CoRR, 2024

Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning.
CoRR, 2024

Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering.
CoRR, 2024

Text-Augmented Multimodal LLMs for Chemical Reaction Condition Recommendation.
CoRR, 2024

Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction.
CoRR, 2024

See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition.
CoRR, 2024

GS-Phong: Meta-Learned 3D Gaussians for Relightable Novel View Synthesis.
CoRR, 2024

E<sup>3</sup>Gen: Efficient, Expressive and Editable Avatars Generation.
CoRR, 2024

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting.
CoRR, 2024

FLoRA: Low-Rank Core Space for N-dimension.
CoRR, 2024

Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research.
CoRR, 2024

IPAD: Industrial Process Anomaly Detection Dataset.
CoRR, 2024

Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization.
CoRR, 2024

NTIRE 2024 Challenge on Image Super-Resolution (⨉4): Methods and Results.
CoRR, 2024

UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment.
CoRR, 2024

Boundary Matters: A Bi-Level Active Finetuning Framework.
CoRR, 2024

A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos.
CoRR, 2024

Comparison of No-Reference Image Quality Models via MAP Estimation in Diffusion Latents.
CoRR, 2024

ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving.
CoRR, 2024

Rethinking Classifier Re-Training in Long-Tailed Recognition: A Simple Logits Retargeting Approach.
CoRR, 2024

Few-Shot Class-Incremental Learning with Prior Knowledge.
CoRR, 2024

Vision-Informed Flow Image Super-Resolution with Quaternion Spatial Modeling and Dynamic Flow Convolution.
CoRR, 2024

Uncertainty-aware Sampling for Long-tailed Semi-supervised Learning.
CoRR, 2024


Model-Based Reinforcement Learning with Multi-task Offline Pretraining.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

SaccadeDet: A Novel Dual-Stage Architecture for Rapid and Accurate Detection in Gigapixel Images.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

<i>E</i><sup>3</sup>Gen: Efficient, Expressive and Editable Avatars Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Missing as Masking: Arbitrary Cross-Modal Feature Reconstruction for Incomplete Multimodal Brain Tumor Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries Using Gaussian Splatting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

HQ-Avatar: Towards High-Quality 3D Avatar Generation via Point-based Representation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Xformer: Hybrid X-Shaped Transformer for Image Denoising.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Recursive Generalization Transformer for Image Super-Resolution.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Breaking the Corpus Bottleneck for Multi-dialect Speech Recognition with Flexible Adapters.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

Tendency-Driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects.
Proceedings of the Computer Vision - ECCV 2024, 2024

Bridging Synthetic and Real Worlds for Pre-Training Scene Text Detectors.
Proceedings of the Computer Vision - ECCV 2024, 2024

PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer.
Proceedings of the Computer Vision - ECCV 2024, 2024

Radiative Gaussian Splatting for Efficient X-Ray Novel View Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

SaccadeMOT: Enhancing Object Detection and Tracking in Gigapixel Images via Scale-Aware Density Estimation.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

ReGenNet: Towards Human Action-Reaction Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inter-X: Towards Versatile Human-Human Interaction Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Monocular Identity-Conditioned Facial Reflectance Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VidToMe: Video Token Merging for Zero-Shot Video Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


Domain Prompt Learning with Quaternion Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LERE: Learning-Based Low-Rank Matrix Recovery with Rank Estimation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Partial Label Learning with a Partner.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SAM-PARSER: Fine-Tuning SAM Efficiently by Parameter Space Reconstruction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Domain-Controlled Prompt Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Robust Mesh Representation Learning via Efficient Local Structure-Aware Anisotropic Convolution.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Blind Image Quality Assessment for Pathological Microscopic Image Under Screen and Immersion Scenarios.
IEEE Trans. Medical Imaging, November, 2023

Fully context-aware image inpainting with a learned semantic pyramid.
Pattern Recognit., November, 2023

MNGNAS: Distilling Adaptive Combination of Multiple Searched Networks for One-Shot Neural Architecture Search.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Audio-visual aligned saliency model for omnidirectional video with implicit neural representation learning.
Appl. Intell., October, 2023

Self-labeling video prediction.
Displays, September, 2023

Unsupervised Learning of Graph Matching With Mixture of Modes via Discrepancy Minimization.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

A Survey on Label-Efficient Deep Image Segmentation: Bridging the Gap Between Weak Supervision and Dense Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Learning Generative RNN-ODE for Collaborative Time-Series and Event Sequence Forecasting.
IEEE Trans. Knowl. Data Eng., July, 2023

Trajectory Guided Robust Visual Object Tracking With Selective Remedy.
IEEE Trans. Circuits Syst. Video Technol., July, 2023

Efficient Person Search: An Anchor-Free Approach.
Int. J. Comput. Vis., July, 2023

Learning Robust Deep State Space for Unsupervised Anomaly Detection in Contaminated Time-Series.
IEEE Trans. Knowl. Data Eng., June, 2023

Learning Multi-Attention Context Graph for Group-Based Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Combinatorial Learning of Robust Deep Graph Matching: An Embedding Based Approach.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Learning Multi-View Interactional Skeleton Graph for Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Guest Editorial Robust Learning of Spatio-Temporal Point Processes: Modeling, Algorithm, and Applications.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

TMM-Nets: Transferred Multi- to Mono-Modal Generation for Lupus Retinopathy Diagnosis.
IEEE Trans. Medical Imaging, April, 2023

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Continual Learning for Blind Image Quality Assessment.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

A Novel Lightweight Audio-visual Saliency Model for Videos.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Toward Visual Behavior and Attention Understanding for Augmented 360 Degree Videos.
ACM Trans. Multim. Comput. Commun. Appl., 2023

FFFN: Frame-By-Frame Feedback Fusion Network for Video Super-Resolution.
IEEE Trans. Multim., 2023

Blind Image Quality Assessment via Cross-View Consistency.
IEEE Trans. Multim., 2023

Angel's Girl for Blind Painters: An Efficient Painting Navigation System Validated by Multimodal Evaluation Approach.
IEEE Trans. Multim., 2023

Residual Quantization for Low Bit-Width Neural Networks.
IEEE Trans. Multim., 2023

Develop Then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition.
IEEE Trans. Multim., 2023

Sequence as a Whole: A Unified Framework for Video Action Localization With Long-Range Text Query.
IEEE Trans. Image Process., 2023

A Super-High-Accuracy Attitude Measurement Method of SINS Based on PWQHN Algorithm in the High-Dynamic Maneuver Environment.
IEEE Trans. Instrum. Meas., 2023

SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Urban dynamics through the lens of human mobility.
Nat. Comput. Sci., 2023

Decoupled dynamic group equivariant filter for saliency prediction on omnidirectional image.
Neurocomputing, 2023

Human attention based movie summarization: Dataset and baseline model.
Neurocomputing, 2023

Domain Prompt Learning with Quaternion Networks.
CoRR, 2023

Binarized 3D Whole-body Human Mesh Recovery.
CoRR, 2023

Image Super-Resolution with Text Prompt Diffusion.
CoRR, 2023

EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices.
CoRR, 2023

Generalizable Person Search on Open-world User-Generated Video Content.
CoRR, 2023

Domain-Controlled Prompt Learning.
CoRR, 2023

Reflection Invariance Learning for Few-shot Semantic Segmentation.
CoRR, 2023

ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models.
CoRR, 2023

Vid2Act: Activate Offline Videos for Visual RL.
CoRR, 2023

Collaborative World Models: An Online-Offline Transfer RL Approach.
CoRR, 2023

Unsupervised Object-Centric Voxelization for Dynamic Scene Understanding.
CoRR, 2023

FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead.
CoRR, 2023

DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images.
CoRR, 2023

Model-Based Reinforcement Learning with Isolated Imaginations.
CoRR, 2023

Predictive Experience Replay for Continual Visual Control and Forecasting.
CoRR, 2023

Class-Incremental Learning Based on Anomaly Detection.
IEEE Access, 2023

Improving Masked Autoencoders by Learning Where to Mask.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

NeRF-IBVS: Visual Servo Based on NeRF for Visual Localization and Navigation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural LerPlane Representations for Fast 4D Reconstruction of Deformable Tissues.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

StockFormer: Learning Hybrid Trading Machines with Predictive Coding.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

LinSATNet: The Positive Linear Satisfiability Neural Networks.
Proceedings of the International Conference on Machine Learning, 2023

Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ROCO: A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Discovering Temporal Patterns for Event Sequence Clustering via Policy Mixture Model (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-supervised Character-to-Character Distillation for Text Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dual Aggregation Transformer for Image Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Facial Geometric Detail Recovery via Implicit Representation.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

SIMSnn: A Weight-Agnostic ReRAM-based Search-In-Memory Engine for SNN Acceleration.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Learning of Partial Graph Matching via Differentiable Top-K.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D-Aware Face Swapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Poisson Process for Bayesian Optimization.
Proceedings of the International Conference on Automated Machine Learning, 2023

Effective Fine-tuning Method for Tibetan Low-resource Dialect Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
TANet: Target Attention Network for Video Bit-Depth Enhancement.
IEEE Trans. Multim., 2022

Dynamic Backlight Scaling Considering Ambient Luminance for Mobile Videos on LCD Displays.
IEEE Trans. Mob. Comput., 2022

Discovering Temporal Patterns for Event Sequence Clustering via Policy Mixture Model.
IEEE Trans. Knowl. Data Eng., 2022

Modeling Dynamic User Preference via Dictionary Learning for Sequential Recommendation.
IEEE Trans. Knowl. Data Eng., 2022

HazDesNet: An End-to-End Network for Haze Density Prediction.
IEEE Trans. Intell. Transp. Syst., 2022

Sequential Attention-Based Distinct Part Modeling for Balanced Pedestrian Detection.
IEEE Trans. Intell. Transp. Syst., 2022

Confusing Image Quality Assessment: Toward Better Augmented Reality Experience.
IEEE Trans. Image Process., 2022

RIHOOP: Robust Invisible Hyperlinks in Offline and Online Photographs.
IEEE Trans. Cybern., 2022

Viewing Behavior Supported Visual Saliency Predictor for 360 Degree Videos.
IEEE Trans. Circuits Syst. Video Technol., 2022

Residual-Guided Multiscale Fusion Network for Bit-Depth Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2022

Mixed-Weight Neural Bagging for Detecting $m^6A$ Modifications in SARS-CoV-2 RNA Sequencing.
IEEE Trans. Biomed. Eng., 2022

Multiscale Brain-Like Neural Network for Saliency Prediction on Omnidirectional Images.
IEEE Trans. Cogn. Dev. Syst., 2022

A novel stereo image self-inpainting network for autonomous robots.
Robotics Auton. Syst., 2022

Auto uning of price prediction models for high-frequency trading via reinforcement learning.
Pattern Recognit., 2022

DeepDRiD: Diabetic Retinopathy - Grading and Image Quality Estimation Challenge.
Patterns, 2022

Fine-Grained Video Captioning via Graph-based Multi-Granularity Interaction Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Neural Graph Matching Network: Learning Lawler's Quadratic Assignment Problem With Extension to Hypergraph and Multiple-Graph Matching.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep Object Tracking With Shrinkage Loss.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

DIPONet: Dual-information progressive optimization network for salient object detection.
Digit. Signal Process., 2022

Screen Content Quality Assessment: Overview, Benchmark, and Beyond.
ACM Comput. Surv., 2022

A K-variate Time Series Is Worth K Words: Evolution of the Vanilla Transformer Architecture for Long-term Multivariate Time Series Forecasting.
CoRR, 2022

A Survey on Label-efficient Deep Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction.
CoRR, 2022

Isolating and Leveraging Controllable and Noncontrollable Visual Dynamics in World Models.
CoRR, 2022

DOTIN: Dropping Task-Irrelevant Nodes for GNNs.
CoRR, 2022

Confusing Image Quality Assessment: Towards Better Augmented Reality Experience.
CoRR, 2022

Analysis Method of Strapdown Inertial Navigation Error Distribution Based on Covariance Matrix Decomposition.
CoRR, 2022

Facial Geometric Detail Recovery via Implicit Representation.
CoRR, 2022

DialogueNeRF: Towards Realistic Avatar Face-to-face Conversation Video Generation.
CoRR, 2022

A GNSS Aided Initial Alignment Method for MEMS-IMU Based on Backtracking Algorithm and Backward Filtering.
CoRR, 2022

DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering.
CoRR, 2022

Mind Your Solver! On Adversarial Attack and Defense for Combinatorial Optimization.
CoRR, 2022

L3E-HD: A Framework Enabling Efficient Ensemble in High-Dimensional Space for Language Tasks.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ZARTS: On Zero-order Optimization for Neural Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CageNeRF: Cage-based Neural Radiance Field for Generalized 3D Deformation and Animation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

M-Mix: Generating Hard Negatives via Multi-sample Mixing for Contrastive Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Learning Mixture of Neural Temporal Point Processes for Multi-dimensional Event Sequence Clustering.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields.
Proceedings of the International Conference on Machine Learning, 2022

Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Randomize and Match: Exploiting Irregular Sparsity for Energy Efficient Processing in SNNs.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

DMANET: Deep Learning-Based Differential Microphone Arrays for Multi-Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

EAutoDet: Efficient Architecture Search for Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Self-supervised Learning of Visual Graph Matching.
Proceedings of the Computer Vision - ECCV 2022, 2022

SATO: spiking neural network acceleration via temporal-oriented dataflow and architecture.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Align Representations with Base: A New Approach to Self-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Invisible Markers for Hidden Codes in Offline-to-online Photography.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Exploring Frequency Adversarial Attacks for Face Forgery Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Continual Predictive Learning from Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

End-to-End Reconstruction-Classification Learning for Face Forgery Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Where are the Children with Autism Looking in Reality?
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Emotional Semantic Neural Radiance Fields for Audio-Driven Talking Head.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Gesture Interaction for Gaming Control Based on an Interferometric Radar.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Contactless Cardiogram Reconstruction Based on the Wavelet Transform via Continuous-Wave Radar.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Exploring Visual Context for Weakly Supervised Person Search.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Exploiting Local Degradation Characteristics and Global Statistical Properties for Blind Quality Assessment of Tone-Mapped HDR Images.
IEEE Trans. Multim., 2021

Uncertainty-Aware Blind Image Quality Assessment in the Laboratory and Wild.
IEEE Trans. Image Process., 2021

Compression Priors Assisted Convolutional Neural Network for Fractional Interpolation.
IEEE Trans. Circuits Syst. Video Technol., 2021

Language-Guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning.
IEEE Trans. Circuits Syst. Video Technol., 2021

Adaptive Region Proposal With Channel Regularization for Robust Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2021

Towards multi-scale deep features learning with correlation metric for person re-identification.
Knowl. Based Syst., 2021

Respiratory Consultant by Your Side: Affordable and Remote Intelligent Respiratory Rate and Respiratory Pattern Monitoring System.
IEEE Internet Things J., 2021

RANSP: Ranking attention network for saliency prediction on omnidirectional images.
Neurocomputing, 2021

Progressive Multi-granularity Analysis for Video Prediction.
Int. J. Comput. Vis., 2021

Structured Computational Modeling of Human Visual System for No-reference Image Quality Assessment.
Int. J. Autom. Comput., 2021

Diabetic Retinal Grading Using Attention-Based Bilinear Convolutional Neural Network and Complement Cross Entropy.
Entropy, 2021

Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid.
CoRR, 2021

Consensus Synergizes with Memory: A Simple Approach for Anomaly Segmentation in Urban Scenes.
CoRR, 2021

TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification.
CoRR, 2021

DAAS: Differentiable Architecture and Augmentation Policy Search.
CoRR, 2021

Efficient Person Search: An Anchor-Free Approach.
CoRR, 2021

Local-to-Global Self-Attention in Vision Transformers.
CoRR, 2021

Exploring Visual Context for Weakly Supervised Person Search.
CoRR, 2021

Making CNNs Interpretable by Building Dynamic Sequential Decision Forests with Top-down Hierarchy Learning.
CoRR, 2021

Saliency prediction on omnidirectional images with attention-aware feature fusion network.
Appl. Intell., 2021

Cross-Modality 3D Object Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Scene-Aware Ensemble Learning for Robust Crowd Counting.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Different Effects of Mass and Damping on Performance of Vibration and Wind Energy Harvesters.
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021

Cross-Modal 3D Object Detection and Tracking for Auto-Driving.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Context-Aware Image Inpainting with Learned Semantic Priors.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Learning Spectral Dictionary for Local Representation of Mesh.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Lavs: A Lightweight Audio-Visual Saliency Prediction Model.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A Lightweight Saliency Prediction Model for Omnidirectional Images.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Learning to Track Objects from Unlabeled Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Perceptual Quality Assessment for Recognizing True and Pseudo 4k Content.
Proceedings of the IEEE International Conference on Acoustics, 2021

Combinatorial Learning of Graph Edit Distance via Dynamic Embedding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PointAugmenting: Cross-Modal Augmentation for 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking Bi-Level Optimization in Neural Architecture Search: A Gibbs Sampling Perspective.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Learning Comprehensive Motion Representation for Action Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Learning Local Neighboring Structure for Robust 3D Shape Representation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Scalable and Explainable 1-Bit Matrix Completion via Graph Signal Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval.
IEEE Trans. Multim., 2020

STFlow: Self-Taught Optical Flow Estimation Using Pseudo Labels.
IEEE Trans. Image Process., 2020

A Multimodal Saliency Model for Videos With High Audio-Visual Correspondence.
IEEE Trans. Image Process., 2020

A Metric for Light Field Reconstruction, Compression, and Display Quality Evaluation.
IEEE Trans. Image Process., 2020

Long-Term Video Prediction via Criticization and Retrospection.
IEEE Trans. Image Process., 2020

MUGGLE: MUlti-Stream Group Gaze Learning and Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2020

A Wavelet-Predominant Algorithm Can Evaluate Quality of THz Security Image and Identify Its Usability.
IEEE Trans. Broadcast., 2020

Unsupervised learning of optical flow with patch consistency and occlusion estimation.
Pattern Recognit., 2020

Tiny-BDN: An Efficient and Compact Barcode Detection Network.
IEEE J. Sel. Top. Signal Process., 2020

Unobtrusive and Automatic Classification of Multiple People's Abnormal Respiratory Patterns in Real Time Using Deep Neural Network and Depth Camera.
IEEE Internet Things J., 2020

Fine-grained image analysis via progressive feature learning.
Neurocomputing, 2020

Generative adversarial networks for non-negative matrix factorization in temporal psycho-visual modulation.
Digit. Signal Process., 2020

Extended geometric models for stereoscopic 3D with vertical screen disparity.
Displays, 2020

DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection.
CoRR, 2020

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradients Accumulation.
CoRR, 2020

Collaborative Learning for Faster StyleGAN Embedding.
CoRR, 2020

PAI-Conv: Permutable Anisotropic Convolutional Networks for Learning on Point Clouds.
CoRR, 2020

PAI-GCN: Permutable Anisotropic Graph Convolutional Networks for 3D Shape Representation Learning.
CoRR, 2020

Graduated Assignment for Joint Multi-Graph Matching and Clustering with Application to Unsupervised Graph Matching Network Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

MergeNAS: Merge Operations into One for Differentiable Architecture Search.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Saliency Prediction on Omnidirectional Images with Brain-Like Shallow Neural Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Video Prediction via Example Guidance.
Proceedings of the 37th International Conference on Machine Learning, 2020

Ransp: Ranking Attention Network For Saliency Prediction On Omnidirectional Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

A Multiple Attributes Image Quality Database for Smartphone Camera Photo Quality Assessment.
Proceedings of the IEEE International Conference on Image Processing, 2020

Learning To Blindly Assess Image Quality In The Laboratory And Wild.
Proceedings of the IEEE International Conference on Image Processing, 2020

Hierarchical Style-Based Networks for Motion Synthesis.
Proceedings of the Computer Vision - ECCV 2020, 2020

Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering.
Proceedings of the Computer Vision - ECCV 2020, 2020

Robust Tracking Against Adversarial Attacks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Layered Neighborhood Expansion for Incremental Multiple Graph Matching.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deep Kinematics Analysis for Monocular 3D Human Pose Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semi-supervised 3D Face Representation Learning from Unconstrained Photo Collections.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning Time Series Associated Event Sequences With Recurrent Point Process Networks.
IEEE Trans. Neural Networks Learn. Syst., 2019

Multi-Channel Decomposition in Tandem With Free-Energy Principle for Reduced-Reference Image Quality Assessment.
IEEE Trans. Multim., 2019

Structure-Constrained Motion Sequence Generation.
IEEE Trans. Multim., 2019

Quality Evaluation of Image Dehazing Methods Using Synthetic Hazy Images.
IEEE Trans. Multim., 2019

Spatiotemporal Symmetric Convolutional Neural Network for Video Bit-Depth Enhancement.
IEEE Trans. Multim., 2019

Deep Progressive Hashing for Image Retrieval.
IEEE Trans. Multim., 2019

Objective Quality Evaluation of Dehazed Images.
IEEE Trans. Intell. Transp. Syst., 2019

BE-CALF: Bit-Depth Enhancement by Concatenating All Level Features of DNN.
IEEE Trans. Image Process., 2019

Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer.
IEEE Trans. Image Process., 2019

Physical Password Breaking via Thermal Sequence Analysis.
IEEE Trans. Inf. Forensics Secur., 2019

Multi-level attention model for person re-identification.
Pattern Recognit. Lett., 2019

Robust Visual Tracking via Hierarchical Convolutional Features.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

EMBDN: An Efficient Multiclass Barcode Detection Network for Complicated Environments.
IEEE Internet Things J., 2019

Recognition oriented facial image quality assessment via deep convolutional neural network.
Neurocomputing, 2019

Learning transform-aware attentive network for object tracking.
Neurocomputing, 2019

End-to-end visual grounding via region proposal networks and bilinear pooling.
IET Comput. Vis., 2019

User tailored colorization using automatic scribbles and hierarchical features.
Digit. Signal Process., 2019

Cross-modality motion parameterization for fine-grained video prediction.
Comput. Vis. Image Underst., 2019

A Saliency Dataset of Head and Eye Movements for Augmented Reality.
CoRR, 2019

Robust Invisible Hyperlinks in Physical Photographs Based on 3D Rendering Attacks.
CoRR, 2019

Decoding Spiking Mechanism with Dynamic Learning on Neuron Population.
CoRR, 2019

Deep Unsupervised Clustering with Clustered Generator Model.
CoRR, 2019

Learning to Blindly Assess Image Quality in the Laboratory and Wild.
CoRR, 2019

Reinforcement Learning with Policy Mixture Model for Temporal Point Processes Clustering.
CoRR, 2019

A dataset of eye movements for the children with autism spectrum disorder.
Proceedings of the 10th ACM Multimedia Systems Conference, 2019

MC360IQA: The Multi-Channel CNN for Blind 360-Degree Image Quality Assessment.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Learning Interpretable Deep State Space Model for Probabilistic Time Series Forecasting.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Piezoelectric Wind Energy Harvester with Interaction Between Vortex-Induced Vibration and Galloping.
Proceedings of the 2019 IEEE SENSORS, Montreal, QC, Canada, October 27-30, 2019, 2019

Cross Modality Alignment of Medical Volumes using Spatio-Semantic Attentive Cycle-GAN.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Hierarchical Features Fusion for Image Aesthetics Assessment.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Advanced CNN Based Motion Compensation Fractional Interpolation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Variational Few-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Combinatorial Embedding Networks for Deep Graph Matching.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Context Graph for Person Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Synergizing Local and Global Models for Matrix Approximation.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Efficient Quantization for Neural Networks with Binary Weights and Low Bitwidth Activations.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Human Action Transfer Based on 3D Model Reconstruction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Learning Semantic-Aligned Action Representation.
IEEE Trans. Neural Networks Learn. Syst., 2018

Blind Quality Assessment Based on Pseudo-Reference Image.
IEEE Trans. Multim., 2018

IPAD: Intensity Potential for Adaptive De-Quantization.
IEEE Trans. Image Process., 2018

Blind Image Quality Estimation via Distortion Aggravation.
IEEE Trans. Broadcast., 2018

Arrow's Impossibility Theorem inspired subjective image quality assessment approach.
Signal Process., 2018

Saliency-induced reduced-reference quality index for natural scene and screen content images.
Signal Process., 2018

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking.
Int. J. Comput. Vis., 2018

Correlation Propagation Networks for Scene Text Detection.
CoRR, 2018

Publication Popularity Modeling via Adversarial Learning of Profile-Specific Dynamic Process.
IEEE Access, 2018

SIQD: Surveillance Image Quality Database and Performance Evaluation for Objective Algorithms.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Video Prediction via Selective Sampling.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Reduced-Reference Image Quality Assessment Based on Free-Energy Principle with Multi-Channel Decomposition.
Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

Depth Structure Preserving Scene Image Generation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Decoupled Learning for Factorial Marked Temporal Point Processes.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Perceptual Quality Assessment of Omnidirectional Images.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Modeling Thermal Sequence Signal Decreasing for Dual Modal Password Breaking.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Learning to Predict where the Children with Asd Look.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Deep Regression Tracking with Shrinkage Loss.
Proceedings of the Computer Vision - ECCV 2018, 2018

Attention-GAN for Object Transfiguration in Wild Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

Fine-Grained Video Captioning for Sports Narrative.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multiple Granularity Group Interaction Prediction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Structure Preserving Video Prediction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Crowd Counting via Adversarial Cross-Scale Consistency Pursuit.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Conditional Generative Models for Temporal Point Processes.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Video Summarization via Semantic Attended Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling.
IEEE Trans. Multim., 2017

Deep Multimodal Distance Metric Learning Using Click Constraints for Image Ranking.
IEEE Trans. Cybern., 2017

No-Reference Quality Metric of Contrast-Distorted Images Based on Information Maximization.
IEEE Trans. Cybern., 2017

适配分辨率动态变化的低复杂度视频场景切换检测方法 (Low Complexity Scene Change Detection Algorithm for Supporting Resolution Dynamic Change).
计算机科学, 2017

Visual attention analysis and prediction on human faces.
Inf. Sci., 2017

Perceptual information hiding based on multi-channel visual masking.
Neurocomputing, 2017

An integrated system for building structural health monitoring and early warning based on an Internet of things approach.
Int. J. Distributed Sens. Networks, 2017

A self-adaptive load-dispatching control framework for device data accessing in IoT-based systems.
Int. J. Commun. Syst., 2017

Learning a no-reference quality metric for single-image super-resolution.
Comput. Vis. Image Underst., 2017

Skeleton-aided Articulated Motion Generation.
CoRR, 2017

Joint Modeling of Event Sequence and Time Series with Attentional Twin Recurrent Neural Networks.
CoRR, 2017

Terahertz Security Image Quality Assessment by No-reference Model Observers.
CoRR, 2017

Deep Binary Representation for Efficient Image Retrieval.
Adv. Multim., 2017

A Novel Text Structure Feature Extractor for Chinese Scene Text Detection and Recognition.
IEEE Access, 2017

Feature selection based on network maximal correlation.
Proceedings of the 20th International Symposium on Wireless Personal Multimedia Communications, 2017

Enhancing pulmonary nodule detection via cross-modal alignment.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Learning a convolutional neural network for fractional interpolation in HEVC inter coding.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Rain removal via residual generation cascading.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

On the Impact of Environmental Sound on Perceived Visual Quality.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Assessment of Visually Induced Motion Sickness in Immersive Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Wasserstein Learning of Deep Generative Point Process Models.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Skeleton-Aided Articulated Motion Generation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Fine-Grained Recognition via Attribute-Guided Attentive Feature Aggregation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Pedestrian Detection via Bi-directional Multi-scale Analysis.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

IVQAD 2017: An immersive video quality assessment database.
Proceedings of the International Conference on Systems, Signals and Image Processing, 2017

Predicting Human Interaction via Relative Attention Model.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Image Matching via Loopy RNN.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Selection of Good Display Mode for Terahertz Security Image via Image Quality Assessment.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2017

Terahertz Security Image Quality Assessment by No-reference Model Observers.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2017

Deep hash learning for efficient image retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

IPAD: Intensity potential for adaptive de-quantization.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Dual-mode imaging system for non-contact heart rate estimation during night.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Learning Mixtures of Markov Chains from Aggregate Data with Structural Constraints (Extended Abstract).
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Performance Guaranteed Network Acceleration via High-Order Residual Quantization.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Video Segmentation via Multiple Granularity Analysis.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Recurrent Modeling of Interaction Context for Collective Activity Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Subjective and objective quality assessment for color changed images.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

An eye-friendly dual-view projection system using temporal psychovisual modulation.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

Modeling the Intensity Function of Point Process Via Recurrent Neural Networks.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Unsupervised Deep Learning for Optical Flow Estimation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Fixation Prediction through Multimodal Analysis.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset.
IEEE Trans. Multim., 2016

Blind Quality Assessment of Tone-Mapped Images Via Analysis of Information, Naturalness, and Structure.
IEEE Trans. Multim., 2016

Saliency-Guided Quality Assessment of Screen Content Images.
IEEE Trans. Multim., 2016

Learning Mixtures of Markov Chains from Aggregate Data with Structural Constraints.
IEEE Trans. Knowl. Data Eng., 2016

A Reconfigurable Tangram Model for Scene Representation and Categorization.
IEEE Trans. Image Process., 2016

Analysis of Distortion Distribution for Pooling in Image Quality Prediction.
IEEE Trans. Broadcast., 2016

When Correlation Filters Meet Convolutional Neural Networks for Visual Tracking.
IEEE Signal Process. Lett., 2016

Content-weighted mean-squared error for quality assessment of compressed images.
Signal Image Video Process., 2016

Multi-Graph Matching via Affinity Optimization with Graduated Consistency Regularization.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Sketch retrieval via local dense stroke features.
Image Vis. Comput., 2016

Learning a blind quality evaluation engine of screen content images.
Neurocomputing, 2016

Factors in Finetuning Deep Model for object detection.
CoRR, 2016

Joint Prediction of Rating and Popularity for Cold-Start Item by Sentinel User Selection.
IEEE Access, 2016

Evaluation of beyond-HEVC entropy coding methods for DCT transform coefficients.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A novel scene text detection algorithm based on convolutional neural network.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Exploiting neural models for no-reference image quality assessment.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A Short Survey of Recent Advances in Graph Matching.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Visual saliency model based on minimum description length.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Live demonstration: Screen piracy protection using saturation laser attack and TPVM.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Modeling Contagious Merger and Acquisition via Point Processes with a Profile Regression Prior.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

On Modeling and Predicting Individual Paper Citation Count over Time.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A novel text structure feature extractor for Chinese scene text detection and recognition.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Blind quality assessment of compressed images via pseudo structural similarity.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

A parallel-fusion RNN-LSTM architecture for image caption generation.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Recognition Oriented Facial Image Quality Assessment via Deep Convolutional Neural Network.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Person Re-identification via Recurrent Feature Aggregation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Cascaded Interactional Targeting Network for Egocentric Video Analysis.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Temporal Action Localization with Pyramid of Score Distribution Features.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Factors in Finetuning Deep Model for Object Detection with Long-Tail Distribution.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Progressively Parsing Interactional Objects for Fine Grained Action Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Distinguish True or False 4K Resolution Using Frequency Domain Analysis and Free-Energy Modelling.
Proceedings of the 7th International Conference on Cloud Computing and Big Data, 2016

A Flash Light System for Individuals with Visual Impairment Based on TPVM.
Proceedings of the 7th International Conference on Cloud Computing and Big Data, 2016

GPU accelerated high-quality video/image super-resolution.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

Review of ITU-T parametric models for compressed video quality estimation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

BEST: Benchmark and Evaluation of Surveillance Task.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Using Free Energy Principle For Blind Image Quality Assessment.
IEEE Trans. Multim., 2015

Single Image Superresolution Based on Gradient Profile Sharpness.
IEEE Trans. Image Process., 2015

Consistency-Driven Alternating Optimization for Multigraph Matching: A Unified Approach.
IEEE Trans. Image Process., 2015

No-Reference Image Sharpness Assessment in Autoregressive Parameter Space.
IEEE Trans. Image Process., 2015

Noise Estimation of Natural Images via Statistical Analysis and Noise Injection.
IEEE Trans. Circuits Syst. Video Technol., 2015

Spatial Error Concealment With an Adaptive Linear Predictor.
IEEE Trans. Circuits Syst. Video Technol., 2015

Automatic Contrast Enhancement Technology With Saliency Preservation.
IEEE Trans. Circuits Syst. Video Technol., 2015

Quality Assessment Considering Viewing Distance and Image Resolution.
IEEE Trans. Broadcast., 2015

Visual Saliency Detection With Free Energy Theory.
IEEE Signal Process. Lett., 2015

An Optimized Pixel-Wise Weighting Approach for Patch-Based Image Denoising.
IEEE Signal Process. Lett., 2015

Wavelet-based hybrid natural image modeling using generalized Gaussian and α-stable distributions.
J. Vis. Commun. Image Represent., 2015

Unsupervised adaptive sign language recognition based on hypothesis comparison guided cross validation and linguistic prior filtering.
Neurocomputing, 2015

A General Multi-Graph Matching Approach via Graduated Consistency-regularized Boosting.
CoRR, 2015

Real time and scene invariant crowd counting: Across a line or inside a region.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Multi-Task Multi-Dimensional Hawkes Processes for Modeling Event Sequences.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Image inpainting with adaptive linear predictor.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Learning a temporally invariant representation for visual tracking.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A Matrix Decomposition Perspective to Multiple Graph Matching.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Hierarchical Convolutional Features for Visual Tracking.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Online trendy topics detection in microblogs with selective user monitoring under cost constraints.
Proceedings of the 2015 IEEE International Conference on Communications, 2015

Cross-scene crowd counting via deep convolutional neural networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Discrete hyper-graph matching.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Motion Part Regularization: Improving action recognition via trajectory group selection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Long-term correlation tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

A new unsupervised convolutional neural network model for Chinese scene text detection.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

CNN-based shot boundary detection and video annotation.
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

On Machine Learning towards Predictive Sales Pipeline Analytics.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Multi-View Point Registration via Alternating Optimization.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Online Pedestrian Tracking Using Ensemble Color Feature.
Proceedings of the 15th IEEE International Conference on Computer and Information Technology, 2015

2014
Robust Noise Estimation Based on Noise Injection.
J. Signal Process. Syst., 2014

Generalized Equalization Model for Image Enhancement.
IEEE Trans. Multim., 2014

Person Re-Identification Over Camera Networks Using Multi-Task Distance Metric Learning.
IEEE Trans. Image Process., 2014

Lossless Predictive Coding for Images With Bayesian Treatment.
IEEE Trans. Image Process., 2014

Inferring User Image-Search Goals Under the Implicit Guidance of Users.
IEEE Trans. Circuits Syst. Video Technol., 2014

Evaluation of Different Algorithms of Nonnegative Matrix Factorization in Temporal Psychovisual Modulation.
IEEE Trans. Circuits Syst. Video Technol., 2014

You Are What You Watch and When You Watch: Inferring Household Structures From IPTV Viewing Data.
IEEE Trans. Broadcast., 2014

Hybrid No-Reference Quality Metric for Singly and Multiply Distorted Images.
IEEE Trans. Broadcast., 2014

Separation of Weak Reflection from a Single Superimposed Image.
IEEE Signal Process. Lett., 2014

HEASK: Robust homography estimation based on appearance similarity and keypoint correspondences.
Pattern Recognit., 2014

Image quality/distortion metric based on α-stable model similarity in wavelet domain.
J. Vis. Commun. Image Represent., 2014

Behavior Informatics: A New Perspective.
IEEE Intell. Syst., 2014

Hybrid modeling of natural image in wavelet domain.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Online video object classification using fast similarity network fusion.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Ultra high definition video saliency database.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Learning to integrate local and global features for a blind image quality measure.
Proceedings of the International Conference on Smart Computing, 2014

Sound influences visual attention discriminately in videos.
Proceedings of the Sixth International Workshop on Quality of Multimedia Experience, 2014

Effective preprocessing stage in the fourier transform domain for image quality assessment.
Proceedings of the Sixth International Workshop on Quality of Multimedia Experience, 2014

Spatial error concealment with adaptive linear predictor.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Details preservation inspired blind quality metric of tone mapping methods.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Fast coding unit depth decision for HEVC.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Using image signature for effective and efficient reduced-reference image quality assessment.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

HDR2014 - A high dynamic range image quality database.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Image quality/distortion metric based on α-stable model similarity.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Deep learning network for blind image quality assessment.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

An efficient color image quality metric with local-tuned-global model.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Cost-Effective User Monitoring for Popularity Prediction of Online User-Generated Content.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Graduated Consistency-Regularized Optimization for Multi-graph Matching.
Proceedings of the Computer Vision - ECCV 2014, 2014

Blind image quality assessment for noise.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

Using JPEG and JPEG2000 compressions for fast image quality metrics based on free energy theory.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

Quaternion Phase Congruency model for edge saliency map extraction and image blur measurement.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2014

Speed up HEVC encoder by precoding with H.264.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
A New Algorithm for Inferring User Search Goals with Feedback Sessions.
IEEE Trans. Knowl. Data Eng., 2013

Reorder user's tweets.
ACM Trans. Intell. Syst. Technol., 2013

Single Image Super-resolution With Detail Enhancement Based on Local Fractal Analysis of Gradient.
IEEE Trans. Circuits Syst. Video Technol., 2013

A new psychovisual paradigm for image quality assessment: from differentiating distortion types to discriminating quality conditions.
Signal Image Video Process., 2013

Sample-based image completion using structure synthesis.
J. Vis. Commun. Image Represent., 2013

A generalized EMD with body prior for pedestrian identification.
J. Vis. Commun. Image Represent., 2013

Quaternion based optical flow estimation for robust object tracking.
Digit. Signal Process., 2013

Cost-effective node monitoring for online hot eventdetection in sina weibo microblogging.
Proceedings of the 22nd International World Wide Web Conference, 2013

Retina model inspired image quality assessment.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Lossless predictive coding with Bayesian treatment.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Adaptive high-frequency clipping for improved image quality assessment.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Brightness preserving video contrast enhancement using S-shaped Transfer function.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Measuring orderliness based on social force model in collective motions.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

A new image quality metric based on MIx-Scale transform.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2013

FISBLIM: A FIve-Step BLInd Metric for quality assessment of multiply distorted images.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2013

The SJTU 4K video sequence dataset.
Proceedings of the Fourth International Workshop on Quality of Multimedia Experience, 2013

A dual-model approach to blind quality assessment of noisy images.
Proceedings of the 30th Picture Coding Symposium, 2013

World Expo Problem and Its Mixed Integer Programming Based Solution.
Proceedings of the Behavior and Social Computing, 2013

Analysis and identification of spamming behaviors in Sina Weibo microblog.
Proceedings of the 7th Workshop on Social Network Mining and Analysis, 2013

Separation of weak reflection from a single superimposed image using gradient profile sharpness.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Hybrid image interpolation with soft-decision kernel regression.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Self-adaptive scale transform for IQA metric.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

A new reduced-reference image quality assessment using structural degradation model.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Self-example based super-resolution with fractal-based gradient enhancement.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Structural similarity weighting for image quality assessment.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

No-reference image quality assessment metric by combining free energy theory and structural degradation model.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

An image quality assessment metric based on quaternion wavelet transform.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Subjective and objective quality assessment for images with contrast change.
Proceedings of the IEEE International Conference on Image Processing, 2013

Image restoration via efficient Gaussian mixture model learning.
Proceedings of the IEEE International Conference on Image Processing, 2013

Action Recognition with Actons.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Joint Optimization for Consistent Multiple Graph Matching.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Sketch Retrieval via Dense Stroke Features.
Proceedings of the British Machine Vision Conference, 2013

No-reference image blur assessment based on gradient profile sharpness.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2013

Observation of Matthew Effects in Sina Weibo microblogger.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012
A Psychovisual Quality Metric in Free-Energy Principle.
IEEE Trans. Image Process., 2012

Multiscale Semilocal Interpolation With Antialiasing.
IEEE Trans. Image Process., 2012

Image Reconstruction From Random Samples With Multiscale Hybrid Parametric and Nonparametric Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2012

No-Reference Stereoscopic IQA Approach: From Nonlinear Effect to Parallax Compensation.
J. Electr. Comput. Eng., 2012

Automatic Movie Restoration Based on Wave Atom Transform and Nonparametric Model.
EURASIP J. Adv. Signal Process., 2012

Learning reconfigurable scene representation by tangram model.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Inferring user image-search goals by mining query logs with semi-supervised spectral clustering.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

New bounds on image denoising: Viewpoint of sparse representation and non-local averaging.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Robust object tracking with bidirectional corner matching and trajectory smoothness algorithm.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Nonlinear additive model based saliency map weighting strategy for image quality assessment.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Colorization Using Quaternion Algebra with Automatic Scribble Generation.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Parsing collective behaviors by hierarchical model with varying structure.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Parallel-Friendly Patch Match Based on Jump Flooding.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

Spatial Detection of Line Scratch Based on Histogram.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

Image super-resolution based on a novel edge sharpness prior.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

No reference measurement of contrast distortion and optimal contrast enhancement.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A new no-reference stereoscopic image quality assessment based on ocular dominance theory and degree of parallax.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Cross-Layered Hidden Markov Modeling for Surveillance Event Recognition.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

A Robust Homography Estimation Method Based on Keypoint Consensus and Appearance Similarity.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Dual-Transform Based Noise Estimation.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Recognizing Human Group Behaviors with Multi-group Causalities.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2012

A Coarse to Fine Facial Key Landmark Points Locating Algorithm Based on Active Shape Model.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2012

On the Convergence of Graph Matching: Graduated Assignment Revisited.
Proceedings of the Computer Vision - ECCV 2012, 2012

Image Classification by Hierarchical Spatial Pooling with Partial Least Squares Analysis.
Proceedings of the British Machine Vision Conference, 2012

Feature Analysis of Spammers in Social Networks with Active Honeypots: A Case Study of Chinese Microblogging Networks.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2012

Robust single image super-resolution based on gradient enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An improved full-reference image quality metric based on structure compensation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Time Varying Dynamic Bayesian Network for Nonstationary Events Modeling and Online Inference.
IEEE Trans. Signal Process., 2011

Integrating Visual Saliency and Consistency for Re-Ranking Image Search Results.
IEEE Trans. Multim., 2011

Adaptive Sequential Prediction of Multidimensional Signals With Applications to Lossless Image Coding.
IEEE Trans. Image Process., 2011

Image colorization using Bayesian nonlocal inference.
J. Electronic Imaging, 2011

Crowd instability analysis using velocity-field based social force model.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Image reconstruction from random samples with parametric and nonparametric modeling.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Separation of superimposed images with unknown motions using sparsity priors.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Inferring users' image-search goals with pseudo-images.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

MCM: An Efficient Geometric Constraint Method for Robust Local Feature Matching.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

Example-based image contrast enhancement.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

A fast video stabilization algorithm based on block matching and edge completion.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

A new network-based algorithm for multi-camera abnormal activity detection.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Hypothesis comparison guided cross validation for unsupervised signer adaptation.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Human identification using body prior and generalized EMD.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning dictionary via subspace segmentation for sparse representation.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning sparse dictionaries with a popularity-based model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Gait identification by sparse representation.
Proceedings of the Eighth International Conference on Fuzzy Systems and Knowledge Discovery, 2011

Building Artificial Identities in Social Network Using Semantic Information.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

An improved contrast-tone mapping algorithm for image contrast enhancement.
Proceedings of the 8th International Conference on Information, 2011

2010
Terrestrial Television Broadcasting in China: Technologies and Applications.
Proceedings of the Intelligent Multimedia Communication: Techniques and Applications, 2010

Guest Editorial: Special Issue on SoC for Multimedia Networking (SiPS 2007).
J. Signal Process. Syst., 2010

Bayesian Error Concealment With DCT Pyramid for Images.
IEEE Trans. Circuits Syst. Video Technol., 2010

Scene categorization based on heterogeneous features.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Image denoising using local tangent space alignment.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Fire Surveillance Method Based on Quaternionic Wavelet Features.
Proceedings of the Advances in Multimedia Modeling, 2010

Simultaneous deblocking and error concealment for decoded visual signal.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Part-based human gait identification under clothing and carrying condition variations.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Color transfer via local binary patterns mapping.
Proceedings of the International Conference on Image Processing, 2010

Integrating visual saliency and consistency for re-ranking image search results.
Proceedings of the International Conference on Image Processing, 2010

Bayesian error concealment with DCT pyramid.
Proceedings of the IEEE International Conference on Acoustics, 2010

Separating Reflections from a Single Image Using Spatial Smoothness and Structure Information.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

QWT: Retrospective and New Applications.
Proceedings of the Geometric Algebra Computing - in Engineering and Computer Science., 2010

2009
Robust Color Demosaicking With Adaptation to Varying Spectral Correlations.
IEEE Trans. Image Process., 2009

Robust Video Region-of-Interest Coding Based on Leaky Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2009

CamShift guided particle filter for visual tracking.
Pattern Recognit. Lett., 2009

Efficient quadtree based block-shift filtering for deblocking and deringing.
J. Vis. Commun. Image Represent., 2009

Shanghai Jiao Tong University participation in high-level feature extraction and surveillance event detection at TRECVID 2009.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Structure-Preserving Colorization Based on Quaternionic Phase Reconstruction.
Proceedings of the Advances in Multimedia Information Processing, 2009

Metric Learning for Regression Problems and Human Age Estimation.
Proceedings of the Advances in Multimedia Information Processing, 2009

Spatiotemporal Phase Congruency Based Invariant Features for Human Behavior Classification.
Proceedings of the Advances in Multimedia Information Processing, 2009

Learning distance metric for regression by semidefinite programming with application to human age estimation.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Image classification based on pyramid histogram of topics.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A lexica family with small semantic gap.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Learning super resolution with global and local constraints.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

MDL context modeling of images with application to denoising.
Proceedings of the International Conference on Image Processing, 2009

Interpolating fine texturess with fields of experts prior.
Proceedings of the International Conference on Image Processing, 2009

Sub clustering K-SVD: Size variable dictionary learning for sparse representations.
Proceedings of the International Conference on Image Processing, 2009

Event recognition with time varying Hidden Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Efficient Deblocking With Coefficient Regularization, Shape-Adaptive Filtering, and Quantization Constraint.
IEEE Trans. Multim., 2008

Cross-Dimensional Perceptual Quality Assessment for Low Bit-Rate Videos.
IEEE Trans. Multim., 2008

Multilevel Framework to Detect and Handle Vehicle Occlusion.
IEEE Trans. Intell. Transp. Syst., 2008

Efficient Image Deblocking Based on Postfiltering in Shifted Windows.
IEEE Trans. Circuits Syst. Video Technol., 2008

Three Dimensional Scalable Video Adaptation via User-End Perceptual Quality Assessment.
IEEE Trans. Broadcast., 2008

GOP-level transmission distortion modeling for mobile streaming video.
Signal Process. Image Commun., 2008

No-reference noticeable blockiness estimation in images.
Signal Process. Image Commun., 2008

Contourlet-based image adaptive watermarking.
Signal Process. Image Commun., 2008

Unified deblocking for discrete cosine transfer compressed images.
J. Electronic Imaging, 2008

Analysis of the H.264 advanced video coding standard and an associated rate control scheme.
J. Electronic Imaging, 2008

Shanghai Jiao Tong University participation in high-level feature extraction, automatic search and surveillance event detectionat TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Cross-dimensional quality assessment for low bitrate video.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Image deringing using quadtree based block-shift filtering.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Application of scalable visual sensitivity profile in image and video coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Face super-resolution using 8-connected Markov Random Fields with embedded prior.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Image error-concealment via Block-based Bilateral Filtering.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Color image watermarking using local quaternion Fourier spectral analysis.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Scalable visual sensitivity profile estimation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Local Quaternionic Gabor Binary Patterns for color face recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Learning object classes from image thumbnails through deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Improved Update Step for Scalable Video Coding in Video Surveillance.
J. VLSI Signal Process., 2007

Moving Cast Shadows Detection Using Ratio Edge.
IEEE Trans. Multim., 2007

Efficient Spatio-temporal Segmentation for Extracting Moving Objects in Video Sequences.
IEEE Trans. Consumer Electron., 2007

Quaternion wavelet phase based stereo matching for uncalibrated images.
Pattern Recognit. Lett., 2007

Spatiotemporal Gaussian mixture model to detect moving objects in dynamic scenes.
J. Electronic Imaging, 2007

Robust Region-of-Interest Scalable Coding with Leaky Prediction in H.264/AVC.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2007

Multi-modality web video categorization.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

GOP-Level Transmission Distortion Modeling for Unequal Importance Judgement.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Multi-Scale Gabor Phase-Based Stereo Matching using Graph Cuts.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Unified Framework for Removing Blocking Artifacts.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Cooperative Stereo Matching using Quaternion Wavlets and Top-Down Segmentation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Bit Allocation for Fine-Granular SNR Scalability Coding with Hierarchical B Pictures.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2D Quaternion Fourier Transform: The Spectrum Properties and its Application in Color Image Registration.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Rate-distortion Based Quantization Level Adjustment Algorithm in Block-based Video Compression.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Spatial Error Concealment Technique using Verge Points.
Proceedings of the IEEE International Conference on Acoustics, 2007

The Wyner-Ziv Rate-Distortion Function of Multivariate Gaussian Sources and Its Application in Distributed Video Coding.
Proceedings of the 2007 Data Compression Conference (DCC 2007), 2007

2006
Statistical Multiplexing based on MPEG-4 Fine Granularity Scalability Coding.
J. VLSI Signal Process., 2006

Transform domain transcoding from MPEG-2 to H.264 with interpolation drift-error compensation.
IEEE Trans. Circuits Syst. Video Technol., 2006

H/<sub>∞</sub>-optimal model for VBR video traffic prediction.
IEEE Trans. Consumer Electron., 2006

Moving vehicles segmentation based on Bayesian framework for Gaussian motion model.
Pattern Recognit. Lett., 2006

Perceptual quality and objective quality measurements of compressed videos.
J. Vis. Commun. Image Represent., 2006

Rate Control for H.264 with Two-Step Quantization Parameter Determination but Single-Pass Encoding.
EURASIP J. Adv. Signal Process., 2006

Cross-layer conditional retransmission for layered video streaming over cellular networks.
Comput. Commun., 2006

Error robustness scheme for H.264 based on LDPC code.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

GES: a new image quality assessment metric based on energy features in Gabor transform domain.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Retransmission-based error spreading for layered video streaming over wireless LANs.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Error-resilience packet scheduling for low bit-rate video streaming over wireless channels.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Moving cast shadows detection based on ratio edge.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Perceptually-adaptive Motion Compensated Temporal Filtering.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Modeling Blocking Visual Sensitivity Profile.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Unequal Iterative Decoding for Power Efficient Video Transmission.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Object Geometry Based Error Resilient Video Coding.
Proceedings of the International Conference on Image Processing, 2006

Motion Vector Smoothing for True Motion Estimation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Channel-Aware Frame Dropping for Cellular Video Streaming.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
An unequal packet loss resilience scheme for video over the Internet.
IEEE Trans. Multim., 2005

Modeling visual attention's modulatory aftereffects on visual sensitivity and quality evaluation.
IEEE Trans. Image Process., 2005

Motion-compensated residue preprocessing in video coding based on just-noticeable-distortion profile.
IEEE Trans. Circuits Syst. Video Technol., 2005

Rate Control for Videophone Using Local Perceptual Cues.
IEEE Trans. Circuits Syst. Video Technol., 2005

Just noticeable distortion model and its applications in video coding.
Signal Process. Image Commun., 2005

Geometrically determining the leaky bucket parameters for video streaming over constant bit-rate channels.
Signal Process. Image Commun., 2005

Perceived visual quality metric based on error spread and contrast.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

An adaptive edge-preserving artifacts removal filter for video post-processing.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Perceptual Quality Metric For Compressed Videos.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Buffer-constrained R-D Model-Based Rate Control for H.264/AVC.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Measuring the negative impact of frame dropping on perceptual visual quality.
Proceedings of the Human Vision and Electronic Imaging X, 2005

2004
A locally adaptive algorithm for measuring blocking artifacts in images and videos.
Signal Process. Image Commun., 2004

Perceptual video quality evaluation using fuzzy inference system.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Local visual perceptual clues and its use in videophone rate control.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Adaptive nonlinear diffusion processes for ringing artifacts removal on JPEG 2000 images.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Perceptually-adaptive pre-processing for motion-compensated residue in video coding.
Proceedings of the 2004 International Conference on Image Processing, 2004

Video quality metric for low bitrate compressed videos.
Proceedings of the 2004 International Conference on Image Processing, 2004

Modelling visual attention and motion effect for visual quality evaluation.
Proceedings of the 2004 International Conference on Image Processing, 2004

An effective perceptual weighting model for videophone coding.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Spatial selectivity modulated just-noticeable-distortion profile for video.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A unified architecture for real-time video-coding systems.
IEEE Trans. Circuits Syst. Video Technol., 2003

Unequal loss protection for robust transmission of motion compensated video over the internet.
Signal Process. Image Commun., 2003

Video quality assessment using neural network based on multi-feature extraction.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Perceptually adaptive hybrid video encoding based on just-noticeable-distortion profile.
Proceedings of the Visual Communications and Image Processing 2003, 2003

PSQM-based RR and NR video quality metrics.
Proceedings of the Visual Communications and Image Processing 2003, 2003

A no-reference quality metric for measuring image blur.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Quality evaluation of MPEG-4 and H.26L coded video for mobile multimedia communications.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

IR face synthesis using motion vector field.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Unequal packet loss resilience for MPEG-4 video over the Internet.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

No-reference JPEG-2000 image quality metric.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

On incorporating just-noticeable-distortion profile into motion-compensated prediction for video compression.
Proceedings of the 2003 International Conference on Image Processing, 2003

Low bit rate quality assessment based on perceptual characteristics.
Proceedings of the 2003 International Conference on Image Processing, 2003

Just-noticeable-distortion profile with nonlinear additivity model for perceptual masking in color images.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Perceptual-quality significance map (PQSM) and its application on video quality distortion metrics.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
A congestion control strategy for multipoint videoconferencing.
IEEE Trans. Circuits Syst. Video Technol., 2002

Adaptive unequal error control for video over the Internet.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

A router based unequal error control scheme for video over the Internet.
Proceedings of the 2002 International Conference on Image Processing, 2002

Degressive error protection algorithm for MPEG-4 FGS video streaming.
Proceedings of the 2002 International Conference on Image Processing, 2002

Unequal error protection for motion compensated video streaming over the Internet.
Proceedings of the 2002 International Conference on Image Processing, 2002

A Novel Joint Rate Control Scheme for the Coding of Multiple Real Time Video Programs.
Proceedings of the 22nd International Conference on Distributed Computing Systems, 2002

An MPEG-4 FGS-based statistical multiplexer.
Proceedings of the Third International Workshop on Digital and Computational Video, 2002


  Loading...