Ge Li

Orcid: 0000-0003-4079-3968

Affiliations:
  • Peking University Shenzhen Graduate School, School of Electronic and Computer Engineering, Shenzhen, China


According to our database1, Ge Li authored at least 249 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Deep Learning for 3D Point Clouds
Springer, ISBN: 978-981-97-9569-7, 2025

2024
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Closing the Gap Between Theory and Practice During Alternating Optimization for GANs.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

3D Point Cloud Attribute Compression Using Diffusion-Based Texture-Aware Intra Prediction.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Enlarged Motion-Aware and Frequency-Aware Network for Compressed Video Artifact Reduction.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Puncturing-Based Resource Allocation for URLLC and eMBB Services via Matching Theory and Unsupervised Deep Learning.
IEEE Trans. Veh. Technol., September, 2024

ComPoint: Can Complex-Valued Representation Benefit Point Cloud Place Recognition?
IEEE Trans. Intell. Transp. Syst., July, 2024

Depth Video Inter Coding Based on Deep Frame Generation.
IEEE Trans. Broadcast., June, 2024

Interpretable Task-inspired Adaptive Filter Pruning for Neural Networks Under Multiple Constraints.
Int. J. Comput. Vis., June, 2024

Advanced Patch-Based Affine Motion Estimation for Dynamic Point Cloud Geometry Compression.
Sensors, May, 2024

Efficient Neural Network Compression Inspired by Compressive Sensing.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Ripple Transformer: A Human-Object Interaction Backbone and a New Prediction Strategy for Smart Surveillance Devices.
IEEE Trans. Consumer Electron., February, 2024

Test-Time Model Adaptation for Visual Question Answering With Debiased Self-Supervisions.
IEEE Trans. Multim., 2024

Hierarchical Prior-Based Super Resolution for Point Cloud Geometry Compression.
IEEE Trans. Image Process., 2024

Adaptive LPU Decision for Dynamic Point Cloud Compression.
IEEE Signal Process. Lett., 2024

Category-agnostic semantic edge detection by measuring neural representation randomness.
Pattern Recognit., 2024

Deep degradation-aware up-sampling-based depth video coding.
J. Electronic Imaging, 2024

Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation.
CoRR, 2024

MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval.
CoRR, 2024

Uncertainty-aware No-Reference Point Cloud Quality Assessment.
CoRR, 2024

ViewPCGC: View-Guided Learned Point Cloud Geometry Compression.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Point Cloud Compression, Enhancement and Applications: From 3D Perception to Large Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Sketch-aided Interactive Fusion Point Cloud Place Recognition.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Instance-level Timing Learning and Prediction at Placement using Res-UNet Network.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

SPCGC: Scalable Point Cloud Geometry Compression for Machine Vision.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

MPVNN: Multi-resolution Point-Voxel Non-parametric Network for 3D Point Cloud Processing.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

PointELM: Fast Point Cloud Classification Using Deep Random Mapping Based Extreme Learning Machines.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

MFITrack: Multi-Frame Integration Strategy for Enhanced Motion-Centric Single Object Tracking.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Efficient Point Cloud Attribute Compression Framework using Attribute-Guided Graph Fourier Transform.
Proceedings of the IEEE International Conference on Acoustics, 2024

ScanPCGC: Learning-Based Lossless Point Cloud Geometry Compression using Sequential Slice Representation.
Proceedings of the IEEE International Conference on Acoustics, 2024

ST-LLM: Large Language Models Are Effective Temporal Learners.
Proceedings of the Computer Vision - ECCV 2024, 2024

Lightweight super resolution network for point cloud geometry compression.
Proceedings of the Data Compression Conference, 2024

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Less Is More: Label Recommendation for Weakly Supervised Point Cloud Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Point Cloud Compression - Technologies and Standardization
Springer, ISBN: 978-981-97-1956-3, 2024

2023
Block-Adaptive Point Cloud Attribute Coding With Region-Aware Optimized Transform.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Mitigating Label Noise in GANs via Enhanced Spectral Normalization.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Global-Context Aggregated Intra Prediction Network for Depth Video Coding.
IEEE Trans. Circuits Syst. II Express Briefs, August, 2023

A Thorough Benchmark and a New Model for Light Field Saliency Detection.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Toward 6G TKμ Extreme Connectivity: Architecture, Key Technologies and Experiments.
IEEE Wirel. Commun., June, 2023

Deep In-Loop Filtering via Multi-Domain Correlation Learning and Partition Constraint for Multiview Video Coding.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

A Regularized Projection-Based Geometry Compression Scheme for LiDAR Point Cloud.
IEEE Trans. Circuits Syst. Video Technol., March, 2023

Image Quality Assessment-driven Reinforcement Learning for Mixed Distorted Image Restoration.
ACM Trans. Multim. Comput. Commun. Appl., February, 2023

Exploiting Manifold Feature Representation for Efficient Classification of 3D Point Clouds.
ACM Trans. Multim. Comput. Commun. Appl., February, 2023

Rate-Distortion Optimized Geometry Compression for Spinning LiDAR Point Cloud.
IEEE Trans. Multim., 2023

Learning a Compact Spatial-Angular Representation for Light Field.
IEEE Trans. Multim., 2023

Semantic Point Cloud Upsampling.
IEEE Trans. Multim., 2023

Multimodal Data Matters: Language Model Pre-Training Over Structured and Unstructured Electronic Health Records.
IEEE J. Biomed. Health Informatics, 2023

SUR-Driven Video Coding Rate Control for Jointly Optimizing Perceptual Quality and Buffer Control.
IEEE Trans. Image Process., 2023

Applying Collaborative Adversarial Learning to Blind Point Cloud Quality Measurement.
IEEE Trans. Instrum. Meas., 2023

Nonrigid Registration-Based Progressive Motion Compensation for Point Cloud Geometry Compression.
IEEE Trans. Geosci. Remote. Sens., 2023

Adaptive Annotation Distribution for Weakly Supervised Point Cloud Semantic Segmentation.
CoRR, 2023

Mug-STAN: Adapting Image-Language Pretrained Models for General Video Understanding.
CoRR, 2023

One For All: Video Conversation is Feasible Without Video Instruction Tuning.
CoRR, 2023

Discriminative-region attention and orthogonal-view generation model for vehicle re-identification.
Appl. Intell., 2023

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Frequency-Aware Self-Supervised Monocular Depth Estimation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

IPFR: Identity-Preserving Face Reenactment with Enhanced Domain Adversarial Training and Multi-level Identity Priors.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LIO-PPF: Fast LiDAR-Inertial Odometry via Incremental Plane Pre-Fitting and Skeleton Tracking.
IROS, 2023

Null-Space Diffusion Sampling for Zero-Shot Point Cloud Completion.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Disentangled Feature Distillation for Light Field Super-Resolution with Degradations.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Cross-Level Guided Attention for Human-Object Interaction Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Large-Scale Spatio-Temporal Attention Based Entropy Model for Point Cloud Compression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Causality Compensated Attention for Contextual Biased Visual Recognition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AdaNIC: Towards Practical Neural Image Compression via Dynamic Transform Routing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Point is A Wave: Point-Wave Network for Place Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Surface-Sampling Based Objective Quality Assessment Metrics for Meshes.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Graph Representation for Point Cloud Segmentation via Attentive Filtering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Efficient Hierarchical Entropy Model for Learned Point Cloud Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Accelerating Transform Algorithm Implementation for Efficient Intra Coding of 8K UHD Videos.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition.
IEEE Trans. Multim., 2022

Multidirection and Multiscale Pyramid in Transformer for Video-Based Pedestrian Retrieval.
IEEE Trans. Ind. Informatics, 2022

Consistent Quality Oriented Rate Control in HEVC Via Balancing Intra and Inter Frame Coding.
IEEE Trans. Ind. Informatics, 2022

QINet: Decision Surface Learning and Adversarial Enhancement for Quasi-Immune Completion of Diverse Corrupted Point Clouds.
IEEE Trans. Geosci. Remote. Sens., 2022

TICNet: A Target-Insight Correlation Network for Object Tracking.
IEEE Trans. Cybern., 2022

PointOT: Interpretable Geometry-Inspired Point Cloud Generative Model via Optimal Transport.
IEEE Trans. Circuits Syst. Video Technol., 2022

Large-Scale Spatio-Temporal Person Re-Identification: Algorithms and Benchmark.
IEEE Trans. Circuits Syst. Video Technol., 2022

Cross-Collaborative Fusion-Encoder Network for Robust RGB-Thermal Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2022

Learning Disentangled Representation for Multi-View 3D Object Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Rate-Distortion Optimized Graph for Point Cloud Attribute Coding.
IEEE Signal Process. Lett., 2022

Deep region segmentation-based intra prediction for depth video coding.
Multim. Tools Appl., 2022

Zero-shot unsupervised image-to-image translation via exploiting semantic attributes.
Image Vis. Comput., 2022

Exploiting robust unsupervised video person re-identification.
IET Image Process., 2022

Toward 6G TKμ Extreme Connectivity: Architecture, Key Technologies and Experiments.
CoRR, 2022

Multi-direction and Multi-scale Pyramid in Transformer for Video-based Pedestrian Retrieval.
CoRR, 2022

Two heads are better than one: Enhancing medical representations by pre-training over structured and unstructured electronic health records.
CoRR, 2022

Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems.
Proceedings of the 96th Vehicular Technology Conference, 2022

ERINet: Effective Rotation Invariant Network for Point Cloud based Place Recognition.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

PointNetGeM: Simple and Efficient Point Cloud Based Network for Place Recognition.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

SparseARFM-SI: Rotary Point Cloud Place Recognition Based on Multi-Resolution and Attention Mechanism.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Near-lossless Point Cloud Geometry Compression Based on Adaptive Residual Compensation.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

A Fast Motion Estimation Method With Hamming Distance for LiDAR Point Cloud Compression.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Geometric-Aware Calibration Mechanism for Self-Supervised Depth Estimation.
Proceedings of the IEEE Smartworld, 2022

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark Under Heterogeneous AI Computing Platforms.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Learning to Share in Networked Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rate-Distortion-Guided Learning Approach with Cross-Projection Information for V-PCC Fast CU Decision.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

OpenPointCloud: An Open-Source Algorithm Library of Deep Learning Based Point Cloud Compression.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

APCCPA '22: 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

OpenHardwareVC: An Open Source Library for 8K UHD Video Coding Hardware Implementation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MOAC: Multi-level Perception Optimizer Based on Dual Augmented Cost for Structure- from-Motion.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

DKNAS: A Practical Deep Keypoint Extraction Framework Based on Neural Architecture Search.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Focus and Adjust: Progressive Refinement Network for Human Object Interaction Detection.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

TDRNet: Transformer-Based Dual-Branch Restoration Network for Geometry Based Point Cloud Compression Artifacts.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Fine-Grained Correlation Representation for Graph-Based Point Cloud Attribute Compression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Deep Geometry Post-Processing for Decompressed Point Clouds.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

End-to-End Spatial-Angular Light Field Super-Resolution Using Parallax Structure Preservation Strategy.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Context-Aware Hierarchical Transformer for Fine-Grained Video-Text Retrieval.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Pointivae: Invertible Variational Autoencoder Framework for 3D Point Cloud Generation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Cross-Type Attribute Prediction For Point Cloud Compression.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

JE<sup>2</sup>NET: Joint Exploitation and Exploration in Reinforcement Learning Based Image Restoration.
Proceedings of the IEEE International Conference on Acoustics, 2022

Attention Guided Invariance Selection for Local Feature Descriptors.
Proceedings of the IEEE International Conference on Acoustics, 2022

Flow-Based Point Cloud Completion Network with Adversarial Refinement.
Proceedings of the IEEE International Conference on Acoustics, 2022

Salient Object Detection for Point Clouds.
Proceedings of the Computer Vision - ECCV 2022, 2022

Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Texture Extraction and Distribution for Controllable Person Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Contextual Debiasing for Visual Recognition with Causal Mechanisms.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Local Surface Descriptor for Geometry and Feature Preserved Mesh Denoising.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Learning Human-Object Interaction via Interactive Semantic Reasoning.
IEEE Trans. Image Process., 2021

Guest Editorial Introduction to the Special Issue on Recent Advances in Point Cloud Processing and Compression.
IEEE Trans. Circuits Syst. Video Technol., 2021

Semantic-Guided Pixel Sampling for Cloth-Changing Person Re-Identification.
IEEE Signal Process. Lett., 2021

Diverse part attentive network for video-based person re-identification.
Pattern Recognit. Lett., 2021

Learning to disentangle scenes for person re-identification.
Image Vis. Comput., 2021

GID-Net: Detecting human-object interaction with global and instance dependency.
Neurocomputing, 2021

Learning to Share in Multi-Agent Reinforcement Learning.
CoRR, 2021

Machine Learning for Multimodal Electronic Health Records-based Research: Challenges and Perspectives.
CoRR, 2021

Large-Scale Spatio-Temporal Person Re-identification: Algorithm and Benchmark.
CoRR, 2021

Low Pass Filter for Anti-aliasing in Temporal Action Localization.
CoRR, 2021

Combining Attention with Flow for Person Image Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Information-Growth Attention Network for Image Super-Resolution.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

An Improved Coarse-To-Fine Motion Estimation Scheme For Lidar Point Cloud Geometry Compression.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Deep Neural Networks for End-to-End Spatiotemporal Video Quality Prediction and Aggregation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Rethinking Training Objective For Self-Supervised Monocular Depth Estimation: Semantic Cues To Rescue.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Structure-transformed Texture-enhanced Network for Person Image Synthesis.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ATVIO: Attention Guided Visual-Inertial Odometry.
Proceedings of the IEEE International Conference on Acoustics, 2021

Nested Error Map Generation Network for No-Reference Image Quality Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2021

SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation.
IEEE Trans. Image Process., 2020

Spatial-Temporal Context-Aware Online Action Detection and Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2020

Learning channel-wise spatio-temporal representations for video salient object detection.
Neurocomputing, 2020

Toward Zero-Shot Unsupervised Image-to-Image Translation.
CoRR, 2020

Neural saliency algorithm guide bi-directional visual perception style transfer.
CAAI Trans. Intell. Technol., 2020

Fast Recolor Prediction Scheme in Point Cloud Attribute Compression.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A point cloud compression framework via spherical projection.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Point Cloud Attribute Compression via Successive Subspace Graph Transform.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Vaccine-style-net: Point Cloud Completion in Implicit Continuous Function Space.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Context-aware Attention Network for Predicting Image Aesthetic Subjectivity.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

MMNet: Multi-Stage and Multi-Scale Fusion Network for RGB-D Salient Object Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VONAS: Network Design in Visual Odometry using Neural Architecture Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Temporal-Aware SfM-Learner: Unsupervised Learning Monocular Depth and Motion from Stereo Video Clips.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Towards Loss Balance and Consistent Model in Self-supervised Monocular Depth Estimation.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Kernel Clustering On Symmetric Positive Definite Manifolds Via Double Approximated Low Rank Representation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Twinvo: Unsupervised Learning of Monocular Visual Odometry Using Bi-Direction Twin Network.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Pose Refinement: Bridging the Gap Between Unsupervised Learning and Geometric Methods for Visual Odometry.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

C3DVQA: Full-Reference Video Quality Assessment with 3D Convolutional Neural Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ROIMIX: Proposal-Fusion Among Multiple Images for Underwater Object Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Regression Before Classification for Temporal Action Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Image Spatial Transformation for Person Image Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Over-Exposure Correction via Exposure and Scene Information Disentanglement.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection.
ACM Trans. Intell. Syst. Technol., 2019

LECARM: Low-Light Image Enhancement Using the Camera Response Model.
IEEE Trans. Circuits Syst. Video Technol., 2019

Efficient Prediction Methods With Enhanced Spatial-Temporal Correlation for HEVC.
IEEE Trans. Circuits Syst. Video Technol., 2019

C-RPNs: Promoting object detection in real world via a cascade structure of Region Proposal Networks.
Neurocomputing, 2019

Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds.
CoRR, 2019

Bi-Skip: A Motion Deblurring Network Using Self-paced Learning.
CoRR, 2019

Enhanced Intra Prediction Scheme in Point Cloud Attribute Compression.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Multi-mapping Image-to-Image Translation via Learning Disentanglement.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

PDNet: Prior-Model Guided Depth-Enhanced Network for Salient Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Salient Contour-Aware Based Twice Learning Strategy for Saliency Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

StructureFlow: Image Inpainting via Structure-Aware Appearance Flow.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

AttPool: Towards Hierarchical Feature Representation in Graph Convolutional Networks via Attention Mechanism.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Boundary Information Matters More: Accurate Temporal Action Detection with Temporal Boundary Network.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-step Self-attention Network for Cross-modal Retrieval Based on a Limited Text Space.
Proceedings of the IEEE International Conference on Acoustics, 2019

BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization.
Proceedings of the IEEE International Conference on Acoustics, 2019

ResGAN: A Low-Level Image Processing Network to Restore Original Quality of JPEG Compressed Images.
Proceedings of the Data Compression Conference, 2019

Separable KLT for Intra Coding in Versatile Video Coding (VVC).
Proceedings of the Data Compression Conference, 2019

Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Real Photographs Denoising With Noise Domain Adaptation and Attentive Generative Adversarial Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Base-detail image inpainting.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
MPEG Internet Video Coding Standard and Its Performance Evaluation.
IEEE Trans. Circuits Syst. Video Technol., 2018

Saliency Detection by Adaptive Channel Fusion.
IEEE Signal Process. Lett., 2018

A multilayer backpropagation saliency detection algorithm and its applications.
Multim. Tools Appl., 2018

Detecting action tubes via spatial action estimation and temporal path inference.
Neurocomputing, 2018

Multi-Mapping Image-to-Image Translation with Central Biasing Normalization.
CoRR, 2018

Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection.
CoRR, 2018

Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction.
CoRR, 2018

Active Temporal Action Detection in Untrimmed Videos Via Deep Reinforcement Learning.
IEEE Access, 2018

Point Clouds Attribute Compression Using Data-Adaptive Intra prediction.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Underwater Image Enhancement by the Combination of Dehazing and Color Correction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Adaptive Integration Skip Compensation Neural Networks for Removing Mixed Noise in Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

A New Accurate Image Denoising Method Based on Sparse Coding Coefficients.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Deep Pedestrian Detection Using Contextual Information and Multi-level Features.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Online Action Tube Detection via Resolving the Spatio-temporal Context Pattern.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

An Active Action Proposal Method Based on Reinforcement Learning.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Robust Salient Object Detection via Fusing Foreground and Background Priors.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A Motion Aided Merge Mode For Hevc.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Innovative Saliency Guided ROI Selection Model for Panoramic Images Compression.
Proceedings of the 2018 Data Compression Conference, 2018

SingleGAN: Image-to-Image Translation by a Single-Generator Network Using Multiple Generative Adversarial Learning.
Proceedings of the Computer Vision - ACCV 2018, 2018

SAP: Self-Adaptive Proposal Model for Temporal Action Detection Based on Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Accelerating Image-Domain-Warping Virtual View Synthesis on GPGPU.
IEEE Trans. Multim., 2017

iAVS2: A Fast Intra-Encoding Platform for IEEE 1857.4.
IEEE Trans. Circuits Syst. Video Technol., 2017

A Bio-Inspired Multi-Exposure Fusion Framework for Low-light Image Enhancement.
CoRR, 2017

Robust Saliency Detection via Fusing Foreground and Background Priors.
CoRR, 2017

Automatic Salient Object Detection for Panoramic Images Using Region Growing and Fixation Prediction Model.
CoRR, 2017

A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning.
CoRR, 2017

A new underwater image enhancing method via color correction and illumination adjustment.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Adaptive difference modelling for background subtraction.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Attribute compression of 3D point clouds using Laplacian sparsity optimized graph transform.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Improved intra boundary filters for HEVC.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Mask-streaming CNN for pedestrian detection.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

SPOS: Deblur Image by Using Sparsity Prior and Outlier Suppression.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Unsupervised Concept Learning in Text Subspace for Cross-Media Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Cross-media Retrieval by Learning Rich Semantic Embeddings of Multimedia.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep Metric Learning with False Positive Probability - Trade Off Hard Levels in a Weighted Way.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

A joint model for action localization and classification in untrimmed video with visual attention.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Enhanced intra prediction for inter pictures.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

An Innovative Salient Object Detection Using Center-Dark Channel Prior.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Towards Automatic Wild Animal Detection in Low Quality Camera-Trap Images Using Two-Channeled Perceiving Residual Pyramid Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

A New Low-Light Image Enhancement Algorithm Using Camera Response Model.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

A Three-Pathway Psychobiological Framework of Salient Object Detection Using Stereoscopic Technology.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

ORGB: Offset correction in RGB color space for illumination-robust image processing.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A Multilayer Backpropagation Saliency Detection Algorithm Based on Depth Mining.
Proceedings of the Computer Analysis of Images and Patterns, 2017

A New Shadow Removal Method Using Color-Lines.
Proceedings of the Computer Analysis of Images and Patterns, 2017

A New Image Contrast Enhancement Algorithm Using Exposure Fusion Framework.
Proceedings of the Computer Analysis of Images and Patterns, 2017

A Violence Detection Approach Based on Spatio-temporal Hypergraph Transition.
Proceedings of the Computer Analysis of Images and Patterns, 2017

Salient Object Detection with Complex Scene Based on Cognitive Neuroscience.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

2016
A new video denoising method using texture metric and adaptive structure variance.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

An effective post quantization rate estimation for HEVC intra encoder.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A Novel Shadow-Free Feature Extractor for Real-Time Road Detection.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

An Empirical Study of Deformable Part Model with fast feature pyramid.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Robust lane marking detection using boundary-based inverse perspective mapping.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

An Object-Aware Anomaly Detection and Localization in Surveillance Videos.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Searching Action Proposals via Spatial Actionness Estimation and Temporal Path Inference and Tracking.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Video super-resolution with registration-reliability regulation and adaptive total variation.
J. Vis. Commun. Image Represent., 2015

Video pre-processing with JND-based Gaussian filtering of superpixels.
Proceedings of the Visual Information Processing and Communication VI, 2015

An Illumination-Robust Approach for Feature-Based Road Detection.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Stereo matching with space-constrained cost aggregation and segmentation-based disparity refinement.
Proceedings of the Three-Dimensional Image Processing, 2015


  Loading...