Thomas H. Li

Orcid: 0000-0001-6123-1265

According to our database¹, Thomas H. Li authored at least 86 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Closing the Gap Between Theory and Practice During Alternating Optimization for GANs.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., October, 2024

ComPoint: Can Complex-Valued Representation Benefit Point Cloud Place Recognition?

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., July, 2024

Garden city: A synthetic dataset and sandbox environment for analysis of pre-processing algorithms for GPS human mobility data.

[BibT_eX]

[DOI]

Thomas H. Li

Francisco Barreras

CoRR, 2024

Sketch-aided Interactive Fusion Point Cloud Place Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

ScanPCGC: Learning-Based Lossless Point Cloud Geometry Compression using Sequential Slice Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Mitigating Label Noise in GANs via Enhanced Spectral Normalization.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., August, 2023

Semantic Point Cloud Upsampling.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video Sequences.

[BibT_eX]

[DOI]

CoRR, 2023

Mug-STAN: Adapting Image-Language Pretrained Models for General Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

One For All: Video Conversation is Feasible Without Video Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2023

A<sup>2</sup>Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Detecting the open-world objects with the help of the Brain.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Frequency-Aware Self-Supervised Monocular Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

IPFR: Identity-Preserving Face Reenactment with Enhanced Domain Adversarial Training and Multi-level Identity Priors.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Test-Time Adaptation for Super-Resolution with Second-Order Degradation and Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

LIO-PPF: Fast LiDAR-Inertial Odometry via Incremental Plane Pre-Fitting and Skeleton Tracking.

[BibT_eX]

[DOI]

IROS, 2023

Causality Compensated Attention for Contextual Biased Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Vision-and-Language Navigation from YouTube Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Graph Representation for Point Cloud Segmentation via Attentive Filtering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Motion Encoding for Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hard Sample Matters a Lot in Zero-Shot Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

QINet: Decision Surface Learning and Adversarial Enhancement for Quasi-Immune Completion of Diverse Corrupted Point Clouds.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

PointOT: Interpretable Geometry-Inspired Point Cloud Generative Model via Optimal Transport.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Learning Disentangled Representation for Multi-View 3D Object Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Rate-Distortion Optimized Graph for Point Cloud Attribute Coding.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

M<sup>3</sup>Video: Masked Motion Modeling for Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Geometric-Aware Calibration Mechanism for Self-Supervised Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Smartworld, 2022

Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Active Camera for Multi-Object Navigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MOAC: Multi-level Perception Optimizer Based on Dual Augmented Cost for Structure- from-Motion.

[BibT_eX]

[DOI]

Peixi Wu

Ge Li

Thomas H. Li

Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

DKNAS: A Practical Deep Keypoint Extraction Framework Based on Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Fine-Grained Correlation Representation for Graph-Based Point Cloud Attribute Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Deep Geometry Post-Processing for Decompressed Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Pointivae: Invertible Variational Autoencoder Framework for 3D Point Cloud Generation.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Attention Guided Invariance Selection for Local Feature Descriptors.

[BibT_eX]

[DOI]

Jiapeng Li

Ge Li

Thomas H. Li

Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Texture Extraction and Distribution for Controllable Person Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Low Pass Filter for Anti-aliasing in Temporal Action Localization.

[BibT_eX]

[DOI]

CoRR, 2021

Combining Attention with Flow for Person Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Information-Growth Attention Network for Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Rethinking Training Objective For Self-Supervised Monocular Depth Estimation: Semantic Cues To Rescue.

[BibT_eX]

[DOI]

Keyao Li

Ge Li

Thomas H. Li

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Structure-transformed Texture-enhanced Network for Person Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ATVIO: Attention Guided Visual-Inertial Odometry.

[BibT_eX]

[DOI]

Li Liu

Ge Li

Thomas H. Li

Proceedings of the IEEE International Conference on Acoustics, 2021

SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Spatial-Temporal Context-Aware Online Action Detection and Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Neural saliency algorithm guide bi-directional visual perception style transfer.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2020

Vaccine-style-net: Point Cloud Completion in Implicit Continuous Function Space.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VONAS: Network Design in Visual Odometry using Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Temporal-Aware SfM-Learner: Unsupervised Learning Monocular Depth and Motion from Stereo Video Clips.

[BibT_eX]

[DOI]

Lanqing Zhang

Ge Li

Thomas H. Li

Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Towards Loss Balance and Consistent Model in Self-supervised Monocular Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Twinvo: Unsupervised Learning of Monocular Visual Odometry Using Bi-Direction Twin Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Pose Refinement: Bridging the Gap Between Unsupervised Learning and Geometric Methods for Visual Odometry.

[BibT_eX]

[DOI]

Lanqing Zhang

Ge Li

Thomas H. Li

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ROIMIX: Proposal-Fusion Among Multiple Images for Underwater Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Regression Before Classification for Temporal Action Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Image Spatial Transformation for Person Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Over-Exposure Correction via Exposure and Scene Information Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019

Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2019

LECARM: Low-Light Image Enhancement Using the Camera Response Model.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds.

[BibT_eX]

[DOI]

CoRR, 2019

Multi-mapping Image-to-Image Translation via Learning Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

PDNet: Prior-Model Guided Depth-Enhanced Network for Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Salient Contour-Aware Based Twice Learning Strategy for Saliency Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

StructureFlow: Image Inpainting via Structure-Aware Appearance Flow.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Boundary Information Matters More: Accurate Temporal Action Detection with Temporal Boundary Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Real Photographs Denoising With Noise Domain Adaptation and Attentive Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018

Detecting action tubes via spatial action estimation and temporal path inference.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection.

[BibT_eX]

[DOI]

CoRR, 2018

Active Temporal Action Detection in Untrimmed Videos Via Deep Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Access, 2018

Adaptive Integration Skip Compensation Neural Networks for Removing Mixed Noise in Image.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Deep Pedestrian Detection Using Contextual Information and Multi-level Features.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Online Action Tube Detection via Resolving the Spatio-temporal Context Pattern.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

SingleGAN: Image-to-Image Translation by a Single-Generator Network Using Multiple Generative Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2017

Towards Automatic Wild Animal Detection in Low Quality Camera-Trap Images Using Two-Channeled Perceiving Residual Pyramid Networks.

[BibT_eX]

[DOI]

Chunbiao Zhu

Thomas H. Li

Ge Li

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Thomas H. Li

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...