2025
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models.
CoRR, May, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.
CoRR, March, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings.
Proceedings of the Design, Automation & Test in Europe Conference, 2025

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
VRT: A Video Restoration Transformer.
IEEE Trans. Image Process., 2024

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.
CoRR, 2024

3D Mesh Editing using Masked LRMs.
CoRR, 2024

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds.
CoRR, 2024

BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation.
CoRR, 2024

CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations.
CoRR, 2024

EVA-Score: Evaluation of Long-form Summarization on Informativeness through Extraction and Validation.
CoRR, 2024

Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Continual Learning for Human-Machine Collaboration in VUCA Environments.
Proceedings of the Navigating Unpredictability: Collaborative Networks in Non-linear Worlds, 2024

MVDiffusion++: A Dense High-Resolution Multi-view Diffusion Model for Single or Sparse-View 3D Object Reconstruction.
Proceedings of the Computer Vision - ECCV 2024, 2024

MoVideo: Motion-Aware Video Generation with Diffusion Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians.
Proceedings of the Computer Vision - ECCV 2024, 2024

LSVOS Challenge Report: Large-Scale Complex and Long Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Continual Learning Supporting Human-Robot Collaboration.
Proceedings of the Technological Innovation for Human-Centric Systems, 2024

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Evaluating Generative Language Models in Information Extraction as Subjective Question Correction.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Pyramid Attention Network for Image Restoration.
Int. J. Comput. Vis., December, 2023

A fault-tolerant optimization mechanism for spatiotemporal data analysis in flink.
World Wide Web (WWW), May, 2023

Training Neural Networks on RAW and HDR Images for Restoration Tasks.
CoRR, 2023

MoVideo: Motion-Aware Video Generation with Diffusion Models.
CoRR, 2023

NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation.
CoRR, 2023

EBSR: Enhanced Binary Neural Network for Image Super-Resolution.
CoRR, 2023

Efficient and Explicit Modelling of Image Hierarchies for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
A Machine Vision - Based Pipe Leakage Detection System for Automated Power Plant Maintenance.
Sensors, 2022

TCBERT: A Technical Report for Chinese Topic Classification BERT.
CoRR, 2022

Preliminary Study of Short-Term Visual Perceptual Training Based on Virtual Reality and Augmented Reality in Postoperative Strabismic Patients.
Cyberpsychology Behav. Soc. Netw., 2022

Recurrent Video Restoration Transformer with Guided Deformable Attention.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Image Super-Resolution With Non-Local Sparse Attention.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results.
CoRR, 2020

Pyramid Attention Networks for Image Restoration.
CoRR, 2020

SHIKEBLCU at SemEval-2020 Task 2: An External Knowledge-enhanced Matrix for Multilingual and Cross-Lingual Lexical Entailment.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Neural Sparse Representation for Image Restoration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NTIRE 2020 Challenge on Image and Video Deblurring.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Image Super-Resolution With Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Scale-Wise Convolution for Image Restoration.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection.
CoRR, 2019

Model Aggregation Method for Data Parallelism in Distributed Real-Time Machine Learning of Smart Sensing Equipment.
IEEE Access, 2019

Video Instance Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

NTIRE 2019 Challenge on Video Super-Resolution: Methods and Results.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

NTIRE 2019 Challenge on Video Deblurring: Methods and Results.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

An Empirical Investigation of Efficient Spatio-Temporal Modeling in Video Restoration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

NTIRE 2019 Challenge on Real Image Denoising: Methods and Results.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Wide Activation for Efficient Image and Video Super-Resolution.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Learning Temporal Dynamics for Video Super-Resolution: A Deep Learning Approach.
IEEE Trans. Image Process., 2018

YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark.
CoRR, 2018

Wide Activation for Efficient and Accurate Image Super-Resolution.
CoRR, 2018

Non-Local Recurrent Network for Image Restoration.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Wide-activated Deep Residual Networks based Restoration for BPG-compressed Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Effective object detection from traffic camera videos.
Proceedings of the 2017 IEEE SmartWorld, 2017

Robust Video Super-Resolution with Learned Temporal Dynamics.
Proceedings of the IEEE International Conference on Computer Vision, 2017

NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Balanced Two-Stage Residual Networks for Image Super-Resolution.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Speaker and language factorization in DNN-based TTS synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Unsupervised speaker adaptation for DNN-based TTS synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Sequence error (SE) minimization training of neural network for voice conversion.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

TTS synthesis with bidirectional LSTM based recurrent neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speaker verification with deep features.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

On the training aspects of Deep Neural Network (DNN) for parametric TTS synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Reshaping deep neural network for fast decoding by node-pruning.
Proceedings of the IEEE International Conference on Acoustics, 2014

Stochastic data sweeping for fast DNN training.
Proceedings of the IEEE International Conference on Acoustics, 2014

2012
Development of the 2012 SJTU HVR system.
Proceedings of the International Conference on Multimodal Interaction, 2012