We stand with Ukraine

We stand with Ukraine

Yan Lu

Orcid: 0009-0002-1449-5174

Affiliations:

Microsoft Research Asia, Beijing, China
Harbin Institute of Technology, China (PhD 2003)

According to our database¹, Yan Lu authored at least 201 papers between 2000 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org
on ieeexplore.ieee.org

On csauthors.net:

Bibliography

2024

Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis.

[BibT_eX]

[DOI]

,

,

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

A Universal Optimization Framework for Learning-based Image Codec.

[BibT_eX]

[DOI]

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., January, 2024

Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2024

Uncertainty-Aware Deep Video Compression With Ensembles.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Multim., 2024

A General Theory for Compositional Generalization.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

RelationVLM: Making Large Vision-Language Models Understand Visual Relations.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Slot-VLM: SlowFast Slots for Video-Language Modeling.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Breaking through the learning plateaus of in-context learning in Transformer.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Mask-Based Modeling for Neural Radiance Fields.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Low-Latency Speech Enhancement via Speech Token Generation.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Long-Term Temporal Context Gathering for Neural Video Compression.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Intra-Modal Correlation Learning for Label-Free 3D Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Generative Latent Coding for Ultra-Low Bitrate Image Compression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Text Grouping Adapter: Adapting Pre-Trained Text Detector for Layout Analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Micro-Doppler Effect and Sparse Representation Analysis of Underwater Targets.

[BibT_eX]

[DOI]

,

,

Sensors, October, 2023

PhaseAnti: An Anti-Interference WiFi-Based Activity Recognition System Using Interference-Independent Phase Component.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Mob. Comput., May, 2023

Temporal Context Mining for Learned Video Compression.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2023

Video Instance Segmentation by Instance Flow Assembly.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Multim., 2023

Latent-Domain Predictive Neural Speech Coding.

[BibT_eX]

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Retrieval-based Video Language Model for Efficient Long Video Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

How does representation impact in-context learning: A exploration on a synthetic task.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Clothes-Invariant Feature Learning by Causal Intervention for Clothes-Changing Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Time-Variance Aware Real-Time Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Trajectories are Generalization Indicators.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Disentangle Propagation and Restoration for Efficient Video Recovery.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Masked Audio Modeling with CLAP and Multi-Objective Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

EVC: Towards Real-Time Neural Image Compression with Mask Decay.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Versatile Neural Processes for Learning Implicit Neural Representations.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient View Synthesis with Neural Radiance Distribution Field.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Referring Video Object Segmentation with Cyclic Structural Consensus.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Adaptive Frequency Filters As Efficient Global Token Mixers.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Real-Time Speech Enhancement with Dynamic Attention Span.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Evopose: A Recursive Transformer for 3D Human Pose Estimation with Kinematic Structure Priors.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Contrast-PLC: Contrastive Learning for Packet Loss Concealment.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Speech Enhancement via Event-Based Query.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Dasformer: Deep Alternating Spectrogram Transformer For Multi/Single-Channel Speech Separation.

[BibT_eX]

[DOI]

,

,

,

Hesam Movassagh

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Disentangled Feature Learning for Real-Time Neural Speech Coding.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Two-shot Video Object Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Crossing the Gap: Domain Generalization for Image Captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Motion Information Propagation for Neural Video Compression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Frequency Filtering for Domain Generalization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Video Compression with Diverse Contexts.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unifying Layout Generation with a Decoupled Diffusion Model.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-Fidelity and Freely Controllable Talking Head Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Active Token Mixer.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Signal Process. Lett., 2022

MonoGRNet: A General Framework for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Estimating Neural Reflectance Field from Radiance Field using Tree Structures.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Predictive Neural Speech Coding.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Online Video Instance Segmentation via Robust Context Fusion.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

Test-time Batch Normalization.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

ActiveMLP: An MLP-like Architecture with Active Token Mixer.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

End-to-End Neural Audio Coding for Real-Time Communications.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

Multi-view Geometry Distillation for Cloth-Changing Person ReID.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Cloth-Aware Center Cluster Loss for Cloth-Changing Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Alignment-guided Temporal Attention for Video Action Recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mask-based Latent Reconstruction for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Visual Concepts Tokenization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Robust Video Object Segmentation with Adaptive Object Calibration.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression.

[BibT_eX]

[DOI]

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Neighbor Correspondence Matching for Flow-based Video Frame Synthesis.

[BibT_eX]

[DOI]

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Error-Resilient Neural Speech Coding.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Cross-Scale Vector Quantization for Scalable Neural Speech Coding.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Neural Speech Coding for Real-Time Communications.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Capture of Animatable 3D Human from Monocular Video.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Semantic-aligned Fusion Transformer for One-shot Object Detection.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Rethinking Minimal Sufficient Representation in Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Compression-Based Feature Learning for Video Restoration.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reliable Propagation-Correction Modulation for Video Object Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Affinity Derivation for Accurate Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2021

Residual Refinement Network with Attribute Guidance for Precise Saliency Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2021

A Deep Reinforcement Learning Approach to Multiple Streams' Joint Bitrate Allocation.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2021

Deep Contextual Video Compression.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Content-Independent Online Handwriting Verification Based on Multi-Modal Fusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Geometry Uncertainty Projection Network for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Self-Supervised Video Representation Learning with Meta-Contrastive Network.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Phoneme-Based Distribution Regularization for Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

A Universal Encoder Rate Distortion Optimization Framework for Learned Compression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

SSAN: Separable Self-Attention Network for Video Representation Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Interactive Speech and Noise Modeling for Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

Sriram Srinivasan

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Weakly-supervised Temporal Action Localization by Uncertainty Modeling.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Single-stage Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2020

Background Modeling via Uncertainty Estimation for Weakly-supervised Action Localization.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

RT-VENet: A Convolutional Network for Real-time Video Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Weakly Supervised 3D Object Detection from Point Clouds.

[BibT_eX]

[DOI]

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

A Hardware-Accelerated System for High Resolution Real-Time Screen Sharing.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2019

Reinforcement learning for bandwidth estimation and congestion control in real-time communications.

[BibT_eX]

[DOI]

,

,

,

,

Yasaman Hosseinkashi

,

,

Albert Sadovnikov

,

,

,

,

,

,

,

Johannes Gehrke

CoRR, 2019

IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2019

Scale Voting With Pyramidal Feature Fusion Network for Person Search.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Access, 2019

Dhff: Robust Multi-Scale Person Search by Dynamic Hierarchical Feature Fusion.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

In Defense of the Classification Loss for Person Re-Identification.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Triangulation Learning Network: From Monocular to Stereo 3D Object Detection.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Parallel In-Loop Filtering in HEVC Encoder on GPU.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Consumer Electron., 2018

Fast Video Stitching for Aerially Captured HD Videos.

[BibT_eX]

[DOI]

,

,

,

Int. J. Image Graph., 2018

Real-Time Anomaly Detection With HMOF Feature.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2018

Weakly Supervised Local Attention Network for Fine-Grained Visual Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2018

Real-time Anomaly Detection with HMOF Feature.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2nd International Conference on Video and Image Processing, 2018

Affinity Derivation and Graph Merge for Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.

[BibT_eX]

[DOI]

,

,

,

Michael Felsberg

,

Roman P. Pflugfelder

,

Luka Cehovin Zajc

,

,

,

,

Abdelrahman Eldesokey

,

Gustavo Fernández

,

Álvaro García-Martín

,

Álvaro Iglesias-Arias

,

A. Aydin Alatan

,

Abel González-García

,

Alfredo Petrosino

,

Alireza Memarmoghadam

,

,

,

,

Arnold W. M. Smeulders

,

Asanka G. Perera

,

,

,

,

,

Changzhen Xiong

,

,

,

,

,

,

,

,

,

,

Efstratios Gavves

,

,

Erik Velasco-Salido

,

Fahad Shahbaz Khan

,

,

,

,

Francesco Battistone

,

,

Gorthi R. K. Sai Subrahmanyam

,

Guilherme Sousa Bastos

,

,

Hamed Kiani Galoogahi

,

,

,

,

,

,

Horst Possegger

,

,

,

,

,

,

Hyung Jin Chang

,

Isabela Drummond

,

,

Jaime Spencer Martin

,

Javaan Singh Chahl

,

,

,

,

,

,

Joakim Johnander

,

João F. Henriques

,

,

Joost van de Weijer

,

Jorge Rodríguez Herranz

,

José M. Martínez

,

,

,

,

,

,

,

,

,

,

Luca Bertinetto

,

,

,

Mario Edoardo Maresca

,

Martin Danelljan

,

Ming-Hsuan Yang

,

Mohamed H. Abdelpakey

,

Mohamed S. Shehata

,

,

,

,

,

,

Pablo Vicente-Moñivar

,

,

,

Philip H. S. Torr

,

Priya Mariam Raju

,

,

,

,

,

Rafael Martin Nieto

,

Rama Krishna Sai Subrahmanyam Gorthi

,

,

,

Richard M. Everson

,

,

,

,

,

,

Shuangping Huang

,

,

,

,

Stuart Golodetz

,

,

,

,

,

Vincenzo Santopietro

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Yiannis Demiris

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Intra Block Copy for Screen Content in the Emerging AV1 Video Codec.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2018 Data Compression Conference, 2018

Feature Selective Networks for Object Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Delay-Rate-Distortion Optimization for Cloud Gaming With Hybrid Streaming.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2017

2016

A High-Fidelity and Low-Interaction-Delay Screen Sharing System.

[BibT_eX]

[DOI]

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2016

GPU-based optimization for sample adaptive offset in HEVC.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015

Layered Compression for High-Precision Depth Data.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Image Process., 2015

Introduction to the Special Section on Visual Computing in the Cloud: Cloud Gaming and Virtualization.

[BibT_eX]

[DOI]

Shervin Shirmohammadi

,

,

Dewan Tanvir Ahmed

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2015

Region-of-interest based coding scheme for synthesized video.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2015 Visual Communications and Image Processing, 2015

2014

An adaptive multi-layer low-latency transmission scheme for H.264 based screen sharing system.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

High frame rate screen video coding for screen sharing applications.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

A low latency cloud gaming system using edge preserved image homography.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

A novel cloud gaming framework using joint video and graphics streaming.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Content adaptive screen image scaling.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013

Kinect-Like Depth Data Compression.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2013

A Low-Complexity Screen Compression Scheme for Interactive Screen Sharing.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2013

Depth sensor assisted real-time gesture recognition for interactive presentation.

[BibT_eX]

[DOI]

,

,

,

,

J. Vis. Commun. Image Represent., 2013

Rate-distortion optimized block classification and bit allocation in screen video compression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Layered screen video coding leveraging hardware video codec.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Effective hand segmentation and gesture recognition for browsing web pages on a large screen.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Arbitrary-sized motion detection in screen video coding.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Image Processing, 2013

2012

A low-complexity screen compression scheme.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2012 Visual Communications and Image Processing, 2012

Layered compression for high dynamic range depth.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2012 Visual Communications and Image Processing, 2012

Content-aware layered compound video compression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

A low-latency transmission scheme for interactive screen sharing.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Texture-assisted Kinect depth inpainting.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Kinect-like depth denoising.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Kinect-Like Depth Compression with 2D+T Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

2011

Virtualized Screen: A Third Element for Cloud-Mobile Convergence.

[BibT_eX]

[DOI]

,

,

IEEE Multim., 2011

Browser-friendly hybrid codec for compound image compression.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

2010

High-Dynamic-Range Texture Compression for Rendering Systems of Different Capacities.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Vis. Comput. Graph., 2010

ReDi: an interactive virtual display system for ubiquitous devices.

[BibT_eX]

[DOI]

,

,

Proceedings of the 18th International Conference on Multimedia 2010, 2010

A proxy-based mobile web browser.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Low-cost realtime screen sharing to multiple clients.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009

Complexity-Constrained H.264 Video Encoding.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2009

A High-Performance Remote Computing Platform.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Seventh Annual IEEE International Conference on Pervasive Computing and Communications, 2009

Real-time screen image scaling and its GPU acceleration.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Image Processing, 2009

Level embedded medical image compression based on value of interest.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Image Processing, 2009

2008

Efficient Multiple-Description Image Coding Using Directional Lifting-Based Transform.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv-Based Multiview Video Coding.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2008

B-picture coding in AVS video compression standard.

[BibT_eX]

[DOI]

,

,

,

,

Signal Process. Image Commun., 2008

Three-tiered network model for image hallucination.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Image Processing, 2008

DHTC: An Effective DXTC-based HDR Texture Compression Scheme.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the EUROGRAPHICS/ACM SIGGRAPH Conference on Graphics Hardware 2008, 2008

2007

Joint Source-Channel Rate-Distortion Optimization for H.264 Video Coding Over Error-Prone Networks.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Multim., 2007

Real-time video coding under power constraint based on H.264 codec.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Visual Communications and Image Processing 2007, 2007

Rate-distortion optimized color quantization for compound image compression.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Visual Communications and Image Processing 2007, 2007

Distributed Video Coding with Trellis Coded Quantization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Multimedia Modeling, 2007

Distributed Video Coding with Spatial Correlation Exploited Only at the Decoder.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Enable Efficient Compound Image Compression in H.264/AVC Intra Coding.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Image Processing, 2007

2006

4-D Wavelet-Based Multiview Video Coding.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2006

Adaptive rate control for H.264.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Susanto Rahardja

,

,

J. Vis. Commun. Image Represent., 2006

Distributed video coding using wavelet.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Bit-Stream Switching in Multiple Bit-Rate Video Streaming using Wyner-Ziv Coding.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Practical Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Image Processing, 2006

Wyner-Ziv Video Coding Based on Set Partitioning in Hierarchical Tree.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Image Processing, 2006

Joint Power-Distortion Optimization on Devices with MPEG-4 AVC/H.264 Codec.

[BibT_eX]

[DOI]

,

,

Proceedings of IEEE International Conference on Communications, 2006

2005

Rate-distortion analysis for H.264/AVC video coding and its application to rate control.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Circuits Syst. Video Technol., 2005

Directional Lifting-Based Wavelet Transform for Multiple Description Image Coding with Quincunx Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Multimedia Information Processing, 2005

Scalable multiview video coding using wavelet.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Viewpoint switching in multiview video streaming.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2004

Optimum End-to-End Distortion Estimation for Error Resilient Video Coding.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A Study on the Quantization Scheme in H.264/AVC and Its Application to Rate Control.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Enhanced direct mode coding for bi-predictive pictures.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Multiple modes intra-prediction in intra coding.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Context-based 2D-VLC for video coding.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Improved error concealment algorithms based on H.264/AVC non-normative decoder.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

New bi-prediction techniques for B pictures coding.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Mode mapping method for h.264/avc spatial downscaling transcoding.

[BibT_eX]

,

,

,

Proceedings of the 2004 International Conference on Image Processing, 2004

Error resilience video coding in H.264 encoder with potential distortion tracking.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2004 International Conference on Image Processing, 2004

New scaling technique for direct mode coding in B pictures.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2004 International Conference on Image Processing, 2004

2003

Efficient background video coding with static sprite generation and arbitrary-shape spatial prediction techniques.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Circuits Syst. Video Technol., 2003

Rate control for advance video coding (AVC) standard.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Latest arrival time leaky bucket for HRD constrained video coding.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A novel coefficient scanning scheme for directional spatial prediction-based image compression.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Rate control for JVT video coding scheme with HRD considerations.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2003 International Conference on Image Processing, 2003

2002

High efficient sprite coding with directional spatial prediction.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2002 International Conference on Image Processing, 2002

2001

Fast and Robust Sprite Generation for MPEG-4 Video Coding.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Multimedia Information Processing, 2001

Sprite generation for frame-based video coding.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2001 International Conference on Image Processing, 2001

2000

Human Facial Expression Recognition based on Learning Subspace Method.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Loading...