Yan Lu

Orcid: 0009-0002-1449-5174

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Harbin Institute of Technology, China (PhD 2003)


According to our database1, Yan Lu authored at least 201 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

A Universal Optimization Framework for Learning-based Image Codec.
ACM Trans. Multim. Comput. Commun. Appl., January, 2024

Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification.
IEEE Trans. Multim., 2024

Uncertainty-Aware Deep Video Compression With Ensembles.
IEEE Trans. Multim., 2024

A General Theory for Compositional Generalization.
CoRR, 2024

RelationVLM: Making Large Vision-Language Models Understand Visual Relations.
CoRR, 2024

Slot-VLM: SlowFast Slots for Video-Language Modeling.
CoRR, 2024

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement.
CoRR, 2024

Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Breaking through the learning plateaus of in-context learning in Transformer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Mask-Based Modeling for Neural Radiance Fields.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Low-Latency Speech Enhancement via Speech Token Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Long-Term Temporal Context Gathering for Neural Video Compression.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Intra-Modal Correlation Learning for Label-Free 3D Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Generative Latent Coding for Ultra-Low Bitrate Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Text Grouping Adapter: Adapting Pre-Trained Text Detector for Layout Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Micro-Doppler Effect and Sparse Representation Analysis of Underwater Targets.
Sensors, October, 2023

PhaseAnti: An Anti-Interference WiFi-Based Activity Recognition System Using Interference-Independent Phase Component.
IEEE Trans. Mob. Comput., May, 2023

Temporal Context Mining for Learned Video Compression.
IEEE Trans. Multim., 2023

Video Instance Segmentation by Instance Flow Assembly.
IEEE Trans. Multim., 2023

Latent-Domain Predictive Neural Speech Coding.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Retrieval-based Video Language Model for Efficient Long Video Question Answering.
CoRR, 2023

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.
CoRR, 2023

Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API.
CoRR, 2023

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition.
CoRR, 2023

How does representation impact in-context learning: A exploration on a synthetic task.
CoRR, 2023

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators.
CoRR, 2023

Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization.
CoRR, 2023

Clothes-Invariant Feature Learning by Causal Intervention for Clothes-Changing Person Re-identification.
CoRR, 2023

MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields.
CoRR, 2023

Time-Variance Aware Real-Time Speech Enhancement.
CoRR, 2023

DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Trajectories are Generalization Indicators.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Disentangle Propagation and Restoration for Efficient Video Recovery.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Masked Audio Modeling with CLAP and Multi-Objective Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

EVC: Towards Real-Time Neural Image Compression with Mask Decay.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Versatile Neural Processes for Learning Implicit Neural Representations.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient View Synthesis with Neural Radiance Distribution Field.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Referring Video Object Segmentation with Cyclic Structural Consensus.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Adaptive Frequency Filters As Efficient Global Token Mixers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Real-Time Speech Enhancement with Dynamic Attention Span.
Proceedings of the IEEE International Conference on Acoustics, 2023

Evopose: A Recursive Transformer for 3D Human Pose Estimation with Kinematic Structure Priors.
Proceedings of the IEEE International Conference on Acoustics, 2023

Contrast-PLC: Contrastive Learning for Packet Loss Concealment.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Speech Enhancement via Event-Based Query.
Proceedings of the IEEE International Conference on Acoustics, 2023

Dasformer: Deep Alternating Spectrogram Transformer For Multi/Single-Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Disentangled Feature Learning for Real-Time Neural Speech Coding.
Proceedings of the IEEE International Conference on Acoustics, 2023

Two-shot Video Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Crossing the Gap: Domain Generalization for Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Motion Information Propagation for Neural Video Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Frequency Filtering for Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Video Compression with Diverse Contexts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unifying Layout Generation with a Decoupled Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-Fidelity and Freely Controllable Talking Head Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Active Token Mixer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo Cancellation.
IEEE Signal Process. Lett., 2022

MonoGRNet: A General Framework for Monocular 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Estimating Neural Reflectance Field from Radiance Field using Tree Structures.
CoRR, 2022

Predictive Neural Speech Coding.
CoRR, 2022

Online Video Instance Segmentation via Robust Context Fusion.
CoRR, 2022

R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency.
CoRR, 2022

Test-time Batch Normalization.
CoRR, 2022

ActiveMLP: An MLP-like Architecture with Active Token Mixer.
CoRR, 2022

End-to-End Neural Audio Coding for Real-Time Communications.
CoRR, 2022

Multi-view Geometry Distillation for Cloth-Changing Person ReID.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Cloth-Aware Center Cluster Loss for Cloth-Changing Person Re-identification.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Alignment-guided Temporal Attention for Video Action Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mask-based Latent Reconstruction for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Visual Concepts Tokenization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Robust Video Object Segmentation with Adaptive Object Calibration.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Neighbor Correspondence Matching for Flow-based Video Frame Synthesis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Error-Resilient Neural Speech Coding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Cross-Scale Vector Quantization for Scalable Neural Speech Coding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Neural Speech Coding for Real-Time Communications.
Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Capture of Animatable 3D Human from Monocular Video.
Proceedings of the Computer Vision - ECCV 2022, 2022

Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semantic-aligned Fusion Transformer for One-shot Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Rethinking Minimal Sufficient Representation in Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Compression-Based Feature Learning for Video Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reliable Propagation-Correction Modulation for Video Object Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Affinity Derivation for Accurate Instance Segmentation.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Residual Refinement Network with Attribute Guidance for Precise Saliency Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2021

A Deep Reinforcement Learning Approach to Multiple Streams' Joint Bitrate Allocation.
IEEE Trans. Circuits Syst. Video Technol., 2021

Deep Contextual Video Compression.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Content-Independent Online Handwriting Verification Based on Multi-Modal Fusion.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Geometry Uncertainty Projection Network for Monocular 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Self-Supervised Video Representation Learning with Meta-Contrastive Network.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Phoneme-Based Distribution Regularization for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Universal Encoder Rate Distortion Optimization Framework for Learned Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

SSAN: Separable Self-Attention Network for Video Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Interactive Speech and Noise Modeling for Speech Enhancement.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Weakly-supervised Temporal Action Localization by Uncertainty Modeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Single-stage Instance Segmentation.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Background Modeling via Uncertainty Estimation for Weakly-supervised Action Localization.
CoRR, 2020

RT-VENet: A Convolutional Network for Real-time Video Enhancement.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Weakly Supervised 3D Object Detection from Point Clouds.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
A Hardware-Accelerated System for High Resolution Real-Time Screen Sharing.
IEEE Trans. Circuits Syst. Video Technol., 2019

Reinforcement learning for bandwidth estimation and congestion control in real-time communications.
CoRR, 2019

IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition.
CoRR, 2019

Scale Voting With Pyramidal Feature Fusion Network for Person Search.
IEEE Access, 2019

Dhff: Robust Multi-Scale Person Search by Dynamic Hierarchical Feature Fusion.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

In Defense of the Classification Loss for Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Triangulation Learning Network: From Monocular to Stereo 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Parallel In-Loop Filtering in HEVC Encoder on GPU.
IEEE Trans. Consumer Electron., 2018

Fast Video Stitching for Aerially Captured HD Videos.
Int. J. Image Graph., 2018

Real-Time Anomaly Detection With HMOF Feature.
CoRR, 2018

Weakly Supervised Local Attention Network for Fine-Grained Visual Classification.
CoRR, 2018

Real-time Anomaly Detection with HMOF Feature.
Proceedings of the 2nd International Conference on Video and Image Processing, 2018

Affinity Derivation and Graph Merge for Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Intra Block Copy for Screen Content in the Emerging AV1 Video Codec.
Proceedings of the 2018 Data Compression Conference, 2018

Feature Selective Networks for Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Delay-Rate-Distortion Optimization for Cloud Gaming With Hybrid Streaming.
IEEE Trans. Circuits Syst. Video Technol., 2017

2016
A High-Fidelity and Low-Interaction-Delay Screen Sharing System.
ACM Trans. Multim. Comput. Commun. Appl., 2016

GPU-based optimization for sample adaptive offset in HEVC.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015
Layered Compression for High-Precision Depth Data.
IEEE Trans. Image Process., 2015

Introduction to the Special Section on Visual Computing in the Cloud: Cloud Gaming and Virtualization.
IEEE Trans. Circuits Syst. Video Technol., 2015

Region-of-interest based coding scheme for synthesized video.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

2014
An adaptive multi-layer low-latency transmission scheme for H.264 based screen sharing system.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

High frame rate screen video coding for screen sharing applications.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

A low latency cloud gaming system using edge preserved image homography.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

A novel cloud gaming framework using joint video and graphics streaming.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Content adaptive screen image scaling.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Kinect-Like Depth Data Compression.
IEEE Trans. Multim., 2013

A Low-Complexity Screen Compression Scheme for Interactive Screen Sharing.
IEEE Trans. Circuits Syst. Video Technol., 2013

Depth sensor assisted real-time gesture recognition for interactive presentation.
J. Vis. Commun. Image Represent., 2013

Rate-distortion optimized block classification and bit allocation in screen video compression.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Layered screen video coding leveraging hardware video codec.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Effective hand segmentation and gesture recognition for browsing web pages on a large screen.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Arbitrary-sized motion detection in screen video coding.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
A low-complexity screen compression scheme.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Layered compression for high dynamic range depth.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Content-aware layered compound video compression.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

A low-latency transmission scheme for interactive screen sharing.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Texture-assisted Kinect depth inpainting.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Kinect-like depth denoising.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Kinect-Like Depth Compression with 2D+T Prediction.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

2011
Virtualized Screen: A Third Element for Cloud-Mobile Convergence.
IEEE Multim., 2011

Browser-friendly hybrid codec for compound image compression.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

2010
High-Dynamic-Range Texture Compression for Rendering Systems of Different Capacities.
IEEE Trans. Vis. Comput. Graph., 2010

ReDi: an interactive virtual display system for ubiquitous devices.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A proxy-based mobile web browser.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Low-cost realtime screen sharing to multiple clients.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
Complexity-Constrained H.264 Video Encoding.
IEEE Trans. Circuits Syst. Video Technol., 2009

A High-Performance Remote Computing Platform.
Proceedings of the Seventh Annual IEEE International Conference on Pervasive Computing and Communications, 2009

Real-time screen image scaling and its GPU acceleration.
Proceedings of the International Conference on Image Processing, 2009

Level embedded medical image compression based on value of interest.
Proceedings of the International Conference on Image Processing, 2009

2008
Efficient Multiple-Description Image Coding Using Directional Lifting-Based Transform.
IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv-Based Multiview Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.
IEEE Trans. Circuits Syst. Video Technol., 2008

B-picture coding in AVS video compression standard.
Signal Process. Image Commun., 2008

Three-tiered network model for image hallucination.
Proceedings of the International Conference on Image Processing, 2008

DHTC: An Effective DXTC-based HDR Texture Compression Scheme.
Proceedings of the EUROGRAPHICS/ACM SIGGRAPH Conference on Graphics Hardware 2008, 2008

2007
Joint Source-Channel Rate-Distortion Optimization for H.264 Video Coding Over Error-Prone Networks.
IEEE Trans. Multim., 2007

Real-time video coding under power constraint based on H.264 codec.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Rate-distortion optimized color quantization for compound image compression.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Distributed Video Coding with Trellis Coded Quantization.
Proceedings of the Advances in Multimedia Modeling, 2007

Distributed Video Coding with Spatial Correlation Exploited Only at the Decoder.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Enable Efficient Compound Image Compression in H.264/AVC Intra Coding.
Proceedings of the International Conference on Image Processing, 2007

2006
4-D Wavelet-Based Multiview Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2006

Adaptive rate control for H.264.
J. Vis. Commun. Image Represent., 2006

Distributed video coding using wavelet.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Bit-Stream Switching in Multiple Bit-Rate Video Streaming using Wyner-Ziv Coding.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Practical Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.
Proceedings of the International Conference on Image Processing, 2006

Wyner-Ziv Video Coding Based on Set Partitioning in Hierarchical Tree.
Proceedings of the International Conference on Image Processing, 2006

Joint Power-Distortion Optimization on Devices with MPEG-4 AVC/H.264 Codec.
Proceedings of IEEE International Conference on Communications, 2006

2005
Rate-distortion analysis for H.264/AVC video coding and its application to rate control.
IEEE Trans. Circuits Syst. Video Technol., 2005

Directional Lifting-Based Wavelet Transform for Multiple Description Image Coding with Quincunx Segmentation.
Proceedings of the Advances in Multimedia Information Processing, 2005

Scalable multiview video coding using wavelet.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Viewpoint switching in multiview video streaming.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2004
Optimum End-to-End Distortion Estimation for Error Resilient Video Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A Study on the Quantization Scheme in H.264/AVC and Its Application to Rate Control.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Enhanced direct mode coding for bi-predictive pictures.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Multiple modes intra-prediction in intra coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Context-based 2D-VLC for video coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Improved error concealment algorithms based on H.264/AVC non-normative decoder.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

New bi-prediction techniques for B pictures coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Mode mapping method for h.264/avc spatial downscaling transcoding.
Proceedings of the 2004 International Conference on Image Processing, 2004

Error resilience video coding in H.264 encoder with potential distortion tracking.
Proceedings of the 2004 International Conference on Image Processing, 2004

New scaling technique for direct mode coding in B pictures.
Proceedings of the 2004 International Conference on Image Processing, 2004

2003
Efficient background video coding with static sprite generation and arbitrary-shape spatial prediction techniques.
IEEE Trans. Circuits Syst. Video Technol., 2003

Rate control for advance video coding (AVC) standard.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Latest arrival time leaky bucket for HRD constrained video coding.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A novel coefficient scanning scheme for directional spatial prediction-based image compression.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Rate control for JVT video coding scheme with HRD considerations.
Proceedings of the 2003 International Conference on Image Processing, 2003

2002
High efficient sprite coding with directional spatial prediction.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Fast and Robust Sprite Generation for MPEG-4 Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2001

Sprite generation for frame-based video coding.
Proceedings of the 2001 International Conference on Image Processing, 2001

2000
Human Facial Expression Recognition based on Learning Subspace Method.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000


  Loading...