Li Song

Orcid: 0000-0002-7124-5182

Affiliations:
  • Shanghai Jiao Tong University, Institute of Image Communication and Network Engineering, Shanghai, China
  • Shanghai Jiao Tong University, Department of Electronic Engineering, China (PhD 2005)


According to our database1, Li Song authored at least 246 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Memories are One-to-Many Mapping Alleviators in Talking Face Generation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

A Character Position-Aware Compression Framework for Screen Text Image.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

Depth-Guided Robust Point Cloud Fusion NeRF for Sparse Input Views.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

EffiHDR: An Efficient Framework for HDRTV Reconstruction and Enhancement in UHD Systems.
IEEE Trans. Broadcast., June, 2024

Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network.
IEEE Trans. Multim., 2024

A New People-Object Interaction Dataset and NVS Benchmarks.
CoRR, 2024

PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation.
CoRR, 2024

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration.
CoRR, 2024

MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration.
CoRR, 2024

JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression.
CoRR, 2024

Pioneer: Offline Reinforcement Learning based Bandwidth Estimation for Real-Time Communication.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

HPC: Hierarchical Progressive Coding Framework for Volumetric Video.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Rate-aware Compression for NeRF-based Volumetric Video.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Visibility-Aware Human Mesh Recovery via Balancing Dense Correspondence and Probability Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LFCAVE: Interactive 3D Space with Multiple Light Field Displays.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Neural Rate Control for Learned Video Compression.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Hdrtvformer: Efficient Sdrtv-to-Hdrtv via Affine Transformation and Spatial-Aware Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

Disentangled Clothed Avatar Generation from Text Descriptions.
Proceedings of the Computer Vision - ECCV 2024, 2024

Identity-Consistent Video De-identification via Diffusion Autoencoders.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

A Priority Aware Free Viewpoint Video Transmit Scheme Based on QUIC.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

Detailed and Controllable Old Photo Restoration with Diffusion Priors.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

LaEC: Loss-aware Earliest Completion Data Scheduler for Multi-site Parallel Downloading.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

No-reference Quality Assessment of Text-to-Image Generation.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2024

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Face De-identification: Safeguarding Identities in the Digital Era
Springer, ISBN: 978-3-031-58221-9, 2024

2023
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation.
IEEE J. Sel. Top. Signal Process., November, 2023

Multi-scale-based joint super-resolution and inverse tone-mapping with data synthesis for UHD HDR video.
Displays, September, 2023

Precise Encoding Complexity Control for Versatile Video Coding.
IEEE Trans. Broadcast., March, 2023

High-Fidelity Face Reenactment Via Identity-Matched Correspondence Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Online Video Stabilization Using IMU Sensors.
IEEE Trans. Multim., 2023

Learned Image Compression Using Cross-Component Attention Mechanism.
IEEE Trans. Image Process., 2023

Disentangled Clothed Avatar Generation from Text Descriptions.
CoRR, 2023

Implicit-explicit Integrated Representations for Multi-view Video Compression.
CoRR, 2023

Learning Dense UV Completion for Human Mesh Recovery.
CoRR, 2023

High-Fidelity Free-View Talking Head Synthesis for Low-Bandwidth Video Conference.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

SAFR: A Real-Time Communication System with Adaptive Frame Rate.
Proceedings of the 1st International Workshop on Networked AI Systems, 2023

NeRF-SDP: Efficient Generalizable Neural Radiance Field with Scene Depth Perception.
Proceedings of the ACM Multimedia Asia 2023, 2023

Achieving Privacy-Preserving Multi-View Consistency with Advanced 3D-Aware Face De-identification.
Proceedings of the ACM Multimedia Asia 2023, 2023

360-Degree Panorama Generation from Few Unregistered NFoV Images.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Content Adaptive Checkerboard Context Model for Learned Image Compression.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

PACC: Perception Aware Congestion Control for Real-time Communication.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Old-Photo Restoration with Detail- and Structure-Enhanced Cascaded Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Divide and Conquer: a Two-Step Method for High Quality Face De-identification with Model Explainability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Perceptual Video Coding Based on Spatial Masking for Medical Video Communication.
Proceedings of the 2023 12th International Conference on Computing and Pattern Recognition, 2023

Dual-Head Fusion Network for Image Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Boosting Video Object Segmentation via Space-Time Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Freestyle Layout-to-Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Efficient Human Rendering with Geometric and Semantic Priors.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2023

Quality of Experience Assessment for Free-viewpoint Video.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2023

2022
Wireless Multiplayer Interactive Virtual Reality Game Systems With Edge Computing: Modeling and Optimization.
IEEE Trans. Wirel. Commun., 2022

Edge-Based Video Compression Texture Synthesis Using Generative Adversarial Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

IdentityMask: Deep Motion Flow Guided Reversible Face Video De-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2022

Ultra-Low Latency, Stable, and Scalable Video Transmission for Free-Viewpoint Video Services.
IEEE Trans. Broadcast., 2022

Multiview nonlinear discriminant structure learning for emotion recognition.
Knowl. Based Syst., 2022

IdentityDP: Differential private identification protection for face images.
Neurocomputing, 2022

L0 structure-prior assisted blur-intensity aware efficient video deblurring.
Neurocomputing, 2022

RGBD-based Real-time Volumetric Reconstruction System: Architecture Design and Implementation.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

A Large-scale Sports Tracking Dataset and Progressive Re-detection Based Sports Tracking.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Perceptual Video Coding Based on Semantic-Guided Texture Detection and Synthesis.
Proceedings of the Picture Coding Symposium, 2022

Position-based Motion Vector Prediction for Textual Image Coding.
Proceedings of the Picture Coding Symposium, 2022

A new free viewpoint video dataset and DIBR benchmark.
Proceedings of the MMSys '22: 13th ACM Multimedia Systems Conference, Athlone, Ireland, June 14, 2022

A Cloud-based Free View Solution.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Multi-Scale Coarse-to-Fine Transformer for Frame Interpolation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Intra Encoding Complexity Control with a Time-Cost Model for Versatile Video Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

An Attention Based CNN with Temporal Hierarchical Deployment for AVS3 Inter In-loop Filtering.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Complexity-Oriented Per-Shot Video Coding Optimization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Generative Compression for Face Video: A Hybrid Scheme.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

A Multi-User Oriented Live Free-Viewpoint Video Streaming System Based on View Interpolation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

CNN-Based Fast CU Partitioning Algorithm for VVC Intra Coding.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

MLS-GAN: Multi-Level Semantic Guided Image Colorization.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Low-Complexity Multi-Model CNN in-Loop Filter for AVS3.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Reinforcement Learning Based Cross-Layer Congestion Control for Real-Time Communication.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

Hiding Among Your Neighbors: Face Image Privacy Protection with Differential Private k-anonymity.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

2021
VMAF Oriented Perceptual Coding Based on Piecewise Metric Coupling.
IEEE Trans. Image Process., 2021

Compression Priors Assisted Convolutional Neural Network for Fractional Interpolation.
IEEE Trans. Circuits Syst. Video Technol., 2021

Modeling Acceleration Properties for Flexible INTRA HEVC Complexity Control.
IEEE Trans. Circuits Syst. Video Technol., 2021

An Elastic System Architecture for Edge Based Low Latency Interactive Video Applications.
IEEE Trans. Broadcast., 2021

DP-Image: Differential Privacy for Image Data in Feature Space.
CoRR, 2021

Current Frame Priors Assisted Neural Network for Intra Prediction.
IEEE Access, 2021

Mobile Edge Resource optimization for Multiplayer Interactive Virtual Reality Game.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2021

Fast and Context-Aware Framework for Space-Time Video Super-Resolution.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Deep Motion Flow Aided Face Video De-identification.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Video Compression based on Jointly Learned Down-Sampling and Super-Resolution Networks.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

HEVC VMAF-oriented Perceptual Rate Distortion Optimization using CNN.
Proceedings of the Picture Coding Symposium, 2021

Deep Face Swapping via Cross-Identity Adversarial Training.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

DVRCNN: Dark Video Post-processing Method for VVC.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Blindly Predict Image and Video Quality in the Wild.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

SVM Based Fast CU Partitioning Algorithm for VVC Intra Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

A Generative Compression Framework For Low Bandwidth Video Conference.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Video Multimethod Assessment Fusion Based Rate-Distortion Optimization for Versatile Video Coding.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Personalized and Invertible Face De-identification by Disentangled Identity Information Manipulation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dense 3D Coordinate Code Prior Guidance for High-Fidelity Face Swapping and Face Reenactment.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

Region-Aware Adaptive Instance Normalization for Image Harmonization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Dual Attention Guided Gaze Target Detection in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SpaAbr: Size Prediction Assisted Adaptive Bitrate Algorithm for Scalable Video Coding Contents.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

3D-BitNet: Flow-Agnostic and Precise Network for video Bit-Depth Expansion.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

Configurable Low Delay Congestion Control Scheme for Cellular Networks.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

Buffer Displacement Based Online Learning Algorithm For Low Latency HTTP Adaptive Streaming.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

Video Enhancement Based on Unpaired Learning.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2021

2020
Rate Distortion Optimization: A Joint Framework and Algorithms for Random Access Hierarchical Video Coding.
IEEE Trans. Image Process., 2020

Real-time UHD video super-resolution and transcoding on heterogeneous hardware.
J. Real Time Image Process., 2020

Quality of Experience Evaluation for Streaming Video Using CGNN.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A Hybrid Model for Natural Face De-Identiation with Adjustable Privacy.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Deep Blind Video Quality Assessment for User Generated Videos.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

A Deep Tracking and Segmentation Approach for Soccer Videos Visual Effects.
Proceedings of the Pattern Recognition and Computer Vision - Third Chinese Conference, 2020

Learning-Based Quality Enhancement For Scalable Coded Video Over Packet Lossy Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Realistic Talking Face Synthesis With Geometry-Aware Feature Transformation.
Proceedings of the IEEE International Conference on Image Processing, 2020

Hiding Private Information in Images From AI.
Proceedings of the 2020 IEEE International Conference on Communications, 2020

Toward Fine-Grained Facial Expression Manipulation.
Proceedings of the Computer Vision - ECCV 2020, 2020

TSGAN: A Two-Stream Generative Adversarial Network for Bit-Depth Expansion.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

Native Resolution Detection for 4K-UHD Videos.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

A VMAF Directed Perceptual Rate Distortion Optimization for Video Coding.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2020

FACT: Fused Attention for Clothing Transfer with Generative Adversarial Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer.
IEEE Trans. Image Process., 2019

An Improved QoE Evaluation Model for HTTP Adaptive Streaming.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Deep Feature Guided Image Retargeting.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

FPGA Based Video Transcoding System with 2K-4K Super-Resolution Conversion.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Identifying and Pruning Redundant Structures for Deep Neural Networks.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

JND-based Perceptual Rate Distortion Optimization for AV1 Encoder.
Proceedings of the Picture Coding Symposium, 2019

CNN Accelerated Intra Video Coding, Where Is the Upper Bound?
Proceedings of the Picture Coding Symposium, 2019

Reinforcement Learning Based Adaptive Bitrate Algorithm for Transmitting Panoramic Videos.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

VMAF Oriented Perceptual Optimization for Video Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Multi-scale Generative Adversarial Learning for Facial Attribute Transfer.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2019

Advanced CNN Based Motion Compensation Fractional Interpolation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Gan Based Multi-Exposure Inverse Tone Mapping.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Low-precision CNN Model Quantization based on Optimal Scaling Factor Estimation.
Proceedings of the 2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2019

Viewport Prediction for Panoramic Video with Multi-CNN.
Proceedings of the 2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2019

Deep Video Inverse Tone Mapping.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

2018
Guest Editorial Special Issue on Quality of Experience for Advanced Broadcast Services.
IEEE Trans. Broadcast., 2018

An improved Real-Time Video Communication System.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Rate-mixed HEVC Tile based 360 Video Streaming System.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Masking Effects Based Rate Control Scheme for High Efficiency Video Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

An MCMC based Efficient Parameter Selection Model for x265 Encoder.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Improving Semantic Style Transfer Using Guided Gram Matrices.
Proceedings of the Digital TV and Multimedia Communication - 15th International Forum, 2018

Motion Adaptive Intra Refresh for Low Delay HEVC Encoding.
Proceedings of the Digital TV and Multimedia Communication - 15th International Forum, 2018

Frame Interpolation via Refined Deep Voxel Flow.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

GPU Based Motion-Compensated Frame Interpolation Acceleration for Future Video Coding.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A containerized media cloud for video transcoding service.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Learning an Inverse Tone Mapping Network with a Generative Adversarial Regularizer.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Segment Constraint ABR Algorithm for HEVC Encoder.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

A No Reference Bitstream-Based Video Quality Assessment Model for H.265/HEVC and H.264/AVC.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

A Generic Distributed Scheduling Algorithm for Frame Rate Up Convert Video Transcoding.
Proceedings of the 2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2018

Video Frame Interpolation Using Recurrent Convolutional Layers.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
DRIMUX: Dynamic Rumor Influence Minimization with User Experience in Social Networks.
IEEE Trans. Knowl. Data Eng., 2017

适配分辨率动态变化的低复杂度视频场景切换检测方法 (Low Complexity Scene Change Detection Algorithm for Supporting Resolution Dynamic Change).
计算机科学, 2017

Evaluation of No Reference Bitstream-based Video Quality Assessment Methods.
CoRR, 2017

Deep Binary Representation for Efficient Image Retrieval.
Adv. Multim., 2017

Learning a convolutional neural network for fractional interpolation in HEVC inter coding.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Two-stream deep encoder-decoder architecture for fully automatic video object segmentation.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

A generic method to improve no-reference image blur metric accuracy in video contents.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Weight-based bit allocation scheme for VR videos in HEVC.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Rate control model for high dynamic range video.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Low Latency MPEG-DASH System Over HTTP 2.0 and WebSocket.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2017

Deep hash learning for efficient image retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Lagrangian method based Rate-Distortion Optimization revisited for dependent video coding.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

CNN based post-processing to improve HEVC.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A lightweight distributed media processing system for UHD service.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

Two-stream recurrent convolutional neural networks for video saliency estimation.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

Machine learning based VP9-to-HEVC video transcoding.
Proceedings of the 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2017

2016
Identifying effective initiators in OSNs: from the spectral radius perspective.
Wirel. Commun. Mob. Comput., 2016

Evaluation of beyond-HEVC entropy coding methods for DCT transform coefficients.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Shot boundary detection using convolutional neural networks.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Improved intra angular prediction with novel interpolation filter and boundary filter.
Proceedings of the 2016 Picture Coding Symposium, 2016

A parallel-fusion RNN-LSTM architecture for image caption generation.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

SJTU 4K video subjective quality dataset for content adaptive bit rate estimation without encoding.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

GPU accelerated high-quality video/image super-resolution.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2016

A novel parallel-friendly rate control scheme for HEVC.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Review of ITU-T parametric models for compressed video quality estimation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Saliency based rate control scheme for high efficiency video coding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

BEST: Benchmark and Evaluation of Surveillance Task.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

DRIMUX: Dynamic Rumor Influence Minimization with User Experience in Social Networks.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Modeling Topic-Level Academic Influence in Scientific Literatures.
Proceedings of the Scholarly Big Data: AI Perspectives, 2016

2015
An Optimized Pixel-Wise Weighting Approach for Patch-Based Image Denoising.
IEEE Signal Process. Lett., 2015

Temporal dependent bit allocation scheme for rate control in HEVC.
Proceedings of the 2015 IEEE Workshop on Signal Processing Systems, 2015

Fast depth decision with enlarged coding block sizes for HEVC intra coding of 4K ultra-HD video.
Proceedings of the 2015 IEEE Workshop on Signal Processing Systems, 2015

Small group people behavior analysis based on temporal recursive trajectory identification.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Which metric can predict coding gain of H.265/HEVC over H.264/AVC?
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

CNN-based shot boundary detection and video annotation.
Proceedings of the 2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2015

Learning based fast H.264 to H.265 transcoding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Evaluation of Different Algorithms of Nonnegative Matrix Factorization in Temporal Psychovisual Modulation.
IEEE Trans. Circuits Syst. Video Technol., 2014

Blind image quality assessment based on a new feature of nature scene statistics.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Fast coding unit depth decision for HEVC.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Are we still friends: Kernel multivariate survival analysis.
Proceedings of the IEEE Global Communications Conference, 2014

Speed up HEVC encoder by precoding with H.264.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Performance evaluation of H.265/MPEG-HEVC encoders for 4K video sequences.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Reorder user's tweets.
ACM Trans. Intell. Syst. Technol., 2013

Foreground Estimation Based on Linear Regression Model With Fused Sparsity on Outliers.
IEEE Trans. Circuits Syst. Video Technol., 2013

Shaking video synthesis for video stabilization performance assessment.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

The SJTU 4K video sequence dataset.
Proceedings of the Fourth International Workshop on Quality of Multimedia Experience, 2013

Analysis and identification of spamming behaviors in Sina Weibo microblog.
Proceedings of the 7th Workshop on Social Network Mining and Analysis, 2013

Efficient realization of parallel HEVC intra encoding.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Foreground detection: Combining background subspace learning with object smoothing model.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Paralleling variable block size motion estimation of HEVC on multi-core CPU plus GPU platform.
Proceedings of the IEEE International Conference on Image Processing, 2013

Image restoration via efficient Gaussian mixture model learning.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
Background subtraction based on phase feature and distance transform.
Pattern Recognit. Lett., 2012

New bounds on image denoising: Viewpoint of sparse representation and non-local averaging.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Measurement Algorithm for Image Structure Noise on Hardcopy.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

Skew Estimation Based on Haar-Like Features.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

Feature Analysis of Spammers in Social Networks with Active Honeypots: A Case Study of Chinese Microblogging Networks.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2012

2011
Prioritized Flow Optimization With Multi-Path and Network Coding Based Routing for Scalable Multirate Multicasting.
IEEE Trans. Circuits Syst. Video Technol., 2011

MCM: An Efficient Geometric Constraint Method for Robust Local Feature Matching.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

Hybrid center-symmetric local pattern for dynamic background subtraction.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Foreground estimation based on robust linear regression model.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning dictionary via subspace segmentation for sparse representation.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning sparse dictionaries with a popularity-based model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Building Artificial Identities in Social Network Using Semantic Information.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

2010
Distributed link-aware rate allocation for R-D optimal multiple video streaming over wireless networks.
Proceedings of the International Conference on Wireless Communications and Signal Processing, 2010

Image denoising using local tangent space alignment.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Video Coding With Key Frames Guided Super-Resolution.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Dynamic background subtraction based on spatial extended center-symmetric local binary pattern.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Background subtraction based on phase and distance transform under sudden illumination change.
Proceedings of the International Conference on Image Processing, 2010

Improving Detector of Viola and Jones through SVM.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Multi-illumination Face Recognition from a Single Training Image per Person with Sparse Representation.
Proceedings of the Computer Vision - ACCV 2010, 2010

QWT: Retrospective and New Applications.
Proceedings of the Geometric Algebra Computing - in Engineering and Computer Science., 2010

2009
Robust Video Region-of-Interest Coding Based on Leaky Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2009

Subgraphs Matching-Based Side Information Generation for Distributed Multiview Video Coding.
EURASIP J. Adv. Signal Process., 2009

Improved Intra-coding Methods for H.264/AVC.
EURASIP J. Adv. Signal Process., 2009

Structure-Preserving Colorization Based on Quaternionic Phase Reconstruction.
Proceedings of the Advances in Multimedia Information Processing, 2009

Spatiotemporal Phase Congruency Based Invariant Features for Human Behavior Classification.
Proceedings of the Advances in Multimedia Information Processing, 2009

Spatial non-stationary correlation noise modeling for Wyner-Ziv error resilience video coding.
Proceedings of the International Conference on Image Processing, 2009

Sub clustering K-SVD: Size variable dictionary learning for sparse representations.
Proceedings of the International Conference on Image Processing, 2009

Advanced H.264/AVC encoder optimizations on a TMS320DM642 digital signal processor.
Proceedings of the 16th International Conference on Digital Signal Processing, 2009

Prioritized Flow Optimization with Generalized Routing for Scalable Multirate Multicasting.
Proceedings of IEEE International Conference on Communications, 2009

Graph Matching Based Side Information Generation for Distributed Multi-View Video Coding.
Proceedings of IEEE International Conference on Communications, 2009

Generic video coding with abstraction and detail completion.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Contourlet-based image adaptive watermarking.
Signal Process. Image Commun., 2008

Shanghai Jiao Tong University participation in high-level feature extraction, automatic search and surveillance event detectionat TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

2-D dual multiresolution decomposition through NUDFB and its application.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Color image watermarking using local quaternion Fourier spectral analysis.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Local Quaternionic Gabor Binary Patterns for color face recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

On Non-sequential Context Modeling with Application to Executable Data Compression.
Proceedings of the 2008 Data Compression Conference (DCC 2008), 2008

2007
Robust Region-of-Interest Scalable Coding with Leaky Prediction in H.264/AVC.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2007

Cooperative Stereo Matching using Quaternion Wavlets and Top-Down Segmentation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Bit Allocation for Fine-Granular SNR Scalability Coding with Hierarchical B Pictures.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2D Quaternion Fourier Transform: The Spectrum Properties and its Application in Color Image Registration.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
A New Deblocking Algorithm Based on Adjusted Contourlet Transform.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Context-Based Error Detection Strategy into H.264/AVC CABAC.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

2005
Contourlet Image Coding Based on Adjusted SPIHT.
Proceedings of the Advances in Multimedia Information Processing, 2005

Adaptive predict based on fading compensation for lifting-based motion compensated temporal filtering.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Study on Motion Prediction and Coding for In-Band Motion Compensated Temporal Filtering.
Proceedings of the Computational Intelligence and Security, International Conference, 2005


  Loading...