Ling-Yu Duan

Orcid: 0000-0002-4491-2023

Affiliations:
  • Peking University, School of Electronic and Computer Engineering, Institute of Digital Media, Beijing, China
  • Institute for Infocomm Research, Singapore
  • University of Newcastle, Australia (PhD 2008)


According to our database1, Ling-Yu Duan authored at least 274 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

Transferable Adversarial Attacks on SAM and Its Downstream Models.
CoRR, 2024

Amodal Segmentation for Laparoscopic Surgery Video Instruments.
CoRR, 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception.
CoRR, 2024

Coding for Intelligence from the Perspective of Category.
CoRR, 2024

ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks.
Proceedings of the Computer Vision - ECCV 2024, 2024

LEAD: Exploring Logit Space Evolution for Model Selection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Seeing Dark Videos via Self-Learned Bottleneck Neural Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Coarse-to-fine Disentangling Demoiréing Framework for Recaptured Screen Images.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

PS-Net: human perception-guided segmentation network for EM cell membrane.
Bioinform., August, 2023

Background Scene Recovery From an Image Looking Through Colored Glass.
IEEE Trans. Multim., 2023

Purifying Low-Light Images via Near-Infrared Enlightened Image.
IEEE Trans. Multim., 2023

Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning.
IEEE Trans. Multim., 2023

Benchmarking Single-Image Reflection Removal Algorithms.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Modeling Uncertain Feature Representation for Domain Generalization.
CoRR, 2023

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring Model Transferability through the Lens of Potential Energy.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Switchable Representation Learning Framework with Self-Compatibility.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
$A^3$-FKG: Attentive Attribute-Aware Fashion Knowledge Graph for Outfit Preference Prediction.
IEEE Trans. Multim., 2022

Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems.
IEEE Trans. Mob. Comput., 2022

Intrinsic Performance Influence-based Participant Contribution Estimation for Horizontal Federated Learning.
ACM Trans. Intell. Syst. Technol., 2022

Towards Low Light Enhancement With RAW Images.
IEEE Trans. Image Process., 2022

Disentangled Feature Learning Network and a Comprehensive Benchmark for Vehicle Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Switchable Representation Learning Framework with Self-compatibility.
CoRR, 2022

Bridging the Source-to-target Gap for Cross-domain Person Re-Identification with Intermediate Domains.
CoRR, 2022

Nonlinear Multi-Model Reuse.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Collaborative Scalable Visual Compression for Human-Centered Videos.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Uncertainty Modeling for Out-of-Distribution Generalization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

mc-BEiT: Multi-choice Discretization for Image BERT Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neighborhood Consensus Contrastive Learning for Backward-Compatible Representation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Attribute-wise Explainable Fashion Compatibility Modeling.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Market2Dish: Health-aware Food Recommendation.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Pose-Normalized and Appearance-Preserved Street-to-Shop Clothing Image Generation and Feature Learning.
IEEE Trans. Multim., 2021

Towards Coding for Human and Machine Vision: Scalable Face Image Coding.
IEEE Trans. Multim., 2021

Dual-Refinement: Joint Label and Feature Refinement for Unsupervised Domain Adaptive Person Re-Identification.
IEEE Trans. Image Process., 2021

Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification.
IEEE Trans. Image Process., 2021

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding.
IEEE Trans. Circuits Syst. Video Technol., 2021

Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie.
IEEE Trans. Circuits Syst. Video Technol., 2021

Face Image Reflection Removal.
Int. J. Comput. Vis., 2021

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics.
CoRR, 2021

Person Retrieval with Conv-Transformer.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective Alignment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Single Image Reflection Removal With Absorption Effect.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generalizable Person Re-Identification With Relevance-Aware Mixture of Experts.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Person30K: A Dual-Meta Generalization Network for Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm.
IEEE Trans. Multim., 2020

Iterative Local-Global Collaboration Learning Towards One-Shot Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics.
IEEE Trans. Image Process., 2020

Toward Intelligent Sensing: Intermediate Deep Feature Compression.
IEEE Trans. Image Process., 2020

CoRRN: Cooperative Reflection Removal Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Skeleton-Based Online Action Prediction Using Scale Selection Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Feature Boosting Network For 3D Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Deep Variational and Structural Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

JDNet: A Joint-Learning Distilled Network for Mobile Visual Food Recognition.
IEEE J. Sel. Top. Signal Process., 2020

Key-Point Sequence Lossless Compression for Intelligent Video Analysis.
IEEE Multim., 2020

Network Update Compression for Federated Learning.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Pose-native Network Architecture Search for Multi-person Human Pose Estimation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Disentangled Feature Learning Network for Vehicle Re-Identification.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

An Emerging Coding Paradigm Vcm: A Scalable Coding Approach Beyond Feature And Signal.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Extending Hashing Towards Fast Re-Identification.
Proceedings of the IEEE International Conference on Image Processing, 2020

Data Representation in Hybrid Coding Framework for Feature Maps Compression.
Proceedings of the IEEE International Conference on Image Processing, 2020

Deep Product Quantization Module for Efficient Image Retrieval.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

FHDe<sup>2</sup>Net: Full High Definition Demoireing Network.
Proceedings of the Computer Vision - ECCV 2020, 2020

What Does Plate Glass Reveal About Camera Calibration?
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Reflection Scene Separation From a Single Image.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Codebook-Free Compact Descriptor for Scalable Visual Search.
IEEE Trans. Multim., 2019

Unified Spatio-Temporal Attention Networks for Action Recognition in Videos.
IEEE Trans. Multim., 2019

Embedding Adversarial Learning for Vehicle Re-Identification.
IEEE Trans. Image Process., 2019

Robust Distracter-Resistive Tracker via Learning a Multi-Component Discriminative Dictionary.
IEEE Trans. Circuits Syst. Video Technol., 2019

Multi-scale Optimal Fusion model for single image dehazing.
Signal Process. Image Commun., 2019

Learning to remove reflections from windshield images.
Signal Process. Image Commun., 2019

基于深度残差网络的HEVC压缩视频增强 (Deep Residual Network Based HEVC Compressed Videos Enhancement).
计算机科学, 2019

Front-End Smart Visual Sensing and Back-End Intelligent Analysis: A Unified Infrastructure for Economizing the Visual System of City Brain.
IEEE J. Sel. Areas Commun., 2019

Toward Knowledge as a Service Over Networks: A Deep Learning Model Communication Paradigm.
IEEE J. Sel. Areas Commun., 2019

IDeRs: Iterative dehazing method for single remote sensing image.
Inf. Sci., 2019

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond.
IEEE Multim., 2019

Compact Descriptors for Video Analysis: The Emerging MPEG Standard.
IEEE Multim., 2019

DeepShoe: An improved Multi-Task View-invariant CNN for street-to-shop shoe retrieval.
Comput. Vis. Image Underst., 2019

Hard-Aware Fashion Attribute Classification.
CoRR, 2019

Signal-Independent Separable KLT by Offline Training for Video Coding.
IEEE Access, 2019

Toward Intelligent Visual Sensing and Low-cost Analysis: A Collaborative Computing Approach.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

See Through the Windshield from Surveillance Camera.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Adaptive Feature Fusion via Graph Neural Network for Person Re-identification.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Market2Dish: A Health-aware Food Recommendation System.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Lossy Intermediate Deep Learning Feature Compression and Evaluation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Few-Shot and Many-Shot Fusion Learning in Mobile Visual Food Recognition.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

From Market to Dish: Multi-ingredient Image Recognition for Personalized Recipe Recommendation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Learning to Remove Reflections for Text Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Incorporating Category Taxonomy in Deep Reinforcement Learning Based Image Hashing.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Denoising Adversarial Networks for Rain Removal and Reflection Removal.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Fashion Recommendation on Street Images.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

SPLINE-Net: Sparse Photometric Stereo Through Lighting Interpolation and Normal Estimation Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning to Jointly Generate and Separate Reflections.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sampling Wisely: Deep Image Embedding by Top-K Precision Optimization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Mop Moiré Patterns Using MopNet.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Separable KLT for Intra Coding in Versatile Video Coding (VVC).
Proceedings of the Data Compression Conference, 2019

VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Accurate One-Stage Object Detection With AP-Loss.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploring Object Relation in Mean Teacher for Cross-Domain Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search.
IEEE Trans. Multim., 2018

Toward Intelligent Product Retrieval for TV-to-Online (T2O) Application: A Transfer Metric Learning Approach.
IEEE Trans. Multim., 2018

Query Adaptive Multiview Object Instance Search and Localization Using Sketches.
IEEE Trans. Multim., 2018

Group-Sensitive Triplet Embedding for Vehicle Reidentification.
IEEE Trans. Multim., 2018

Region-Aware Reflection Removal With Unified Content and Gradient Priors.
IEEE Trans. Image Process., 2018

Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks.
IEEE Trans. Image Process., 2018

Minimizing Reconstruction Bias Hashing via Joint Projection Learning and Quantization.
IEEE Trans. Image Process., 2018

Fast MPEG-CDVS Encoder With GPU-CPU Hybrid Computing.
IEEE Trans. Image Process., 2018

Rate-Distortion Optimized Sparse Coding With Ordered Dictionary for Image Set Compression.
IEEE Trans. Circuits Syst. Video Technol., 2018

Transfer Metric Learning: Algorithms, Applications and Outlooks.
CoRR, 2018

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing.
CoRR, 2018

Tracklet Siamese Network with Constrained Clustering for Multiple Object Tracking.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Depth Structure Preserving Scene Image Generation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-Scale Context Attention Network for Image Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Unified Generative Adversarial Framework for Image Generation and Person Re-identification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

From Data to Knowledge: Deep Learning Model Compression, Transmission and Communication.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Gated Square-Root Pooling for Image Instance Retrieval.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

CRRN: Multi-Scale Guided Concurrent Reflection Removal Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

SSNet: Scale Selection Network for Online 3D Action Prediction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval.
IEEE Trans. Multim., 2017

Pruning Convolutional Neural Networks for Image Instance Retrieval.
CoRR, 2017

Skeleton Based Human Action Recognition with Global Context-Aware Attention LSTM Networks.
CoRR, 2017

Fast MPEG-CDVS Encoder with GPU-CPU Hybrid Computing.
CoRR, 2017

Incorporating Intra-Class Variance to Fine-Grained Visual Recognition.
CoRR, 2017

From Part to Whole: Who is Behind the Painting?
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Nested Invariance Pooling and RBM Hashing for Image Instance Retrieval.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

DeepHash for Image Instance Retrieval: Getting Regularization, Depth and Fine-Tuning Right.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Improving object detection with region similarity learning.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Incorporating intra-class variance to fine-grained visual recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

GPU Based fast MPEG-CDVS encoder.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A Multi-Block N-ary trie structure for exact r-neighbour search in hamming space.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deep regional feature pooling for video matching.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Benchmarking Single-Image Reflection Removal Algorithms.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Compact Deep Invariant Descriptors for Video Retrieval.
Proceedings of the 2017 Data Compression Conference, 2017

Compression of Deep Neural Networks for Image Instance Retrieval.
Proceedings of the 2017 Data Compression Conference, 2017

Global Context-Aware Attention LSTM Networks for 3D Action Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware Descriptors.
IEEE Trans. Multim., 2016

Overview of the MPEG-CDVS Standard.
IEEE Trans. Image Process., 2016

A Compact Binary Aggregated Descriptor via Dual Selection for Visual Search.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

To Project More or to Quantize More: Minimize Reconstruction Bias for Learning Compact Binary Codes.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Smart query expansion scheme for CDVS based on illumination and key features.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Two-stage pooling of deep convolutional features for image retrieval.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Depth-based local feature selection for mobile visual search.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Selectively Aggregated Fisher Vectors of Query Video for Mobile Visual Search.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Adaptive Weighted Matching of Deep Convolutional Features for Painting Retrieval.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Affinity Preserving Quantization for Hashing: A Vector Quantization Approach to Compact Learn Binary Codes.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Weighted Component Hashing of Binary Aggregated Descriptors for Fast Visual Search.
IEEE Trans. Multim., 2015

Depth-Preserving Warping for Stereo Image Retargeting.
IEEE Trans. Image Process., 2015

A Low Complexity Interest Point Detector.
IEEE Signal Process. Lett., 2015

Finding the Secret of Image Saliency in the Frequency Domain.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Efficient image retrieval based mobile indoor localization.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Query-Adaptive Logo Search using Shape-Aware Descriptors.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Hamming Compatible Quantization for Hashing.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Hierarchical multi-VLAD for image retrieval.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

An efficient coding framework for compact descriptors extracted from video sequence.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Optimizing Binary Fisher Codes for Visual Search.
Proceedings of the 2015 Data Compression Conference, 2015

Overview of the MPEG CDVS Standard.
Proceedings of the 2015 Data Compression Conference, 2015

Real-Time Tracking with Selective DoP-RIEF Features for Augmented Reality.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

2014
Towards Mobile Document Image Retrieval for Digital Library.
IEEE Trans. Multim., 2014

Spatiotemporal Grid Flow for Video Retargeting.
IEEE Trans. Image Process., 2014

Mining Compact Bag-of-Patterns for Low Bit Rate Mobile Visual Search.
IEEE Trans. Image Process., 2014

Interactive ads recommendation with contextual search on product topic space.
Multim. Tools Appl., 2014

Compact Descriptors for Visual Search.
IEEE Multim., 2014

Component hashing of variable-length binary aggregated descriptors for fast image search.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Joint optimization of JPEG quantization table and coefficient thresholding for low bitrate mobile visual search.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Region-based depth-preserving stereoscopic image retargeting.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Learning to Distribute Vocabulary Indexing for Scalable Visual Search.
IEEE Trans. Multim., 2013

Estimating Visual Saliency Through Single Image Optimization.
IEEE Signal Process. Lett., 2013

Learning from mobile contexts to minimize the mobile location search latency.
Signal Process. Image Commun., 2013

Learning Compact Visual Descriptors for Low Bit Rate Mobile Landmark Search.
AI Mag., 2013

A local shape descriptor for mobile linedrawing retrieval.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

An Error Resilient Depth Map Coding Scheme Using Adaptive Wyner-Ziv Frame.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Mobile media communication, processing, and analysis: A review of recent advances.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Compact descriptors for mobile visual search and MPEG CDVS standardization.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

A novel pair-wise image matching strategy with compact descriptors.
Proceedings of the IEEE International Conference on Image Processing, 2013

Robust fisher codes for large scale image retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2013

On the interoperability of local descriptors compression.
Proceedings of the IEEE International Conference on Acoustics, 2013

A hybrid pixel-block based view synthesis for multiviewpoint 3D video.
Proceedings of the 3DTV-Conference 2013: The True Vision, 2013

2012
A Generic Approach for Systematic Analysis of Sports Videos.
ACM Trans. Intell. Syst. Technol., 2012

Group-Sensitive Multiple Kernel Learning for Object Recognition.
IEEE Trans. Image Process., 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.
Int. J. Comput. Vis., 2012

Optimizing JPEG quantization table for low bit rate mobile visual search.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Motion Based Perceptual Distortion and Rate Optimization for Video Coding.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Social Image Tagging by Mining Sparse Tag Patterns from Auxiliary Data.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Learning sparse tag patterns for social image classification.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Weakly supervised topic grouping of YouTube search results.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Multi-stage vector quantization towards low bit rate visual search.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Allocating images and selecting image collections for distributed visual search.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

PQ-WGLOH: A bit-rate scalable local feature descriptor.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Learning multiple codebooks for low bit rate mobile visual search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Predicting the effectiveness of queries for visual search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Pruning tree-structured vector quantizer towards low bit rate mobile visual search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Towards compact topical descriptors.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Grid-Based Retargeting with Transformation Consistency Smoothing.
Proceedings of the Advances in Multimedia Modeling, 2011

Towards low bit rate mobile visual search with multiple-channel coding.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search.
Proceedings of the IJCAI 2011, 2011

Fast retargeting with adaptive grid optimization.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Learning the trip suggestion from landmark photos on the web.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

PKUBench: A context rich mobile visual search benchmark.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Generating vocabulary for global feature representation towards commerce image retrieval.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

When codeword frequency meets geographical location.
Proceedings of the IEEE International Conference on Acoustics, 2011

A lowbit rate vocabulary coding scheme for mobile landmark search.
Proceedings of the IEEE International Conference on Acoustics, 2011

Sorting local descriptors for lowbit rate mobile visual search.
Proceedings of the IEEE International Conference on Acoustics, 2011

Topic level sampling towards optimized locality sensitive vocabulary coding.
Proceedings of the 8th International Conference on Information, 2011

2010
Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context.
IEEE Trans. Multim., 2010

Per-Sample Multiple Kernel Approach for Visual Concept Learning.
EURASIP J. Image Video Process., 2010

AdVR: Linking Ad Video with Products or Service.
Proceedings of the Advances in Multimedia Modeling, 2010

Saliency detection based on 2D log-gabor wavelets and center bias.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Video retargeting with multi-scale trajectory optimization.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Interactive Web Video Advertising with Context Analysis and Search.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

ESUR: A system for Events detection in SURveillance video.
Proceedings of the International Conference on Image Processing, 2010

Interactive service recommendation based on ad concept hierarchy.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Automatic video genre categorization and event detection techniques on large-scale sports data.
Proceedings of the 2010 conference of the Centre for Advanced Studies on Collaborative Research, 2010

2009
A New Multiple Kernel Approach for Visual Concept Learning.
Proceedings of the Advances in Multimedia Modeling, 2009

Sports video retargeting.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Consumer video retargeting: context assisted spatial-temporal grid optimization.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Automatic sports genre categorization and view-type classification over large-scale dataset.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Multiple kernel active learning for image classification.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Linking video ADS with product or service information by web search.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A generic approach to classify sports video shots and its application in event detection.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Semantic Linking between Video Ads and Web Services with Progressive Search.
Proceedings of the ICDM Workshops 2009, 2009

Group-sensitive multiple kernel learning for object categorization.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2008
Audio keywords generation for sports video analysis.
ACM Trans. Multim. Comput. Commun. Appl., 2008

A Multimodal Scheme for Program Segmentation and Representation in Broadcast Video Streams.
IEEE Trans. Multim., 2008

Digesting Commercial Clips from TV Streams.
IEEE Multim., 2008

Hierarchical movie affective content analysis based on arousal and valence features.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Personalization of media and its attention service applications.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
An algorithm to estimate mean vehicle speed from MPEG Skycam video.
Multim. Tools Appl., 2007

Automatic TV Logo Detection, Tracking and Removal in Broadcast Video.
Proceedings of the Advances in Multimedia Modeling, 2007

TV ad video categorization with probabilistic latent concept learning.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Robust Commercial Retrieval in Video Streams.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Nonparametric motion characterization for robust classification of camera motion patterns.
IEEE Trans. Multim., 2006

A Semantic Image Category for Structuring TV Broadcast Video Streams.
Proceedings of the Advances in Multimedia Information Processing, 2006

Live sports event detection based on broadcast video and web-casting text.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Local Motion Analysis and Its Application in Video based Swimming Style Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

TV Commercial Classification by using Multi-Modal Textual Information.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Robust Method for TV Logo Tracking in Video Streams.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Mid-Level Scene Change Representation Via Audiovisual Alignment.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
A unified framework for semantic shot classification in sports video.
IEEE Trans. Multim., 2005

Shot-Level Camera Motion Estimation Based on a Parametric Model.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Automatic generation of personalized music sports video.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

A unified framework for semantic shot representation of sports video.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Periodicity Detection of Local Motion.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

A Mid-level Visual Concept Generation Framework for Sports Analysis.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Replay Scene Classification in Soccer Video Using Web Broadcast Text.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
Fast and Robust Short Video Clip Search for Copy Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

HMM-Based Audio Keyword Generation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Audio keyword generation for sports video analysis.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Fast and robust video clip search using index structure.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Nonparametric motion model.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Nonparametric motion model with applications to camera motion pattern classification.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Fast and robust short video clip search using an index structure.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Mean shift based nonparametric motion characterization.
Proceedings of the 2004 International Conference on Image Processing, 2004

Mean shift based video segment representation and applications to replay detection.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Semantic Shot Classification in Sports Video.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003

Nonparametric color characterization using mean shift.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

A mid-level representation framework for semantic sports video analysis.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Robust moving video object segmentation in the MPEG compressed domain.
Proceedings of the 2003 International Conference on Image Processing, 2003

A fusion scheme of visual and auditory modalities for event detection in sports video.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Shot Classification of Sports Video Based on Features in Motion Vector Field.
Proceedings of the Advances in Multimedia Information Processing, 2002

Foreground Segmentation Using Motion Vectors in Sports Video.
Proceedings of the Advances in Multimedia Information Processing, 2002

A unified framework for semantic shot classification in sports videos.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Clear face analysis from MPEG compressed video.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002


  Loading...