Yonghong Tian

Orcid: 0000-0002-2978-5935

Affiliations:
  • Peking University, Beijing, School of Electronics Engineering and Computer Science, National Engineering Laboratory for Video Technology, China
  • Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China (PhD 2005)
  • University of Electronic Science and Technology, Chengdu, China (former)


According to our database1, Yonghong Tian authored at least 391 papers between 2001 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

Event-Enhanced Snapshot Mosaic Hyperspectral Frame Deblurring.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

2024
Sequential Trajectory Data Publishing With Adaptive Grid-Based Weighted Differential Privacy.
IEEE Trans. Knowl. Data Eng., December, 2024

Sustainable Distributed Adaptive Platoon in Multi-Agent Mobile-Edge Computing Networks for Lane Reduction Scenario.
IEEE Trans. Intell. Transp. Syst., November, 2024

Training-Free Transformer Architecture Search With Zero-Cost Proxy Guided Evolution.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Uncovering the Over-Smoothing Challenge in Image Super-Resolution: Entropy-Based Quantification and Contrastive Optimization.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Event-Based Monocular Depth Estimation With Recurrent Transformers.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Multirate Progressive Entropy Model for Learned Image Compression.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Network model with internal complexity bridges artificial intelligence and neuroscience.
Nat. Comput. Sci., August, 2024

Brain-Inspired Computing: A Systematic Survey and Future Trends.
Proc. IEEE, June, 2024

Parsing Objects at a Finer Granularity: A Survey.
Mach. Intell. Res., June, 2024

Unsupervised Deraining: Where Asymmetric Contrastive Learning Meets Self-Similarity.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Universal Object Detection with Large Vision Model.
Int. J. Comput. Vis., April, 2024

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows.
IEEE Trans. Cybern., March, 2024

Pick-and-Place Transform Learning for Fast Multi-View Clustering.
IEEE Trans. Image Process., 2024

Sensitivity Decouple Learning for Image Compression Artifacts Reduction.
IEEE Trans. Image Process., 2024

The Role of Class Information in Model Inversion Attacks Against Image Deep Learning Classifiers.
IEEE Trans. Dependable Secur. Comput., 2024

Self-architectural knowledge distillation for spiking neural networks.
Neural Networks, 2024

ETTFS: An Efficient Training Framework for Time-to-First-Spike Neuron.
CoRR, 2024

Spatial-Temporal Search for Spiking Neural Networks.
CoRR, 2024

Is Parameter Collision Hindering Continual Learning in LLMs?
CoRR, 2024

CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection.
CoRR, 2024

Multi-granularity Score-based Generative Framework Enables Efficient Inverse Design of Complex Organics.
CoRR, 2024

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis.
CoRR, 2024

Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms.
CoRR, 2024

Time-Dependent VAE for Building Latent Factor from Visual Neural Activity with Complex Dynamics.
CoRR, 2024

HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions.
CoRR, 2024

Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition.
CoRR, 2024

SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition.
CoRR, 2024

EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images.
CoRR, 2024

Direct Training High-Performance Deep Spiking Neural Networks: A Review of Theories and Methods.
CoRR, 2024

Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition.
CoRR, 2024

State Space Model for New-Generation Network Alternative to Transformers: A Survey.
CoRR, 2024

MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control.
CoRR, 2024

QKFormer: Hierarchical Spiking Transformer using Q-K Attention.
CoRR, 2024

Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline.
CoRR, 2024

Noisy Spiking Actor Network for Exploration.
CoRR, 2024

TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation.
CoRR, 2024

ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing.
CoRR, 2024

Fully Spiking Actor Network with Intra-layer Connections for Reinforcement Learning.
CoRR, 2024

Deep peak property learning for efficient chiral molecules ECD spectra prediction.
CoRR, 2024

CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras.
CoRR, 2024

Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket.
CoRR, 2024

A fuzzy logic constrained particle swarm optimization algorithm for industrial design problems.
Appl. Soft Comput., 2024

Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

High-Performance Temporal Reversible Spiking Neural Networks with O(L) Training Memory and O(1) Inference Cost.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Optimal ANN-SNN Conversion with Group Neurons.
Proceedings of the IEEE International Conference on Acoustics, 2024

Temporal Contrastive Learning for Spiking Neural Networks.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

HiFi-123: Towards High-Fidelity One Image to 3D Content Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Event Stream-Based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Event-Based Visible and Infrared Fusion via Multi-Task Collaboration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Adaptive Discovering and Merging for Incremental Novel Class Discovery.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Distilling a Powerful Student Model via Online Knowledge Distillation.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control.
IEEE Trans. Knowl. Data Eng., November, 2023

MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

SODFormer: Streaming Object Detection With Transformer Using Events and Frames.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Running ahead of evolution - AI-based simulation for predicting future high-risk SARS-CoV-2 variants.
Int. J. High Perform. Comput. Appl., November, 2023

Carrying Out CNN Channel Pruning in a White Box.
IEEE Trans. Neural Networks Learn. Syst., October, 2023

Dual Adaptive Representation Alignment for Cross-Domain Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Attention Spiking Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey.
Mach. Intell. Res., August, 2023

A Hybrid Spiking Neurons Embedded LSTM Network for Multivariate Time Series Learning Under Concept-Drift Environment.
IEEE Trans. Knowl. Data Eng., July, 2023

Neuron-Based Spiking Transmission and Reasoning Network for Robust Image-Text Retrieval.
IEEE Trans. Circuits Syst. Video Technol., July, 2023

Semi-Supervised CT Lesion Segmentation Using Uncertainty-Based Data Pairing and SwapMix.
IEEE Trans. Medical Imaging, May, 2023

Asynchronous Spatiotemporal Spike Metric for Event Cameras.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

Picking Up Quantization Steps for Compressed Image Classification.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

Nonlinear Transforms in Learned Image Compression From a Communication Perspective.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

1xN Pattern for Pruning Convolutional Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

From Pose to Part: Weakly-Supervised Pose Evolution for Human Part Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking.
IEEE Trans. Multim., 2023

Learning Super-Resolution Reconstruction for High Temporal Resolution Spike Stream.
IEEE Trans. Circuits Syst. Video Technol., 2023

Ultra-High Temporal Resolution Visual Reconstruction From a Fovea-Like Spike Camera via Spiking Neuron Model.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Population-Based Evolutionary Gaming for Unsupervised Person Re-identification.
Int. J. Comput. Vis., 2023

Machine Mindset: An MBTI Exploration of Large Language Models.
CoRR, 2023

Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition.
CoRR, 2023

SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence.
CoRR, 2023

HiFi-123: Towards High-fidelity One Image to 3D Content Generation.
CoRR, 2023

SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition.
CoRR, 2023

Deep recurrent spiking neural networks capture both static and dynamic representations of the visual cortex under movie stimuli.
CoRR, 2023

Auto-Spikformer: Spikformer Architecture Search.
CoRR, 2023

Album Storytelling with Iterative Story-aware Captioning and Large Language Models.
CoRR, 2023

Probabilistic Modeling: Proving the Lottery Ticket Hypothesis in Spiking Neural Network.
CoRR, 2023

Enhancing the Performance of Transformer-based Spiking Neural Networks by SNN-optimized Downsampling with Precise Gradient Backpropagation.
CoRR, 2023

Parallel Spiking Neurons with High Efficiency and Long-term Dependencies Learning Ability.
CoRR, 2023

Spikingformer: Spike-driven Residual Learning for Transformer-based Spiking Neural Network.
CoRR, 2023

Training Full Spike Neural Networks via Auxiliary Accumulation Pathway.
CoRR, 2023

Spike-driven Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hierarchical Adaptive Value Estimation for Multi-modal Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Knowledge Prompt-tuning for Sequential Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HumVis: Human-Centric Visual Analysis System.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Event-Diffusion: Event-Based Image Reconstruction and Restoration with Diffusion Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LocLoc: Low-level Cues and Local-area Guides for Weakly Supervised Object Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Dynamic Belief for Decentralized Multi-Agent Cooperative Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Spikformer: When Spiking Neural Network Meets Transformer.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Unified Framework for Soft Threshold Pruning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Sparse Neural Networks with Identity Layers.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Stabilizing Visual Reinforcement Learning via Asymmetric Interactive Cooperation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Meta Architecture for Point Cloud Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Spiking Neural Networks with High Representation Similarity Model Visual Pathways of Macaque and Mouse.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Optimized separable convolution: Yet another efficient convolution operator.
AI Open, January, 2022

Tracking by Joint Local and Global Search: A Target-Aware Attention-Based Approach.
IEEE Trans. Neural Networks Learn. Syst., 2022

Filter Sketch for Network Pruning.
IEEE Trans. Neural Networks Learn. Syst., 2022

Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection.
IEEE Trans. Multim., 2022

Asynchronous Spatio-Temporal Memory Network for Continuous Event-Based Object Detection.
IEEE Trans. Image Process., 2022

Revealing Fine Structures of the Retinal Receptive Field by Deep-Learning Networks.
IEEE Trans. Cybern., 2022

Neural System Identification With Spike-Triggered Non-Negative Matrix Factorization.
IEEE Trans. Cybern., 2022

Fast Class-Wise Updating for Online Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Adversarial Reciprocal Points Learning for Open Set Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Million-scale Object Detection with Large Vision Model.
CoRR, 2022

Event-based Monocular Dense Depth Estimation with Recurrent Transformers.
CoRR, 2022

Meta Architecure for Point Cloud Analysis.
CoRR, 2022

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric.
CoRR, 2022

Annotation Efficient Person Re-Identification with Diverse Cluster-Based Pair Selection.
CoRR, 2022

Transformations in Learned Image Compression from a Modulation Perspective.
CoRR, 2022

Deep Reinforcement Learning with Spiking Q-learning.
CoRR, 2022

1000x Faster Camera and Machine Vision with Ordinary Devices.
CoRR, 2022

Spectrum Random Masking for Generalization in Image-based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Visible Surface Area Estimation for Irregular Objects.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

State Transition of Dendritic Spines Improves Learning of Sparse Spiking Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

Temporal Up-Sampling for Asynchronous Events.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Learning Stereo Depth Estimation with Bio-Inspired Spike Cameras.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Modeling The Detection Capability Of High-Speed Spiking Cameras.
Proceedings of the IEEE International Conference on Acoustics, 2022

Masked Autoencoders for Point Cloud Self-supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

PowerGear: Early-Stage Power Estimation in FPGA HLS via Heterogeneous Edge-Centric GNNs.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

Training-free Transformer Architecture Search.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Architecture Search with Representation Mutual Information.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Deraining: Where Contrastive Learning Meets Self-similarity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Fine-Grained Object Classification via Self-Supervised Pose Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Event-based Video Reconstruction via Potential-assisted Spiking Neural Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ConformerDTI: Local Features Coupling Global Representations for Drug-Target Interaction Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

Retinomorphic Object Detection in Asynchronous Visual Streams.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Motion-Aware Structured Matrix Factorization for Foreground Detection in Complex Scenes.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Part-Guided Relational Transformers for Fine-Grained Visual Recognition.
IEEE Trans. Image Process., 2021

Salient Object Detection With Purificatory Mechanism and Structural Similarity Loss.
IEEE Trans. Image Process., 2021

Hyperspectral Image Restoration: Where Does the Low-Rank Property Exist.
IEEE Trans. Geosci. Remote. Sens., 2021

Hybrid Coding of Spatiotemporal Spike Data for a Bio-Inspired Camera.
IEEE Trans. Circuits Syst. Video Technol., 2021

Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2021

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding.
IEEE Trans. Circuits Syst. Video Technol., 2021

Joint segmentation and detection of COVID-19 via a sequential region generation network.
Pattern Recognit., 2021

MIGO-NAS: Towards Fast and Generalizable Neural Architecture Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Ordinal Multi-Task Part Segmentation With Recurrent Prior Generation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Prioritized Subnet Sampling for Resource-Adaptive Supernet Training.
CoRR, 2021

An Information Theory-inspired Strategy for Automatic Network Pruning.
CoRR, 2021

Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion.
CoRR, 2021

PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021

Carrying out CNN Channel Pruning in a White Box.
CoRR, 2021

Lottery Jackpots Exist in Pre-trained Models.
CoRR, 2021

Distilling a Powerful Student Model via Online Knowledge Distillation.
CoRR, 2021

Spike-based Residual Blocks.
CoRR, 2021

Variationally and Intrinsically motivated reinforcement learning for decentralized traffic signal control.
CoRR, 2021

PNPDet: Efficient Few-shot Detection without Forgetting via Plug-and-Play Sub-networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Deep Residual Learning in Spiking Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Annotation-Efficient Untrimmed Video Action Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Retinomorphic Sensing: A Novel Paradigm for Future Multimedia Computing.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

How to Learn a Domain-Adaptive Event Simulator?
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learnable Oriented-Derivative Network for Polyp Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Optimal ANN-SNN Conversion for Fast and Accurate Inference in Deep Spiking Neural Networks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Pruning of Deep Spiking Neural Networks through Gradient Rewiring.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Adaptive Multi-Scale Semantic Fusion Network For Zero-Shot Learning.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Generate And Adjust: A Novel Framework For Semi-Supervised Pedestrian Attribute Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Heterogeneous Relational Complement for Vehicle Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ReCU: Reviving the Dead Weights in Binary Neural Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Incorporating Learnable Membrane Time Constant to Enhance Learning of Spiking Neural Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

NeuSpike-Net: High Speed Video Reconstruction via Bio-inspired Neuromorphic Cameras.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Allocating DNN Layers Computation Between Front-End Devices and The Cloud Server for Video Big Data Processing.
Proceedings of the IEEE International Conference on Acoustics, 2021

Collaborative Intelligence: Challenges and Opportunities.
Proceedings of the IEEE International Conference on Acoustics, 2021

Short Video Performance Evaluation of AV1 Coding Tools.
Proceedings of the 31st Data Compression Conference, 2021

Reducing Image Compression Artifacts for Deep Neural Networks.
Proceedings of the 31st Data Compression Conference, 2021

High-Speed Image Reconstruction Through Short-Term Plasticity for Spiking Cameras.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Matching on Sets: Conquer Occluded Person Re-identification Without Alignment.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A Convolutional Long Short-Term Memory Neural Network Based Prediction Model.
Int. J. Comput. Commun. Control, August, 2020

GAN-Driven Personalized Spatial-Temporal Private Data Sharing in Cyber-Physical Social Systems.
IEEE Trans. Netw. Sci. Eng., 2020

Adaptation-Oriented Feature Projection for One-Shot Action Recognition.
IEEE Trans. Multim., 2020

Guest Editorial Multimedia Computing With Interpretable Machine Learning.
IEEE Trans. Multim., 2020

Model-Guided Multi-Path Knowledge Aggregation for Aerial Saliency Prediction.
IEEE Trans. Image Process., 2020

Joint Coding of Local and Global Deep Features in Videos for Visual Search.
IEEE Trans. Image Process., 2020

Probabilistic inference of binary Markov random fields in spiking neural networks through mean-field approximation.
Neural Networks, 2020

Reconstruction of natural visual scenes from neural spikes with deep neural networks.
Neural Networks, 2020

Distortion-Adaptive Salient Object Detection in 360° Omnidirectional Images.
IEEE J. Sel. Top. Signal Process., 2020

Urban Multimedia Computing: Emerging Methods in Multimedia Computing for Urban Data Analysis and Applications.
IEEE Multim., 2020

Intrinsic Relationship Reasoning for Small Object Detection.
CoRR, 2020

Revisiting Mid-Level Patterns for Distant-Domain Few-Shot Recognition.
CoRR, 2020

SEKD: Self-Evolving Keypoint Detection and Description.
CoRR, 2020

Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection.
CoRR, 2020

Filter Sketch for Network Pruning.
CoRR, 2020

Global Co-occurrence Feature Learning and Active Coordinate System Conversion for Skeleton-based Action Recognition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cooperative Bi-path Metric for Few-shot Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Discriminative Spatial Feature Learning for Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Masked Face Recognition with Generative Data Augmentation and Domain Constrained Ranking.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Masked Face Recognition with Latent Part Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multimedia Intelligence: When Multimedia Meets Artificial Intelligence.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Anonymous Model Pruning for Compressing Deep Neural Networks.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Learning Compact Networks via Similarity-Aware Channel Pruning.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Prune it Yourself: Automated Pruning by Multiple Level Sensitivity.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

End-Edge-Cloud Collaborative System: A Video Big Data Processing and Analysis Architecture.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

R-SiamNet: ROI-Align Pooling Baesd Siamese Network for Object Tracking.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

BCData: A Large-Scale Dataset and Benchmark for Cell Detection and Counting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Channel Pruning via Automatic Structure Search.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Multiple Expert Brainstorming for Domain Adaptive Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Open Set Network with Discriminative Reciprocal Points.
Proceedings of the Computer Vision - ECCV 2020, 2020

Binary Representation and High Efficient Compression of 3D CNN Features for Action Recognition.
Proceedings of the Data Compression Conference, 2020

Retina-Like Visual Image Reconstruction via Spiking Neural Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Rethinking Performance Estimation in Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

HRank: Filter Pruning Using High-Rank Feature Map.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Accurate Low Bit-Width Quantization with Multiple Phase Adaptations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multiscale video sequence matching for near-duplicate detection and retrieval.
Multim. Tools Appl., 2019

Spike Coding for Dynamic Vision Sensor in Intelligent Driving.
IEEE Internet Things J., 2019

Residual-Based Post-Processing for HEVC.
IEEE Multim., 2019

Hierarchical Deep Cosegmentation of Primary Objects in Aerial Videos.
IEEE Multim., 2019

Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss.
CoRR, 2019

Exploring Reciprocal Attention for Salient Object Detection by Cooperative Learning.
CoRR, 2019

P-ODN: Prototype based Open Deep Network for Open Set Recognition.
CoRR, 2019

Reconstruction of Natural Visual Scenes from Neural Spikes with Deep Neural Networks.
CoRR, 2019

Skeleton-Based 3D Object Retrieval Using Retina-Like Feature Descriptor.
IEEE Access, 2019

3D Human Skeleton Data Compression for Action Recognition.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Bi-directional Re-ranking for Person Re-identification.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Semi-Siamese Network for Content-Based Video Relevance Prediction.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Learning Local Feature Descriptor with Motion Attribute For Vision-based Localization.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

2D LiDAR Map Prediction via Estimating Motion Flow with GRU.
Proceedings of the International Conference on Robotics and Automation, 2019

A Retina-Inspired Sampling Method for Visual Texture Reconstruction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Event-Based Vision Enhanced: A Joint Detection Framework in Autonomous Driving.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Multi-Class Part Parsing With Joint Boundary-Semantic Awareness.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Selectivity or Invariance: Boundary-Aware Salient Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Efficient and Fast Coefficient Sign Inference for Video Coding.
Proceedings of the Data Compression Conference, 2019

Spike Coding: Towards Lossy Compression for Dynamic Vision Sensor.
Proceedings of the Data Compression Conference, 2019

An Efficient Coding Method for Spike Camera Using Inter-Spike Intervals.
Proceedings of the Data Compression Conference, 2019

Part-Regularized Near-Duplicate Vehicle Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Joint Semantic and Latent Attribute Modelling for Cross-Class Transfer Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

PA-Search: Predicting units adaptive motion search for surveillance video coding.
Comput. Vis. Image Underst., 2018

Selectivity or Invariance: Boundary-aware Salient Object Detection.
CoRR, 2018

How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction.
CoRR, 2018

Characterizing Neuronal Circuits with Spike-triggered Non-negative Matrix Factorization.
CoRR, 2018

Winner-Take-All as Basic Probabilistic Inference Unit of Neuronal Circuits.
CoRR, 2018

Primary Object Segmentation in Aerial Videos via Hierarchical Temporal Slicing and Co-Segmentation.
CoRR, 2018

A simple blind-denoising filter inspired by electrically coupled photoreceptors in the retina.
CoRR, 2018

Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Cross-Domain Adversarial Feature Learning for Sketch Re-identification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Fast Compressed Domain Copy Detection with Motion Vector Imaging.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Multi-Pose Learning based Head-Shoulder Re-identification.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Hierarchical Temporal Memory Enhanced One-Shot Distance Learning for Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

SFCM: Learn a Pooling Kernel for Weakly Supervised Object Localization.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

ODN: Opening the Deep Network for Open-Set Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Temporal Attentive Network for Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Spike Coding for Dynamic Vision Sensors.
Proceedings of the 2018 Data Compression Conference, 2018

Toward Efficient Simultaneous Detection and Segmentation.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Deep Transfer Learning for Person Re-Identification.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Learning Discriminative Subspaces on Random Contrasts for Image Saliency Analysis.
IEEE Trans. Neural Networks Learn. Syst., 2017

Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN.
IEEE Trans. Multim., 2017

Rate-Performance-Loss Optimization for Inter-Frame Deep Feature Coding From Videos.
IEEE Trans. Image Process., 2017

Towards human-like and transhuman perception in AI 2.0: a review.
Frontiers Inf. Technol. Electron. Eng., 2017

Learning a Repression Network for Precise Vehicle Search.
CoRR, 2017

YoTube: Searching Action Proposal via Recurrent and Static Regression Networks.
CoRR, 2017

A fast skip and direction adaptive search algorithm for Sub-Pixel Motion Estimation on HEVC.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Deep hashing with mixed supervised losses for image search.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Deep hashing with multi-task learning for large-scale instance-level vehicle search.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Search video action proposal with recurrent and static YOLO.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Long-Term Dependencies for Action Recognition with a Biologically-Inspired Deep Network.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Spike Camera and Its Coding Methods.
Proceedings of the 2017 Data Compression Conference, 2017

Image Saliency Analysis Based on Retina Simulation.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

2016
Guest Editors' Introduction.
Int. J. Semantic Comput., 2016

Measuring Visual Surprise Jointly from Intrinsic and Extrinsic Contexts for Image Saliency Estimation.
Int. J. Comput. Vis., 2016

Ubiquitous Multimedia: Emerging Research on Multimedia Computing.
IEEE Multim., 2016

Fixed-point Gaussian Mixture Model for analysis-friendly surveillance video coding.
Comput. Vis. Image Underst., 2016

shuttleNet: A biologically-inspired RNN with loop connection and parameter sharing.
CoRR, 2016

Joint Network based Attention for Action Recognition.
CoRR, 2016

Deep Transfer Learning for Person Re-identification.
CoRR, 2016

CNN vs. SIFT for Image Retrieval: Alternative or Complementary?
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Joint Learning of Semantic and Latent Attributes.
Proceedings of the Computer Vision - ECCV 2016, 2016

Unsupervised Cross-Dataset Transfer Learning for Person Re-identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Relative Distance Learning: Tell the Difference between Similar Vehicles.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

CNUSVM: Hybrid CNN-Uneven SVM Model for Imbalanced Visual Learning.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

High-Efficiency Coding for Shaking Surveillance Videos Based on Global Motion Compensation.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
TASC: A Transformation-Aware Soft Cascading Approach for Multimodal Video Copy Detection.
ACM Trans. Inf. Syst., 2015

Guest Editorial Multimedia: The Biggest Big Data.
IEEE Trans. Multim., 2015

Image saliency estimation via random walk guided by informativeness and latent signal correlations.
Signal Process. Image Commun., 2015

Robust multiple cameras pedestrian detection with multi-view Bayesian network.
Pattern Recognit., 2015

Finding the Secret of Image Saliency in the Frequency Domain.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning Complementary Saliency Priors for Foreground Object Segmentation in Complex Scenes.
Int. J. Comput. Vis., 2015

Multimedia Big Data.
IEEE Multim., 2015

Quality-progressive coding for high bit-rate background frames on surveillance videos.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Detecting abnormal behaviors in surveillance videos based on fuzzy clustering and multiple Auto-Encoders.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Image deblurring using robust sparsity priors.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Detecting Rare Actions and Events from Surveillance Big Data with Bag of Dynamic Trajectories.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Performance Evaluation for AVS2 Scene Video Coding Techniques.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Swiss-System Based Cascade Ranking for Gait-Based Person Re-Identification.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Can We Beat DDoS Attacks in Clouds?
IEEE Trans. Parallel Distributed Syst., 2014

Optimizing the Hierarchical Prediction and Coding in HEVC for Surveillance and Conference Videos With Background Modeling.
IEEE Trans. Image Process., 2014

Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding.
IEEE Trans. Image Process., 2014

Guest Editorial.
J. Multim., 2014

Visual Saliency with Statistical Priors.
Int. J. Comput. Vis., 2014

The IEEE 1857 Standard: Empowering Smart Video Surveillance Systems.
IEEE Intell. Syst., 2014

Representing Visual Objects in HEVC Coding Loop.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2014

A refined object detection method based on HTM.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Background-foreground division based search for motion estimation in surveillance video coding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Multi-view gait recognition with incomplete training data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Video picture-in-picture detection using spatio-temporal slicing.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Quality Assessment for Comparing Image Enhancement Algorithms.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Content-based copy detection through multimodal feature representation and temporal pyramid matching.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Fast and Efficient Transcoding Based on Low-Complexity Background Modeling and Adaptive Block Classification.
IEEE Trans. Multim., 2013

Selective Eigenbackground for Background Modeling and Subtraction in Crowded Scenes.
IEEE Trans. Circuits Syst. Video Technol., 2013

Estimating Visual Saliency Through Single Image Optimization.
IEEE Signal Process. Lett., 2013

Video Copy-Detection and Localization with a Scalable Cascading Framework.
IEEE Multim., 2013

IEEE 1857: Boosting Video Applications in CPSS.
IEEE Intell. Syst., 2013

A coding unit classification based AVC-to-HEVC transcoding with background modeling for surveillance videos.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Wavelet based smoke detection method with RGB Contrast-image and shape constrain.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Surveillance video coding with quadtree partition based ROI extraction.
Proceedings of the 30th Picture Coding Symposium, 2013

Single underwater image enhancement with a new optical model.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

A background proportion adaptive Lagrange multiplier selection method for surveillance video on HEVC.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

MPLBoost-based mixture model for effective human detection with Deformable Part Model.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Pair-wise event detection using cubic features and sequence discriminant learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Overview of the IEEE 1857 surveillance groups.
Proceedings of the IEEE International Conference on Image Processing, 2013

A system based on sequence learning for event detection in surveillance video.
Proceedings of the IEEE International Conference on Image Processing, 2013

Hierarchical-and-Adaptive Bit-Allocation with Selective Background Prediction for High Efficiency Video Coding (HEVC).
Proceedings of the 2013 Data Compression Conference, 2013

2012
Group-Sensitive Multiple Kernel Learning for Object Recognition.
IEEE Trans. Image Process., 2012

Societally connected multimedia across cultures.
J. Zhejiang Univ. Sci. C, 2012

Low-complexity and high-efficiency background modeling for surveillance video coding.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

An efficient surveillance coding method based on a timely and bit-saving background updating model.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

PKU-NEC @TRECVID2012 SED : Uneven-Sequence Based Event Detection in Surveillance Video.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Robust and discriminative image authentication based on standard model feature.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Macro-Block-Level Selective Background Difference Coding for Surveillance Video.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Video Copy Detection Using a Soft Cascade of Multimodal Features.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

A Fast and Performance-Maintained Transcoding Method Based on Background Modeling for Surveillance Video.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Multi-camera Pedestrian Detection with Multi-view Bayesian Network Model.
Proceedings of the British Machine Vision Conference, 2012

Single and Multiple View Detection, Tracking and Video Analysis in Crowded Environments.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

An Efficient Background Reconstruction Based Coding Method for Surveillance Videos Captured by Moving Camera.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Automatic Webcam-Based Human Heart Rate Measurements Using Laplacian Eigenmap.
Proceedings of the Computer Vision, 2012

2011
Multi-Task Rank Learning for Visual Saliency Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2011

Salient region detection and segmentation for general object recognition and image understanding.
Sci. China Inf. Sci., 2011

PKU-IDM @TRECVID2011 CBCD: Content-Based Copy Detection with Cascade of Multimodal Features and Temporal Pyramid Matching.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

PKU-NEC @TRECVID2011 SED: Sequence-Based Event Detection in Surveillance Video.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Augmenting Image Processing with Social Tag Mining for Landmark Recognition.
Proceedings of the Advances in Multimedia Modeling, 2011

A multimodal video copy detection approach with sequential pyramid matching.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Selective eigenbackgrounds method for background subtraction in crowed scenes.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

iULib: where UDL and Wikipedia could meet.
Proceedings of the Imaging and Printing in a Web 2.0 World II, 2011

Robust and discriminative image authentication based on sparse coding.
Proceedings of the 2011 IEEE Consumer Communications and Networking Conference, 2011

2010
Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context.
IEEE Trans. Multim., 2010

Cost-Sensitive Rank Learning From Positive and Unlabeled Data for Visual Saliency Estimation.
IEEE Signal Process. Lett., 2010

Salient object extraction for user-targeted video content association.
J. Zhejiang Univ. Sci. C, 2010

A ranking SVM based fusion model for cross-media meta-search engine.
J. Zhejiang Univ. Sci. C, 2010

Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video.
Int. J. Comput. Vis., 2010

Per-Sample Multiple Kernel Approach for Visual Concept Learning.
EURASIP J. Image Video Process., 2010

Vlogging: A survey of videoblogging technology on the web.
ACM Comput. Surv., 2010

Social Multimedia Computing.
Computer, 2010

Mediaprinting: Identifying Multimedia Content for Digital Rights Management.
Computer, 2010

PKU@TRECVID2010: Pair-Wise Event Detection in Surveillance Video.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Automatic interesting object extraction from images using complementary saliency maps.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Saliency detection based on 2D log-gabor wavelets and center bias.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Dynamic multi-cue tracking with detection responses association.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Video retargeting with multi-scale trajectory optimization.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

ESUR: A system for Events detection in SURveillance video.
Proceedings of the International Conference on Image Processing, 2010

2009
PKU@TRECVID2009: Single-Actor and Pair-Activity Event Detection in surveillance Video.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

DCT-Based Videoprinting on Saliency-Consistent Regions for Detecting Video Copies with Text Insertion.
Proceedings of the Advances in Multimedia Information Processing, 2009

A New Multiple Kernel Approach for Visual Concept Learning.
Proceedings of the Advances in Multimedia Modeling, 2009

Multiple kernel active learning for image classification.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A dataset and evaluation methodology for visual saliency in video.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Group-sensitive multiple kernel learning for object categorization.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Content-Based Video Semantic Analysis.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Semantic Classification and Annotation of Images.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

2008
Multi-polarity text segmentation using graph theory.
Proceedings of the International Conference on Image Processing, 2008

PKU at ImageCLEF 2008: Experiments with Query Extension Techniques for Text-Based and Content-Based Image Retrieval.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

Large-Scale Cross-Media Retrieval of WikipediaMM Images with Textual and Visual Query Expansion.
Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

2007
Towards multi-granularity multi-facet e-book retrieval.
Proceedings of the 16th International Conference on World Wide Web, 2007

2006
Learning Contextual Dependency Network Models for Link-Based Classification.
IEEE Trans. Knowl. Data Eng., 2006

Latent linkage semantic kernels for collective classification of link data.
J. Intell. Inf. Syst., 2006

Context-based statistical relational learning.
AI Commun., 2006

Improving the Image Retrieval Results Via Topic Coverage Graph.
Proceedings of the Advances in Multimedia Information Processing, 2006

Diversifying the image retrieval results.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Keyphrase Extraction Using Semantic Networks Structure Analysis.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Robust Collective Classification with Contextual Dependency Network Models.
Proceedings of the Advanced Data Mining and Applications, Second International Conference, 2006

Semantic Scoring Based on Small-World Phenomenon for Feature Selection in Text Mining.
Proceedings of the Advanced Data Mining and Applications, Second International Conference, 2006

2004
Two-phase Web site classification based on Hidden Markov Tree models.
Web Intell. Agent Syst., 2004

Context-Based Classification for Link Data.
Proceedings of the Advances in Web-Based Learning, 2004

2003
Two-Phase Web Site Classification Based on Hidden Markov Tree Models.
Proceedings of the 2003 IEEE / WIC International Conference on Web Intelligence, 2003

2002
Quantitatively evaluating the influence of online social interactions in the community-assisted digital library.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

2001
The Development of a Mobile Decision Support System.
J. Interconnect. Networks, 2001


  Loading...