Yuxin Peng

Orcid: 0000-0003-3772-0872

According to our database1, Yuxin Peng authored at least 254 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Compositional Prompting for Anti-Forgetting in Domain Incremental Learning.
Int. J. Comput. Vis., December, 2024

Exemplar-Free Lifelong Person Re-identification via Prompt-Guided Adaptive Knowledge Consolidation.
Int. J. Comput. Vis., November, 2024

EOGT: Video Anomaly Detection with Enhanced Object Information and Global Temporal Dependency.
ACM Trans. Multim. Comput. Commun. Appl., October, 2024

SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback.
ACM Trans. Multim. Comput. Commun. Appl., June, 2024

I2C: Invertible Continuous Codec for High-Fidelity Variable-Rate Image Compression.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Negatives Make a Positive: An Embarrassingly Simple Approach to Semi-Supervised Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Decoupled domain-specific and domain-conditional representation learning for cross-domain recommendation.
Inf. Process. Manag., March, 2024

Recognizing wearable upper-limb rehabilitation gestures by a hybrid multi-feature neural network.
Eng. Appl. Artif. Intell., January, 2024

MAAN: Memory-Augmented Auto-Regressive Network for Text-Driven 3D Indoor Scene Generation.
IEEE Trans. Multim., 2024

HCL: Hierarchical Consistency Learning for Webly Supervised Fine-Grained Recognition.
IEEE Trans. Multim., 2024

Image Super-Resolution via Efficient Transformer Embedding Frequency Decomposition With Restart.
IEEE Trans. Image Process., 2024

Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model.
IEEE Trans. Image Process., 2024

MECOM: A Meta-Completion Network for Fine-Grained Recognition With Incomplete Multi-Modalities.
IEEE Trans. Image Process., 2024

SIM-OFE: Structure Information Mining and Object-Aware Feature Enhancement for Fine-Grained Visual Categorization.
IEEE Trans. Image Process., 2024

Shank-RIO: A Shank-Mounted Ranging-Inertial Odometry for Gait Analysis and Positioning in Complex Environment.
IEEE Trans. Instrum. Meas., 2024

DMA: Dual Modality-Aware Alignment for Visible-Infrared Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

Omnidirectional and Size-Adaptive Soft Bending Sensor for Accurate Human Joint Motion Monitoring.
IEEE Trans. Ind. Electron., 2024

CountMamba: Exploring Multi-directional Selective State-Space Models for Plant Counting.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RelScene: A Benchmark and baseline for Spatial Relations in text-driven 3D Scene Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Mitigate Catastrophic Remembering via Continual Knowledge Purification for Noisy Lifelong Person Re-Identification.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

InsVP: Efficient Instance Visual Prompting from Image Itself.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Progressive Prototype Evolving for Dual-Forgetting Mitigation in Non-Exemplar Online Continual Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FineFMPL: Fine-grained Feature Mining Prompt Learning for Few-Shot Class Incremental Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Semantic-Aware Human Object Interaction Image Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Firzen: Firing Strict Cold-Start Items with Frozen Heterogeneous and Homogeneous Graphs for Recommendation.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Training-Free Video Temporal Grounding Using Large-Scale Pre-trained Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Conditional Multi-modal Prompts for Zero-Shot HOI Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

FineSports: A Multi-Person Hierarchical Sports Video Dataset for Fine-Grained Action Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Distribution-Aware Knowledge Prototyping for Non-Exemplar Lifelong Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FineParser: A Fine-Grained Spatio-Temporal Action Parser for Human-Centric Action Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FCS: Feature Calibration and Separation for Non-Exemplar Class Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Continual Compatible Representation for Re-indexing Free Lifelong Person Re-identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DART: Dual-Modal Adaptive Online Prompting and Knowledge Retention for Test-Time Adaptation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Comprehensive Visual Grounding for Video Description.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Continual Vision-Language Retrieval via Dynamic Knowledge Rectification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Performance-Based Iterative Learning Control for Task-Oriented Rehabilitation: A Pilot Study in Robot-Assisted Bilateral Training.
IEEE Trans. Cogn. Dev. Syst., December, 2023

Smart Public Transportation Sensing: Enhancing Perception and Data Management for Efficient and Safety Operations.
Sensors, November, 2023

Attribute-Aware Deep Hashing With Self-Consistency for Large-Scale Fine-Grained Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Disentangled Graph Neural Networks for Session-Based Recommendation.
IEEE Trans. Knowl. Data Eng., August, 2023

DCR-ReID: Deep Component Reconstruction for Cloth-Changing Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

CAT: a coarse-to-fine attention tree for semantic change detection.
Vis. Intell., 2023

MKVSE: Multimodal Knowledge Enhanced Visual-semantic Embedding for Image-text Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model.
CoRR, 2023

MB-HGCN: A Hierarchical Graph Convolutional Network for Multi-behavior Recommendation.
CoRR, 2023

Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation.
CoRR, 2023

Multi-Behavior Recommendation with Cascading Graph Convolution Networks.
Proceedings of the ACM Web Conference 2023, 2023

MV-Diffusion: Motion-aware Video Diffusion Model.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Efficiency-optimized Video Diffusion Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Fine-Grained Visual Prompt Learning of Vision-Language Models for Image Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi Hybrid Extractor Network for 3D Human Pose Estimation.
Proceedings of the IEEE International Conference on Image Processing, 2023

Uncover the Body: Occluded Person Re-identification via Masked Image Modeling.
Proceedings of the Image and Graphics - 12th International Conference, 2023

DensityLayout: Density-Conditioned Layout GAN for Visual-Textual Presentation Designs.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Masked Retraining Teacher-Student Framework for Domain Adaptive Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fusing Drama Therapy and Cognitive Behavioral Therapy in a Virtual Reality Setting: An Innovative Strategy for Tackling Maladaptive Lifestyle Habits.
Proceedings of the Eleventh International Symposium of Chinese CHI, 2023

Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Phrase-Level Temporal Relationship Mining for Temporal Sentence Localization.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Dual-View 3D Reconstruction via Learning Correspondence and Dependency of Point Cloud Regions.
IEEE Trans. Image Process., 2022

Unsupervised Visual-Textual Correlation Learning With Fine-Grained Semantic Alignment.
IEEE Trans. Cybern., 2022

MARS: Learning Modality-Agnostic Representation for Scalable Cross-Media Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2022

Fine-Grained Image Analysis With Deep Learning: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Semantic association enhancement transformer with relative position for image captioning.
Multim. Tools Appl., 2022

Hand gesture recognition framework using a lie group based spatio-temporal recurrent network with multiple hand-worn motion sensors.
Inf. Sci., 2022

Learning conditional photometric stereo with high-resolution features.
Comput. Vis. Media, 2022

Team PKU-WICT-MIPL PIC Makeup Temporal Video Grounding Challenge 2022 Technical Report.
CoRR, 2022

Prototype-based classifier learning for long-tailed visual recognition.
Sci. China Inf. Sci., 2022

Learn from Unlabeled Videos for Near-duplicate Video Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Weakly Supervised Video Anomaly Detection with Temporal and Abnormal Information.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

An Embarrassingly Simple Approach to Semi-Supervised Few-Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Automatic Recognition of Verb-Complement Separable Words Based on BCC.
Proceedings of the Chinese Lexical Semantics - 23rd Workshop, 2022

Global Contextual Complementary Network for Multi-View Stereo.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Visual-Textual Hybrid Sequence Matching for Joint Reasoning.
IEEE Trans. Cybern., 2021

Multi-Level Knowledge Injecting for Visual Commonsense Reasoning.
IEEE Trans. Circuits Syst. Video Technol., 2021

A Flexible Pressure Sensor with Ink Printed Porous Graphene for Continuous Cardiovascular Status Monitoring.
Sensors, 2021

Hierarchical Visual-Textual Knowledge Distillation for Life-Long Correlation Learning.
Int. J. Comput. Vis., 2021

2020
Sequential Cross-Modal Hashing Learning via Multi-scale Correlation Mining.
ACM Trans. Multim. Comput. Commun. Appl., 2020

RCE-HIL: Recognizing Cross-media Entailment with Heterogeneous Interactive Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval.
IEEE Trans. Multim., 2020

CKD: Cross-Task Knowledge Distillation for Text-to-Image Synthesis.
IEEE Trans. Multim., 2020

Deep Reinforcement Learning for Image Hashing.
IEEE Trans. Multim., 2020

Video Captioning With Object-Aware Spatio-Temporal Correlation and Aggregation.
IEEE Trans. Image Process., 2020

MAVA: Multi-Level Adaptive Visual-Textual Alignment by Cross-Media Bi-Attention Mechanism.
IEEE Trans. Image Process., 2020

SCH-GAN: Semi-Supervised Cross-Modal Hashing by Generative Adversarial Network.
IEEE Trans. Cybern., 2020

MHTN: Modal-Adversarial Hybrid Transfer Network for Cross-Modal Retrieval.
IEEE Trans. Cybern., 2020

Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis.
IEEE Trans. Circuits Syst. Video Technol., 2020

Quintuple-Media Joint Correlation Learning With Deep Compression and Regularization.
IEEE Trans. Circuits Syst. Video Technol., 2020

Reinforced Cross-Media Correlation Learning by Context-Aware Bidirectional Translation.
IEEE Trans. Circuits Syst. Video Technol., 2020

Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph.
IEEE Trans. Circuits Syst. Video Technol., 2020

Guest Editorial Introduction to the Special Section on Representation Learning for Visual Content Understanding.
IEEE Trans. Circuits Syst. Video Technol., 2020

Fine-Grained Visual-Textual Representation Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

Fuzzy logic compliance adaptation for an assist-as-needed controller on the Gait Rehabilitation Exoskeleton (GAREX).
Robotics Auton. Syst., 2020

A robot-assisted bilateral upper limb training strategy with subject-specific workspace: A pilot study.
Robotics Auton. Syst., 2020

Subject-specific compliance control of an upper-limb bilateral robotic system.
Robotics Auton. Syst., 2020

DV-Net: Dual-view network for 3D reconstruction by fusing multiple sets of gated control point clouds.
Pattern Recognit. Lett., 2020

Attribute hierarchy based multi-task learning for fine-grained image classification.
Neurocomputing, 2020

PKU_WICT at TRECVID 2020: Instance Search Task.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

A Smart User Authentication Approach using Sensing Seat.
Proceedings of the 16th IEEE International Conference on Automation Science and Engineering, 2020

2019
CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Show and Tell in the Loop: Cross-Modal Circular Correlation Learning.
IEEE Trans. Multim., 2019

TPCKT: Two-Level Progressive Cross-Media Knowledge Transfer.
IEEE Trans. Multim., 2019

A Novel Stick-Slip Piezoelectric Actuator Based on a Triangular Compliant Driving Mechanism.
IEEE Trans. Ind. Electron., 2019

SSDH: Semi-Supervised Deep Hashing for Large Scale Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2019

Two-Stream Collaborative Learning With Spatial-Temporal Attention for Video Classification.
IEEE Trans. Circuits Syst. Video Technol., 2019

Fast Fine-Grained Image Classification via Weakly Supervised Discriminative Localization.
IEEE Trans. Circuits Syst. Video Technol., 2019

Which and How Many Regions to Gaze: Focus Discriminative Regions for Fine-Grained Visual Categorization.
Int. J. Comput. Vis., 2019

Adaptive Trajectory Tracking Control of a Parallel Ankle Rehabilitation Robot With Joint-Space Force Distribution.
IEEE Access, 2019

PKU_ICST at TRECVID 2019: Instance Search Task.
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Hierarchical Vision-Language Alignment for Video Captioning.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

A New Benchmark and Approach for Fine-grained Cross-media Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

IRC-GAN: Introspective Recurrent Convolutional GAN for Text-to-video Generation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Object-Aware Aggregation With Bidirectional Temporal Graph for Video Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Query-Adaptive Image Retrieval by Deep-Weighted Hashing.
IEEE Trans. Multim., 2018

CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network.
IEEE Trans. Multim., 2018

Modality-Specific Cross-Modal Similarity Measurement With Recurrent Attention Network.
IEEE Trans. Image Process., 2018

Object-Part Attention Model for Fine-Grained Image Classification.
IEEE Trans. Image Process., 2018

An Overview of Cross-Media Retrieval: Concepts, Methodologies, Benchmarks, and Challenges.
IEEE Trans. Circuits Syst. Video Technol., 2018

IEEE Access Special Section Editorial: Recent Advantages of Computer Vision.
IEEE Access, 2018

PKU_ICST at TRECVID 2018: Instance Search Task.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Multi-attention Guided Activation Propagation in CNNs.
Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

Cost-Sensitive Deep Metric Learning for Fine-Grained Image Classification.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Text-to-image Synthesis via Symmetrical Distillation Networks.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Multi-Scale Correlation for Sequential Cross-modal Hashing Learning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Life-long Cross-media Correlation Learning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Only Learn One Sample: Fine-Grained Visual Categorization with One Sample Training.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Coarse Label Refined Knowledge Reasoning for Fine-Grained Visual Categorization.
Proceedings of the Intelligence Science and Big Data Engineering, 2018

Better and Faster: Knowledge Transfer from Multiple Self-supervised Learning Tasks via Graph Distillation for Video Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Visual Data Synthesis via GAN for Zero-Shot Video Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Cross-media Multi-level Alignment with Relation Attention Network.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Cross-modal Bidirectional Translation via Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

StackDRL: Stacked Deep Reinforcement Learning for Fine-grained Visual Categorization.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Dual Adversarial Networks for Zero-shot Cross-media Retrieval.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Progressive Cross-Media Correlation Learning.
Proceedings of the Image and Graphics Technologies and Applications, 2018

Deep Cross-Media Knowledge Transfer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Stacking VAE and GAN for Context-aware Text-to-Image Generation.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Unsupervised Generative Adversarial Cross-Modal Hashing.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Cross-media similarity metric learning with unified deep networks.
Multim. Tools Appl., 2017

Cross-media analysis and reasoning: advances and directions.
Frontiers Inf. Technol. Electron. Eng., 2017

A framework of mining semantic-based probabilistic event relations for complex activity recognition.
Inf. Sci., 2017

Discriminative latent semantic feature learning for pedestrian detection.
Neurocomputing, 2017

Exploiting distinctive topological constraint of local feature matching for logo image recognition.
Neurocomputing, 2017

Cross-media retrieval by exploiting fine-grained correlation at entity level.
Neurocomputing, 2017

CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning.
CoRR, 2017

Visual-textual Attention Driven Fine-grained Representation Learning.
CoRR, 2017

CCL: Cross-modal Correlation Learning with Multi-grained Fusion by Hierarchical Network.
CoRR, 2017

Object-Part Attention Driven Discriminative Localization for Fine-grained Image Classification.
CoRR, 2017

Fine-graind Image Classification via Combining Vision and Language.
CoRR, 2017

PKU_ICST at TRECVID 2017: Instance Search Task.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Cross-modal Common Representation Learning by Hybrid Transfer Network.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Saliency-guided video classification via adaptively weighted learning.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Cross-modal deep metric learning with multi-task regularization.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Zero-Shot Cross-Media Retrieval with External Knowledge.
Proceedings of the Internet Multimedia Computing and Service, 2017

Attention-Sharing Correlation Learning for Cross-Media Retrieval.
Proceedings of the Image and Graphics - 9th International Conference, 2017

Fine-Grained Image Classification via Combining Vision and Language.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Weakly Supervised Learning of Part Selection Model with Spatial Constraints for Fine-Grained Image Classification.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization.
IEEE Trans. Circuits Syst. Video Technol., 2016

Mining intricate temporal rules for recognizing complex activities of daily living under uncertainty.
Pattern Recognit., 2016

Complex activity recognition using time series pattern dictionary learned from ubiquitous sensors.
Inf. Sci., 2016

Query-adaptive Image Retrieval by Deep Weighted Hashing.
CoRR, 2016

SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval.
CoRR, 2016

PKU-ICST at TRECVID 2016: Instance Search Task.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Logo Recognition via Improved Topological Constraint.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Cross-Media Retrieval via Semantic Entity Projection.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Cross-Media Shared Representation by Hierarchical Learning with Multiple Deep Networks.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Cross-Media Retrieval by Multimodal Representation Fusion with Deep Networks.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

Group Cost-Sensitive Boosting for Multi-Resolution Pedestrian Detection.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A Boosted Multi-Task Model for Pedestrian Detection With Occlusion Handling.
IEEE Trans. Image Process., 2015

Sensor-based human activity recognition system with a multilayered model using time series shapelets.
Knowl. Based Syst., 2015

PKU-ICST at TRECVID 2015: Instance Search Task.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Effects of compliant and flexible trunks on peak-power of a lizard-inspired robot.
Proceedings of the 2015 IEEE International Conference on Robotics and Biomimetics, 2015

A Hierarchical Pachinko Allocation Model for Social Sentiment Mining.
Proceedings of the Knowledge Science, Engineering and Management, 2015

The application of two-level attention models in deep convolutional neural network for fine-grained image classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Adaptive Sampling with Optimal Cost for Class-Imbalance Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization.
IEEE Trans. Circuits Syst. Video Technol., 2014

Graph-based multimodal semi-supervised image classification.
Neurocomputing, 2014

PKU-ICST at TRECVID 2014: Instance Search Task.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Weakly-Supervised Image Parsing via Constructing Semantic Graphs and Hypergraphs.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Error-Driven Incremental Learning in Deep Convolutional Neural Network for Large-Scale Image Classification.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Semantic Graph Construction for Weakly-Supervised Image Parsing.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Cross-View Feature Learning for Scalable Social Image Analysis.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Exploiting Semantic and Visual Context for Effective Video Annotation.
IEEE Trans. Multim., 2013

Latent semantic learning with structured sparse representation for human action recognition.
Pattern Recognit., 2013

Cross-media retrieval by intra-media and inter-media correlation mining.
Multim. Syst., 2013

L<sub>1</sub>-graph construction using structured sparsity.
Neurocomputing, 2013

Vocabulary hierarchy optimisation based on spatial context and category information.
Int. J. Multim. Intell. Secur., 2013

Exhaustive and Efficient Constraint Propagation: A Graph-Based Learning Approach and Its Applications.
Int. J. Comput. Vis., 2013

A temporal context model for boosting video annotation.
Sci. China Inf. Sci., 2013

PKU_ICST at TRECVID2013 : Instance Search Task.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Learning Descriptive Visual Representation by Semantic Regularized Matrix Factorization.
Proceedings of the IJCAI 2013, 2013

Multimodal semi-supervised image classification by combining tag refinement, graph-based learning and support vector regression.
Proceedings of the IEEE International Conference on Image Processing, 2013

Cross-media retrieval by cluster-based correlation analysis.
Proceedings of the IEEE International Conference on Image Processing, 2013

Heterogeneous Metric Learning with Joint Graph Regularization for Cross-Media Retrieval.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Unified Constraint Propagation on Multi-View Data.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
PKU-ICST @TRECVID2012: Known-item Search Task.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Effective Heterogeneous Similarity Measure with Nearest Neighbors for Cross-Media Retrieval.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Visual Vocabulary Optimization with Spatial Context for Image Annotation and Classification.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

PDSS: patch-descriptor-similarity space for effective face verification.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Image annotation by semantic sparse recoding of visual content.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Tri-space and ranking based heterogeneous similarity measure for cross-media retrieval.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Heterogeneous Constraint Propagation with Constrained Sparse Representation.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Cross-modality correlation propagation for cross-media retrieval.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Contextual Kernel and Spectral Methods for Learning the Semantics of Images.
IEEE Trans. Image Process., 2011

Combining multiple clusterings using fast simulated annealing.
Pattern Recognit. Lett., 2011

Robust Image Analysis by L1-Norm Semi-supervised Learning
CoRR, 2011

Exhaustive and Efficient Constraint Propagation: A Semi-Supervised Learning Perspective and Its Applications
CoRR, 2011

Mining concept relationship in temporal context for effective video annotation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Combining latent semantic learning and reduced hypergraph learning for semi-supervised image categorization.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Web video search by mutual boosting between the inside and outside text of video.
Proceedings of the 2011 Joint International Conference on Digital Libraries, 2011

Spectral learning of latent semantics for action recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Latent Semantic Learning by Efficient Sparse Coding with Hypergraph Regularization.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Symmetric Graph Regularized Constraint Propagation.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Gaussian mixture learning via robust competitive agglomeration.
Pattern Recognit. Lett., 2010

Image categorization via robust pLSA.
Pattern Recognit. Lett., 2010

Story-Based Retrieval by Learning and Measuring the Concept-Based and Content-Based Similarity.
Proceedings of the Advances in Multimedia Modeling, 2010

Refining video annotation by exploiting inter-shot context.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

AdaOUBoost: adaptive over-sampling and under-sampling to boost the concept learning in large scale imbalanced data sets.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Effective Multi-level Image Representation for Image Categorization.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
PKU-ICST at TRECVID2009: High Level Feature Extraction and Search.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Semantic concept annotation based on audio PLSA model.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Audio retrieval by segment-based manifold-ranking.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Using Multiple Frame Integration for the Text Recognition of Video.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

2008
A Semi-supervised Learning Algorithm on Gaussian Mixture with Automatic Model Selection.
Neural Process. Lett., 2008

Peking University at TRECVID 2008: High Level Feature Extraction.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Unsupervised learning of finite mixtures using entropy regularization and its application to image segmentation.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

From Comparing Clusterings to Combining Clusterings.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
OM-based video shot retrieval by one-to-one matching.
Multim. Tools Appl., 2007

Color-Based Text Extraction for the Image.
Proceedings of the Advances in Multimedia Information Processing, 2007

Color-based clustering for text detection and extraction in image.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

A Theoretical Approach to Construct Highly Discriminative Features with Application in AdaBoost.
Proceedings of the Computer Vision, 2007

2006
Clip-based similarity measure for query-dependent clip retrieval and video summarization.
IEEE Trans. Circuits Syst. Video Technol., 2006

Using Earth Mover's Distance for Audio Clip Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2006

Audio similarity measure by graph modeling and matching.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

2005
A New Retrieval Model Based on TextTiling for Document Similarity Search.
J. Comput. Sci. Technol., 2005

A New Re-ranking Method for Generic Chinese Text Summarization and Its Evaluation.
Proceedings of the Digital Libraries: Implementing Strategies and Sharing Experiences, 2005

Hot Event Detection and Summarization by Graph Modeling and Matching.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

EMD-Based Video Clip Retrieval by Many-to-Many Matching.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

The earth mover's distance as a semantic measure for document similarity.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2004
Clip-based similarity measure for hierarchical video retrieval.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

A Measure Based on Optimal Matching in Graph Theory for Document Similarity.
Proceedings of the Information Retrieval Technology, Asia Information Retrieval Symposium, 2004

2003
Video clip retrieval by maximal matching and optimal matching in graph theory.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003


  Loading...