Min Xu

Orcid: 0000-0001-9581-8849

Affiliations:
  • University of Technology Sydney, School of Computing and Communications, Sydney, NSW, Australia
  • University of Newcastle, Callaghan, NSW, Australia (PhD 2010)
  • National University of Singapore, Singapore (former)


According to our database1, Min Xu authored at least 181 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unsupervised Part Discovery via Dual Representation Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Learning Graph Representations Through Learning and Propagating Edge Features.
IEEE Trans. Neural Networks Learn. Syst., June, 2024

Vital Sign Monitoring in Dynamic Environment via mmWave Radar and Camera Fusion.
IEEE Trans. Mob. Comput., May, 2024

SSFG: Stochastically Scaling Features and Gradients for Regularizing Graph Convolutional Networks.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Towards High-Quality Photorealistic Image Style Transfer.
IEEE Trans. Multim., 2024

A privacy-preserving framework with multi-modal data for cross-domain recommendation.
Knowl. Based Syst., 2024

CAA: Class-Aware Affinity calculation add-on for semantic segmentation.
Knowl. Based Syst., 2024

Center-bridged Interaction Fusion for hyperspectral and LiDAR classification.
Neurocomputing, 2024

Federated Prototype-based Contrastive Learning for Privacy-Preserving Cross-domain Recommendation.
CoRR, 2024

Federated User Preference Modeling for Privacy-Preserving Cross-Domain Recommendation.
CoRR, 2024

Differential Encoding for Improved Representation Learning over Graphs.
CoRR, 2024

Conditional Local Feature Encoding for Graph Neural Networks.
CoRR, 2024

Neighbour-level Message Interaction Encoding for Improved Representation Learning on Graphs.
CoRR, 2024

RandAlign: A Parameter-Free Method for Regularizing Graph Convolutional Networks.
CoRR, 2024

Causal Disentanglement for Regulating Social Influence Bias in Social Recommendation.
CoRR, 2024

A Learnable Agent Collaboration Network Framework for Personalized Multimodal AI Search Engine.
Proceedings of the 2nd International Workshop on Deep Multimodal Generation and Retrieval, 2024

2023
A Parametrical Model for Instance-Dependent Label Noise.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Adversarial Heterogeneous Graph Neural Network for Robust Recommendation.
IEEE Trans. Comput. Soc. Syst., October, 2023

Robust Face Alignment via Inherent Relation Learning and Uncertainty Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Single-Target Real-Time Passive WiFi Tracking.
IEEE Trans. Mob. Comput., June, 2023

Learning enhanced features and inferring twice for fine-grained image classification.
Multim. Tools Appl., April, 2023

Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning.
IEEE Trans. Multim., 2023

Multiscale Emotion Representation Learning for Affective Image Recognition.
IEEE Trans. Multim., 2023

Multimodal Hyperspectral Image Classification via Interconnected Fusion.
CoRR, 2023

Patch-shuffle-based semi-supervised segmentation of bone computed tomography via consistent learning.
Biomed. Signal Process. Control., 2023

Dataset Pruning: Reducing Training Data by Examining Generalization Influence.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
PID Control for Output Synchronization of Multiple Output Coupled Complex Networks.
IEEE Trans. Netw. Sci. Eng., 2022

Deep-IRTarget: An Automatic Target Detector in Infrared Imagery Using Dual-Domain Feature Extraction and Allocation.
IEEE Trans. Multim., 2022

Category attention transfer for efficient fine-grained visual categorization.
Pattern Recognit. Lett., 2022

Bridging the Gap Between Few-Shot and Many-Shot Learning via Distribution Calibration.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Accurate AoA Estimation for RFID Tag Array With Mutual Coupling.
IEEE Internet Things J., 2022

Distribution-Aware Margin Calibration for Semantic Segmentation in Images.
Int. J. Comput. Vis., 2022

An efficient multitask neural network for face alignment, head pose estimation and face tracking.
Expert Syst. Appl., 2022

Combination of Images and Point Clouds in a Generative Adversarial Network for Upsampling Crack Point Clouds.
IEEE Access, 2022

Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network.
Proceedings of the International Conference on Machine Learning, 2022

Objects in Semantic Topology.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Weakly Supervised Emotion Intensity Prediction for Recomi/tmi40.htmlgnition of Emotions in Images.
IEEE Trans. Multim., 2021

Context-Dependent Propagating-Based Video Recommendation in Multimodal Heterogeneous Information Networks.
IEEE Trans. Multim., 2021

Computer Vision-Assisted 3D Object Localization via COTS RFID Devices and a Monocular Camera.
IEEE Trans. Mob. Comput., 2021

4-D Flight Trajectory Prediction With Constrained LSTM Network.
IEEE Trans. Intell. Transp. Syst., 2021

Noise Augmented Double-Stream Graph Convolutional Networks for Image Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2021

Graph neural networks with multiple kernel ensemble attention.
Knowl. Based Syst., 2021

LGAttNet: Automatic micro-expression detection using dual-stream local and global attentions.
Knowl. Based Syst., 2021

Knowledge Graph enhanced Neural Collaborative Filtering with Residual Recurrent Network.
Neurocomputing, 2021

GEME: Dual-stream multi-task GEnder-based micro-expression recognition.
Neurocomputing, 2021

Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data.
IEEE Multim., 2021

Knowledge graph enhanced neural collaborative recommendation.
Expert Syst. Appl., 2021

Single-Target Real-Time Passive WiFi Tracking.
CoRR, 2021

Estimating Instance-dependent Label-noise Transition Matrix using DNNs.
CoRR, 2021

SSFG: Stochastically Scaling Features and Gradients for Regularizing Graph Convolution Networks.
CoRR, 2021

Free Lunch for Few-shot Learning: Distribution Calibration.
Proceedings of the 9th International Conference on Learning Representations, 2021

Recognizing 3D Orientation of a Two-RFID-Tag Labeled Object in Multipath Environments Using Deep Transfer Learning.
Proceedings of the 41st IEEE International Conference on Distributed Computing Systems, 2021

Edge-enhanced Instance Segmentation of Wrist CT via a Semi-Automatic Annotation Database Construction Method.
Proceedings of the 2021 Digital Image Computing: Techniques and Applications, 2021

Single-View 3D Object Reconstruction From Shape Priors in Memory.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Image to Modern Chinese Poetry Creation via a Constrained Topic-aware Model.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Recall What You See Continually Using GridLSTM in Image Captioning.
IEEE Trans. Multim., 2020

A Distributed and Anonymous Data Collection Framework Based on Multilevel Edge Computing Architecture.
IEEE Trans. Ind. Informatics, 2020

Multi-camera multi-player tracking with deep player identification in sports video.
Pattern Recognit., 2020

Learning Multi-level Deep Representations for Image Emotion Classification.
Neural Process. Lett., 2020

Manifold feature integration for micro-expression recognition.
Multim. Syst., 2020

Improving the generalization performance of deep networks by dual pattern learning with adversarial adaptation.
Knowl. Based Syst., 2020

LSTM-Cubic A*-based auxiliary decision support system in air traffic management.
Neurocomputing, 2020

Multi-camera 3D ball tracking framework for sports video.
IET Image Process., 2020

Meta3D: Single-View 3D Object Reconstruction from Shape Priors in Memory.
CoRR, 2020

A Survey on Machine Learning Techniques for Cyber Security in the Last Decade.
IEEE Access, 2020

RF-Mirror: Mitigating Mutual Coupling Interference in Two-Tag Array Labeled RFID Systems.
Proceedings of the 17th Annual IEEE International Conference on Sensing, 2020

Characterizing the Landscape of COVID-19 Themed Cyberattacks and Defenses.
Proceedings of the IEEE International Conference on Intelligence and Security Informatics, 2020

Data-Driven Characterization and Detection of COVID-19 Themed Malicious Websites.
Proceedings of the IEEE International Conference on Intelligence and Security Informatics, 2020

Multi-camera Sports Players 3D Localization with Identification Reasoning.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Infrared Target Detection Using Intensity Saliency And Self-Attention.
Proceedings of the IEEE International Conference on Image Processing, 2020

2019
Error Concealment for Cloud-Based and Scalable Video Coding of HD Videos.
IEEE Trans. Cloud Comput., 2019

RF-Focus: Computer Vision-assisted Region-of-interest RFID Tag Recognition and Localization in Multipath-prevalent Environments.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2019

Multi-modal multi-view Bayesian semantic embedding for community question answering.
Neurocomputing, 2019

Multi-level region-based Convolutional Neural Network for image emotion classification.
Neurocomputing, 2019

Facial Component-Landmark Detection With Weakly-Supervised LR-CNN.
IEEE Access, 2019

Synthetic IR Image Refinement Using Adversarial Learning With Bidirectional Mappings.
IEEE Access, 2019

Strawberry Verticillium Wilt Detection Network Based on Multi-Task Learning and Attention.
IEEE Access, 2019

Road Vehicle Detection and Classification Using Magnetic Field Measurement.
IEEE Access, 2019

AAANE: Attention-Based Adversarial Autoencoder for Multi-scale Network Embedding.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2019

Improving Micro-expression Recognition Accuracy Using Twofold Feature Extraction.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Squeezed Bilinear Pooling for Fine-Grained Visual Categorization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Airborne Object Detection Using Hyperspectral Imaging: Deep Learning Review.
Proceedings of the Computational Science and Its Applications - ICCSA 2019, 2019

2018
Recognition of Emotions in User-Generated Videos With Kernelized Features.
IEEE Trans. Multim., 2018

A Joint Framework for QoS and QoE for Video Transmission over Wireless Multimedia Sensor Networks.
IEEE Trans. Mob. Comput., 2018

A survey: facial micro-expression recognition.
Multim. Tools Appl., 2018

Generating affective maps for images.
Multim. Tools Appl., 2018

Appearance features in Encoding Color Space for visual surveillance.
Neurocomputing, 2018

Dual Pattern Learning Networks by Empirical Dual Prediction Risk Minimization.
CoRR, 2018

Single Image Rain Removal via a Simplified Residual Dense Network.
IEEE Access, 2018

ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

LSTM-based Flight Trajectory Prediction.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

RF-MVO: Simultaneous 3D Object Localization and Camera Trajectory Recovery Using RFID Devices and a 2D Monocular Camera.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018

3D Multiview Basketball Players Detection and Localization Based on Probabilistic Occupancy.
Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

Fine-Grained Categorization by Deep Part-Collaboration Convolution Net.
Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

2017
Who Are Your "Real" Friends: Analyzing and Distinguishing Between Offline and Online Friendships From Social Multimedia Data.
IEEE Trans. Multim., 2017

Hierarchically Supervised Deconvolutional Network for Semantic Video Segmentation.
Pattern Recognit., 2017

User relationship strength modeling for friend recommendation on Instagram.
Neurocomputing, 2017

Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Image Based Facial Micro-Expression Recognition Using Deep Learning on Small Datasets.
Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

2016
Improving Visual Saliency Computing With Emotion Intensity.
IEEE Trans. Neural Networks Learn. Syst., 2016

Frame Interpolation for Cloud-Based Mobile Video Streaming.
IEEE Trans. Multim., 2016

Adaptive Content Condensation Based on Grid Optimization for Thumbnail Image Generation.
IEEE Trans. Circuits Syst. Video Technol., 2016

A unified model sharing framework for moving object detection.
Signal Process., 2016

ActiveAd: A novel framework of linking ad videos to online products.
Neurocomputing, 2016

Modelling Temporal Information Using Discrete Fourier Transform for Video Classification.
CoRR, 2016

Modelling Temporal Information Using Discrete Fourier Transform for Recognizing Emotions in User-generated Videos.
CoRR, 2016

Learning Multi-level Deep Representations for Image Emotion Classification.
CoRR, 2016

Person re-identification via rich color-gradient feature.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Modeling temporal information using discrete fourier transform for recognizing emotions in user-generated videos.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Multi-scale blocks based image emotion classification using multiple instance learning.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015
A camera motion histogram descriptor for video shot classification.
Multim. Tools Appl., 2015

NIF-based seam carving for image resizing.
Multim. Syst., 2015

Survey of Error Concealment techniques: Research directions and open issues.
Proceedings of the 2015 Picture Coding Symposium, 2015

Community Detection Based on Links and Node Features in Social Networks.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Learning Multi-view Deep Features for Small Object Retrieval in Surveillance Scenarios.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

A Survey of Applying Machine Learning Techniques for Credit Rating: Existing Models and Open Issues.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Hand gesture recognition for a virtual mouse application using geometric feature of finger's trajectories.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

2014
Mobile Landmark Search with 3D Models.
IEEE Trans. Multim., 2014

CAMHID: Camera Motion Histogram Descriptor and Its Application to Cinematographic Shot Classification.
IEEE Trans. Circuits Syst. Video Technol., 2014

A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization.
Signal Process., 2014

A three-level framework for affective content analysis and its case studies.
Multim. Tools Appl., 2014

A Hybrid Image Retargeting Approach via Combining Seam Carving and Grid Warping.
J. Multim., 2014

Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Estimate Gaze Density by Incorporating Emotion.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

A Multiple Features Distance Preserving (MFDP) Model for Saliency Detection.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

2013
Context-Aware Video Retargeting via Graph Model.
IEEE Trans. Multim., 2013

Hierarchical affective content analysis in arousal and valence dimensions.
Signal Process., 2013

Graph-Guided Fusion Penalty Based Sparse Coding for Image Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Semantically-Based Human Scanpath Estimation with HMMs.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Enhanced 3-D Modeling for Landmark Image Classification.
IEEE Trans. Multim., 2012

Content on demand video adaptation based on MPEG-21 digital item adaptation.
EURASIP J. Wirel. Commun. Netw., 2012

Accurate Pedestrian Counting System Based on Local Features.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

3D Pose Estimation of Front Vehicle Towards a Better Driver Assistance System.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Vehicle Type Classification Using PCA with Self-Clustering.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Bag of features using sparse coding for gender classification.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

On splitting dataset: Boosting Locally Adaptive Regression Kernels for car localization.
Proceedings of the 12th International Conference on Control Automation Robotics & Vision, 2012

Shot Classification Using Domain Specific Features for Movie Management.
Proceedings of the Database Systems for Advanced Applications, 2012

Fusing Warping, Cropping, and Scaling for Optimal Image Thumbnail Generation.
Proceedings of the Computer Vision, 2012

Efficient Clothing Retrieval with Semantic-Preserving Visual Phrases.
Proceedings of the Computer Vision, 2012

2011
Landmark recognition and retrieval: from 2D to 3D.
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011

Using context saliency for movie shot classification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Cascade-Based License Plate Localization with Line Segment Features and Haar-Like Features.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

2010
Using Scripts for Affective Content Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Visual Attention Based Motion Object Detection and Trajectory Tracking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

A close-up detection method for movies.
Proceedings of the International Conference on Image Processing, 2010

Visual attention based small object segmentation in natual images.
Proceedings of the International Conference on Image Processing, 2010

Adaptive local hyperplanes for MTV affective analysis.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Learning priors for super-resolution in video sequence.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

2009
Affective content analysis by mid-level representation in multiple modalities.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Microscopic image segmentation based on color pixels classification.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

2008
Audio keywords generation for sports video analysis.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Affective Content Detection by Using Timing Features and Fuzzy Clustering.
Proceedings of the Advances in Multimedia Information Processing, 2008

Automatic Colonic Polyp Detection by the Mapping Using Regional Unit Sphere.
Proceedings of the 2008 International Conference on Multimedia and Ubiquitous Engineering (MUE 2008), 2008

Comparison analysis on supervised learning based solutions for sports video categorization.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Hierarchical movie affective content analysis based on arousal and valence features.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

2007
Efficient sampling of training set in large and noisy multimedia data.
ACM Trans. Multim. Comput. Commun. Appl., 2007

2006
Multimodal Semantic Analysis and Annotation for Basketball Video.
EURASIP J. Adv. Signal Process., 2006

Efficient data reduction in multimedia data.
Appl. Intell., 2006

Affective content detection in sitcom using subtitle and audio.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Event on demand with MPEG-21 video adaptation system.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

An Event-Driven Sports Video Adaptation for the MPEG-21 DIA Framework.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

2005
A unified framework for semantic shot classification in sports video.
IEEE Trans. Multim., 2005

Affective content analysis in comedy and horror videos by audio emotional event detection.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

EASIER Sampling for Audio Event Identification.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
HMM-Based Audio Keyword Generation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Audio keyword generation for sports video analysis.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Nonparametric motion model.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Nonparametric motion model with applications to camera motion pattern classification.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Mean shift based nonparametric motion characterization.
Proceedings of the 2004 International Conference on Image Processing, 2004

Mean shift based video segment representation and applications to replay detection.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Semantic Shot Classification in Sports Video.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003

Nonparametric color characterization using mean shift.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

A mid-level representation framework for semantic sports video analysis.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Creating audio keywords for event detection in soccer video.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A fusion scheme of visual and auditory modalities for event detection in sports video.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Foreground Segmentation Using Motion Vectors in Sports Video.
Proceedings of the Advances in Multimedia Information Processing, 2002

A unified framework for semantic shot classification in sports videos.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002


  Loading...