Junyu Gao

Orcid: 0000-0001-6000-8168

Affiliations:
  • Northwestern Polytechnical University, OPTIMAL, Xi'an, China


According to our database1, Junyu Gao authored at least 84 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Real-Time Text Detection With Similar Mask in Traffic, Industrial, and Natural Scenes.
IEEE Trans. Intell. Transp. Syst., January, 2025

Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2025

Distance-aware network for physical-world object distribution estimation and counting.
Pattern Recognit., 2025

Memory-enhanced hierarchical transformer for video paragraph captioning.
Neurocomputing, 2025

H3T: Hierarchical Transferable Transformer with TokenMix for Unsupervised Domain Adaptation.
Expert Syst. Appl., 2025

2024
Learning Long-Range Relationships for Temporal Aircraft Anomaly Detection.
IEEE Trans. Aerosp. Electron. Syst., October, 2024

Text kernel calculation for arbitrary shape text detection.
Vis. Comput., April, 2024

SSIR: Spatial shuffle multi-head self-attention for Single Image Super-Resolution.
Pattern Recognit., April, 2024

An End-to-End Contrastive License Plate Detector.
IEEE Trans. Intell. Transp. Syst., January, 2024

FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation.
IEEE Trans. Image Process., 2024

Single-Stream Extractor Network With Contrastive Pre-Training for Remote-Sensing Change Captioning.
IEEE Trans. Geosci. Remote. Sens., 2024

Integrating SAM With Feature Interaction for Remote Sensing Change Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

Alignment and Fusion Using Distinct Sensor Data for Multimodal Aerial Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2024

Enhancing Unimodal Features Matters: A Multimodal Framework for Building Extraction.
IEEE Trans. Geosci. Remote. Sens., 2024

Contrastive Tokens and Label Activation for Remote Sensing Weakly Supervised Semantic Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2024

Balanced Density Regression Network for Remote Sensing Object Counting.
IEEE Trans. Geosci. Remote. Sens., 2024

NWPU-MOC: A Benchmark for Fine-Grained Multicategory Object Counting in Aerial Images.
IEEE Trans. Geosci. Remote. Sens., 2024

Center-enhanced video captioning model with multimodal semantic alignment.
Neural Networks, 2024

Audio-visual representation learning for anomaly events detection in crowds.
Neurocomputing, 2024

RRTrN: A lightweight and effective backbone for scene text recognition.
Expert Syst. Appl., 2024

Multivariate time series classification with crucial timestamps guidance.
Expert Syst. Appl., 2024

SignEye: Traffic Sign Interpretation from Vehicle First-Person View.
CoRR, 2024

Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection.
CoRR, 2024

Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera.
CoRR, 2024

Quantum-inspired Interpretable Deep Learning Architecture for Text Sentiment Analysis.
CoRR, 2024

A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot.
CoRR, 2024

StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation.
CoRR, 2024

Text-only Synthesis for Image Captioning.
CoRR, 2024

U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation.
CoRR, 2024

Like Humans to Few-Shot Learning through Knowledge Permeation of Vision and Text.
CoRR, 2024

Dynamic Proxy Domain Generalizes the Crowd Localization by Better Binary Segmentation.
CoRR, 2024

NWPU-MOC: A Benchmark for Fine-grained Multi-category Object Counting in Aerial Images.
CoRR, 2024

SamLP: A Customized Segment Anything Model for License Plate Detection.
CoRR, 2024

A Descriptive Basketball Highlight Dataset for Automatic Commentary Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Domain-Adaptive Crowd Counting via High-Quality Image Translation and Density Reconstruction.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Boosting One-Stage License Plate Detector via Self-Constrained Contrastive Aggregation.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Crowd Localization From Gaussian Mixture Scoped Knowledge and Scoped Teacher.
IEEE Trans. Image Process., 2023

Holistic Mutual Representation Enhancement for Few-Shot Remote Sensing Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2023

Exploring Hard Samples in Multiview for Few-Shot Remote Sensing Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2023

LGNet: Location-Guided Network for Road Extraction From Satellite Images.
IEEE Trans. Geosci. Remote. Sens., 2023

NAS-Kernel: Learning Suitable Gaussian Kernel for Remote-Sensing Object Counting.
IEEE Geosci. Remote. Sens. Lett., 2023

Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators.
CoRR, 2023

Imbalanced Aircraft Data Anomaly Detection.
CoRR, 2023

2022
Multitask Attention Network for Lane Detection and Fitting.
IEEE Trans. Neural Networks Learn. Syst., 2022

Neuron Linear Transformation: Modeling the Domain Shift for Crowd Counting.
IEEE Trans. Neural Networks Learn. Syst., 2022

Video Crowd Localization With Multifocus Gaussian Neighborhood Attention and a Large-Scale Benchmark.
IEEE Trans. Image Process., 2022

Density-Aware Curriculum Learning for Crowd Counting.
IEEE Trans. Cybern., 2022

Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images.
Remote. Sens., 2022

Congested crowd instance localization with dilated convolutional swin transformer.
Neurocomputing, 2022

Counting Like Human: Anthropoid Crowd Counting on Modeling the Similarity of Objects.
CoRR, 2022

MAFNet: A Multi-Attention Fusion Network for RGB-T Crowd Counting.
CoRR, 2022

DR.VIC: Decomposition and Reasoning for Video Individual Counting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Feature-Aware Adaptation and Density Alignment for Crowd Counting in Video Surveillance.
IEEE Trans. Cybern., 2021

NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Learning to detect anomaly events in crowd scenes from synthetic data.
Neurocomputing, 2021

Pixel-Wise Crowd Understanding via Synthetic Data.
Int. J. Comput. Vis., 2021

Audio-visual Representation Learning for Anomaly Events Detection in Crowds.
CoRR, 2021

LDC-Net: A Unified Framework for Localization, Detection and Counting in Dense Crowds.
CoRR, 2021

Unsupervised Domain Adaptive Learning via Synthetic Data for Person Re-identification.
CoRR, 2021

Video Crowd Localization with Multi-focus Gaussian Neighbor Attention and a Large-Scale Benchmark.
CoRR, 2021

Multi-Domain Synchronous Refinement Network for Unsupervised Cross-Domain Person Re-Identification.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
PCC Net: Perspective Crowd Counting via Spatial Convolutional Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

Learning Independent Instance Maps for Crowd Localization.
CoRR, 2020

Ambient Sound Helps: Audiovisual Crowd Counting in Extreme Conditions.
CoRR, 2020

CNN-based Density Estimation and Crowd Counting: A Survey.
CoRR, 2020

NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting.
CoRR, 2020

Hyperspectral Image Classification With CapsNet and Markov Random Fields.
IEEE Access, 2020

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Pixel-Level Self-Paced Learning For Super-Resolution.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Focus on Semantic Consistency for Cross-Domain Crowd Understanding.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Flow Base Bi-path Network for Cross-Scene Video Crowd Understanding in Aerial View.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020


Multi-feature Counting of Dense Crowd Image Based on Multi-column Convolutional Neural Network.
Proceedings of the 5th International Conference on Computer and Communication Systems, 2020

2019
Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes.
IEEE Trans. Image Process., 2019

SCAR: Spatial-/channel-wise attention regression networks for crowd counting.
Neurocomputing, 2019

Domain-adaptive Crowd Counting via Inter-domain Features Segregation and Gaussian-prior Reconstruction.
CoRR, 2019

Feature-aware Adaptation and Structured Density Alignment for Crowd Counting in Video Surveillance.
CoRR, 2019

C^3 Framework: An Open-source PyTorch Code for Crowd Counting.
CoRR, 2019

Convolutional Regression Network for Multi-Oriented Text Detection.
IEEE Access, 2019

Learning From Synthetic Data for Crowd Counting in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Embedding Structured Contour and Location Prior in Siamesed Fully Convolutional Networks for Road Detection.
IEEE Trans. Intell. Transp. Syst., 2018

A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling.
IEEE Trans. Intell. Transp. Syst., 2018


  Loading...