Haojin Yang

Orcid: 0000-0002-8733-5772

Affiliations:
  • Hasso Plattner Institute (HPI), Potsdam, Germany


According to our database1, Haojin Yang authored at least 105 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Optimal Parameter Design and Microrobotic Navigation Control of Parallel-Mobile-Coil Systems.
IEEE Trans Autom. Sci. Eng., January, 2024

Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification.
Trans. Mach. Learn. Res., 2024

A flexible BERT model enabling width- and depth-dynamic inference.
Comput. Speech Lang., 2024

SeCoKD: Aligning Large Language Models for In-Context Learning with Fewer Shots.
CoRR, 2024

Feature Distribution Shift Mitigation with Contrastive Pretraining for Intrusion Detection.
CoRR, 2024

ImbaGCD: Imbalanced Generalized Category Discovery.
CoRR, 2024

Generalized Categories Discovery for Long-tailed Recognition.
CoRR, 2024

Guided Cluster Aggregation: A Hierarchical Approach to Generalized Category Discovery.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Otem-IGCD: An Optimal Transport-based EM Framework for Imbalanced Generalized Category Discovery.
Proceedings of the International Joint Conference on Neural Networks, 2024

Enhancing Optimization Robustness in 1-Bit Neural Networks Through Stochastic Sign Descent.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Magnetically Actuated Continuum Medical Robots: A Review.
Adv. Intell. Syst., June, 2023

Supervised Knowledge May Hurt Novel Class Discovery Performance.
Trans. Mach. Learn. Res., 2023

Towards Optimization-Friendly Binary Neural Network.
Trans. Mach. Learn. Res., 2023

Scaled Prompt-Tuning for Few-Shot Natural Language Generation.
CoRR, 2023

SMKD: Selective Mutual Knowledge Distillation.
Proceedings of the International Joint Conference on Neural Networks, 2023

Flexible BERT with Width- and Depth-dynamic Inference.
Proceedings of the International Joint Conference on Neural Networks, 2023

QuadMag: A Mobile-Coil System With Enhanced Magnetic Actuation Efficiency and Dexterity.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents.
Proceedings of the 7th International Workshop on Historical Document Imaging and Processing, 2023

Boosting Bert Subnets with Neural Grafting.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Magnetic Micro-Driller System for Nasolacrimal Duct Recanalization.
IEEE Robotics Autom. Lett., 2022

Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket.
CoRR, 2022

Empirical Evaluation of Post-Training Quantization Methods for Language Tasks.
CoRR, 2022

A Closer Look at Novel Class Discovery from the Labeled Set.
CoRR, 2022

Mobile Ultrasound Tracking and Magnetic Control for Long-Distance Endovascular Navigation of Untethered Miniature Robots against Pulsatile Flow.
Adv. Intell. Syst., 2022

Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

2021
Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data.
CoRR, 2021

BoolNet: Minimizing The Energy Consumption of Binary Neural Networks.
CoRR, 2021

Not All Knowledge Is Created Equal.
CoRR, 2021

Evaluating Post-Training Compression in GANs using Locality-Sensitive Hashing.
CoRR, 2021

MeliusNet: An Improved Network Architecture for Binary Neural Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Hybrid Magnetic Force and Torque Actuation of Miniature Helical Robots Using Mobile Coils to Accelerate Blood Clot Removal.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Denoising AutoEncoder Based Delete and Generate Approach for Text Style Transfer.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

AsymmNet: Towards Ultralight Convolution Neural Networks Using Asymmetrical Bottlenecks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

One Model to Reconstruct Them All: A Novel Way to Use the Stochastic Noise in StyleGAN.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Recurrent generative adversarial network for learning imbalanced medical image semantic segmentation.
Multim. Tools Appl., 2020

MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
CoRR, 2020

microbatchGAN: Stimulating Diversity with Multi-Adversarial Discrimination.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

BMXNet 2: An Open Source Framework for Low-bit Networks - Reproducing, Understanding, Designing and Showcasing.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Best Student Forcing: A Simple Training Mechanism in Adversarial Language Generation.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Synthetic Data for the Analysis of Archival Documents: Handwriting Determination.
Proceedings of the Digital Image Computing: Techniques and Applications, 2020

2019
Deep representation learning for multimedia data analysis
, 2019

KISS: Keeping It Simple for Scene Text Recognition.
CoRR, 2019

Back to Simplicity: How to Train Accurate BNNs from Scratch?
CoRR, 2019

Conditional Generative Adversarial Refinement Networks for Unbalanced Medical Image Semantic Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Learning imbalanced semantic segmentation through cross-domain relations of multi-agent generative adversarial networks.
Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019

Training Accurate Binary Neural Networks from Scratch.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

BinaryDenseNet: Developing an Architecture for Binary Neural Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018
Image Captioning with Deep Bidirectional LSTMs and Multi-Task Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Automatic Online Lecture Highlighting Based on Multimedia Analysis.
IEEE Trans. Learn. Technol., 2018

Training Competitive Binary Neural Networks from Scratch.
CoRR, 2018

Multi-Task Generative Adversarial Network for Handling Imbalanced Clinical Data.
CoRR, 2018

Conditional Generative Refinement Adversarial Networks for Unbalanced Medical Image Semantic Segmentation.
CoRR, 2018

Learning to Train a Binary Neural Network.
CoRR, 2018

Dropout-GAN: Learning from a Dynamic Ensemble of Discriminators.
CoRR, 2018

voxel-GAN: Adversarial Framework for Learning Imbalanced Brain Tumor Segmentation.
Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2018

Instance Tumor Segmentation using Multitask Convolutional Neural Network.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Generative Adversarial Framework for Learning Multiple Clinical Tasks.
Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

Whole Heart and Great Vessel Segmentation with Context-aware of Generative Adversarial Networks.
Proceedings of the Bildverarbeitung für die Medizin 2018 - Algorithmen - Systeme, 2018

LoANs: Weakly Supervised Object Detection with Localizer Assessor Networks.
Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deep Learning for Medical Image Analysis.
CoRR, 2017

Brain Abnormality Detection by Deep Convolutional Neural Network.
CoRR, 2017

STN-OCR: A single Neural Network for Text Detection and Text Recognition.
CoRR, 2017

Traversal-Free Word Vector Evaluation in Analogy Space.
Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017

BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Conditional Adversarial Network for Semantic Segmentation of Brain Tumor.
Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2017

Deep Neural Network with l2-Norm Unit for Brain Lesions Detection.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Language Identification Using Deep Convolutional Recurrent Neural Networks.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Automatic Lecture Subtitle Generation and How It Helps.
Proceedings of the 17th IEEE International Conference on Advanced Learning Technologies, 2017

2016
A deep semantic framework for multimodal representation learning.
Multim. Tools Appl., 2016

SceneTextReg: A Real-Time Video OCR System.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Image Captioning with Deep Bidirectional LSTMs.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Punctuation Prediction for Unsegmented Transcript Based on Word Vector.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Signature Embedding: Writer Independent Offline Signature Verification with Deep Metric Learning.
Proceedings of the Advances in Visual Computing - 12th International Symposium, 2016

Sentence Boundary Detection Based on Parallel Lexical and Acoustic Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploring multimodal video representation for action recognition.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Real-Time Action Recognition in Surveillance Videos Using ConvNets.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Action Recognition in Surveillance Video Using ConvNets and Motion History Image.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2016, 2016

Pre-Course Key Segment Analysis of Online Lecture Videos.
Proceedings of the 16th IEEE International Conference on Advanced Learning Technologies, 2016

Sentence-Level Automatic Lecture Highlighting Based on Acoustic Analysis.
Proceedings of the 2016 IEEE International Conference on Computer and Information Technology, 2016

2015
Table Detection from Slide Images.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Concept-Based Multimodal Learning for Topic Generation.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

An Improved System For Real-Time Scene Text Recognition.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Adaptive E-Lecture Video Outline Extraction Based on Slides Analysis.
Proceedings of the Advances in Web-Based Learning - ICWL 2015, 2015

Deep Semantic Mapping for Cross-Modal Retrieval.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Visual-Textual Late Semantic Fusion Using Deep Neural Network for Document Categorization.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Does Multilevel Semantic Representation Improve Text Categorization?
Proceedings of the Database and Expert Systems Applications, 2015

Reward-based Intermittent Reinforcement in Gamification for E-learning.
Proceedings of the CSEDU 2015, 2015

2014
Content Based Lecture Video Retrieval Using Speech and Video Text Information.
IEEE Trans. Learn. Technol., 2014

A framework for improved video text detection and recognition.
Multim. Tools Appl., 2014

The Automated Generation and Further Application of Tree-Structure Outline for Lecture Videos with Synchronized Slides.
Int. J. Technol. Educ. Mark., 2014

Improving text recognition by distinguishing scene and overlay text.
Proceedings of the Seventh International Conference on Machine Vision, 2014

2013
Automatic video indexing and retrieval using video OCR technology.
PhD thesis, 2013

Next Generation Tele-Teaching: Latest Recording Technology, User Engagement and Automatic Metadata Retrieval.
Proceedings of the Human Factors in Computing and Informatics, 2013

Lecture video segmentation by automatically analyzing the synchronized slides.
Proceedings of the ACM Multimedia Conference, 2013

Lecture Video Browsing Using Multimodal Information Resources.
Proceedings of the Advances in Web-Based Learning - ICWL 2013, 2013

Evaluating the Digital Manuscript Functionality - User Testing for Lecture Video Annotation Features.
Proceedings of the Advances in Web-Based Learning - ICWL 2013, 2013

2012
A skeleton based binarization approach for video text recognition.
Proceedings of the 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012

Open Up Cultural Heritage in Video Archives with Mediaglobe.
Proceedings of the 12th International Conference on Innovative Internet Community Systems (I<sup>2</sup>CS 2012), 2012

An Automated Analysis and Indexing Framework for Lecture Video Portal.
Proceedings of the Advances in Web-Based Learning - ICWL 2012, 2012

Automated Extraction of Lecture Outlines from Lecture Videos - A Hybrid Solution for Lecture Video Indexing.
Proceedings of the CSEDU 2012, 2012

2011
Lecture Video Indexing and Analysis Using Video OCR Technology.
J. Multim. Process. Technol., 2011

Lecture Video Indexing and Analysis Using Video OCR Technology.
Proceedings of the Seventh International Conference on Signal-Image Technology and Internet-Based Systems, 2011

Automatic Lecture Video Indexing Using Video OCR Technology.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

German Speech Recognition: A Solution for the Analysis and Processing of Lecture Recordings.
Proceedings of the 10th IEEE/ACIS International Conference on Computer and Information Science, 2011


  Loading...