Haojin Yang

Comput. Speech Lang., 2024

SeCoKD: Aligning Large Language Models for In-Context Learning with Fewer Shots.

[BibT_eX]

[DOI]

Weixing Wang

CoRR, 2024

Feature Distribution Shift Mitigation with Contrastive Pretraining for Intrusion Detection.

[BibT_eX]

[DOI]

Cristian Bermudez Serna

Carmen Mas Machuca

CoRR, 2024

ImbaGCD: Imbalanced Generalized Category Discovery.

[BibT_eX]

[DOI]

CoRR, 2024

Generalized Categories Discovery for Long-tailed Recognition.

[BibT_eX]

[DOI]

Ziyun Li

CoRR, 2024

Guided Cluster Aggregation: A Hierarchical Approach to Generalized Category Discovery.

[BibT_eX]

[DOI]

Jona Otholt

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Otem-IGCD: An Optimal Transport-based EM Framework for Imbalanced Generalized Category Discovery.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Enhancing Optimization Robustness in 1-Bit Neural Networks Through Stochastic Sign Descent.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Magnetically Actuated Continuum Medical Robots: A Review.

[BibT_eX]

[DOI]

Adv. Intell. Syst., June, 2023

Supervised Knowledge May Hurt Novel Class Discovery Performance.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Towards Optimization-Friendly Binary Neural Network.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Scaled Prompt-Tuning for Few-Shot Natural Language Generation.

[BibT_eX]

[DOI]

CoRR, 2023

SMKD: Selective Mutual Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2023

Flexible BERT with Width- and Depth-dynamic Inference.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2023

QuadMag: A Mobile-Coil System With Enhanced Magnetic Actuation Efficiency and Dexterity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

DocLangID: Improving Few-Shot Training to Identify the Language of Historical Documents.

[BibT_eX]

[DOI]

Proceedings of the 7th International Workshop on Historical Document Imaging and Processing, 2023

Boosting Bert Subnets with Neural Grafting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Magnetic Micro-Driller System for Nasolacrimal Duct Recanalization.

[BibT_eX]

[DOI]

Kelvin Kam Lung Chong

Chi Pui Pang

Li Zhang

IEEE Robotics Autom. Lett., 2022

Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket.

[BibT_eX]

[DOI]

CoRR, 2022

Empirical Evaluation of Post-Training Quantization Methods for Language Tasks.

[BibT_eX]

[DOI]

CoRR, 2022

A Closer Look at Novel Class Discovery from the Labeled Set.

[BibT_eX]

[DOI]

CoRR, 2022

Mobile Ultrasound Tracking and Magnetic Control for Long-Distance Endovascular Navigation of Untethered Miniature Robots against Pulsatile Flow.

[BibT_eX]

[DOI]

Adv. Intell. Syst., 2022

Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

2021

Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data.

[BibT_eX]

[DOI]

CoRR, 2021

BoolNet: Minimizing The Energy Consumption of Binary Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Not All Knowledge Is Created Equal.

[BibT_eX]

[DOI]

Neil Martin Robertson

David A. Clifton

CoRR, 2021

Evaluating Post-Training Compression in GANs using Locality-Sensitive Hashing.

[BibT_eX]

[DOI]

Gonçalo Mordido

CoRR, 2021

MeliusNet: An Improved Network Architecture for Binary Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Hybrid Magnetic Force and Torque Actuation of Miniature Helical Robots Using Mobile Coils to Accelerate Blood Clot Removal.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Denoising AutoEncoder Based Delete and Generate Approach for Text Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

AsymmNet: Towards Ultralight Convolution Neural Networks Using Asymmetrical Bottlenecks.

[BibT_eX]

[DOI]

Zhen Shen

Yucheng Zhao

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

One Model to Reconstruct Them All: A Novel Way to Use the Stochastic Noise in StyleGAN.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Recurrent generative adversarial network for learning imbalanced medical image semantic segmentation.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2020

MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?

[BibT_eX]

[DOI]

CoRR, 2020

microbatchGAN: Stimulating Diversity with Multi-Adversarial Discrimination.

[BibT_eX]

[DOI]

Gonçalo Mordido

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

BMXNet 2: An Open Source Framework for Low-bit Networks - Reproducing, Understanding, Designing and Showcasing.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Best Student Forcing: A Simple Training Mechanism in Adversarial Language Generation.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Synthetic Data for the Analysis of Archival Documents: Handwriting Determination.

[BibT_eX]

[DOI]

Proceedings of the Digital Image Computing: Techniques and Applications, 2020

2019

Deep representation learning for multimedia data analysis

[BibT_eX]

[DOI]

, 2019

KISS: Keeping It Simple for Scene Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Back to Simplicity: How to Train Accurate BNNs from Scratch?

[BibT_eX]

[DOI]

CoRR, 2019

Conditional Generative Adversarial Refinement Networks for Unbalanced Medical Image Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Learning imbalanced semantic segmentation through cross-domain relations of multi-agent generative adversarial networks.

[BibT_eX]

[DOI]

Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019

Training Accurate Binary Neural Networks from Scratch.

[BibT_eX]

[DOI]

Joseph Bethge

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

BinaryDenseNet: Developing an Architecture for Binary Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018

Image Captioning with Deep Bidirectional LSTMs and Multi-Task Learning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2018

Automatic Online Lecture Highlighting Based on Multimedia Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Learn. Technol., 2018

Training Competitive Binary Neural Networks from Scratch.

[BibT_eX]

[DOI]

CoRR, 2018

Multi-Task Generative Adversarial Network for Handling Imbalanced Clinical Data.

[BibT_eX]

[DOI]

CoRR, 2018

Conditional Generative Refinement Adversarial Networks for Unbalanced Medical Image Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to Train a Binary Neural Network.

[BibT_eX]

[DOI]

CoRR, 2018

Dropout-GAN: Learning from a Dynamic Ensemble of Discriminators.

[BibT_eX]

[DOI]

Gonçalo Mordido

CoRR, 2018

voxel-GAN: Adversarial Framework for Learning Imbalanced Brain Tumor Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2018

Instance Tumor Segmentation using Multitask Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Generative Adversarial Framework for Learning Multiple Clinical Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Digital Image Computing: Techniques and Applications, 2018

Whole Heart and Great Vessel Segmentation with Context-aware of Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Bildverarbeitung für die Medizin 2018 - Algorithmen - Systeme, 2018

LoANs: Weakly Supervised Object Detection with Localizer Assessor Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition.

[BibT_eX]

[DOI]

Christian Bartz

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Deep Learning for Medical Image Analysis.

[BibT_eX]

[DOI]

CoRR, 2017

Brain Abnormality Detection by Deep Convolutional Neural Network.

[BibT_eX]

[DOI]

CoRR, 2017

STN-OCR: A single Neural Network for Text Detection and Text Recognition.

[BibT_eX]

[DOI]

Christian Bartz

CoRR, 2017

Traversal-Free Word Vector Evaluation in Analogy Space.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017

BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Conditional Adversarial Network for Semantic Segmentation of Brain Tumor.

[BibT_eX]

[DOI]

Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2017

Deep Neural Network with l2-Norm Unit for Brain Lesions Detection.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Language Identification Using Deep Convolutional Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Automatic Lecture Subtitle Generation and How It Helps.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Advanced Learning Technologies, 2017

2016

A deep semantic framework for multimodal representation learning.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

SceneTextReg: A Real-Time Video OCR System.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Image Captioning with Deep Bidirectional LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Punctuation Prediction for Unsegmented Transcript Based on Word Vector.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Signature Embedding: Writer Independent Offline Signature Verification with Deep Metric Learning.

[BibT_eX]

[DOI]

Hannes Rantzsch

Proceedings of the Advances in Visual Computing - 12th International Symposium, 2016

Sentence Boundary Detection Based on Parallel Lexical and Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploring multimodal video representation for action recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Real-Time Action Recognition in Surveillance Videos Using ConvNets.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Action Recognition in Surveillance Video Using ConvNets and Motion History Image.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2016, 2016

Pre-Course Key Segment Analysis of Online Lecture Videos.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Conference on Advanced Learning Technologies, 2016

Sentence-Level Automatic Lecture Highlighting Based on Acoustic Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Computer and Information Technology, 2016

2015

Table Detection from Slide Images.

[BibT_eX]

[DOI]

Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Concept-Based Multimodal Learning for Topic Generation.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

An Improved System For Real-Time Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Adaptive E-Lecture Video Outline Extraction Based on Slides Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Web-Based Learning - ICWL 2015, 2015

Deep Semantic Mapping for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Visual-Textual Late Semantic Fusion Using Deep Neural Network for Document Categorization.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Does Multilevel Semantic Representation Improve Text Categorization?

[BibT_eX]

[DOI]

Proceedings of the Database and Expert Systems Applications, 2015

Reward-based Intermittent Reinforcement in Gamification for E-learning.

[BibT_eX]

[DOI]

Sheng Luo

Proceedings of the CSEDU 2015, 2015

2014

Content Based Lecture Video Retrieval Using Speech and Video Text Information.

[BibT_eX]

[DOI]

IEEE Trans. Learn. Technol., 2014

A framework for improved video text detection and recognition.

[BibT_eX]

[DOI]

Bernhard Quehl

Multim. Tools Appl., 2014

The Automated Generation and Further Application of Tree-Structure Outline for Lecture Videos with Synchronized Slides.

[BibT_eX]

[DOI]

Int. J. Technol. Educ. Mark., 2014

Improving text recognition by distinguishing scene and overlay text.

[BibT_eX]

[DOI]

Bernhard Quehl

Proceedings of the Seventh International Conference on Machine Vision, 2014

2013

Automatic video indexing and retrieval using video OCR technology.

[BibT_eX]

[DOI]

PhD thesis, 2013

Next Generation Tele-Teaching: Latest Recording Technology, User Engagement and Automatic Metadata Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Human Factors in Computing and Informatics, 2013

Lecture video segmentation by automatically analyzing the synchronized slides.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Lecture Video Browsing Using Multimodal Information Resources.

[BibT_eX]

[DOI]

Proceedings of the Advances in Web-Based Learning - ICWL 2013, 2013

Evaluating the Digital Manuscript Functionality - User Testing for Lecture Video Annotation Features.

[BibT_eX]

[DOI]

Franka Grünewald

Proceedings of the Advances in Web-Based Learning - ICWL 2013, 2013

2012

A skeleton based binarization approach for video text recognition.

[BibT_eX]

[DOI]

Bernhard Quehl

Proceedings of the 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012

Open Up Cultural Heritage in Video Archives with Mediaglobe.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Innovative Internet Community Systems (I<sup>2</sup>CS 2012), 2012

An Automated Analysis and Indexing Framework for Lecture Video Portal.

[BibT_eX]

[DOI]

Christoph Oehlke

Proceedings of the Advances in Web-Based Learning - ICWL 2012, 2012

Automated Extraction of Lecture Outlines from Lecture Videos - A Hybrid Solution for Lecture Video Indexing.

[BibT_eX]

Franka Gruenewald

Proceedings of the CSEDU 2012, 2012

2011

Lecture Video Indexing and Analysis Using Video OCR Technology.

[BibT_eX]

[DOI]

J. Multim. Process. Technol., 2011

Lecture Video Indexing and Analysis Using Video OCR Technology.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Signal-Image Technology and Internet-Based Systems, 2011

Automatic Lecture Video Indexing Using Video OCR Technology.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

German Speech Recognition: A Solution for the Analysis and Processing of Lecture Recordings.

[BibT_eX]

[DOI]

Christoph Oehlke