2024

Visual Grounding with Attention-Driven Constraint Balancing.

[DOI]

Weitai Kang

Luowei Zhou

Junyi Wu

Changchang Sun

Yan Yan

CoRR, 2024

AssistGUI: Task-Oriented PC Graphical User Interface Automation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Temporally guided articulated hand pose tracking in surgical videos.

[DOI]

Int. J. Comput. Assist. Radiol. Surg., January, 2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.

[DOI]

Trans. Mach. Learn. Res., 2023

ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation.

[DOI]

CoRR, 2023

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn.

[DOI]

CoRR, 2023

MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling.

[DOI]

CoRR, 2022

Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks.

[DOI]

CoRR, 2022

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks.

[DOI]

CoRR, 2022

Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

RegionCLIP: Region-based Language-Image Pretraining.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

BEVT: BERT Pretraining of Video Transformers.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CLIP-Event: Connecting Text and Images with Event Structures.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Florence: A New Foundation Model for Computer Vision.

[DOI]

CoRR, 2021

CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning.

[DOI]

CoRR, 2021

VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation.

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cluster-Former: Clustering-based Sparse Transformer for Question Answering.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding.

[DOI]

CoRR, 2020

Unified Vision-Language Pre-Training for Image Captioning and VQA.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Influential Factors and Correction Method of Furfural Content in Transformer Oil.

[DOI]

IEEE Access, 2019

Grounded Video Description.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition.

[DOI]

Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018

Dynamic Graph Modules for Modeling Higher-Order Interactions in Activity Recognition.

[DOI]

CoRR, 2018

End-to-End Dense Video Captioning With Masked Transformer.

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction.

[DOI]

Luowei Zhou

Nathan Louis

Jason J. Corso

Proceedings of the British Machine Vision Conference 2018, 2018

Towards Automatic Learning of Procedures From Web Instructional Videos.

[DOI]

Luowei Zhou

Chenliang Xu

Jason J. Corso

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Online Condition Monitoring for Both IGBT Module and DC-Link Capacitor of Power Converter Based on Short-Circuit Current Simultaneously.

[DOI]

IEEE Trans. Ind. Electron., 2017

Analysis and Design of a Single-Stage Isolated AC-DC LED Driver With a Voltage Doubler Rectifier.

[DOI]

IEEE Trans. Ind. Electron., 2017

Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer.

[DOI]

IEEE Trans. Cybern., 2017

ProcNets: Learning to Segment Procedures in Untrimmed and Unconstrained Videos.

[DOI]

Luowei Zhou

Chenliang Xu

Jason J. Corso

CoRR, 2017

Watch What You Just Said: Image Captioning with Text-Conditional Attention.

[DOI]

Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

A method of active junction temperature control for IGBT.

[DOI]

Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

2016

Monitoring chip fatigue in an IGBT module based on grey relational analysis.

[DOI]

Microelectron. Reliab., 2016

Lifetime estimation for IGBT modules in wind turbine power converter system considering ambient temperature.

[DOI]

Microelectron. Reliab., 2016

Image Caption Generation with Text-Conditional Semantic Attention.

[DOI]

CoRR, 2016

2015

Improvement of Stability and Power Factor in PCM Controlled Boost PFC Converter With Hybrid Dynamic Compensation.

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2015

Corrigendum to "Junction temperature management of IGBT module in power electronic converters" [Microelectron. Reliab. 54 (2014) 2788-2795].

[DOI]

Microelectron. Reliab., 2015

Modeling and analysis of magnetically coupled resonant wireless power transfer system with rectifier bridge LED load.

[DOI]

Int. J. Circuit Theory Appl., 2015

Filter-based perturbation control of low-frequency oscillation in voltage-mode H-bridge DC-AC inverter.

[DOI]

Int. J. Circuit Theory Appl., 2015

Control of sub-harmonic oscillation in peak current mode buck converter with dynamic resonant perturbation.

[DOI]

Int. J. Circuit Theory Appl., 2015

Multi-agent Reinforcement Learning with Sparse Interactions by Negotiation and Knowledge Transfer.

[DOI]

CoRR, 2015

2014

Junction temperature management of IGBT module in power electronic converters.

[DOI]

Microelectron. Reliab., 2014

2013

Investigation of gate voltage oscillations in an IGBT module after partial bond wires lift-off.

[DOI]

Luowei Zhou

Shengqi Zhou

Mingwei Xu

Microelectron. Reliab., 2013

2012

DC Link Active Power Filter for Three-Phase Diode Rectifier.

[DOI]

IEEE Trans. Ind. Electron., 2012

2011

Non-invasive chaos control of DC-DC converter and its optimization.

[DOI]

Int. J. Circuit Theory Appl., 2011

2009

Double-Frequency Buck Converter.

[DOI]

Xiong Du

Luowei Zhou

Heng-Ming Tai

IEEE Trans. Ind. Electron., 2009