2024
Visual Grounding with Attention-Driven Constraint Balancing.
CoRR, 2024
AssistGUI: Task-Oriented PC Graphical User Interface Automation.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Temporally guided articulated hand pose tracking in surgical videos.
Int. J. Comput. Assist. Radiol. Surg., January, 2023
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.
,
,
,
,
,
,
,
,
,
,
,
Trans. Mach. Learn. Res., 2023
ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn.
CoRR, 2023
MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling.
CoRR, 2022
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks.
CoRR, 2022
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022
DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment.
Proceedings of the Computer Vision - ECCV 2022, 2022
RegionCLIP: Region-based Language-Image Pretraining.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
BEVT: BERT Pretraining of Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
CLIP-Event: Connecting Text and Images with Event Structures.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Florence: A New Foundation Model for Computer Vision.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning.
CoRR, 2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Cluster-Former: Clustering-based Sparse Transformer for Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding.
CoRR, 2020
Unified Vision-Language Pre-Training for Image Captioning and VQA.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Influential Factors and Correction Method of Furfural Content in Transformer Oil.
IEEE Access, 2019
Grounded Video Description.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition.
Proceedings of the 30th British Machine Vision Conference 2019, 2019
2018
Dynamic Graph Modules for Modeling Higher-Order Interactions in Activity Recognition.
CoRR, 2018
End-to-End Dense Video Captioning With Masked Transformer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction.
Proceedings of the British Machine Vision Conference 2018, 2018
Towards Automatic Learning of Procedures From Web Instructional Videos.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Online Condition Monitoring for Both IGBT Module and DC-Link Capacitor of Power Converter Based on Short-Circuit Current Simultaneously.
IEEE Trans. Ind. Electron., 2017
Analysis and Design of a Single-Stage Isolated AC-DC LED Driver With a Voltage Doubler Rectifier.
IEEE Trans. Ind. Electron., 2017
Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer.
IEEE Trans. Cybern., 2017
ProcNets: Learning to Segment Procedures in Untrimmed and Unconstrained Videos.
CoRR, 2017
Watch What You Just Said: Image Captioning with Text-Conditional Attention.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017
A method of active junction temperature control for IGBT.
Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017
2016
Monitoring chip fatigue in an IGBT module based on grey relational analysis.
Microelectron. Reliab., 2016
Lifetime estimation for IGBT modules in wind turbine power converter system considering ambient temperature.
Microelectron. Reliab., 2016
Image Caption Generation with Text-Conditional Semantic Attention.
CoRR, 2016
2015
Improvement of Stability and Power Factor in PCM Controlled Boost PFC Converter With Hybrid Dynamic Compensation.
IEEE Trans. Circuits Syst. I Regul. Pap., 2015
Corrigendum to "Junction temperature management of IGBT module in power electronic converters" [Microelectron. Reliab. 54 (2014) 2788-2795].
Microelectron. Reliab., 2015
Modeling and analysis of magnetically coupled resonant wireless power transfer system with rectifier bridge LED load.
Int. J. Circuit Theory Appl., 2015
Filter-based perturbation control of low-frequency oscillation in voltage-mode H-bridge DC-AC inverter.
Int. J. Circuit Theory Appl., 2015
Control of sub-harmonic oscillation in peak current mode buck converter with dynamic resonant perturbation.
Int. J. Circuit Theory Appl., 2015
Multi-agent Reinforcement Learning with Sparse Interactions by Negotiation and Knowledge Transfer.
CoRR, 2015
2014
Junction temperature management of IGBT module in power electronic converters.
Microelectron. Reliab., 2014
2013
Investigation of gate voltage oscillations in an IGBT module after partial bond wires lift-off.
Microelectron. Reliab., 2013
2012
DC Link Active Power Filter for Three-Phase Diode Rectifier.
IEEE Trans. Ind. Electron., 2012
2011
Non-invasive chaos control of DC-DC converter and its optimization.
Int. J. Circuit Theory Appl., 2011
2009
Double-Frequency Buck Converter.
IEEE Trans. Ind. Electron., 2009