Bang Yang
Orcid: 0000-0003-2019-0377
According to our database1,
Bang Yang
authored at least 36 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Zero-Shot Temporal Action Detection by Learning Multimodal Prompts and Text-Enhanced Actionness.
IEEE Trans. Circuits Syst. Video Technol., November, 2024
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024
CoRR, 2024
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework.
CoRR, 2024
WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs.
CoRR, 2024
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
Forward-transmission based distributed fiber sensing compatible with C+L unidirectional communication systems.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2024
MAKEN: Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024
KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024
PCLmed: Champion Solution for ImageCLEFmedical 2024 Caption Prediction Challenge via Medical Vision-Language Foundation Models.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024
C2RG: Parameter-efficient Adaptation of 3D Vision and Language Foundation Model for Coronary CTA Report Generation.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Image Process., 2023
npj Digit. Medicine, 2023
Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models.
CoRR, 2023
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework.
CoRR, 2023
CoRR, 2023
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
PCLmed at ImageCLEFmedical 2023: Customizing General-Purpose Foundation Models for Medical Report Generation.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
2021
CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning.
CoRR, 2021
O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
2019
2013
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013
2009
Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009
Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009
2008
Proceedings of the Fourth International Conference on Natural Computation, 2008