Bang Yang
Orcid: 0000-0003-2019-0377
  According to our database1,
  Bang Yang
  authored at least 39 papers
  between 2008 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation.
    
  
    CoRR, June, 2025
    
  
Aligning, Autoencoding and Prompting Large Language Models for Novel Disease Reporting.
    
  
    IEEE Trans. Pattern Anal. Mach. Intell., May, 2025
    
  
Low-complexity SOP-based vibration broadband sensing and efficient recognition for stable IM/DD optical interconnects in data centers.
    
  
    J. Opt. Commun. Netw., 2025
    
  
  2024
Zero-Shot Temporal Action Detection by Learning Multimodal Prompts and Text-Enhanced Actionness.
    
  
    IEEE Trans. Circuits Syst. Video Technol., November, 2024
    
  
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation.
    
  
    IEEE Trans. Pattern Anal. Mach. Intell., August, 2024
    
  
    CoRR, 2024
    
  
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework.
    
  
    CoRR, 2024
    
  
WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs.
    
  
    CoRR, 2024
    
  
    Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
    
  
Forward-transmission based distributed fiber sensing compatible with C+L unidirectional communication systems.
    
  
    Proceedings of the Optical Fiber Communications Conference and Exhibition, 2024
    
  
MAKEN: Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models.
    
  
    Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024
    
  
KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
PCLmed: Champion Solution for ImageCLEFmedical 2024 Caption Prediction Challenge via Medical Vision-Language Foundation Models.
    
  
    Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024
    
  
C2RG: Parameter-efficient Adaptation of 3D Vision and Language Foundation Model for Coronary CTA Report Generation.
    
  
    Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
    
  
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.
    
  
    Proceedings of the Findings of the Association for Computational Linguistics, 2024
    
  
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning.
    
  
    Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
    
  
  2023
    IEEE Trans. Image Process., 2023
    
  
    npj Digit. Medicine, 2023
    
  
Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models.
    
  
    CoRR, 2023
    
  
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework.
    
  
    CoRR, 2023
    
  
    CoRR, 2023
    
  
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation.
    
  
    Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
    
  
PCLmed at ImageCLEFmedical 2023: Customizing General-Purpose Foundation Models for Medical Report Generation.
    
  
    Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023
    
  
    Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
    
  
Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels.
    
  
    Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
    
  
  2022
    Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022
    
  
    Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022
    
  
    Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
    
  
    Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
    
  
  2021
CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning.
    
  
    CoRR, 2021
    
  
O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning.
    
  
    Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
    
  
    Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
    
  
  2020
Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning.
    
  
    Proceedings of the 25th International Conference on Pattern Recognition, 2020
    
  
  2019
  2013
    Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013
    
  
  2009
    Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009
    
  
    Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009
    
  
  2008
    Proceedings of the Fourth International Conference on Natural Computation, 2008