Bang Yang

Orcid: 0000-0003-2019-0377

According to our database¹, Bang Yang authored at least 39 papers between 2008 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation.

[BibT_eX]

[DOI]

CoRR, June, 2025

Aligning, Autoencoding and Prompting Large Language Models for Novel Disease Reporting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Low-complexity SOP-based vibration broadband sensing and efficient recognition for stable IM/DD optical interconnects in data centers.

[BibT_eX]

[DOI]

J. Opt. Commun. Netw., 2025

2024

Zero-Shot Temporal Action Detection by Learning Multimodal Prompts and Text-Enhanced Actionness.

[BibT_eX]

[DOI]

Asif Raza

Bang Yang

Yuexian Zou

IEEE Trans. Circuits Syst. Video Technol., November, 2024

ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework.

[BibT_eX]

[DOI]

CoRR, 2024

WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs.

[BibT_eX]

[DOI]

CoRR, 2024

Fake-GPT: Detecting Fake Image via Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Forward-transmission based distributed fiber sensing compatible with C+L unidirectional communication systems.

[BibT_eX]

[DOI]

Proceedings of the Optical Fiber Communications Conference and Exhibition, 2024

MAKEN: Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

PCLmed: Champion Solution for ImageCLEFmedical 2024 Caption Prediction Challenge via Medical Vision-Language Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

C2RG: Parameter-efficient Adaptation of 3D Vision and Language Foundation Model for Coronary CTA Report Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Concept-Aware Video Captioning: Describing Videos With Effective Prior Information.

[BibT_eX]

[DOI]

Bang Yang

Meng Cao

Yuexian Zou

IEEE Trans. Image Process., 2023

A medical multimodal large language model for future pandemics.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2023

Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework.

[BibT_eX]

[DOI]

CoRR, 2023

Customizing General-Purpose Foundation Models for Medical Report Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PCLmed at ImageCLEFmedical 2023: Customizing General-Purpose Foundation Models for Medical Report Generation.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Adaptive Curriculum Learning for Video Captioning.

[BibT_eX]

[DOI]

Shanhao Li

Bang Yang

Yuexian Zou

IEEE Access, 2022

CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter.

[BibT_eX]

[DOI]

Bang Yang

Tong Zhang

Yuexian Zou

Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Consensus-Guided Keyword Targeting for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Graph-in-Graph Network for Automatic Gene Ontology Description Generation.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021

CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning.

[BibT_eX]

[DOI]

Bang Yang

Yuexian Zou

CoRR, 2021

O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Non-Autoregressive Coarse-to-Fine Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning.

[BibT_eX]

[DOI]

Bang Yang

Yuexian Zou

Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019

Non-Autoregressive Video Captioning with Iterative Refinement.

[BibT_eX]

[DOI]

Bang Yang

Fenglin Liu

Yuexian Zou

CoRR, 2019

2013

A novel water-drop power generation system based on ICPF actuator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

2009

Identification of Flow-Routing Sequence from DEMs Based on Quicksort.

[BibT_eX]

[DOI]

Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009

Meteorological Drought Forecasting Using Markov Chain Model.

[BibT_eX]

[DOI]

Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009

2008

Rainfall-Runoff Modeling at Daily Scale with Artificial Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Natural Computation, 2008

Bang Yang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...