Jinfa Huang
Orcid: 0000-0002-0081-4106
According to our database1,
Jinfa Huang
authored at least 34 papers
between 1987 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
ACM Trans. Intell. Syst. Technol., June, 2025
CoRR, June, 2025
Aligning, Autoencoding and Prompting Large Language Models for Novel Disease Reporting.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation.
CoRR, May, 2025
TACO: Enhancing Multimodal In-context Learning via Task Mapping-Guided Sequence Configuration.
CoRR, May, 2025
QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension.
CoRR, March, 2025
A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis.
npj Digit. Medicine, 2025
CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
IEEE Trans. Image Process., 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
CoRR, 2022
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
2020
Guoym at SemEval-2020 Task 8: Ensemble-based Classification of Visuo-Lingual Metaphor in Memes.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020
LDNN: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness Understanding.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
1987
Proceedings of the European Conference on Speech Technology, 1987