Ting Yu

Orcid: 0000-0001-6918-3157

Affiliations:
  • Hangzhou Normal University, China


According to our database1, Ting Yu authored at least 16 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CloudCap3D: enhancing 3D in-scene descriptions via point cloud integration and efficient text filtering.
Multim. Syst., August, 2026

SwiftCraft3D: semantic-enhanced multi-view prompting for efficient and high-fidelity text-to-3D generation.
Vis. Comput., January, 2026

2025
Semi-Supervised RGB-D Hand Gesture Recognition via Mutual Learning of Self-Supervised Models.
ACM Trans. Multim. Comput. Commun. Appl., April, 2025

WP-CMA: Waypoint Prediction for Cross-modal Alignment of Vision-and-Language Navigation in Continuous Environments.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
3D human pose estimation with multi-hypotheses gated transformer.
Multim. Syst., December, 2024

Token-Mixer: Bind Image and Text in One Embedding Space for Medical Image Reporting.
IEEE Trans. Medical Imaging, November, 2024

A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering.
IEEE Trans. Image Process., 2024

Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering.
CoRR, 2024

2021
Long-Term Video Question Answering via Multimodal Hierarchical Memory Attentive Networks.
IEEE Trans. Circuits Syst. Video Technol., 2021

2020
Compositional Attention Networks With Two-Stream Fusion for Video Question Answering.
IEEE Trans. Image Process., 2020

Multi-task Compositional Network for Visual Relationship Detection.
Int. J. Comput. Vis., 2020

An Efficient Degraded Deductive Fault Simulator for Small-Delay Defects.
IEEE Access, 2020

2019
On Exploring Undetermined Relationships for Visual Relationship Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...