Sixiao Zheng

Orcid: 0000-0001-8324-1528

According to our database1, Sixiao Zheng authored at least 17 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Mind-of-Director: Multi-modal Agent-Driven Film Previsualization via Collaborative Decision-Making.
CoRR, March, 2026

VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control.
CoRR, January, 2026

2025
TriVLA: A Triple-System-Based Unified Vision-Language-Action Model for General Robot Control.
CoRR, July, 2025

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation.
CoRR, February, 2025

A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Vision Transformers: From Semantic Segmentation to Dense Prediction.
Int. J. Comput. Vis., December, 2024

TemporalStory: Enhancing Consistency in Story Visualization using Spatial-Temporal Attention.
CoRR, 2024

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT.
CoRR, 2024

2023
Clustering by the Probability Distributions From Extreme Value Theory.
IEEE Trans. Artif. Intell., April, 2023

2022
Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective.
CoRR, 2022

HunYuan_tvr for Text-Video Retrievial.
CoRR, 2022

2021
NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Incrementally Zero-Shot Detection by an Extreme Value Analyzer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2018
Could Interaction with Social Robots Facilitate Joint Attention of Children with Autism Spectrum Disorder?
CoRR, 2018


  Loading...