Zixin Guo

Orcid: 0000-0002-7088-2331

According to our database1, Zixin Guo authored at least 26 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Multi-segment Pulse Charging Strategy for Prolonging the Service Life of Semi-solid-State Batteries.
IEEE Trans. Ind. Electron., July, 2026

InstanceRSR: Real-World Super-Resolution via Instance-Aware Representation Alignment.
CoRR, March, 2026

Imagine How To Change: Explicit Procedure Modeling for Change Captioning.
CoRR, March, 2026

Type-Aware Retrieval-Augmented Generation with Dependency Closure for Solver-Executable Industrial Optimization Modeling.
CoRR, March, 2026

SeekUI: Predicting Visual Search Behavior on Graphical User Interfaces with a Reward-Augmented Vision Language Model.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

2025
Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening.
Vis. Intell., 2025

Prompt-based Weakly-supervised Vision-language Pre-training.
Pattern Recognit. Lett., 2025

FastTalker: An unified framework for generating speech and conversational gestures from text.
Neurocomputing, 2025

Valor32k-AVQA v2.0: Open-Ended Audio-Visual Question Answering Dataset and Benchmark.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Learning to Describe Implicit Changes: Noise-robust Pre-training for Image Difference Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
A Deep Convolutional Neural Network-Based Method for Self-Piercing Rivet Joint Defect Detection.
J. Comput. Inf. Sci. Eng., April, 2024

Impact of Design Decisions in Scanpath Modeling.
Proc. ACM Hum. Comput. Interact., 2024

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

Optimal trajectory control of the wall polishing robot.
Proceedings of the 2024 4th International Conference on Internet of Things and Machine Learning, 2024

TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling.
Proceedings of the Computer Vision - ECCV 2024, 2024

FastTalker: Jointly Generating Speech and Conversational Gestures from Text.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Diffusion-Based Multimodal Video Captioning.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
An In-Situ Low Temperature-Mechanical Coupling Test System for Battery Materials.
IEEE Trans. Instrum. Meas., 2023

PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Designing a Robot for Enhancing Attention of Office Workers with the Heavily Use of Screen.
Proceedings of the Design, User Experience, and Usability, 2023

2022
CLIP4IDC: CLIP for Image Difference Captioning.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Post-Attention Modulator for Dense Video Captioning.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

2021
EdgeKeeper: a trusted edge computing framework for ubiquitous power Internet of Things.
Frontiers Inf. Technol. Electron. Eng., 2021

Global Fusion Attention for Vision and Language Understanding (Student Abstract).
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
PicSOM Experiments in TRECVID 2020.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

2013
Graph-based multiple instance learning for action recognition.
Proceedings of the IEEE International Conference on Image Processing, 2013


  Loading...