Shiyi Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Meta-CoT: Enhancing Granularity and Generalization in Image Editing.
CoRR, April, 2026

Generative Visual Chain-of-Thought for Image Editing.
CoRR, March, 2026

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation.
CoRR, February, 2026

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO.
CoRR, February, 2026

TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts.
CoRR, January, 2026

Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing.
CoRR, January, 2026

An LLM-Driven Multi-Agent Simulation Framework for Coupled Epidemic-Economic Dynamics.
Inf., 2026

KnowFC: Navigating Knowledge Conflicts in Large Language Model-based Fact-Checking.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

2025
Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation.
CoRR, December, 2025

FLAG3D++: A Benchmark for 3D Fitness Activity Comprehension With Language Instruction.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization.
CoRR, November, 2025

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference.
CoRR, September, 2025

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

RED-Net: Radiomics-Enhanced Diffusion Network for MRI-to-PET Cross-Modality Image Synthesis.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

KV-Edit: Training-Free Image Editing for Precise Background Preservation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

HAVIR: Hierarchical Vision to Image Reconstruction Using Clip-Guided Versatile Diffusion.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

2024
Comparing location-specific and location-open social media data: methodological lessons from a study of blaming of minorities on Twitter during the COVID-19 pandemic.
J. Comput. Soc. Sci., December, 2024

Redefining the Game: MVAE-DFDPnet's Low-Dimensional Embeddings for Superior Drug-Protein Interaction Predictions.
IEEE J. Biomed. Health Informatics, July, 2024

The Effect of Audiovisual Spatial Design on User Experience of Bare-Hand Interaction in VR.
Int. J. Hum. Comput. Interact., June, 2024

Design and Implementation of a Visual Logging and Automatic Modeling Tool for Camp Distribution Connection based on Deep Learning Algorithms.
Scalable Comput. Pract. Exp., 2024

Have we found a solution for health misinformation? A ten-year systematic review of health misinformation literature 2013-2022.
Int. J. Medical Informatics, 2024

ColorFlow: Retrieval-Augmented Image Sequence Colorization.
CoRR, 2024

Sustainable Traffic Management: A Framework Utilizing Energy Consumption Patterns.
Proceedings of the International Conference on Electrical, 2024

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Narrative Action Evaluation with Prompt-Guided Multimodal Interaction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Research on user experience of the video game difficulty based on flow theory and fNIRS.
Behav. Inf. Technol., April, 2023

A Blockchain Framework for Preserving Music Intellectual Property Rights.
Int. J. Commun. Networks Inf. Secur., 2023

Two-Layer Generation Expansion Planning Based on Flexibility Balance.
IEEE Access, 2023

LOGO: A Long-Form Video Dataset for Group Action Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multi-Camera Vehicle Tracking System for AI City Challenge 2022.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
A vision-based fast base frame calibration method for coordinated mobile manipulators.
Robotics Comput. Integr. Manuf., 2021

A Robotic Electrochemical Biosensor Based on Kinetic Electronics Technique.
Proceedings of the 2021 IEEE Sensors, Sydney, Australia, October 31 - Nov. 3, 2021, 2021

2020
Fault diagnosis of rotating machinery based on time-frequency image feature extraction.
J. Intell. Fuzzy Syst., 2020

A Visual Servoing Method based on Point Cloud.
Proceedings of the 2020 IEEE International Conference on Real-time Computing and Robotics, 2020

2019
A Fast Two-Phase Monte Carlo Method for Constructing Polar Codes With Arbitrary Binary Kernel.
IEEE Access, 2019

Collaborative Attention Network for Natural Language Inference.
Proceedings of the Communications, Signal Processing, and Systems, 2019

Similar Cluster Based Continuous Bag-of-Words for Word Vector Training.
Proceedings of the Communications, Signal Processing, and Systems, 2019

2018
Simplified Successive Cancellation Decoding of Polar Codes With Medium-Dimensional Binary Kernels.
IEEE Access, 2018

2017
Construction and Filtration of Lightweight Formalized MDS Matrices.
IACR Cryptol. ePrint Arch., 2017

On the Successive Cancellation Decoding of Polar Codes with Arbitrary Linear Binary Kernels.
CoRR, 2017

On the Construction of the 4 x 4 Lightest Circulant MDS Matrices.
Proceedings of the 2017 International Conference on Cryptography, Security and Privacy, 2017

2016
On the Construction of the lightest Circulant MDS Matrices.
IACR Cryptol. ePrint Arch., 2016

New construction of single-cycle T-function families.
IACR Cryptol. ePrint Arch., 2016


  Loading...