Zeyi Huang

According to our database1, Zeyi Huang authored at least 37 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP.
CoRR, July, 2025

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection.
CoRR, May, 2025

T2I-ConBench: Text-to-Image Benchmark for Continual Post-training.
CoRR, May, 2025

Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts.
CoRR, April, 2025

Do Vision Models Develop Human-Like Progressive Difficulty Understanding?
CoRR, March, 2025

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents.
CoRR, February, 2025

An Investigation on LLMs' Visual Understanding Ability Using SVG for Image-Text Bridging.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Action Detail Matters: Refining Video Recognition with Local Action Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Ascend HiFloat8 Format for Deep Learning.
CoRR, 2024

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding.
CoRR, 2023

Identification of a Novel Model for Predicting the Prognosis and Immune Response Based on Genes Related to Ferroptosis and Disulfidptosis in Liver Hepatocellular Carcinoma.
Proceedings of 2023 International Conference on Medical Imaging and Computer-Aided Diagnosis, 2023

Identification of Hub Biomarkers and Immune Cell Infiltration Characteristics in Ulcerative Colitis by Bioinformatics Analysis and Machine Learning.
Proceedings of 2023 International Conference on Medical Imaging and Computer-Aided Diagnosis, 2023

A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Expeditious Saliency-guided Mix-up through Random Gradient Thresholding.
CoRR, 2022

The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization.
CoRR, 2022

Toward learning human-aligned cross-domain robust models by countering misaligned features.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

On the Integration of Self-Attention and Convolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

The Two Dimensions of Worst-case Training and Their Integrated Effect for Out-of-domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features.
CoRR, 2021

Not All Images are Worth 16x16 Words: Dynamic Vision Transformers with Adaptive Sequence Length.
CoRR, 2021

Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Squared 𝓁<sub>2</sub> Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations.
CoRR, 2020

Improving Object Detection with Inverted Attention.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Comprehensive Attention Self-Distillation for Weakly-Supervised Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Self-challenging Improves Cross-Domain Generalization.
Proceedings of the Computer Vision - ECCV 2020, 2020

High-Frequency Component Helps Explain the Generalization of Convolutional Neural Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multiple Anchor Learning for Visual Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Discriminative Feature Learning With Consistent Attention Regularization for Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2017
Large Margin Object Tracking with Circulant Feature Maps.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2015
Remaining Useful Life Prediction for a Nonlinear Heterogeneous Wiener Process Model With an Adaptive Drift.
IEEE Trans. Reliab., 2015

2014
A new descriptor resistant to affine transformation and monotonic intensity change.
Comput. Vis. Image Underst., 2014


  Loading...