Xuanyu Zhang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model.
CoRR, July, 2025

VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning.
CoRR, June, 2025

Magistral.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, June, 2025

DiffLLE: Diffusion-based Domain Calibration for Weak Supervised Low-light Image Enhancement.
Int. J. Comput. Vis., May, 2025

AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection.
CoRR, May, 2025

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models.
CoRR, May, 2025

Q-Insight: Understanding Image Quality via Visual Reinforcement Learning.
CoRR, March, 2025

GaussianSeal: Rooting Adaptive Watermarks for 3D Gaussian Generation Model.
CoRR, March, 2025

Self-supervised Scalable Deep Compressed Sensing.
Int. J. Comput. Vis., February, 2025

Adaptive large neighborhood search for autonomous electric vehicle scheduling in airport baggage transport service.
Comput. Oper. Res., 2025

Smoothed dynamic scheduling of aircraft engines with off-site warehouse.
Adv. Eng. Informatics, 2025

SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Extracting the Essence and Discarding the Dross: Enhancing Code Generation with Contrastive Execution Feedback.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

FinMoE: A MoE-based Large Chinese Financial Language Model.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

An Asynchronous RISC-V-based SNN Processor with Custom ISA Extensions for Programmable On-Chip Learning.
Proceedings of the 29th IEEE International Symposium on Asynchronous Circuits and Systems, 2025

Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Progressive Content-Aware Coded Hyperspectral Snapshot Compressive Imaging.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Understanding Layer Significance in LLM Alignment.
CoRR, 2024

Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling.
CoRR, 2024

Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation.
CoRR, 2024

Diffusion-Based Hierarchical Image Steganography.
CoRR, 2024

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection.
CoRR, 2024

Recurrent Drafter for Fast Speculative Decoding in Large Language Models.
CoRR, 2024

A Heterogeneous Dynamic Convolutional Neural Network for Image Super-resolution.
CoRR, 2024

DAPT: A Dual Attention Framework for Parameter-Efficient Continual Learning of Large Language Models.
CoRR, 2024

Image super-resolution via dynamic network.
CAAI Trans. Intell. Technol., 2024

GS-Hider: Hiding Messages into 3D Gaussian Splatting.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

V<sup>2</sup>A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Improving Factual Consistency in Abstractive Summarization with Sentence Structure Pruning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Tsetlin Machine for Sentiment Analysis and Spam Review Detection in Chinese.
Algorithms, February, 2023

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement.
CoRR, 2023

Trajectory Generation and Tracking based on Energy Minimization for a Four-Link Brachiation Robot.
CoRR, 2023

CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial Domains.
CoRR, 2023

XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters.
CoRR, 2023

Self-QA: Unsupervised Knowledge Guided Language Model Alignment.
CoRR, 2023

Progressive Content-aware Coded Hyperspectral Compressive Imaging.
CoRR, 2023

CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Relation Extraction Based on Dual-Path Graph Convolutional Networks.
Proceedings of the 2023 IEEE International Conferences on Internet of Things (iThings) and IEEE Green Computing & Communications (GreenCom) and IEEE Cyber, 2023

Generating Extractive Answers: Gated Recurrent Memory Reader for Conversational Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Adaptive Attention for Sparse-based Long-sequence Transformer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
LVE-S2D: Low-Light Video Enhancement From Static to Dynamic.
IEEE Trans. Circuits Syst. Video Technol., 2022

Clickbait detection on WeChat: A deep model integrating semantic and syntactic information.
Knowl. Based Syst., 2022

Just ClozE! A Fast and Simple Method for Evaluating the Factual Consistency in Abstractive Summarization.
CoRR, 2022

Generative Adversarial Networks for Image Super-Resolution: A Survey.
CoRR, 2022

TranS: Transition-based Knowledge Graph Embedding with Synthetic Relation Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Instance-Guided Prompt Learning for Few-Shot Text Matching.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DeepVT: Deep View-Temporal Interaction Network for News Recommendation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Combining Explicit Entity Graph with Implicit Text Information for News Recommendation.
Proceedings of the Companion of The Web Conference 2021, 2021

Position-Augmented Transformers with Entity-Aligned Mesh for TextVQA.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

WCD: A New Chinese Online Social Media Dataset for Clickbait Analysis and Detection.
Proceedings of the 7th IEEE International Conference on Network Intelligence and Digital Content, 2021

Oriented Object Detection with Fine-Grained Enhancement and Angle Constraint.
Proceedings of the 16th International Conference on Computer Science & Education, 2021

DML: Dynamic Multi-Granularity Learning for BERT-Based Document Reranking.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
FTCLNet: Convolutional LSTM with Fourier Transform for Vulnerability Detection.
Proceedings of the 19th IEEE International Conference on Trust, 2020

Rception: Wide and Deep Interaction Networks for Machine Reading Comprehension (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CFGNN: Cross Flow Graph Neural Networks for Question Answering on Complex Tables.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Spectral Pooling Based CNN Model Compression for Light Field Depth-Estimation.
Proceedings of the Image and Graphics Technologies and Applications, 2019

MC\^2: Multi-perspective Convolutional Cube for Conversational Machine Reading Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...