Zhaozhuo Xu

Orcid: 0000-0001-9555-1830

According to our database1, Zhaozhuo Xu authored at least 81 papers between 2016 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
OAT-Rephrase: Optimization-Aware Training Data Rephrasing for Zeroth-Order LLM Fine-Tuning.
CoRR, June, 2025

Mitigating Non-IID Drift in Zeroth-Order Federated LLM Fine-Tuning with Transferable Sparsity.
CoRR, June, 2025

DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic.
CoRR, May, 2025

VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation.
CoRR, May, 2025

Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models.
CoRR, April, 2025

Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory.
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

ZEN: Empowering Distributed Training with Sparsity-driven Data Synchronization.
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025

ALinFiK: Learning to Approximate Linearized Future Influence Kernel for Scalable Third-Parity LLM Data Valuation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Taming Language Models for Text-attributed Graph Learning with Decoupled Aggregation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Compression-Aware Computing for Scalable and Sustainable AI.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Fox-1 Technical Report.
CoRR, 2024

Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs.
CoRR, 2024

Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery.
CoRR, 2024

SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching.
CoRR, 2024

Measuring Copyright Risks of Large Language Model via Partial Information Probing.
CoRR, 2024

Sirius: Contextual Sparsity with Correction for Efficient LLMs.
CoRR, 2024

PolyRouter: A Multi-LLM Querying System.
CoRR, 2024

TorchOpera: A Compound AI System for LLM Safety.
CoRR, 2024

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity.
CoRR, 2024

LLM Multi-Agent Systems: Challenges and Open Problems.
CoRR, 2024

SIRIUS : Contexual Sparisty with Correction for Efficient LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GNNs Also Deserve Editing, and They Need It More Than Once.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Soft Prompt Recovers Compressed LLMs, Transferably.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

TVE: Learning Meta-attribution for Transferable Vision Explainer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Knowledge Graphs Can be Learned with Just Intersection Features.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

Do LLMs Know to Respect Copyright Notice?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

In Defense of Structural Sparse Adapters for Concurrent LLM Serving.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TensorOpera Router: A Multi-Model Router for Efficient LLM Inference.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

QUEST: Efficient Extreme Multi-Label Text Classification with Large Language Models on Commodity Hardware.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Token-wise Influential Training Data Retrieval for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Multi-Domain Fusion Graph Network for Semi-Supervised PolSAR Image Classification.
Remote. Sens., January, 2023

LETA: Learning Transferable Attribution for Generic Vision Explainer.
CoRR, 2023

Zen: Near-Optimal Sparse Tensor Synchronization for Distributed DNN Training.
CoRR, 2023

Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt.
CoRR, 2023

A Theoretical Analysis Of Nearest Neighbor Search On Approximate Near Neighbor Graph.
CoRR, 2023

Graph Self-supervised Learning via Proximity Distribution Minimization.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

One-Pass Distribution Sketch for Measuring Data Heterogeneity in Federated Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

A Tale of Two Efficient Value Iteration Algorithms for Solving Linear MDPs with Large Action Space.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Composite Sequential Network With POA Attention for PolSAR Image Analysis.
IEEE Trans. Geosci. Remote. Sens., 2022

DA2Net: Distraction-Attention-Driven Adversarial Network for Robust Remote Sensing Image Scene Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

A General Feature Paradigm for Unsupervised Cross-Domain PolSAR Image Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory.
CoRR, 2022

Sublinear Time Algorithm for Online Weighted Bipartite Matching.
CoRR, 2022

Accelerating Frank-Wolfe Algorithm using Low-Dimensional and Adaptive Data Structures.
CoRR, 2022

Proximity Graph Maintenance for Fast Online Nearest Neighbor Search.
CoRR, 2022

Speeding Up Sparsification using Inner Product Search Data Structures.
CoRR, 2022

HALOS: Hashing Large Output Space for Cheap Inference.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

Structural Contrastive Representation Learning for Zero-shot Multi-label Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Adaptive and Dynamic Multi-Resolution Hashing for Pairwise Summations.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Satellite Images and Deep Learning to Identify Discrepancy in Mailing Addresses with Applications to Census 2020 in Houston.
CoRR, 2021

PairConnect: A Compute-Efficient MLP Alternative to Attention.
CoRR, 2021

Sublinear Least-Squares Value Iteration via Locality Sensitive Hashing.
CoRR, 2021

Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion.
CoRR, 2021

Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Locality Sensitive Teaching.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Raw Nav-merge Seismic Data to Subsurface Properties with MLP based Multi-Modal Information Unscrambler.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Norm Adjusted Proximity Graph for Fast Inner Product Retrieval.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Climbing the WOL: Training for Cheaper Inference.
CoRR, 2020

Fast Item Ranking under Neural Network based Measures.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

2019
Dynamic Fractal Texture Analysis for PolSAR Land Cover Classification.
IEEE Trans. Geosci. Remote. Sens., 2019

Detection, Tracking, and Geolocation of Moving Vehicle From UAV Using Monocular Camera.
IEEE Access, 2019

SAR-to-Optical Image Translation Using Supervised Cycle-Consistent Adversarial Networks.
IEEE Access, 2019

Möbius Transformation for Fast Inner Product Search on Graph.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Class Activation Mapping Guided Adversarial Training Method for Land-Use Classification and Object Detection.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

On Efficient Retrieval of Top Similarity Vectors.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
A DLM-LSTM Framework for North-South Land Deformation Trend Analysis from Low-Cost GPS Sensor Time Series.
J. Sensors, 2018

2017
Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery.
Remote. Sens., 2017

Unified management and control of heterogeneous water quality measuring devices via edge computing nodes.
Proceedings of the 2017 IEEE SENSORS, Glasgow, United Kingdom, October 29, 2017

2016
Raspberry Pi Based Intelligent Wireless Sensor Node for Localized Torrential Rain Monitoring.
J. Sensors, 2016


  Loading...