Xing Hu

Orcid: 0009-0003-4510-898X

Affiliations:
  • Houmo AI, China
  • Hohai University, College of Computer and Information, Nanjing, China (former)


According to our database1, Xing Hu authored at least 23 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MGVQ: Synergizing Multi-dimensional Sensitivity-Aware and Gradient-Hessian Fusion for Vector Quantization.
CoRR, May, 2026

TORQ: Two-Level Orthogonal Rotation for MXFP4 Quantization.
CoRR, May, 2026

KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models.
CoRR, February, 2026

SAES-SVD: Self-Adaptive Suppression of Accumulated and Local Errors for SVD-based LLM Compression.
CoRR, February, 2026

NLI:Non-uniform Linear Interpolation Approximation of Nonlinear Operations for Efficient LLMs Inference.
CoRR, February, 2026

FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

VAEVQ: Enhancing Discrete Visual Tokenization Through Variational Modeling.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models.
CoRR, October, 2025

Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection.
CoRR, February, 2025

MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization.
CoRR, February, 2025

OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting.
CoRR, January, 2025

A 22-nm 64-kB lightning-like hybrid computing-in-memory macro with a compressed adder tree and analog-storage quantizers for transformer and CNNs.
Sci. China Inf. Sci., 2025

MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Static Quantization.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AIM: Software and Hardware Co-design for Architecture-level IR-drop Mitigation in High-performance PIM.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Post-training quantization for re-parameterization via coarse & fine weight splitting.
J. Syst. Archit., February, 2024

I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models.
CoRR, 2024

34.3 A 22nm 64kb Lightning-Like Hybrid Computing-in-Memory Macro with a Compressed Adder Tree and Analog-Storage Quantizers for Transformer and CNNs.
Proceedings of the IEEE International Solid-State Circuits Conference, 2024

2022
3D Object Detection Based on Multi-scale Feature Fusion and Contrastive Learning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2022


  Loading...