Jiaming Tang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
π<sub>0.7</sub>: a Steerable Generalist Robotic Foundation Model with Emergent Capabilities.
CoRR, April, 2026

MEM: Multi-Scale Embodied Memory for Vision Language Action Models.
CoRR, March, 2026

2025
AgentBay: A Hybrid Interaction Sandbox for Seamless Human-AI Intervention in Agentic Systems.
CoRR, December, 2025

Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding.
CoRR, December, 2025

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference.
CoRR, December, 2025

Multi-Perspective Semantic Segmentation of Ground Penetrating Radar Images for Pavement Subsurface Objects.
IEEE Trans. Intell. Transp. Syst., September, 2025

Passivity-based synchronous control of Markov jump systems with actuator saturation.
Frontiers Inf. Technol. Electron. Eng., September, 2025

Twilight: Adaptive Attention Sparsity with Hierarchical Top-<i>p</i> Pruning.
CoRR, February, 2025

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention.
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025

Transitive Array: An Efficient GEMM Accelerator with Result Reuse.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

BioLedger: A Token Incentive-Based Framework for the Secure Sharing of Genetic Data in Consortium Blockchain.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

2024
AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration.
GetMobile Mob. Comput. Commun., December, 2024

DCRMTA: Unbiased Causal Representation for Multi-touch Attribution.
CoRR, 2024

A brain topography graph embedded convolutional neural network for EEG-based motor imagery classification.
Biomed. Signal Process. Control., 2024

AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Analyzing Corporate Privacy Policies using AI Chatbots.
Proceedings of the 2024 ACM on Internet Measurement Conference, 2024

QUEST: Query-Aware Sparsity for Efficient Long-Context LLM Inference.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration.
CoRR, 2023

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

2022
Low Compaction Level Detection of Newly Constructed Asphalt Pavement Based on Regional Index.
Sensors, 2022

Crack Unet: Crack Recognition Algorithm Based on Three-Dimensional Ground Penetrating Radar Images.
Sensors, 2022

Efficient object detection method based on aerial optical sensors for remote sensing.
Displays, 2022

2021
Balanced Dual-Band Superconducting Filter Using Stepped-Impedance Resonators With High Band-to-Band Isolation and Wide Stopband.
IEEE Trans. Circuits Syst. II Express Briefs, 2021

2020
Compact Wide-Stopband Dual-Band Balanced Filter Using an Electromagnetically Coupled SIR Pair With Controllable Transmission Zeros and Bandwidths.
IEEE Trans. Circuits Syst., 2020


  Loading...