Hongfei Xue

Orcid: 0000-0001-9691-9668

According to our database1, Hongfei Xue authored at least 67 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CLLAP: Contrastive Learning-based LiDAR-Augmented Pretraining for Enhanced Radar-Camera Fusion.
CoRR, April, 2026

HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models.
CoRR, April, 2026

LiveGesture Streamable Co-Speech Gesture Generation Model.
CoRR, April, 2026

FastTurn: Unifying Acoustic and Streaming Semantic Cues for Low-Latency and Robust Turn Detection.
CoRR, April, 2026

KHMP: Frequency-Domain Kalman Refinement for High-Fidelity Human Motion Prediction.
CoRR, March, 2026

Monocular Models are Strong Learners for Multi-View Human Mesh Recovery.
CoRR, March, 2026

OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs.
CoRR, March, 2026

Send Less, Perceive More: Masked Quantized Point Cloud Communication for Loss-Tolerant Collaborative Perception.
CoRR, February, 2026

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem.
CoRR, January, 2026

The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era.
CoRR, January, 2026

Crystal Generation using the Fully Differentiable Pipeline and Latent Space Optimization.
CoRR, January, 2026

Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion Prior.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Lifelong Domain Adaptive 3D Human Pose Estimation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

WenetSpeech-Yue: A Large-Scale Cantonese Speech Corpus with Multi-dimensional Annotation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation.
CoRR, November, 2025

Easy Turn: Integrating Acoustic and Linguistic Modalities for Robust Turn-Taking in Full-Duplex Spoken Dialogue Systems.
CoRR, September, 2025

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing.
CoRR, September, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.
CoRR, August, 2025

The TEA-ASLP System for Multilingual Conversational Speech Recognition and Speech Diarization in MLC-SLM 2025 Challenge.
CoRR, July, 2025

mmHand: Toward Pixel-Level-Accuracy Hand Localization Using a Single Commodity mmWave Device.
IEEE Internet Things J., June, 2025

AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment.
CoRR, June, 2025

DanceMosaic: High-Fidelity Dance Generation with Multimodal Editability.
CoRR, April, 2025

BioPose: Biomechanically-Accurate 3D Pose Estimation from Monocular Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

RAM-Hand: Robust Acoustic Multi-Hand Pose Reconstruction Using a Microphone Array.
Proceedings of the 23rd ACM Conference on Embedded Networked Sensor Systems, 2025

Argus: Multi-View Egocentric Human Mesh Reconstruction Based on Stripped-Down Wearable mmWave Add-on.
Proceedings of the 23rd ACM Conference on Embedded Networked Sensor Systems, 2025

Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Selective Invocation for Multilingual ASR: A Cost-effective Approach Adapting to Speech Recognition Difficulty.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Physical Backdoor Attacks against mmWave-based Human Activity Recognition.
Proceedings of the 45th IEEE International Conference on Distributed Computing Systems, 2025

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-Based Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MaskControl: Spatio-Temporal Control for Masked Motion Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

mmCooper: A Multi-Agent Multi-Stage Communication-Efficient and Collaboration-Robust Cooperative Perception Framework.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FreqPure: A High-Frequency Preservation Diffusion-Based Purification Method for Protective Perturbation Removal.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GenHMR: Generative Human Mesh Recovery.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Prompt Learning for Multimodal Intent Recognition with Modal Alignment Perception.
Cogn. Comput., November, 2024

Towards Smartphone-based 3D Hand Pose Reconstruction Using Acoustic Signals.
ACM Trans. Sens. Networks, September, 2024

MMHMR: Generative Masked Modeling for Hand Mesh Recovery.
CoRR, 2024

ControlMM: Controllable Masked Motion Generation.
CoRR, 2024

Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text.
CoRR, 2024

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

mmCLIP: Boosting mmWave-based Zero-shot HAR via Signal-Text Alignment.
Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems, 2024

Malicious Attacks against Multi-Sensor Fusion in Autonomous Driving.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

E-Chat: Emotion-Sensitive Spoken Dialogue System with Large Language Models.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Breakthrough from Nuance and Inconsistency: Enhancing Multimodal Sarcasm Detection with Context-Aware Self-Attention Fusion and Word Weight Calculation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Towards Robust mmWave-based Human Activity Recognition using Large Simulated Dataset for Model Pretraining.
Proceedings of the IEEE International Conference on Big Data, 2024

2023
Towards Generalized mmWave-based Human Pose Estimation through Signal Augmentation.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Macular: A Multi-Task Adversarial Framework for Cross-Lingual Natural Language Understanding.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

TileMask: A Passive-Reflection-based Attack against mmWave Radar Object Detection in Autonomous Driving.
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security, 2023

BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
M<sup>4</sup>esh: mmWave-Based 3D Human Mesh Construction for Multiple Subjects.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

Fusing Global and Local Features for Generalized AI-Synthesized Image Detection.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
mmMesh: towards 3D real-time dynamic human mesh construction using millimeter-wave.
Proceedings of the MobiSys '21: The 19th Annual International Conference on Mobile Systems, Applications, and Services, Virtual Event, Wisconsin, USA, 24 June, 2021

2020
DeepMV: Multi-View Deep Learning for Device-Free Human Activity Recognition.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2020

Towards 3D human pose construction using wifi.
Proceedings of the MobiCom '20: The 26th Annual International Conference on Mobile Computing and Networking, 2020

2019
DeepFusion: A Deep Learning Framework for the Fusion of Heterogeneous Sensory Data.
Proceedings of the Twentieth ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2019

On the Estimation of Treatment Effect with Text Covariates.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Deep Metric Learning: The Generalization Analysis and an Adaptive Algorithm.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Towards Environment Independent Device Free Human Activity Recognition.
Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, 2018

A novel channel-aware attention framework for multi-channel EEG seizure detection via multi-view deep learning.
Proceedings of the 2018 IEEE EMBS International Conference on Biomedical & Health Informatics, 2018

2016
Risk Factor Analysis Based on Deep Learning Models.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016


  Loading...