Di Wang

Orcid: 0009-0003-2330-6854

Affiliations:
  • Tencent Inc., Hunyuan, Machine Learning Platform Department, Large Language Model Department, China


According to our database1, Di Wang authored at least 28 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent.
CoRR, December, 2025

Robust Beamforming for Secure and Covert Communications Against Location Uncertainty.
IEEE Wirel. Commun. Lett., November, 2025

Towards a Comprehensive Scaling Law of Mixture-of-Experts.
CoRR, September, 2025

Reinforcement Learning on Pre-Training Data.
CoRR, September, 2025

Proximal Supervised Fine-Tuning.
CoRR, August, 2025

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again.
CoRR, July, 2025

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels.
CoRR, July, 2025

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material.
CoRR, June, 2025

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model.
CoRR, May, 2025

TransMamba: Flexibly Switching between Transformer and Mamba.
CoRR, March, 2025

BeamVQ: Beam Search with Vector Quantization to Mitigate Data Scarcity in Physical Spatiotemporal Forecasting.
CoRR, February, 2025

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation.
CoRR, January, 2025

Hydraulis: Balancing Large Transformer Model Training via Co-designing Parallel Strategies and Data Assignment.
Proc. ACM Manag. Data, 2025

DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Scaling Laws for Floating-Point Quantization Training.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Autonomy-of-Experts Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

The Security Threat of Compressed Projectors in Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

HMoE: Heterogeneous Mixture of Experts for Language Modeling.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
HunyuanVideo: A Systematic Framework For Large Video Generative Models.
CoRR, 2024

More Expressive Attention with Negative Weights.
CoRR, 2024

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation.
CoRR, 2024

HMoE: Heterogeneous Mixture of Experts for Language Modeling.
CoRR, 2024

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding.
CoRR, 2024

Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2021
PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021


  Loading...