Chong Ruan

Orcid: 0009-0000-7896-2558

According to our database1, Chong Ruan authored at least 27 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures.
CoRR, May, 2025

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition.
CoRR, April, 2025

Inference-Time Scaling for Generalist Reward Modeling.
CoRR, April, 2025

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention.
CoRR, February, 2025

Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling.
CoRR, January, 2025

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning.
CoRR, January, 2025

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
DeepSeek-V3 Technical Report.
CoRR, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding.
CoRR, 2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search.
CoRR, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence.
CoRR, 2024

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data.
CoRR, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model.
CoRR, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding.
CoRR, 2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.
CoRR, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
An automatic vulnerability classification framework based on BiGRU-TextCNN.
Proceedings of the International Neural Network Society Workshop on Deep Learning Innovations and Applications, 2023

2018
Meteor++: Incorporating Copy Knowledge into Machine Translation Evaluation.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Research on the influence and measurement of harmonic in power supply system for super capacitor tram.
Proceedings of the 2018 Annual IEEE International Systems Conference, 2018

Sparse Word Representation for RNN Language Models on Cellphones.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

2017
Optimize Hierarchical Softmax with Word Similarity Knowledge.
POLIBITS, 2017

2016
Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2014
Block-based multiscale error concealment using low-rank completion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014


  Loading...