We stand with Ukraine

We stand with Ukraine

Weijie Kong

Orcid: 0000-0003-1700-4801

According to our database¹, Weijie Kong authored at least 30 papers between 2013 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation.

[DOI]

,

,

,

,

,

,

Jiangfeng Xiong

,

,

,

,

,

CoRR, May, 2026

Precise: SDE-Consistent Stochastic Sampling for RL Post-Training of Flow-Matching Models.

[DOI]

,

,

,

,

,

,

Jiangfeng Xiong

,

,

,

CoRR, May, 2026

AffordVLA: Injecting Affordance Representations into Vision-Language-Action Models via Implicit Feature Alignment.

[DOI]

,

,

,

CoRR, May, 2026

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning.

[DOI]

,

,

,

,

Jiangfeng Xiong

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

Manifold-Aware Exploration for Reinforcement Learning in Video Generation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

IG-RFT: An Interaction-Guided RL Framework for VLA Models in Long-Horizon Robotic Manipulation.

[DOI]

,

,

,

CoRR, February, 2026

Robotic Bin Packing via Hierarchical Reinforcement Learning.

[DOI]

,

,

,

,

,

IEEE Trans Autom. Sci. Eng., 2026

2025

Global and Local Semantic Completion Learning for Vision-Language Pre-Training.

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

AffPose: An Integrated RGB-Based Framework for Simultaneous Pose Estimation and Affordance Detection in Robotic Tool Manipulation.

[DOI]

,

,

,

,

,

IEEE Robotics Autom. Lett., October, 2025

HunyuanImage 3.0 Technical Report.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Jiangfeng Xiong

,

,

,

,

,

,

,

,

,

,

,

,

,

Zhengguang Zhou

,

,

,

,

,

,

,

,

,

,

,

Tianxiang Zheng

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Bin Packing Optimization via Deep Reinforcement Learning.

[DOI]

,

,

,

IEEE Robotics Autom. Lett., March, 2025

Symmetric Taper Fiber Cleaving for Centered Waist-Inserted FPI: Temperature-Compensated High-Sensitivity Strain Sensor.

[DOI]

,

,

,

,

,

,

,

Symmetry, 2025

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions With Multi-Level Representations.

[DOI]

,

,

,

,

,

IEEE Access, 2025

2024

HunyuanVideo: A Systematic Framework For Large Video Generative Models.

[DOI]

,

,

,

,

,

,

Jiangfeng Xiong

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

2023

Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022.

[DOI]

Kevin Qinghong Lin

,

Alex Jinpeng Wang

,

,

,

,

Eric Zhongcong Xu

,

,

,

,

,

,

,

,

,

,

Mike Zheng Shou

CoRR, 2022

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.

[DOI]

Kevin Qinghong Lin

,

Alex Jinpeng Wang

,

,

Eric Zhongcong Xu

,

,

,

,

,

,

,

,

Mike Zheng Shou

CoRR, 2022

Egocentric Video-Language Pretraining.

[DOI]

Kevin Qinghong Lin

,

Alex Jinpeng Wang

,

,

,

,

Eric Zhongcong Xu

,

,

,

,

,

,

,

,

,

,

Mike Zheng Shou

CoRR, 2022

HunYuan_tvr for Text-Video Retrievial.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2022

Egocentric Video-Language Pretraining.

[DOI]

Kevin Qinghong Lin

,

,

,

,

,

Eric Zhongcong Xu

,

,

,

,

,

,

,

,

,

,

Mike Zheng Shou

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2020

Regression Before Classification for Temporal Action Detection.

[DOI]

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization.

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection.

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Deep Pedestrian Detection Using Contextual Information and Multi-level Features.

[DOI]

,

,

,

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector.

[DOI]

,

,

,

,

,

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017

The particle filter based on random number searching algorithm for parameter estimation.

[DOI]

,

,

,

Commun. Stat. Simul. Comput., 2017

2014

Automated Test Approach Based on All Paths Covered Optimal Algorithm and Sequence Priority Selected Algorithm.

[DOI]

,

,

,

IEEE Trans. Intell. Transp. Syst., 2014

2013

A New DHT Supporting Multi-attribute Queries for Grid Information Services.

[DOI]

,

,

,

,

,

,

Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

Loading...