Weijie Kong

Orcid: 0000-0003-1700-4801

According to our database1, Weijie Kong authored at least 30 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation.
CoRR, May, 2026

Precise: SDE-Consistent Stochastic Sampling for RL Post-Training of Flow-Matching Models.
CoRR, May, 2026

AffordVLA: Injecting Affordance Representations into Vision-Language-Action Models via Implicit Feature Alignment.
CoRR, May, 2026

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning.
CoRR, March, 2026

Manifold-Aware Exploration for Reinforcement Learning in Video Generation.
CoRR, March, 2026

IG-RFT: An Interaction-Guided RL Framework for VLA Models in Long-Horizon Robotic Manipulation.
CoRR, February, 2026

Robotic Bin Packing via Hierarchical Reinforcement Learning.
IEEE Trans Autom. Sci. Eng., 2026

2025
Global and Local Semantic Completion Learning for Vision-Language Pre-Training.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

AffPose: An Integrated RGB-Based Framework for Simultaneous Pose Estimation and Affordance Detection in Robotic Tool Manipulation.
IEEE Robotics Autom. Lett., October, 2025

HunyuanImage 3.0 Technical Report.
CoRR, September, 2025

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions.
CoRR, September, 2025

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model.
CoRR, May, 2025

Bin Packing Optimization via Deep Reinforcement Learning.
IEEE Robotics Autom. Lett., March, 2025

Symmetric Taper Fiber Cleaving for Centered Waist-Inserted FPI: Temperature-Compensated High-Sensitivity Strain Sensor.
Symmetry, 2025

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions With Multi-Level Representations.
IEEE Access, 2025

2024
HunyuanVideo: A Systematic Framework For Large Video Generative Models.
CoRR, 2024

2023
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Egocentric Video-Language Pretraining @ Ego4D Challenge 2022.
CoRR, 2022

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.
CoRR, 2022

Egocentric Video-Language Pretraining.
CoRR, 2022

HunYuan_tvr for Text-Video Retrievial.
CoRR, 2022

Egocentric Video-Language Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2020
Regression Before Classification for Temporal Action Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization.
Proceedings of the IEEE International Conference on Acoustics, 2019

Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Deep Pedestrian Detection Using Contextual Information and Multi-level Features.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
The particle filter based on random number searching algorithm for parameter estimation.
Commun. Stat. Simul. Comput., 2017

2014
Automated Test Approach Based on All Paths Covered Optimal Algorithm and Sequence Priority Selected Algorithm.
IEEE Trans. Intell. Transp. Syst., 2014

2013
A New DHT Supporting Multi-attribute Queries for Grid Information Services.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013


  Loading...