Hong Chen

Orcid: 0000-0002-0943-2286

Affiliations:
  • Tsinghua University, Department of Computer Science and Technology, Beijing National Research Center for Information Science and Technology, Beijing, China


According to our database1, Hong Chen authored at least 55 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
ScenarioDiff: Text-to-video Generation with Dynamic Transformations of Scene Conditions.
Int. J. Comput. Vis., July, 2025

Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning.
CoRR, June, 2025

Automated Disentangled Sequential Recommendation with Large Language Models.
ACM Trans. Inf. Syst., March, 2025

VideoDreamer: Customized Multi-Subject Text-to-Video Generation With Disen-Mix Finetuning on Language-Video Foundation Models.
IEEE Trans. Multim., 2025

Aligning Large Multimodal Model with Sequential Recommendation via Content-Behavior Guidance.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025

Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Identity-Text Video Corpus Grounding.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Behavior Importance-Aware Graph Neural Architecture Search for Cross-Domain Recommendation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Disentangled Representation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

DisenDreamer: Subject-Driven Text-to-Image Generation With Sample-Aware Disentangled Tuning.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Dynamic Spatio-Temporal Graph Reasoning for VideoQA With Self-Supervised Event Recognition.
IEEE Trans. Image Process., 2024

Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond.
CoRR, 2024

Multi-sentence Video Grounding for Long Video Generation.
CoRR, 2024

Curriculum Learning: Theories, Approaches, Applications, Tools, and Future Directions in the Era of Large Language Models.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Neighbor Does Matter: Curriculum Global Positive-Negative Sampling for Vision-Language Pre-training.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DisenStudio: Customized Multi-Subject Text-to-Video Generation with Disentangled Spatial Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Curriculum Learning for Multimedia in the Era of Large Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Large Language Model with Curriculum Reasoning for Visual Concept Recognition.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

CurBench: Curriculum Learning Benchmark.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Disentangled Continual Graph Neural Architecture Search with Invariant Modular Supernet.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

VTimeLLM: Empower LLM to Grasp Video Moments.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Disentangled Representation Learning for Recommendation.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos.
CoRR, 2023

LLM4VG: Large Language Models Evaluation for Video Grounding.
CoRR, 2023

Lightweight Diffusion Models with Distillation-Based Block Neural Architecture Search.
CoRR, 2023

VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning.
CoRR, 2023

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation.
CoRR, 2023

Cross-domain Recommendation with Behavioral Importance Perception.
Proceedings of the ACM Web Conference 2023, 2023

Multi-task Graph Neural Architecture Search with Task-aware Collaboration and Curriculum.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Joint Data-Task Generation for Auxiliary Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Global-Local GraphFormer: Towards Better Understanding of User Intentions in Sequential Recommendation.
Proceedings of the ACM Multimedia Asia 2023, 2023

Intra- and Inter-Modal Curriculum for Multimodal Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Diff4Rec: Sequential Recommendation with Curriculum-scheduled Diffusion Augmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

TIVA-KG: A Multimodal Knowledge Graph with Text, Image, Video and Audio.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mixup-Augmented Temporally Debiased Video Grounding with Content-Location Disentanglement.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Disentangled Representation Learning for Multimedia.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Adaptive Disentangled Transformer for Sequential Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Curriculum Co-disentangled Representation Learning across Multiple Environments for Social Recommendation.
Proceedings of the International Conference on Machine Learning, 2023

Curriculum Multi-Negative Augmentation for Debiased Video Grounding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification.
CoRR, 2022

Module-Aware Optimization for Auxiliary Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CurML: A Curriculum Machine Learning Library.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Curriculum-NAS: Curriculum Weight-Sharing Neural Architecture Search.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

AVQA: A Dataset for Audio-Visual Question Answering on Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dynamic Spatio-Temporal Modular Network for Video Question Answering.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Large-Scale Graph Neural Architecture Search.
Proceedings of the International Conference on Machine Learning, 2022

Auxiliary Learning with Joint Task and Data Scheduling.
Proceedings of the International Conference on Machine Learning, 2022

NeurIPS’22 Cross-Domain MetaDL competition: Design and baseline results.
Proceedings of the ECML/PKDD Workshop on Meta-Knowledge Transfer, 2022

2021
Curriculum Disentangled Recommendation with Noisy Multi-feedback.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification.
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Multimodal Disentangled Representation for Recommendation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021


  Loading...