Donglin Di

Orcid: 0000-0002-2270-3378

According to our database1, Donglin Di authored at least 64 papers between 2019 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Mode Hypergraph Neural Network.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis.
CoRR, August, 2025

DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation.
CoRR, August, 2025

MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention.
CoRR, August, 2025

LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning.
CoRR, July, 2025

SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment.
CoRR, July, 2025

QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation.
CoRR, July, 2025

Spatially Gene Expression Prediction using Dual-Scale Contrastive Learning.
CoRR, June, 2025

Memory-Augmented Incomplete Multimodal Survival Prediction via Cross-Slide and Gene-Attentive Hypergraph Learning.
CoRR, June, 2025

Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion.
CoRR, June, 2025

ChronoTailor: Harnessing Attention Guidance for Fine-Grained Video Virtual Try-On.
CoRR, June, 2025

On Weak-to-Strong Generalization and f-Divergence.
CoRR, June, 2025

GrainBrain: Multiview Identification and Stratification of Defective Grain Kernels.
IEEE Trans. Ind. Informatics, May, 2025

Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph.
Int. J. Comput. Vis., May, 2025

Hypergraph Tversky-Aware Domain Incremental Learning for Brain Tumor Segmentation with Missing Modalities.
CoRR, May, 2025

Multimodal Cancer Survival Analysis via Hypergraph Learning with Cross-Modality Rebalance.
CoRR, May, 2025

Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models.
CoRR, April, 2025

Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow.
CoRR, March, 2025

Semantic Latent Motion for Portrait Video Generation.
CoRR, March, 2025

UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation.
CoRR, February, 2025

Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion.
CoRR, January, 2025

TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Manipulation.
IEEE Trans. Multim., 2025

ToCoAD: Two-Stage Contrastive Learning for Industrial Anomaly Detection.
IEEE Trans. Instrum. Meas., 2025

An Open-Set Semi-Supervised Contrastive Learning for Bearing Fault Diagnosis.
IEEE Trans. Instrum. Meas., 2025

Hypergraph BiFormer for Semantic Segmentation of High-Resolution Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2025

Multi-modal hypergraph contrastive learning for medical image segmentation.
Pattern Recognit., 2025

MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GRPose: Learning Graph Relations for Human Image Generation with Pose Priors.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

MV-VTON: Multi-View Virtual Try-On with Diffusion Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Divide-Aggregate Heterogeneous Hypergraph for large-scale user intention detection.
Knowl. Based Syst., 2024

UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control.
CoRR, 2024

Revitalizing Reconstruction Models for Multi-class Anomaly Detection via Class-Aware Contrastive Learning.
CoRR, 2024

TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt.
CoRR, 2024

Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch.
CoRR, 2024

FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset.
CoRR, 2024

GRPose: Learning Graph Relations for Human Image Generation with Pose Priors.
CoRR, 2024

Real Face Video Animation Platform.
CoRR, 2024

One-Shot Pose-Driving Face Animation Platform.
CoRR, 2024

TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation.
CoRR, 2024

Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Boundary-Guided Learning for Gene Expression Prediction in Spatial Transcriptomics.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

2023
A sampling method based on forecasting and combinatorial optimization for high performance A/B testing.
Frontiers Comput. Sci., December, 2023

Generating Hypergraph-Based High-Order Representations of Whole-Slide Histopathological Images for Survival Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch.
ACM Trans. Asian Low Resour. Lang. Inf. Process., April, 2023

StfMLP: Spatiotemporal Fusion Multilayer Perceptron for Remote-Sensing Images.
IEEE Geosci. Remote. Sens. Lett., 2023

Dual attentional transformer for video visual relation prediction.
Neurocomputing, 2023

Scene Style Text Editing.
CoRR, 2023

Self-Supervised Cross-Language Scene Text Editing.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Big-Hypergraph Factorization Neural Network for Survival Prediction From Whole Slide Image.
IEEE Trans. Image Process., 2022

SERNet: Squeeze and Excitation Residual Network for Semantic Segmentation of High-Resolution Remote Sensing Images.
Remote. Sens., 2022

AGNet: An Attention-Based Graph Network for Point Cloud Classification and Segmentation.
Remote. Sens., 2022

Multi-Scale U-Shape MLP for Hyperspectral Image Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

Context-Aware Attentional Graph U-Net for Hyperspectral Image Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

Binary Neural Network for Multispectral Image Classification.
IEEE Geosci. Remote. Sens. Lett., 2022

GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
geoGAT: Graph Model Based on Attention Mechanism for Geographic Text Classification.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

Hypergraph learning for identification of COVID-19 with CT imaging.
Medical Image Anal., 2021

2020
Hypergraph Learning for Identification of COVID-19 with CT Imaging.
CoRR, 2020

Ranking-Based Survival Prediction on Histopathological Whole-Slide Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

2019
Relation Understanding in Videos: A Grand Challenge Overview.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Annotating Objects and Relations in User-Generated Videos.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Multiple Hypothesis Video Relation Detection.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

A Neural Network Approach to Verb Phrase Ellipsis Resolution.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...