Yang Liu

Orcid: 0000-0002-9423-9252

Affiliations:
  • Sun Yat-sen University, School of Data and Computer Science, Guangzhou, China
  • Xidian University, School of Telecommunications Engineering, Xi'an, China (PhD 2019)


According to our database1, Yang Liu authored at least 52 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Video-Based Reward Modeling for Computer-Use Agents.
CoRR, March, 2026

Structure-preserving contrastive graph clustering with dual-channel label alignment.
Neural Networks, 2026

2025
MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models.
CoRR, October, 2025

SaFeR-VLM: Toward Safety-aware Fine-grained Reasoning in Multimodal Models.
CoRR, October, 2025

ODMixer: Fine-Grained Spatial-Temporal MLP for Metro Origin-Destination Prediction.
IEEE Trans. Knowl. Data Eng., September, 2025

Learning to See and Act: Task-Aware View Planning for Robotic Manipulation.
CoRR, August, 2025

AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving.
CoRR, June, 2025

DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Transformer and Mamba.
CoRR, June, 2025

Incentivizing LLMs to Self-Verify Their Answers.
CoRR, June, 2025

Skywork Open Reasoner 1 Technical Report.
CoRR, May, 2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment.
CoRR, April, 2025

Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench.
CoRR, April, 2025

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base.
CoRR, March, 2025

Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering.
CoRR, March, 2025

Cross-Modal Causal Representation Learning for Radiology Report Generation.
IEEE Trans. Image Process., 2025

Learn 3D VQA Better with Active Selection and Reannotation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Dual-Level Facilitated Multi-View Contrastive Graph Clustering.
Proceedings of the 31th IEEE International Conference on Parallel and Distributed Systems, 2025

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Cross-modal Causal Relation Alignment for Video Question Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HyperCRS: Hypergraph-Aware Multi-Grained Preference Learning to Burst Filter Bubbles in Conversational Recommendation System.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Progressive Multi-Iteration Registration-Fusion Co-Optimization Network for Unregistered Hyperspectral Image Super-Resolution.
IEEE Trans. Geosci. Remote. Sens., 2024

Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization.
CoRR, 2024

VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis.
CoRR, 2024

InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction.
CoRR, 2024

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs.
CoRR, 2024

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models - The Story Goes On.
CoRR, 2024

Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI.
CoRR, 2024

Fine-grained Spatial-temporal MLP Architecture for Metro Origin-Destination Prediction.
CoRR, 2024

Multimodal Embodied Interactive Agent for Cafe Scene.
CoRR, 2024

Diversity Matters: User-Centric Multi-Interest Learning for Conversational Movie Recommendation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Confidence-oriented Contrastive Graph Clustering.
Proceedings of the International Joint Conference on Neural Networks, 2024

Self-contradictory reasoning evaluation and detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Urban regional function guided traffic flow prediction.
Inf. Sci., July, 2023

Hybrid-Order Representation Learning for Electricity Theft Detection.
IEEE Trans. Ind. Informatics, 2023

VCD: Visual Causality Discovery for Cross-Modal Question Reasoning.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Visual Causal Scene Refinement for Video Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Self-Supervised Contrastive Learning for Audio-Visual Action Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2023

2022
TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning.
IEEE Trans. Image Process., 2022

Causal Reasoning Meets Visual Representation Learning: A Prospective Study.
Int. J. Autom. Comput., 2022

Audio-Visual Contrastive Learning for Self-supervised Action Recognition.
CoRR, 2022

2021
Semantics-Aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition.
IEEE Trans. Image Process., 2021

Temporal Contrastive Graph for Self-supervised Video Representation Learning.
CoRR, 2021

2020
Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition.
IEEE Trans. Image Process., 2020

A Cloud Detection Method Using Convolutional Neural Network Based on Gabor Transform and Attention Mechanism with Dark Channel Subnet for Remote Sensing Image.
Remote. Sens., 2020

2019
Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2019

2018
Global Temporal Representation Based CNNs for Infrared Action Recognition.
IEEE Signal Process. Lett., 2018

Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition.
Complex., 2018

2017
A Non-Greedy Algorithm for L1-Norm LDA.
IEEE Trans. Image Process., 2017

Adaptive maximum margin analysis for image recognition.
Pattern Recognit., 2017

2016
Combining Multiple Features for Cross-Domain Face Sketch Recognition.
Proceedings of the Biometric Recognition - 11th Chinese Conference, 2016


  Loading...