Yao Hu
Orcid: 0009-0006-1274-7111Affiliations:
- Xiaohongshu Inc., Beijing, China
According to our database1,
Yao Hu
authored at least 119 papers
between 2012 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, October, 2025
HyMiRec: A Hybrid Multi-interest Learning Framework for LLM-based Sequential Recommendation.
CoRR, October, 2025
CoRR, October, 2025
DecEx-RAG: Boosting Agentic Retrieval-Augmented Generation with Decision and Execution Optimization via Process Supervision.
CoRR, October, 2025
RealBench: A Chinese Multi-image Understanding Benchmark Close to Real-world Scenarios.
CoRR, September, 2025
CoRR, September, 2025
FireRedChat: A Pluggable, Full-Duplex Voice Interaction System with Cascaded and Semi-Cascaded Implementations.
CoRR, September, 2025
SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment.
CoRR, September, 2025
CoRR, September, 2025
Decomposed Reasoning with Reinforcement Learning for Relevance Assessment in UGC Platforms.
CoRR, August, 2025
CoRR, July, 2025
CoRR, July, 2025
CoRR, July, 2025
Flux-Sculptor: Text-Driven Rich-Attribute Portrait Editing through Decomposed Spatial Flow Control.
CoRR, July, 2025
CoRR, June, 2025
Plan Your Travel and Travel with Your Plan: Wide-Horizon Planning and Evaluation via LLM.
CoRR, June, 2025
MT<sup>3</sup>: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning.
CoRR, May, 2025
Redefining Machine Translation on Social Network Services with Large Language Models.
CoRR, April, 2025
IEEE Trans. Neural Networks Learn. Syst., March, 2025
CoRR, March, 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection.
CoRR, March, 2025
CoRR, March, 2025
CoRR, February, 2025
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration.
CoRR, January, 2025
DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors.
CoRR, January, 2025
Scalable Overload-Aware Graph-Based Index Construction for 10-Billion-Scale Vector Similarity Search.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025
Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network.
Proceedings of the Nineteenth ACM Conference on Recommender Systems, 2025
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025
Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
ZigZagKV: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Int. J. Comput. Vis., November, 2024
Int. J. Comput. Vis., November, 2024
TOMGPT: Reliable Text-Only Training Approach for Cost-Effective Multi-modal Large Language Model.
ACM Trans. Knowl. Discov. Data, August, 2024
PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation.
Neurocomputing, 2024
CoRR, 2024
ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval.
CoRR, 2024
Benchmarking Large Language Models for Conversational Question Answering in Multi-instructional Documents.
CoRR, 2024
Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance.
CoRR, 2024
Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning.
CoRR, 2024
From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition.
CoRR, 2024
Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior.
CoRR, 2024
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Knowledge-Enhanced Multi-perspective Incongruity Perception Network for Multimodal Sarcasm Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the IEEE International Conference on Image Processing, 2024
Proceedings of the IEEE International Conference on Data Mining, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Optimizing traffic efficiency via a reinforcement learning approach based on time allocation.
Int. J. Mach. Learn. Cybern., October, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
IEEE Trans. Image Process., 2022
Proceedings of the Web and Big Data - 6th International Joint Conference, 2022
2021
ACM Trans. Multim. Comput. Commun. Appl., 2021
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Linking the Characters: Video-oriented Social Graph Generation via Hierarchical-cumulative GCN.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
2016
ACM Trans. Multim. Comput. Commun. Appl., 2016
Online robust principal component analysis via truncated nuclear norm regularization.
Neurocomputing, 2016
Neurocomputing, 2016
2015
Neurocomputing, 2015
Proceedings of the Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques, 2015
2014
IEEE Trans. Cybern., 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
IEEE Trans. Pattern Anal. Mach. Intell., 2013
Proceedings of the Intelligence Science and Big Data Engineering, 2013
A Unified Approximate Nearest Neighbor Search Scheme by Combining Data Structure and Hashing.
Proceedings of the IJCAI 2013, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
2012
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012