Siliang Tang
Orcid: 0000-0002-7356-9711Affiliations:
- Zhejiang University, College of Computer Science and Technology, Hangzhou, China
- National University of Ireland, Maynooth, Ireland (PhD 2012)
According to our database1,
Siliang Tang
authored at least 222 papers
between 2004 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
IEEE Trans. Neural Networks Learn. Syst., August, 2025
CoRR, June, 2025
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities.
CoRR, June, 2025
MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models.
CoRR, June, 2025
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL.
CoRR, June, 2025
CoRR, June, 2025
Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation.
CoRR, June, 2025
CoRR, April, 2025
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark.
CoRR, March, 2025
SOYO: A Tuning-Free Approach for Video Style Morphing via Style-Adaptive Interpolation in Diffusion Models.
CoRR, March, 2025
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation.
CoRR, March, 2025
CoRR, March, 2025
IEEE Trans. Neural Networks Learn. Syst., February, 2025
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation.
CoRR, February, 2025
IEEE Trans. Multim., 2025
EvidenceMap: Learning evidence analysis to unleash the power of small language models for biomedical question answering.
Artif. Intell. Medicine, 2025
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs.
Proceedings of the ACM on Web Conference 2025, 2025
MM-CARP: Multimodal Model with Cross-Modal Retrieval-Augmented and Visual Region Perception.
Proceedings of the MultiMedia Modeling, 2025
LLAUS: A High-Quality Instruction-Tuned Large Vision Language Assistant for UltraSound.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
ITERATE: Image-Text Enhancement, Retrieval, and Alignment for Transmodal Evolution with LLMs.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
ChatMap: Mining Human Thought Processes for Customer Service Chatbots via Multi-Agent Collaboration.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Align²LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Ask Questions With Double Hints: Visual Question Generation With Answer-Awareness and Region-Reference.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Unleash the Power of Inconsistency-Based Semi-Supervised Active Learning by Dynamic Programming of Curriculum Learning.
IEEE Trans. Knowl. Data Eng., November, 2024
RustGraph: Robust Anomaly Detection in Dynamic Graphs by Jointly Learning Structural-Temporal Dependency.
IEEE Trans. Knowl. Data Eng., July, 2024
Neurocomputing, 2024
Neurocomputing, 2024
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation.
CoRR, 2024
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework.
CoRR, 2024
Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness.
CoRR, 2024
CoRR, 2024
Align<sup>2</sup>LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation.
CoRR, 2024
CoRR, 2024
Logic Distillation: Learning from Code Function by Function for Planning and Decision-making.
CoRR, 2024
IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization.
CoRR, 2024
CoRR, 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation.
CoRR, 2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models.
CoRR, 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
CoRR, 2024
GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning.
Proceedings of the ACM on Web Conference 2024, 2024
MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning.
Proceedings of the ACM on Web Conference 2024, 2024
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Unified Generative and Discriminative Training for Multi-modal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Data Shunt: Collaboration of Small and Large Models for Lower Costs and Better Performance.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023
Attribute-driven streaming edge partitioning with reconciliations for distributed graph neural network training.
Neural Networks, August, 2023
IEEE Trans. Multim., 2023
Single image super-resolution based on progressive fusion of orientation-aware features.
Pattern Recognit., 2023
Neurocomputing, 2023
Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model.
CoRR, 2023
Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR, 2023
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration.
CoRR, 2023
Proceedings of the Semantic Web - ISWC 2023, 2023
Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
FedAA: Using Non-sensitive Modalities to Improve Federated Learning while Preserving Image Privacy.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the International Conference on Machine Learning, 2023
Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Reasoning Makes Good Annotators : An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Structure-Aware Group Discrimination with Adaptive-View Graph Encoder: A Fast Graph Contrastive Learning Framework.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Image Process., 2022
Citation Trajectory Prediction via Publication Influence Representation Using Temporal Knowledge Graph.
CoRR, 2022
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the International Conference on Machine Learning, 2022
QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE Trans. Neural Networks Learn. Syst., 2021
Tell and guess: cooperative learning for natural image caption generation with hierarchical refined attention.
Multim. Tools Appl., 2021
Frontiers Inf. Technol. Electron. Eng., 2021
CoRR, 2021
Alleviate Representation Overlapping in Class Incremental Learning by Contrastive Class Concentration.
CoRR, 2021
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the Advances in Information Retrieval, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE Trans. Multim., 2020
MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution.
IEEE Trans. Multim., 2020
Hybrid embedding and joint training of stacked encoder for opinion question machine reading comprehension.
Frontiers Inf. Technol. Electron. Eng., 2020
Inf. Process. Manag., 2020
CoRR, 2020
CoRR, 2020
Deep Sequential Feature Learning in Clinical Image Classification of Infectious Keratitis.
CoRR, 2020
CoRR, 2020
CoRR, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Alleviate Dataset Shift Problem in Fine-grained Entity Typing with Virtual Adversarial Training.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Generating Natural Language Adversarial Examples on a Large Scale with Generative Models.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
IEEE Trans. Vis. Comput. Graph., 2019
Cascaded Deep Networks With Multiple Receptive Fields for Infrared Image Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., 2019
Deep Neural Network for Fast and Accurate Single Image Super-Resolution via Channel-Attention-based Fusion of Orientation-aware Features.
CoRR, 2019
Proceedings of the 2019 Text Analysis Conference, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Frontiers Inf. Technol. Electron. Eng., 2018
Proceedings of the 2018 Text Analysis Conference, 2018
Multi-modal Sequence to Sequence Learning with Content Attention for Hotspot Traffic Speed Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
2017
IEEE Trans. Knowl. Data Eng., 2017
Flickr group recommendation with auxiliary information in heterogeneous information networks.
Multim. Syst., 2017
Frontiers Inf. Technol. Electron. Eng., 2017
Proceedings of the 2017 Text Analysis Conference, 2017
Proceedings of the 2017 Text Analysis Conference, 2017
ENCORE: External Neural Constraints Regularized Distant Supervision for Relation Extraction.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017
Detecting Temporal Proposal for Action Localization with Tree-structured Search Policy.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
2016
IEEE Trans. Circuits Syst. Video Technol., 2016
Proceedings of the 2016 Text Analysis Conference, 2016
Proceedings of the 2016 Text Analysis Conference, 2016
Proceedings of the 2016 Text Analysis Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
2015
Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization.
IEEE Trans. Multim., 2015
IEEE Trans. Image Process., 2015
Pattern Recognit. Lett., 2015
Proceedings of the 2015 Text Analysis Conference, 2015
Proceedings of the 2015 Text Analysis Conference, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the 10th IEEE Conference on Visual Analytics Science and Technology, 2015
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015
2014
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Geo-informative discriminative image representation by semi-supervised hierarchical topic modeling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
2013
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013
2012
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012
2009
A context quality management infrastructure for complex ubiquitous environment.
Comput. Syst. Sci. Eng., 2009
2004
Proceedings of the Grid and Cooperative Computing, 2004