Siliang Tang

Orcid: 0000-0002-7356-9711

According to our database1, Siliang Tang authored at least 157 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models.
CoRR, 2024

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
CoRR, 2024

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning.
CoRR, 2024

Efficient Tuning and Inference for Large Language Models on Textual Graphs.
CoRR, 2024

Data Shunt: Collaboration of Small and Large Models for Lower Costs and Better Performance.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Attribute-driven streaming edge partitioning with reconciliations for distributed graph neural network training.
Neural Networks, August, 2023

Cross-Modal Data Augmentation for Tasks of Different Modalities.
IEEE Trans. Multim., 2023

Single image super-resolution based on progressive fusion of orientation-aware features.
Pattern Recognit., 2023

Graph neural networks meet with distributed graph partitioners and reconciliations.
Neurocomputing, 2023

Language Model is a Branch Predictor for Simultaneous Machine Translation.
CoRR, 2023

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data.
CoRR, 2023

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.
CoRR, 2023

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
CoRR, 2023

GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning.
CoRR, 2023

Improving Vision Anomaly Detection with the Guidance of Language Modality.
CoRR, 2023

ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval.
CoRR, 2023

Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model.
CoRR, 2023

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR, 2023

MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning.
CoRR, 2023

Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document.
CoRR, 2023

Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration.
CoRR, 2023

InstructVid2Vid: Controllable Video Editing with Natural Language Instructions.
CoRR, 2023

Meta-augmented Prompt Tuning for Better Few-shot Learning.
CoRR, 2023

Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding.
CoRR, 2023

SGL-PT: A Strong Graph Learner with Graph Prompt Tuning.
CoRR, 2023

A Study on ReLU and Softmax in Transformer.
CoRR, 2023

Negative Sampling with Adaptive Denoising Mixup for Knowledge Graph Embedding.
Proceedings of the Semantic Web - ISWC 2023, 2023

Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

FedAA: Using Non-sensitive Modalities to Improve Federated Learning while Preserving Image Privacy.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Continual Vision-Language Representation Learning with Off-Diagonal Information.
Proceedings of the International Conference on Machine Learning, 2023

Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Reasoning Makes Good Annotators : An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Structure-Aware Group Discrimination with Adaptive-View Graph Encoder: A Fast Graph Contrastive Learning Framework.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

SkillQG: Learning to Generate Question for Reading Comprehension Assessment.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

KICE: A Knowledge Consolidation and Expansion Framework for Relation Extraction.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.
IEEE Trans. Image Process., 2022

NAP: Neural architecture search with pruning.
Neurocomputing, 2022

DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention.
CoRR, 2022

Distilling Task-specific Logical Rules from Large Pre-trained Models.
CoRR, 2022

Citation Trajectory Prediction via Publication Influence Representation Using Temporal Knowledge Graph.
CoRR, 2022

BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval.
CoRR, 2022

Fine-Grained Semantically Aligned Vision-Language Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Hybrid Behavior Patterns for Multimedia Recommendation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Robust Meta-learning with Sampling Noise and Label Noise via Eigen-Reptile.
Proceedings of the International Conference on Machine Learning, 2022

QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning to Learn by Jointly Optimizing Neural Architecture and Weights.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Feeding What You Need by Understanding What You Learned.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Local-Global Memory Neural Network for Medication Prediction.
IEEE Trans. Neural Networks Learn. Syst., 2021

Tell and guess: cooperative learning for natural image caption generation with hierarchical refined attention.
Multim. Tools Appl., 2021

Visual knowledge: an attempt to explore machine creativity.
Frontiers Inf. Technol. Electron. Eng., 2021

Self-Supervised Class Incremental Learning.
CoRR, 2021

Federated Self-Supervised Contrastive Learning via Ensemble Similarity Distillation.
CoRR, 2021

Alleviate Representation Overlapping in Class Incremental Learning by Contrastive Class Concentration.
CoRR, 2021

To be a fast adaptive learner: using game history to defeat opponents.
CoRR, 2021

MGD-GAN: Text-to-Pedestrian Generation Through Multi-grained Discrimination.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Learning to Generate Visual Questions with Noisy Supervision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Improving Weakly Supervised Object Localization via Causal Intervention.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Clustering-Augmented Multi-instance Learning for Neural Relation Extraction.
Proceedings of the Advances in Information Retrieval, 2021

Grounded, Controllable and Debiased Image Completion With Lexical Semantics.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Consensus Graph Representation Learning for Better Grounded Image Captioning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Disentangled Motif-aware Graph Learning for Phrase Grounding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Frame Augmented Alternating Attention Network for Video Question Answering.
IEEE Trans. Multim., 2020

MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution.
IEEE Trans. Multim., 2020

Hybrid embedding and joint training of stacked encoder for opinion question machine reading comprehension.
Frontiers Inf. Technol. Electron. Eng., 2020

Video question answering via grounded cross-attention network learning.
Inf. Process. Manag., 2020

Run Away From your Teacher: Understanding BYOL by a Novel Self-Supervised Approach.
CoRR, 2020

MGD-GAN: Text-to-Pedestrian generation through Multi-Grained Discrimination.
CoRR, 2020

Deep Sequential Feature Learning in Clinical Image Classification of Infectious Keratitis.
CoRR, 2020

NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results.
CoRR, 2020

Quda: Natural Language Queries for Visual Data Analytics.
CoRR, 2020

Grounded and Controllable Image Completion by Incorporating Lexical Semantics.
CoRR, 2020

Relational Graph Learning for Grounded Video Description Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Photo Stream Question Answer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Alleviate Dataset Shift Problem in Fine-grained Entity Typing with Virtual Adversarial Training.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Generating Natural Language Adversarial Examples on a Large Scale with Generative Models.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020


Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural-DINF: A Neural Network based Framework for Measuring Document Influence.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Rethinking the Bottom-Up Framework for Query-Based Video Localization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
VPModel: High-Fidelity Product Simulation in a Virtual-Physical Environment.
IEEE Trans. Vis. Comput. Graph., 2019

Cascaded Deep Networks With Multiple Receptive Fields for Infrared Image Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., 2019

Deep Neural Network for Fast and Accurate Single Image Super-Resolution via Channel-Attention-based Fusion of Orientation-aware Features.
CoRR, 2019

The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2019.
Proceedings of the 2019 Text Analysis Conference, 2019

Posterior-regularized REINFORCE for Instance Selection in Distant Supervision.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Walking with MIND: Mental Imagery eNhanceD Embodied QA.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Informative Visual Storytelling with Cross-modal Rules.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning Dynamic Context Augmentation for Global Entity Linking.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Orientation-Aware Deep Neural Network for Real Image Super-Resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019


KCAT: A Knowledge-Constraint Typing Annotation Tool.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Cross-Relation Cross-Bag Attention for Distantly-Supervised Relation Extraction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Temporality-enhanced knowledgememory network for factoid question answering.
Frontiers Inf. Technol. Electron. Eng., 2018

Entity mention aware document representation.
Inf. Sci., 2018

Two Step Joint Model for Drug Drug Interaction Extraction.
Proceedings of the 2018 Text Analysis Conference, 2018

Multi-modal Sequence to Sequence Learning with Content Attention for Hotspot Traffic Speed Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Reading Document and Answering Question via Global Attentional Inference.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
Temporal Interaction and Causal Influence in Community-Based Question Answering.
IEEE Trans. Knowl. Data Eng., 2017

Flickr group recommendation with auxiliary information in heterogeneous information networks.
Multim. Syst., 2017

Disambiguating named entities with deep supervised learning via crowd labels.
Frontiers Inf. Technol. Electron. Eng., 2017

The Y_dcd_zju Slot Filling System for TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

The ZHI-EDL System for Entity Discovery and Linking at TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

ENCORE: External Neural Constraints Regularized Distant Supervision for Relation Extraction.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Learning Deep Contextual Attention Network for Narrative Photo Stream Captioning.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Detecting Temporal Proposal for Action Localization with Tree-structured Search Policy.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

NITE: A Neural Inductive Teaching Framework for Domain Specific NER.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Aspect Learning for Multimedia Summarization via Nonparametric Bayesian.
IEEE Trans. Circuits Syst. Video Technol., 2016

LSTM-in-LSTM for generating long descriptions of images.
Comput. Vis. Media, 2016

Sentences Embedding for Slot Filling via Convolutional Neural Networks.
Proceedings of the 2016 Text Analysis Conference, 2016

ZJU Participation in TAC 2016 EDL task.
Proceedings of the 2016 Text Analysis Conference, 2016

The ijk System for EAL at TAC KBP 2016 Event Track.
Proceedings of the 2016 Text Analysis Conference, 2016

Ad Recommendation for Sponsored Search Engine via Composite Long-Short Term Memory.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization.
IEEE Trans. Multim., 2015

Probabilistic Word Selection via Topic Modeling.
IEEE Trans. Knowl. Data Eng., 2015

Cross-Modal Learning to Rank via Latent Joint Representation.
IEEE Trans. Image Process., 2015

The classification of multi-modal data with hidden conditional random field.
Pattern Recognit. Lett., 2015

Combining MIML and Distant Supervision for KBP Slot Filling.
Proceedings of the 2015 Text Analysis Conference, 2015

The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2015.
Proceedings of the 2015 Text Analysis Conference, 2015

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Multi-modal Retrieval via Deep Textual-Visual Correlation Learning.
Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015

Sketch the Storyline with CHARCOAL: A Non-Parametric Approach.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

HTMVS: Visualizing hierarchical topics and their evolution.
Proceedings of the 10th IEEE Conference on Visual Analytics Science and Technology, 2015

Flickr group recommendation via heterogeneous information networks.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

2014
Sparse Multi-Modal Hashing.
IEEE Trans. Multim., 2014

Hashing with List-Wise learning to rank.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Cross-Media Hashing with Neural Networks.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Cross-media hashing with kernel regression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Geo-informative discriminative image representation by semi-supervised hierarchical topic modeling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013
A low rank structural large margin method for cross-modal ranking.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

πLDA: document clustering with selective structural constraints.
Proceedings of the ACM Multimedia Conference, 2013

Supervised Nonnegative Tensor Factorization with Maximum-Margin Constraint.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Image Ranking via Attribute Boosted Hypergraph.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Supervised cross-collection topic modeling.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Logistic Tensor Regression for Classification.
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

2009
A context quality management infrastructure for complex ubiquitous environment.
Comput. Syst. Sci. Eng., 2009

2004
Virtual Battlefield Attack-Defense Countermeasure Simulation on the Grid.
Proceedings of the Grid and Cooperative Computing, 2004


  Loading...