Yang Yang

Orcid: 0000-0002-5070-4511

Affiliations:
  • University of Electronic Science and Technology of China, Center for Future Media, Chengdu, China
  • National University of Singapore, Singapore (former)
  • University of Queensland, Brisbane, Australia (PhD 2012)


According to our database1, Yang Yang authored at least 311 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey.
IEEE Trans. Knowl. Data Eng., January, 2024

Semantics Disentangling for Cross-Modal Retrieval.
IEEE Trans. Image Process., 2024

Coreset Learning-Based Sparse Black-Box Adversarial Attack for Video Recognition.
IEEE Trans. Inf. Forensics Secur., 2024

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning.
CoRR, 2024

Weakly-Supervised Mirror Detection via Scribble Annotations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Unsupervised Domain Adaptative Temporal Sentence Localization with Mutual Information Maximization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Independency Adversarial Learning for Cross-Modal Sound Separation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Composition-Aware Image Steganography Through Adversarial Self-Generated Supervision.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Less is Better: Exponential Loss for Cross-Modal Matching.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Hypercomplex context guided interaction modeling for scene graph generation.
Pattern Recognit., September, 2023

On the Imaginary Wings: Text-Assisted Complex-Valued Fusion Network for Fine-Grained Visual Classification.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Self-Supervised Discriminative Feature Learning for Deep Multi-View Clustering.
IEEE Trans. Knowl. Data Eng., July, 2023

Relation-mining self-attention network for skeleton-based human action recognition.
Pattern Recognit., July, 2023

Category Alignment Adversarial Learning for Cross-Modal Retrieval.
IEEE Trans. Knowl. Data Eng., May, 2023

Interpretable Signed Link Prediction With Signed Infomax Hyperbolic Graph.
IEEE Trans. Knowl. Data Eng., April, 2023

Region Attention Enhanced Unsupervised Cross-Domain Facial Emotion Recognition.
IEEE Trans. Knowl. Data Eng., April, 2023

Improving Rumor Detection by Promoting Information Campaigns With Transformer-Based Generative Adversarial Learning.
IEEE Trans. Knowl. Data Eng., March, 2023

Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation.
IEEE Trans. Multim., 2023

Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval.
IEEE Trans. Multim., 2023

Quaternion Relation Embedding for Scene Graph Generation.
IEEE Trans. Multim., 2023

Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment.
IEEE Trans. Image Process., 2023

Physics Guided Remote Sensing Image Synthesis Network for Ship Detection.
IEEE Trans. Geosci. Remote. Sens., 2023

Quaternion Representation Learning for cross-modal matching.
Knowl. Based Syst., 2023

Generalized Damping Torque Analysis of Ultra-Low Frequency Oscillation in the Jerk Space.
CoRR, 2023

Solving Math Word Problems with Reexamination.
CoRR, 2023

CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis.
CoRR, 2023

Focusing on Relevant Responses for Multi-modal Rumor Detection.
CoRR, 2023

Faster Video Moment Retrieval with Point-Level Supervision.
CoRR, 2023

Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era.
CoRR, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.
CoRR, 2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement.
CoRR, 2023

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding.
CoRR, 2023

A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
CoRR, 2023

Multimodal Apology: Using WebXR to Repair Trust with Virtual Companion.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Cross-modal Consistency Learning with Fine-grained Fusion Network for Multimodal Fake News Detection.
Proceedings of the ACM Multimedia Asia 2023, 2023

Precise Target-Oriented Attack against Deep Hashing-based Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Noise-Robust Continual Test-Time Domain Adaptation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Task-Adversarial Adaptation for Multi-modal Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multimodal Physiological Signals Fusion for Online Emotion Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Open-Scenario Domain Adaptive Object Detection in Autonomous Driving.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DCEL: Deep Cross-modal Evidential Learning for Text-Based Person Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Faster Video Moment Retrieval with Point-Level Supervision.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cross-modality Representation Interactive Learning for Multimodal Sentiment Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Zero-shot Sketch-based Image Retrieval with Adaptive Balanced Discriminability and Generalizability.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Multi-granularity Separation Network for Text-Based Person Retrieval with Bidirectional Refinement Regularization.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Region-Aware Semantic Consistency for Unsupervised Domain-Adaptive Semantic Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Unsupervised Sounding Pixel Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Multilateral Semantic Relations Modeling for Image Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multi-level Storage Optimization for Intermediate Data in AI Model Training.
Proceedings of the Databases Theory and Applications, 2023

2022
Universal adversarial perturbations generative network.
World Wide Web, 2022

Virtual Reality Aided High-Quality 3D Reconstruction by Remote Drones.
ACM Trans. Internet Techn., 2022

Mind the Remainder: Taylor's Theorem View on Recurrent Neural Networks.
IEEE Trans. Neural Networks Learn. Syst., 2022

One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework.
IEEE Trans. Multim., 2022

Push & Pull: Transferable Adversarial Examples With Attentive Attack.
IEEE Trans. Multim., 2022

Localization of Networks on 3D Terrain Surfaces.
IEEE Trans. Mob. Comput., 2022

Answer Again: Improving VQA With Cascaded-Answering Model.
IEEE Trans. Knowl. Data Eng., 2022

Modeling Two-Stream Correspondence for Visual Sound Separation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Entity Slot Filling for Visual Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Universal Weighting Metric Learning for Cross-Modal Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Transferring Inter-Class Correlation for Teacher-Student frameworks with flexible models.
Knowl. Based Syst., 2022

Semantic guided knowledge graph for large-scale zero-shot learning.
J. Vis. Commun. Image Represent., 2022

Domain adaptive state representation alignment for reinforcement learning.
Inf. Sci., 2022

Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning.
CoRR, 2022

Medium Transmission Map Matters for Learning to Restore Real-World Underwater Images.
CoRR, 2022

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark Under Heterogeneous AI Computing Platforms.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Point to Rectangle Matching for Image Text Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Rethinking Open-World Object Detection in Autonomous Driving Scenarios.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

ARRA: Absolute-Relative Ranking Attack against Image Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Non-Autoregressive Cross-Modal Coherence Modelling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Accelerated Sign Hunter: A Sign-based Black-box Attack via Branch-Prune Strategy and Stabilized Hierarchical Search.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

NAS-StegNet: Lightweight Image Steganography Networks via Neural Architecture Search.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

TVT: Three-Way Vision Transformer through Multi-Modal Hypersphere Learning for Zero-Shot Sketch-Based Image Retrieval.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Cross-Modal Hybrid Feature Fusion for Image-Sentence Matching.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Radial Graph Convolutional Network for Visual Question Generation.
IEEE Trans. Neural Networks Learn. Syst., 2021

Exploiting Subspace Relation in Semantic Labels for Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., 2021

Collaborative Learning for Extremely Low Bit Asymmetric Hashing.
IEEE Trans. Knowl. Data Eng., 2021

Adversarial Attack Against Urban Scene Segmentation for Autonomous Vehicles.
IEEE Trans. Ind. Informatics, 2021

Adaptive Component Embedding for Domain Adaptation.
IEEE Trans. Cybern., 2021

Arbitrary-View Human Action Recognition: A Varying-View RGB-D Action Dataset.
IEEE Trans. Circuits Syst. Video Technol., 2021

Challenging tough samples in unsupervised domain adaptation.
Pattern Recognit., 2021

View-invariant action recognition via Unsupervised AttentioN Transfer (UANT).
Pattern Recognit., 2021

Arbitrary-view human action recognition via novel-view action generation.
Pattern Recognit., 2021

Learning a Weighted Classifier for Conditional Domain Adaptation.
Knowl. Based Syst., 2021

Toward Effective Intrusion Detection Using Log-Cosh Conditional Variational Autoencoder.
IEEE Internet Things J., 2021

A Cognitive Memory-Augmented Network for Visual Anomaly Detection.
IEEE CAA J. Autom. Sinica, 2021

ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval.
CoRR, 2021

VHCN : A Hybrid Data Center Network Based on VLC Link.
Proceedings of the WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence, Hybrid Event / Melbourne, VIC, Australia, December 14 - 17, 2021, 2021

A Rate-based Drone Control with Adaptive Origin Update in Telexistence.
Proceedings of the IEEE Virtual Reality and 3D User Interfaces, 2021

Hierarchical Composition Learning for Composed Query Image Retrieval.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Semantic Enhanced Cross-modal GAN for Zero-shot Learning.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Multi-scale Dynamic Network for Temporal Action Detection.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Towards Unbiased Covid-19 Lesion Localisation And Segmentation Via Weakly Supervised Learning.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Attention-Based Relation Reasoning Network for Video-Text Retrieval.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Region Semantically Aligned Network for Zero-Shot Learning.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Adaptive Knowledge Driven Regularization for Deep Neural Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Balanced Open Set Domain Adaptation via Centroid Alignment.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Cross-Modal Attention With Semantic Consistence for Image-Text Matching.
IEEE Trans. Neural Networks Learn. Syst., 2020

The Disruptions of 5G on Data-Driven Technologies and Applications.
IEEE Trans. Knowl. Data Eng., 2020

Self-weighted Multi-view Fuzzy Clustering.
ACM Trans. Knowl. Discov. Data, 2020

Sparse Graph Connectivity for Image Segmentation.
ACM Trans. Knowl. Discov. Data, 2020

PML-LocNet: Improving Object Localization With Prior-Induced Multi-View Learning Network.
IEEE Trans. Image Process., 2020

Deep-Like Hashing-in-Hash for Visual Retrieval: An Embarrassingly Simple Method.
IEEE Trans. Image Process., 2020

A Context Knowledge Map Guided Coarse-to-Fine Action Recognition.
IEEE Trans. Image Process., 2020

Scalable Deep Hashing for Large-Scale Social Image Retrieval.
IEEE Trans. Image Process., 2020

Graph Convolutional Network Hashing.
IEEE Trans. Cybern., 2020

Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval.
IEEE Trans. Cybern., 2020

Bidirectional Discrete Matrix Factorization Hashing for Image Search.
IEEE Trans. Cybern., 2020

Optimal Projection Guided Transfer Hashing for Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2020

A Survey of Human Action Analysis in HRI Applications.
IEEE Trans. Circuits Syst. Video Technol., 2020

Leveraging unpaired out-of-domain data for image captioning.
Pattern Recognit. Lett., 2020

Learning explicitly transferable representations for domain adaptation.
Neural Networks, 2020

Discovering attractive segments in the user-generated video streams.
Inf. Process. Manag., 2020

Interpretable Signed Link Prediction with Signed Infomax Hyperbolic Graph.
CoRR, 2020

Scene graph generation via multi-relation classification and cross-modal attention coordinator.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Graph-based variational auto-encoder for generalized zero-shot learning.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Self-supervised adversarial learning for cross-modal retrieval.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Multi-level expression guided attention network for referring expression comprehension.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Semantic feature augmentation for fine-grained visual categorization with few-sample training.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Learning Optimization-based Adversarial Perturbations for Attacking Sequential Recognition Models.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Text-Embedded Bilinear Model for Fine-Grained Visual Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Hierarchical Bi-Directional Feature Perception Network for Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Incomplete Cross-modal Retrieval with Dual-Aligned Variational Autoencoders.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Class-Center Involved Triplet Loss for Skin Disease Classification on Imbalanced Data.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

CC-LSTM: Cross and Conditional Long-Short Time Memory for Video Captioning.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Fooled by Imagination: Adversarial Attack to Image Captioning Via Perturbation in Complex Domain.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Universal Weighting Metric Learning for Cross-Modal Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Probability Weighted Compact Feature for Domain Adaptive Retrieval.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning from the Past: Continual Meta-Learning with Bayesian Graph Neural Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Embedding and predicting the event at early stage.
World Wide Web, 2019

Hierarchical Multi-Clue Modelling for POI Popularity Prediction with Heterogeneous Tourist Information.
IEEE Trans. Knowl. Data Eng., 2019

More is Better: Precise and Detailed Image Captioning Using Online Positive Recall and Missing Concepts Mining.
IEEE Trans. Image Process., 2019

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.
IEEE Trans. Image Process., 2019

Collective Reconstructive Embeddings for Cross-Modal Hashing.
IEEE Trans. Image Process., 2019

Describing Video With Attention-Based Bidirectional LSTM.
IEEE Trans. Cybern., 2019

Multi-scale aggregation network for temporal action proposals.
Pattern Recognit. Lett., 2019

Web-based SBLR method of multimedia tools for computer-aided drawing.
Multim. Tools Appl., 2019

Word-to-region attention network for visual question answering.
Multim. Tools Appl., 2019

Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network.
Neurocomputing, 2019

Learning from the Past: Continual Meta-Learning via Bayesian Graph Modeling.
CoRR, 2019

CANZSL: Cycle-Consistent Adversarial Networks for Zero-Shot Learning from Natural Language.
CoRR, 2019

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation.
CoRR, 2019

EncryptGAN: Image Steganography with Domain Transform.
CoRR, 2019

ReshapeGAN: Object Reshaping by Providing A Single Reference Image.
CoRR, 2019

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition.
CoRR, 2019

Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval.
CoRR, 2019

Statistical Karyotype Analysis Using CNN and Geometric Optimization.
IEEE Access, 2019

A 3D-CNN and LSTM Based Multi-Task Learning Architecture for Action Recognition.
IEEE Access, 2019

A 6-DOF Telexistence Drone Controlled by a Head Mounted Display.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019

Adversarial Category Alignment Network for Cross-domain Sentiment Classification.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Residual Graph Convolutional Networks for Zero-Shot Learning.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

CRA-Net: Composed Relation Attention Network for Visual Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Alleviating Feature Confusion for Generative Zero-shot Learning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Attention Transfer (ANT) Network for View-invariant Action Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Exploring Nonnegative and Low-Rank Correlation for Noise-Resistant Spectral Clustering.
Proceedings of the Web and Big Data - Third International Joint Conference, 2019

Look across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

From Zero-Shot Learning to Cold-Start Recommendation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

MR-NET: Exploiting Mutual Relation for Visual Relationship Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning to Localize Objects with Noisy Labeled Instances.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Social Media Harvesting.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Video Captioning by Adversarial LSTM.
IEEE Trans. Image Process., 2018

Hashing with Angular Reconstructive Embeddings.
IEEE Trans. Image Process., 2018

Recurrent attention network using spatial-temporal relations for action recognition.
Signal Process., 2018

One-shot learning based pattern transition map for action early recognition.
Signal Process., 2018

Zero-shot learning via discriminative representation extraction.
Pattern Recognit. Lett., 2018

Robust discrete code modeling for supervised hashing.
Pattern Recognit., 2018

Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Semantic binary coding for visual recognition via joint concept-attribute modelling.
Multim. Tools Appl., 2018

Hierarchical topology based hand pose estimation from a single depth image.
Multim. Tools Appl., 2018

Stroke-based stylization by learning sequential drawing examples.
J. Vis. Commun. Image Represent., 2018

Learning binary codes with local and inner data structure.
Neurocomputing, 2018

Look Across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition.
CoRR, 2018

Domain Invariant Subspace Learning for Cross-Modal Retrieval.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Large-scale RGB-D Database for Arbitrary-view Human Action Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Visual Spatial Attention Network for Relationship Detection.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Modal-adversarial Semantic Learning Network for Extendable Cross-modal Retrieval.
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

Dual Learning for Visual Question Generation.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

ML-LocNet: Improving Object Localization with Multi-view Learning Network.
Proceedings of the Computer Vision - ECCV 2018, 2018

Index and Retrieve Multimedia Data: Cross-Modal Hashing by Learning Subspace Relation.
Proceedings of the Database Systems for Advanced Applications, 2018

Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Asymmetric Binary Coding for Image Search.
IEEE Trans. Multim., 2017

Discrete Nonnegative Spectral Clustering.
IEEE Trans. Knowl. Data Eng., 2017

A Data-Driven and Optimal Bus Scheduling Model With Time-Dependent Traffic and Demand.
IEEE Trans. Intell. Transp. Syst., 2017

Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval.
IEEE Trans. Image Process., 2017

Hierarchical Latent Concept Discovery for Video Event Detection.
IEEE Trans. Image Process., 2017

Robust Web Image Annotation via Exploring Multi-Facet and Structural Knowledge.
IEEE Trans. Image Process., 2017

Semi-Paired Discrete Hashing: Learning Latent Hash Codes for Semi-Paired Cross-View Retrieval.
IEEE Trans. Cybern., 2017

Erratum to: Multi-view feature selection and classification for Alzheimer's Disease Diagnosis.
Multim. Tools Appl., 2017

Multi-view feature selection and classification for Alzheimer's Disease diagnosis.
Multim. Tools Appl., 2017

Special issue on "visual semantic analysis with weak supervision".
Multim. Syst., 2017

Captioning Videos Using Large-Scale Image Corpus.
J. Comput. Sci. Technol., 2017

Editorial: Good practices in multimedia modeling.
Neurocomputing, 2017

Supervised hashing with adaptive discrete optimization for multimedia retrieval.
Neurocomputing, 2017

Classification by Retrieval: Binarizing Data and Classifiers.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Event Early Embedding: Predicting Event Volume Dynamics at Early Stage.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

POI Popularity Prediction via Hierarchical Fusion of Multiple Social Clues.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Cosmetic-vis: sample-based 3D facial editor for cosmetic medical visualization.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2017

Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adversarial Cross-Modal Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep Asymmetric Pairwise Hashing.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Cross-Domain Image Retrieval with Attention Modeling.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Attending to Visual Attributes and Linguistic Knowledge for Captioning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Transductive Visual-Semantic Embedding for Zero-shot Learning.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

CFM@MediaEval 2017 Retrieving Diverse Social Images Task via Re-ranking and Hierarchical Clustering.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

BMC@MediaEval 2017 Multimedia Satellite Task via Regression Random Forest.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Attribute hashing for zero-shot image retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Asymmetric sparse hashing.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Unsupervised cross-modal retrieval through adversarial learning.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Exploiting Concept Correlation with Attributes for Semantic Binary Representation Learning.
Proceedings of the Internet Multimedia Computing and Service, 2017

Preserving-Ignoring Transformation Based Index for Approximate k Nearest Neighbor Search.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Significance Analysis for Typical Objects in mmWave Urban Railway Propagation Environment.
Proceedings of the 2017 IEEE Globecom Workshops, Singapore, December 4-8, 2017, 2017

WebPainter: Collaborative Stroke-Based Rendering Through HTML5 and WebGL.
Proceedings of the E-Learning and Games - 11th International Conference, 2017

Semi-Supervised Network Embedding.
Proceedings of the Database Systems for Advanced Applications, 2017

Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Jointly Modeling Static Visual Appearance and Temporal Pattern for Unsupervised Video Hashing.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Deep Semantic Indexing Using Convolutional Localization Network with Region-Based Visual Attention for Image Database.
Proceedings of the Databases Theory and Applications, 2017

A Deep Approach for Multi-modal User Attribute Modeling.
Proceedings of the Databases Theory and Applications, 2017

Efficient Supervised Hashing via Exploring Local and Inner Data Structure.
Proceedings of the Databases Theory and Applications, 2017

One-Step Spectral Clustering via Dynamically Learning Affinity Matrix and Subspace.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Deep Fusion of Multiple Semantic Cues for Complex Event Recognition.
IEEE Trans. Image Process., 2016

Detecting Densely Distributed Graph Patterns for Fine-Grained Image Categorization.
IEEE Trans. Image Process., 2016

Web Video Event Recognition by Semantic Analysis From Ubiquitous Documents.
IEEE Trans. Image Process., 2016

A Fast Optimization Method for General Binary Code Learning.
IEEE Trans. Image Process., 2016

Face image classification by pooling raw features.
Pattern Recognit., 2016

Face recognition using linear representation ensembles.
Pattern Recognit., 2016

Binary code learning via optimal class representations.
Neurocomputing, 2016

L<sub>2, p</sub>-norm and sample constraint based feature selection and classification for AD diagnosis.
Neurocomputing, 2016

Binary representation learning in computer vision.
Neurocomputing, 2016

Face identification with second-order pooling in single-layer networks.
Neurocomputing, 2016

Combining multi-representation for multimedia event detection using co-training.
Neurocomputing, 2016

Binary Subspace Coding for Query-by-Image Video Retrieval.
CoRR, 2016

Learning Binary Codes and Binary Weights for Efficient Classification.
CoRR, 2016

Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering.
CoRR, 2016

Bidirectional Long-Short Term Memory for Video Description.
CoRR, 2016

Cross-modal Retrieval with Label Completion.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Bidirectional Long-Short Term Memory for Video Description.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Zero-Shot Hashing via Transferring Supervised Knowledge.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Discriminant Cross-modal Hashing.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

A Unified Framework for Discrete Spectral Clustering.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

An Online Approach for Direction-Based Trajectory Compression with Error Bound Guarantee.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

Dynamic User Attribute Discovery on Social Media.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

2015
Social event identification and ranking on flickr.
World Wide Web, 2015

Compact Image Fingerprint Via Multiple Kernel Hashing.
IEEE Trans. Multim., 2015

Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base.
IEEE Trans. Multim., 2015

Corrections to "Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss".
IEEE Trans. Multim., 2015

Tag Features for Geo-Aware Image Classification.
IEEE Trans. Multim., 2015

Multimedia Summarization for Social Events in Microblog Stream.
IEEE Trans. Multim., 2015

Robust Multiview Feature Learning for RGB-D Image Understanding.
ACM Trans. Intell. Syst. Technol., 2015

Multitask Spectral Clustering by Exploring Intertask Correlation.
IEEE Trans. Cybern., 2015

Robust Discrete Spectral Hashing for Large-Scale Image Semantic Indexing.
IEEE Trans. Big Data, 2015

Learning Visual Semantic Relationships for Efficient Visual Retrieval.
IEEE Trans. Big Data, 2015

Supervised feature learning via l<sub>2</sub>-norm regularized logistic regression for 3D object recognition.
Neurocomputing, 2015

Learning for visual semantic understanding in big data.
Neurocomputing, 2015

Multimedia Social Event Detection in Microblog.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Learning Features from Large-Scale, Noisy and Social Image-Tag Collection.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Visual Coding in a Semantic Hierarchy.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Semi-supervised Coupled Dictionary Learning for Cross-modal Retrieval in Internet Images and Texts.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Multi-view Semi-supervised Learning for Web Image Annotation.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Exploring Viewable Angle Information in Georeferenced Video Search.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Spatial-aware Multimodal Location Estimation for Social Images.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

What are Popular: Exploring Twitter Features for Event Detection, Tracking and Visualization.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Learning Binary Codes for Maximum Inner Product Search.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss.
IEEE Trans. Multim., 2014

On the Influence Propagation of Web Videos.
IEEE Trans. Knowl. Data Eng., 2014

Robust (Semi) Nonnegative Graph Embedding.
IEEE Trans. Image Process., 2014

Robust Hashing With Local Models for Approximate Similarity Search.
IEEE Trans. Cybern., 2014

Gradient-domain-based enhancement of multi-view depth video.
Inf. Sci., 2014

Predicting trending messages and diffusion participants in microblogging network.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

WeMash: An Online System for Web Video Mashup.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

One of a Kind: User Profiling by Social Curation.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

EventEye: Monitoring Evolving Events from Tweet Streams.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Image Tagging with Social Assistance.
Proceedings of the International Conference on Multimedia Retrieval, 2014

UQ-DKE's Participation at MediaEval 2014 Placing Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

2013
Semantic Annotation for Visual Data
PhD thesis, 2013

Effective transfer tagging from image to video.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Discriminative Nonnegative Spectral Clustering with Out-of-Sample Extension.
IEEE Trans. Knowl. Data Eng., 2013

Self-taught dimensionality reduction on the high-dimensional small-sized data.
Pattern Recognit., 2013

Local image tagging via graph regularized joint group sparsity.
Pattern Recognit., 2013

Imagilar: A Real-Time Image Similarity Search System on Mobile Platform.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

Spatio-temporal Event Modeling and Ranking.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

Inter-media hashing for large-scale retrieval from heterogeneous data sources.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Robust Semantic Video Indexing by Harvesting Web Images.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval.
Proceedings of the ACM Multimedia Conference, 2013

Multimedia summarization for trending topics in microblogs.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Automatic tagging by exploring tag information capability and correlation.
World Wide Web, 2012

Robust cross-media transfer for visual event detection.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
Mining multi-tag association for image tagging.
World Wide Web, 2011

UQMSG Experiments for TRECVID 2011.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Transfer tagging from image to video.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Tag localization with spatial correlations and joint group sparsity.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Skin Region Tracking Using Hybrid Color Model and Gradient Vector Flow.
Proceedings of the 2010 International Conference on Machine Vision and Human-machine Interface, 2010

2009
A micro niche evolutionary algorithm with lower-dimensional-search crossover for optimisation problems with constraints.
Int. J. Bio Inspired Comput., 2009

An orthogonal multi-objective evolutionary algorithm with lower-dimensional crossover.
Proceedings of the IEEE Congress on Evolutionary Computation, 2009


  Loading...