Jinqiao Wang

Image Vis. Comput., 2026

SQL-Checker: Error Detection and Labeling for Text-to-SQL with Interpretability Analysis.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

HB-Mamba: Hierarchical Bi-directional State Space Modeling for LiDAR Semantic Segmentation in Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

PASs-MoE: Mitigating Misaligned Co-drift among Router and Experts via Pathway Activation Subspaces for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Improving Generalization in LLM Structured Pruning via Function-Aware Neuron Grouping.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Semi-supervised Learning for Detector-free Multi-person Pose Estimation.

[BibT_eX]

[DOI]

Mach. Intell. Res., December, 2025

ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, December, 2025

UniBYD: A Unified Framework for Learning Robotic Manipulation Across Embodiments Beyond Imitation of Human Demonstrations.

[BibT_eX]

[DOI]

CoRR, December, 2025

PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning.

[BibT_eX]

[DOI]

CoRR, November, 2025

PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments.

[BibT_eX]

[DOI]

CoRR, October, 2025

From Seeing to Predicting: A Vision-Language Framework for Trajectory Forecasting and Controlled Video Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

MLLM-CBench:A Comprehensive Benchmark for Continual Instruction Tuning of Multimodal LLMs with Chain-of-Thought Reasoning Analysis.

[BibT_eX]

[DOI]

CoRR, August, 2025

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval.

[BibT_eX]

[DOI]

CoRR, August, 2025

Scaling Linear Attention with Sparse State Expansion.

[BibT_eX]

[DOI]

CoRR, July, 2025

FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation.

[BibT_eX]

[DOI]

CoRR, June, 2025

VFaith: Do Large Multimodal Models Really Reason on Seen Images Rather than Previous Memories?

[BibT_eX]

[DOI]

CoRR, June, 2025

GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking.

[BibT_eX]

[DOI]

CoRR, June, 2025

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

Generation of surgical reports for lymph node dissection during laparoscopic gastric cancer surgery based on artificial intelligence.

[BibT_eX]

[DOI]

Int. J. Comput. Assist. Radiol. Surg., May, 2025

MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, April, 2025

An Empirical Study of Validating Synthetic Data for Text-Based Person Retrieval.

[BibT_eX]

[DOI]

CoRR, March, 2025

Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, March, 2025

LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning.

[BibT_eX]

[DOI]

CoRR, March, 2025

A Benchmark for Crime Surveillance Video Analysis with Large Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

Learning enhancing modality-invariant features for visible-infrared person re-identification.

[BibT_eX]

[DOI]

Int. J. Mach. Learn. Cybern., January, 2025

MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark.

[BibT_eX]

[DOI]

CoRR, January, 2025

FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization.

[BibT_eX]

[DOI]

CoRR, January, 2025

Optimization of Prompt Learning via Multi-Knowledge Representation for Vision-Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

ADFormer: Generalizable Few-Shot Anomaly Detection With Dual CNN-Transformer Architecture.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2025

AMITA: Attribute-Guided Masked Image-Text Alignment for Multi-Label Image Representation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2025

DSTA: Reinforcing Vision-Language Understanding for Scene-Text VQA With Dual-Stream Training Approach.

[BibT_eX]

[DOI]

Yingtao Tan

IEEE Signal Process. Lett., 2025

Enhancing Visual Aligning and Grounding for Aerial Vision-and-Dialog Navigation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2025

BlingDiff: High-Fidelity Virtual Jewelry Try-On with Detail-Optimized Diffusion.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Workshop on Rich Media With Generative AI, 2025

MaSA: Mamba-Based Global Feature Selective Aggregator for Efficient Lane Detection.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Referring Expression Instance Retrieval and A Strong End-to-End Baseline.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Make "V" and "Q" Inseparable: Deliberately Dual-Channel Adversarial Learning for Robust Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

FLARE: A Framework for Stellar Flare Forecasting Using Stellar Physical Properties and Historical Records.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Semantic-aware Fine-grained Point Augmentation for 3D Multi-modal Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Systematic Outliers in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dual-Chain Reasoning: Enhancing Multimodal Document VQA Through Positive and Negative Reasoning Paths.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 13th International Conference, 2025

TEI-Face: A Temporal Expression and Identity Stability Oriented Face Swapping.

[BibT_eX]

[DOI]

Biying Li

Zhiwei Liu

Proceedings of the Image and Graphics - 13th International Conference, 2025

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LINK: Adaptive Modality Interaction for Audio-Visual Video Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Extracting Sparse Specialist Models from Generalist Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

OpenS2S: Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SACPlace: Multi-Agent Deep Reinforcement Learning for Symmetry-Aware Analog Circuit Placement.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference, 2025

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Synthetic Data is an Elegant GIFT for Continual Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

sxFusion: A Novel Single-Cell Clustering Tool Based on Feature Fusion and Co-Optimization of Low-Rank Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Enhancing Chain of Thought Prompting in Large Language Models via Reasoning Patterns.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

AAformer: Auto-Aligned Transformer for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2024

Efficient Masked Autoencoders With Self-Consistency.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Multi-Model Style-Aware Diffusion Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Relation-Associated Instructions & Hallucination Benchmark.

[BibT_eX]

[DOI]

Dataset, July, 2024

Structural Dependence Learning Based on Self-attention for Face Alignment.

[BibT_eX]

[DOI]

Mach. Intell. Res., June, 2024

Dual-Path Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2024

Objformer: Boosting 3D object detection via instance-wise interaction.

[BibT_eX]

[DOI]

Pattern Recognit., February, 2024

Artificial intelligence for automatic surgical phase recognition of laparoscopic gastrectomy in gastric cancer.

[BibT_eX]

[DOI]

Int. J. Comput. Assist. Radiol. Surg., February, 2024

A fast mask synthesis method for face recognition.

[BibT_eX]

[DOI]

Kaiwen Guo

Chaoyang Zhao

Vis. Intell., 2024

MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Pixel-Level Contrastive Pretrainer for Industrial Image Representation.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2024

EFCPose: End-to-End Multi-Person Pose Estimation With Fully Convolutional Heads.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2024

ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Learning facial structural dependency in 3D aligned space for face alignment.

[BibT_eX]

[DOI]

Biying Li

Zhiwei Liu

Image Vis. Comput., 2024

SlowFastFormer for 3D human pose estimation.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2024

Friend or Foe? Harnessing Controllable Overfitting for Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Monocular Lane Detection Based on Deep Learning: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion.

[BibT_eX]

[DOI]

CoRR, 2024

Recurrent Context Compression: Efficiently Expanding the Context Window of LLM.

[BibT_eX]

[DOI]

CoRR, 2024

VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons.

[BibT_eX]

[DOI]

CoRR, 2024

Pattern-Aware Chain-of-Thought Prompting in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PM-VIS: High-Performance Box-Supervised Video Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring.

[BibT_eX]

[DOI]

CoRR, 2024

FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Auto DragGAN: Editing the Generative Image Manifold in an Autoregressive Manner.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Estate: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2024

Multimodal Mamba: A Versatile Multimodal Model for Seamless Integration into Diverse Downstream Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2024 13th International Conference on Computing and Pattern Recognition, 2024

MSAOT: Enhancing Video Object Segmentation with Memory-Selective Updates and High-Quality Mask Regeneration.

[BibT_eX]

[DOI]

Proceedings of the 2024 13th International Conference on Computing and Pattern Recognition, 2024

PFDM: Parser-Free Virtual Try-On via Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

BFRFormer: Transformer-Based Generator for Real-World Blind Face Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Griffon: Spelling Out All Object Locations at Any Granularity with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

The BRAVO Semantic Segmentation Challenge Results in UNCV2024.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Self-Supervised Representation Learning from Arbitrary Scenarios.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Set of Effective Strategies for Optimized Road Damage Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data, 2024

Contrastive Learning with Information Compensation for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 14th Asian Control Conference, 2024

Knowledge Distillation Dealing with Sample-Wise Long-Tail Problem.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Fluctuation-Based Adaptive Structured Pruning for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Progressive Direction-Aware Pose Grammar for Human Pose Estimation.

[BibT_eX]

[DOI]

Rama Krishna Sai Subrahmanyam Gorthi

IEEE Trans. Biom. Behav. Identity Sci., October, 2023

Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., April, 2023

Pseudo Label Rectification With Joint Camera Shift Adaptation and Outlier Progressive Recycling for Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., March, 2023

Human Parsing With Part-Aware Relation Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Pruning-aware Sparse Regularization for Network Pruning.

[BibT_eX]

[DOI]

Int. J. Autom. Comput., 2023

Mitigating Hallucination in Visual Language Models with Visual Supervision.

[BibT_eX]

[DOI]

CoRR, 2023

Continual Instruction Tuning for Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2023

Surgical Temporal Action-aware Network with Sequence Regularization for Phase Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model.

[BibT_eX]

[DOI]

CoRR, 2023

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network.

[BibT_eX]

[DOI]

CoRR, 2023

SSPFusion: A Semantic Structure-Preserving Approach for Infrared and Visible Image Fusion.

[BibT_eX]

[DOI]

CoRR, 2023

FastBCSD: Fast and Efficient Neural Network for Binary Code Similarity Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Fast Segment Anything.

[BibT_eX]

[DOI]

CoRR, 2023

FreConv: Frequency Branch-and-Integration Convolutional Networks.

[BibT_eX]

[DOI]

CoRR, 2023

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Masked Autoencoders with Self-Consistency.

[BibT_eX]

[DOI]

CoRR, 2023

Uncertainty-Aware Boundary Attention Network for Real-Time Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Instance-Proxy Loss for Semi-supervised Learning with Coarse Labels.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Surgical Video Captioning with Mutual-Modal Concept Alignment.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

ShiftFormer: Spatial-Temporal Shift Operation in Video Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

FreConv: Frequency Branch-and-Integration Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Explicit Attention Modeling for Pedestrian Attribute Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal Action-aware Network with Sequence Regularization for Phase Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022

Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

Multi-Granularity Mutual Learning Network for Object Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2022

Grammar-Induced Wavelet Network for Human Parsing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Dynamic Orthogonal Projection Constrained Discriminative Tracking.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Fine-Grained Human-Centric Tracklet Segmentation with Single Frame Supervision.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2022

Part-Aware Self-Supervised Pre-Training for Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2022

PruneFaceDet: Pruning lightweight face detection network by sparsity training.

[BibT_eX]

[DOI]

Cogn. Comput. Syst., 2022

Global Patch Cross-Attention for Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained Visual Categorization.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

When Skeleton Meets Appearance: Adaptive Appearance Information Enhancement for Skeleton Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Transfering Low-Frequency Features for Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Regularizing Vector Embedding in Bottom-Up Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Ensemble of One-Stage and Two-Stage Detectors Approach for Road Damage Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data, 2022

2021

Antidecay LSTM for Siamese Tracking With Adversarial Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Siamese Regression Tracking With Reinforced Template Updating.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Semi-Supervised Scene Text Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Enhanced Bounding Box Estimation with Distribution Calibration for Visual Tracking.

[BibT_eX]

[DOI]

Sensors, 2021

STN-enhanced message passing guided by adversarial learning for human pose estimation.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Macro-micro mutual learning inside compositional model for human pose estimation.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Unsupervised cycle-consistent person pose transfer.

[BibT_eX]

[DOI]

Neurocomputing, 2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, 2021

AAformer: Auto-Aligned Transformer for Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

Fast Kernelized Correlation Filter without Boundary Effect.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

High-Performance Discriminative Tracking with Target-Aware Feature Embeddings.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

MST: Masked Self-Supervised Transformer for Visual Representation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DPT: Deformable Patch-based Transformer for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Attention-Guided Knowledge Distillation for Efficient Single-Stage Detector.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

High-Performance Discriminative Tracking with Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving Multiple Object Tracking With Single Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive Class Suppression Loss for Long-Tail Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Consistent-Separable Feature Representation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Recall What You See Continually Using GridLSTM in Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

A Comparison of Correlation Filter-Based Trackers and Struck Trackers.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

An end-to-end exemplar association for unsupervised person Re-identification.

[BibT_eX]

[DOI]

Neural Networks, 2020

Food det: Detecting foods in refrigerator with supervised transformer network.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Siamese Deformable Cross-Correlation Network for Real-Time Visual Tracking.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Semantic-spatial fusion network for human parsing.

[BibT_eX]

[DOI]

Neurocomputing, 2020

A novel data augmentation scheme for pedestrian detection with attribute preserving GAN.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Progressive rectification network for irregular text recognition.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2020

Siamese Attentive Graph Tracking.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Task Decoupled Knowledge Distillation For Lightweight Face Detectors.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Unsupervised Domain Adaptive Re-Identification with Feature Adversarial Learning and Self-similarity Clustering.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

High-Speed And Accurate Scale Estimation For Visual Tracking With Gaussian Process Regression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

PruneFaceDet: Pruning Lightweight Face Detection Network by Sparsity Training.

[BibT_eX]

[DOI]

Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

Identity-Guided Human Semantic Parsing for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Occlusion-Aware Siamese Network for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Feature Embeddings for Discriminant Model Based Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Blended Grammar Network for Human Parsing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Adaptive Variance Based Label Distribution Learning for Facial Age Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Large Batch Optimization for Object Detection: Training COCO in 12 minutes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Part-Aware Context Network for Human Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Progressive Bi-C3D Pose Grammar for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Dynamic Collaborative Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

Multi-Correlation Filters With Triangle-Structure Constraints for Object Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Two-Level Attention Network With Multi-Grain Ranking Loss for Vehicle Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Feature Distilled Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2019

Adversarial Deep Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Pixelwise Deep Sequence Learning for Moving Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Real-Time Multi-Scale Face Detector on Embedded Devices.

[BibT_eX]

[DOI]

Sensors, 2019

Elite Loss for scene text detection.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Reading scene text with fully convolutional sequence modeling.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Adversarial image generation by combining content and style.

[BibT_eX]

[DOI]

IET Image Process., 2019

Class Regularization: Improve Few-shot Image Classification by Reducing Meta Shift.

[BibT_eX]

[DOI]

CoRR, 2019

Learning Features with Differentiable Closed-Form Solver for Tracking.

[BibT_eX]

[DOI]

CoRR, 2019

Color-Sensitive Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Mask Guided Knowledge Distillation for Single Shot Detector.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Bi-Directional Message Passing Based Scanet for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Vehicle Re-Identification with Refined Part Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Pose-Weighted Gan for Photorealistic Face Frontalization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Cascade Attention Network for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.

[BibT_eX]

[DOI]

Abdelrahman Eldesokey

Alireza Memarmoghadam

Ardhendu Shekhar Tripathi

Arnold W. M. Smeulders

Joni-Kristian Kämäräinen

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Fast-deepKCF Without Boundary Effect.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FLDet: A CPU Real-time Joint Face and Landmark Detector.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Biometrics, 2019

In Defense of Color Names for Small-Scale Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Biometrics, 2019

Learning Discriminative and Complementary Patches for Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Semantic Alignment: Finding Semantically Consistent Ground-Truth for Facial Landmark Detection.

[BibT_eX]

[DOI]

Neil Martin Robertson

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Gate-based Bidirectional Interactive Decoding Network for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018

Appearance features in Encoding Color Space for visual surveillance.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Recurrent Calibration Network for Irregular Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2018

High Speed Kernelized Correlation Filters without Boundary Effect.

[BibT_eX]

[DOI]

CoRR, 2018

Multi-view pedestrian captioning with an attention topic CNN model.

[BibT_eX]

[DOI]

Comput. Ind., 2018

Domain Adaptation Tracker With Global and Local Searching.

[BibT_eX]

[DOI]

IEEE Access, 2018

Learning Robust Gaussian Process Regression for Visual Tracking.

[BibT_eX]

[DOI]

Linyu Zheng

Ming Tang

Gorthi R. K. Sai Subrahmanyam

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Tree Hierarchical CNNs for Object Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Dense Chained Attention Network for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.

[BibT_eX]

[DOI]

Abdelrahman Eldesokey

Gustavo Fernández

Álvaro García-Martín

Álvaro Iglesias-Arias

A. Aydin Alatan

Abel González-García

Alfredo Petrosino

Alireza Memarmoghadam

Andrea Vedaldi

Andrej Muhic

Anfeng He

Arnold W. M. Smeulders

Guilherme Sousa Bastos

Haibin Ling

Hamed Kiani Galoogahi

Jorge Rodríguez Herranz

Mario Edoardo Maresca

Martin Danelljan

Ming-Hsuan Yang

Mohamed H. Abdelpakey

Pablo Vicente-Moñivar

Rama Krishna Sai Subrahmanyam Gorthi

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

High-Speed Tracking With Multi-Kernel Correlation Filters.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Improved Single Shot Object Detector Using Enhanced Features and Predicting Heads.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Progressive Cognitive Human Parsing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Learning discriminative context models for concurrent collective activity recognition.

[BibT_eX]

[DOI]

Chaoyang Zhao

Multim. Tools Appl., 2017

Automatic group activity annotation for mobile videos.

[BibT_eX]

[DOI]

Multim. Syst., 2017

On the Relations of Correlation Filter Based Trackers and Struck.

[BibT_eX]

[DOI]

CoRR, 2017

Reading Scene Text with Attention Convolutional Sequence Modeling.

[BibT_eX]

[DOI]

CoRR, 2017

Fast Deep Matting for Portrait Animation on Mobile Phone.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

DenseTracker: A multi-task dense network for visual tracking.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Joint background reconstruction and foreground segmentation via a two-stage convolutional neural network.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Deep embedding network for robust age estimation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Joint Visual Context for Pedestrian Captioning.

[BibT_eX]

[DOI]

Proceedings of the Internet Multimedia Computing and Service, 2017

Automatic Watermeter Digit Recognition on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the Internet Multimedia Computing and Service, 2017

CoupleNet: Coupling Global Structure with Local Parts for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Adaptive Receptive Fields for Deep Image Parsing Network.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Improving Visual Saliency Computing With Emotion Intensity.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2016

Multi-View 3D Object Retrieval With Deep Embedding Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Adaptive Content Condensation Based on Grid Optimization for Thumbnail Image Generation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Real-time people counting for indoor scenes.

[BibT_eX]

[DOI]

Signal Process., 2016

A unified model sharing framework for moving object detection.

[BibT_eX]

[DOI]

Signal Process., 2016

ActiveAd: A novel framework of linking ad videos to online products.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Multiple deep features learning for object retrieval in surveillance videos.

[BibT_eX]

[DOI]

Haiyun Guo

IET Comput. Vis., 2016

Clustering based ensemble correlation tracking.

[BibT_eX]

[DOI]

Guibo Zhu

Comput. Vis. Image Underst., 2016

Learning weighted part models for object tracking.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2016

WHU-NERCMS at TRECVID2016: Instance Search Task.

[BibT_eX]

[DOI]

Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Robust Crowd Segmentation and Counting in Indoor Scenes.

[BibT_eX]

[DOI]

Ren Yang

Huazhong Xu

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Natural image classification driven by human brain activity.

[BibT_eX]

[DOI]

Proceedings of the Medical Imaging 2016: Biomedical Applications in Molecular, Structural, and Functional Imaging, San Diego, California, United States, 27 February, 2016

Scale-Adaptive Low-Resolution Person Re-Identification via Learning a Discriminating Surface.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Person re-identification via rich color-gradient feature.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Boosted local classifiers for visual tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Multi-scale blocks based image emotion classification using multiple instance learning.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Extensive Comparison of Visual Features for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Deep People Counting with Faster R-CNN and Correlation Tracking.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Scale-Adaptive Deconvolutional Regression Network for Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2016, 2016

Piecewise Video Condensation for Complex Scenes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

MC-HOG Correlation Tracking with Saliency Proposal.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Weighted Part Context Learning for Visual Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Image Tag Refinement With View-Dependent Concept Representations.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Finding logos in real-world images with point-context representation-based region search.

[BibT_eX]

[DOI]

Jianlong Fu

Multim. Syst., 2015

A Real-Time People Counting Approach in Indoor Environment.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Learning Multi-view Deep Features for Small Object Retrieval in Surveillance Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Mobile Media Thumbnailing.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Learning sharable models for robust background subtraction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Color names learning using convolutional neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Learning deep compact descriptor with bagging auto-encoders for object retrieval.

[BibT_eX]

[DOI]

Haiyun Guo

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Multiple features based shared models for background subtraction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

60 Hz self-tuning background modeling.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Concurrent group activity classification with context modeling.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Relaxing from Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Collaborative Correlation Tracking.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2015, 2015

3D Object Retrieval with Multimodal Views.

[BibT_eX]

[DOI]

Proceedings of the 8th Eurographics Workshop on 3D Object Retrieval, 2015

2014

Bilayer Sparse Topic Model for Scene Analysis in Imbalanced Surveillance Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Spatiotemporal Grid Flow for Video Retargeting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Spatiotemporal Group Context for Pedestrian Counting.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2014

A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization.

[BibT_eX]

[DOI]

Signal Process., 2014

Sparse representation for robust abnormality detection in crowded scenes.

[BibT_eX]

[DOI]

Pattern Recognit., 2014

Key observation selection-based effective video synopsis for camera network.

[BibT_eX]

[DOI]

Mach. Vis. Appl., 2014

A three-level framework for affective content analysis and its case studies.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2014

Interactive ads recommendation with contextual search on product topic space.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2014

A Hybrid Image Retargeting Approach via Combining Seam Carving and Grid Warping.

[BibT_eX]

[DOI]

J. Multim., 2014

Online video synopsis of structured motion.

[BibT_eX]

[DOI]

Neurocomputing, 2014

Group latent factor model for recommendation with multiple user behaviors.

[BibT_eX]

[DOI]

Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

A Curvature Filter and Normal Clustering Based Approach to Detecting Cylinder on 3D Medical Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Estimate Gaze Density by Incorporating Emotion.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Discriminative Context Models for Collective Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

A coarse-to-fine logo recognition method in video streams.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Recommendation on Flickr by combining community user ratings and item importance.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Object tracking with part-based discriminative context models.

[BibT_eX]

[DOI]

Guibo Zhu

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Part Context Learning for Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2014

Clustering Ensemble Tracking.

[BibT_eX]

[DOI]

Guibo Zhu

Proceedings of the Computer Vision - ACCV 2014, 2014

Learning a Representative and Discriminative Part Model with Deep Convolutional Features for Scene Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

What Visual Attributes Characterize an Object Class?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

Exploiting content relevance and social relevance for personalized ad recommendation on internet TV.

[BibT_eX]

[DOI]

Bo Wang

ACM Trans. Multim. Comput. Commun. Appl., 2013

Context-Aware Video Retargeting via Graph Model.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

Dynamic scene understanding by improved sparse topical coding.

[BibT_eX]

[DOI]

Pattern Recognit., 2013

Multiple Hypotheses Based Spatial-Temporal Association for Stable Pedestrian Counting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Collaborative Tracking: Dynamically Fusing Short-Term Trackers and Long-Term Detector.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Subspace learning based active learning for image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Improving scene classification with weakly spatial symmetry information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Brand Image Detection in Broadcast Video Streams.

[BibT_eX]

[DOI]

Xian Wang

Proceedings of the Seventh International Conference on Image and Graphics, 2013

Classification Related Manifold Dimension Estimation with Restricted Boltzmann Machine.

[BibT_eX]

[DOI]

Kezhen Teng

Proceedings of the Seventh International Conference on Image and Graphics, 2013

2012

Enhanced 3-D Modeling for Landmark Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Real-Time Probabilistic Covariance Tracking With Efficient Model Update.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

Real-time multiple object instances detection.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Key observation selection for effective video synopsis.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Pattern Recognition, 2012

Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Anomaly detection in crowded scene via appearance and dynamics joint modeling.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Object-centered narratives for video surveillance.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Multiple features fusion for crowd density estimation.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Bag of features using sparse coding for gender classification.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Point-context descriptor based region search for logo recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Fast seam carving with strip constraints.

[BibT_eX]

[DOI]

Lianchao Cao

Lifang Wu

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Weighted Interaction Force Estimation for Abnormality Detection in Crowd Scenes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2012, 2012

Fusing Warping, Cropping, and Scaling for Optimal Image Thumbnail Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2012

Efficient Clothing Retrieval with Semantic-Preserving Visual Phrases.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2012

2011

Boosting part-sense multi-feature learners toward effective object detection.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2011

Adaptive Model for Robust Pedestrian Counting.

[BibT_eX]

[DOI]

Jingjing Liu

Proceedings of the Advances in Multimedia Modeling, 2011

Grid-Based Retargeting with Transformation Consistency Smoothing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2011

Landmark recognition and retrieval: from 2D to 3D.

[BibT_eX]

[DOI]

Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011

Fast retargeting with adaptive grid optimization.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Using context saliency for movie shot classification.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Specific vehicle detection and tracking in road environment.

[BibT_eX]

[DOI]

Proceedings of the ICIMCS 2011, 2011

Global Trajectory Construction across Multi-cameras via Graph Matching.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Image and Graphics, 2011

Video Reshuffling with Narratives toward Effective Video Browsing.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Image and Graphics, 2011

2010

People Detection by Boosting Features in Nonlinear Subspace.

[BibT_eX]

[DOI]

Jie Yang

Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Visual Attention Model Based Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Personalized Sports Video Customization for Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

AdVR: Linking Ad Video with Products or Service.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

Landmark image classification using 3D point clouds.

[BibT_eX]

[DOI]

Xian Xiao

Changsheng Xu

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Effective logo retrieval with adaptive local feature selection.

[BibT_eX]

[DOI]

Jianlong Fu

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Fast feature selection and training for AdaBoost-based concept detection with large scale datasets.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Interactive Web Video Advertising with Context Analysis and Search.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Interactive service recommendation based on ad concept hierarchy.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

A improved silhouette tracking approach integrating particle filter with graph cuts.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Multi-level trajectory modeling for video copy detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Image Classification Using Spatial Pyramid Coding and Visual Word Reweighting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2010, 2010

2009

IVA-NLPR-IA-CAS TRECVID 2009: High LevelFeatures Extraction.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Based on the Reinforcement Learning Association Rules Recommendation Study.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Semantics, Knowledge and Grid, 2009

A Hierarchical Semantics-Matching Approach for Sports Video Annotation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2009

Sports video retargeting.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Consumer video retargeting: context assisted spatial-temporal grid optimization.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Linking video ADS with product or service information by web search.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Context saliency based image summarization.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Learning local features for object categorization.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Boosted forest for human detection.

[BibT_eX]

[DOI]

Chengli Xie

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Spatial pyramid based histogram representation for visual tracking with partial occlusion.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Semantic Linking between Video Ads and Web Services with Progressive Search.

[BibT_eX]

[DOI]

Proceedings of the ICDM Workshops 2009, 2009

Real-time visual tracking via Incremental Covariance Tensor Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Robust Bayesian tracking on Riemannian manifolds via fragments-based representation.

[BibT_eX]

[DOI]

Yi Wu

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

A Multimodal Scheme for Program Segmentation and Representation in Broadcast Video Streams.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2008

Digesting Commercial Clips from TV Streams.

[BibT_eX]

[DOI]

IEEE Multim., 2008

A Spatial-Temporal-Scale Registration Approach for Video Copy Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2008

Boosting relative spaces for categorizing objects with large intra-class variation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Hand posture recognition with co-training.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Online video advertising based on user's attention relavancy computing.

[BibT_eX]

[DOI]

Yikai Fang