Xiangyang Xue

Orcid: 0000-0002-4897-9209

Affiliations:
  • Fudan University, Shanghai, China


According to our database1, Xiangyang Xue authored at least 387 papers between 1999 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unsupervised Object-Centric Learning From Multiple Unspecified Viewpoints.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Cost-Sensitive GNN-Based Imbalanced Learning for Mobile Social Network Fraud Detection.
IEEE Trans. Comput. Soc. Syst., April, 2024

Subclassified Loss: Rethinking Data Imbalance From Subclass Perspective for Semantic Segmentation.
IEEE Trans. Intell. Veh., January, 2024

FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View.
CoRR, 2024

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability.
CoRR, 2024

Towards Generative Abstract Reasoning: Completing Raven's Progressive Matrix via Rule Abstraction and Selection.
CoRR, 2024

Exploring One-Shot Semi-supervised Federated Learning with Pre-trained Diffusion Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Federated Adaptive Prompt Tuning for Multi-Domain Collaborative Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
When, Where and How Does it Fail? A Spatial-Temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving.
IEEE Trans. Vis. Comput. Graph., December, 2023

H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Compositional Scene Representation Learning via Reconstruction: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

One-shot Federated Learning without server-side training.
Neural Networks, July, 2023

Dynamic Graph Message Passing Networks.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Location-Guided LiDAR-Based Panoptic Segmentation for Autonomous Driving.
IEEE Trans. Intell. Veh., February, 2023

Multi-view Shape Generation for a 3D Human-like Body.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Rethinking Local and Global Feature Representation for Dense Prediction.
Pattern Recognit., 2023

Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients.
CoRR, 2023

One-Shot Federated Learning with Classifier-Guided Diffusion Models.
CoRR, 2023

OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data.
CoRR, 2023

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model.
CoRR, 2023

Rethinking Person Re-identification from a Projection-on-Prototypes Perspective.
CoRR, 2023

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification.
CoRR, 2023

Abstracting Concept-Changing Rules for Solving Raven's Progressive Matrix Problems.
CoRR, 2023

OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for Object-Centric Learning.
CoRR, 2023

Privacy-Preserving Collaborative Chinese Text Recognition with Federated Learning.
CoRR, 2023

Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model.
CoRR, 2023

GAT-COBO: Cost-Sensitive Graph Neural Network for Telecom Fraud Detection.
CoRR, 2023

Semantic Neural Decoding via Cross-Modal Generation.
CoRR, 2023

Learning Versatile 3D Shape Generation with Improved AR Models.
CoRR, 2023

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation.
CoRR, 2023

Vocabulary-informed Zero-shot and Open-set Learning.
CoRR, 2023

ImpDet: Exploring Implicit Fields for 3D Object Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Visual Exploration and Planning of the Automated Material Handling System for Smart Factory in the Immersive Environment.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Weakly-Supervised Text Instance Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Scene Text Segmentation with Text-Focused Transformers.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Language Guided Robotic Grasping with Fine-Grained Instructions.
IROS, 2023

Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection.
Proceedings of the International Joint Conference on Neural Networks, 2023

Towards Accurate Video Text Spotting with Text-wise Semantic Reasoning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Orientation-Independent Chinese Text Recognition in Scene Images.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

TextFormer: Component-aware Text Segmentation with Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Cross-domain Federated Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Compositional Law Parsing with Latent Random Functions.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Foreign Object Detection Based on Compositional Scene Modeling.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Versatile 3D Shape Generation with Improved Auto-regressive Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Metaverse: Perspectives from graphics, interactions and visualization.
Vis. Informatics, 2022

Visual Evaluation for Autonomous Driving.
IEEE Trans. Vis. Comput. Graph., 2022

Exploring Efficient Few-shot Adaptation for Vision Transformers.
Trans. Mach. Learn. Res., 2022

SGM3D: Stereo Guided Monocular 3D Object Detection.
IEEE Robotics Autom. Lett., 2022

AGO-Net: Association-Guided 3D Point Cloud Object Detection Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

HandO: a hybrid 3D hand-object reconstruction model for unknown objects.
Multim. Syst., 2022

Learning the Compositional Domains for Generalized Zero-shot Learning.
Comput. Vis. Image Underst., 2022

Chinese Character Recognition with Radical-Structured Stroke Trees.
CoRR, 2022

Compositional Scene Modeling with Global Object-Centric Representations.
CoRR, 2022

Cross-domain Federated Adaptive Prompt Tuning for CLIP.
CoRR, 2022

Domain Discrepancy Aware Distillation for Model Aggregation in Federated Learning.
CoRR, 2022

Dynamic Graph Message Passing Networks for Visual Recognition.
CoRR, 2022

Style Spectroscope: Improve Interpretability and Controllability through Fourier Analysis.
CoRR, 2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation.
CoRR, 2022

DEMoS: a deep learning-based ensemble approach for predicting the molecular subtypes of gastric adenocarcinomas from histopathological images.
Bioinform., 2022

Chinese Character Recognition with Augmented Character Profile Matching.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Local Slot Attention for Vision and Language Navigation.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous Driving.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Learning 6-DoF Object Poses to Grasp Category-Level Objects by Language Instructions.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Towards Scalable and Fast Distributionally Robust Optimization for Data-Driven Deep Learning.
Proceedings of the IEEE International Conference on Data Mining, 2022

High-Fidelity Portrait Editing Via Exploring Differentiable Guided Sketches from the Latent Space.
Proceedings of the IEEE International Conference on Acoustics, 2022

RCLane: Relay Chain Prediction for Lane Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling.
Proceedings of the Computer Vision - ECCV 2022, 2022

DST: Dynamic Substitute Training for Data-free Black-box Attack.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

H4D: Human 4D Modeling by Learning Neural Compositional Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Density-preserving Deep Point Cloud Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Co-attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-identification.
Proceedings of the Computer Vision - ACCV 2022, 2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation.
Proceedings of the Computer Vision - ACCV 2022, 2022

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Text Gestalt: Stroke-Aware Scene Text Image Super-resolution.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Periphery-aware COVID-19 diagnosis with contrastive representation enhancement.
Pattern Recognit., 2021

Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection.
Int. J. Comput. Vis., 2021

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study.
CoRR, 2021

SGM3D: Stereo Guided Monocular 3D Object Detection.
CoRR, 2021

The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk Screening by Eye-region Manifestations.
CoRR, 2021

DONet: Learning Category-Level 6D Object Pose and Size Estimation from Depth Observation.
CoRR, 2021

Rapid COVID-19 Risk Screening by Eye-region Manifestations.
CoRR, 2021

A Generic Object Re-identification System for Short Videos.
CoRR, 2021

Syntax-guided text generation via graph neural network.
Sci. China Inf. Sci., 2021

Temporal Context Aggregation for Video Retrieval with Contrastive Learning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Progressive Coordinate Transforms for Monocular 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The Image Local Autoregressive Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Symbolic Representation Learning for Image Captioning.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Distance Restricted Transformer Encoder for Multi-Label Classification.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Depth-Guided AdaIN and Shift Attention Network for Vision-And-Language Navigation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Delving into Data: Effectively Substitute Training for Black-box Attack.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Compositional Representation for 4D Captures With Neural ODE.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Scene Text Telescope: Text-Focused Scene Image Super-Resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking local and global feature representation for semantic segmentation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Knowledge-Guided Object Discovery with Acquired Deep Impressions.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Raven's Progressive Matrices Completion with Latent Gaussian Process Priors.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization.
IEEE Trans. Multim., 2020

M$^3$Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening From CT Imaging.
IEEE J. Biomed. Health Informatics, 2020

Pose-Guided Person Image Synthesis in the Non-Iconic Views.
IEEE Trans. Image Process., 2020

Learning to Score Figure Skating Sport Videos.
IEEE Trans. Circuits Syst. Video Technol., 2020

Object Detection from Scratch with Deep Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Vocabulary-Informed Zero-Shot and Open-Set Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging.
CoRR, 2020

A New Screening Method for COVID-19 based on Ocular Feature Recognition by Machine Learning Tools.
CoRR, 2020

Context Encoding for Video Retrieval with Contrastive Learning.
CoRR, 2020

MOTS: Multiple Object Tracking for General Categories Based On Few-Shot Method.
CoRR, 2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.
CoRR, 2020

Is normalization indispensable for training deep neural network?
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

3DCFS: Fast and Robust Joint 3D Semantic-Instance Segmentation via Coupled Feature Selection.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Learnable Higher-Order Representation for Action Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models.
Proceedings of the 8th International Conference on Learning Representations, 2020

BERT-ATTACK: Adversarial Attack Against BERT Using BERT.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

DeepSFM: Structure from Motion via Deep Bundle Adjustment.
Proceedings of the Computer Vision - ECCV 2020, 2020

Neural Pose Transfer by Spatially Adaptive Instance Normalization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Long-Term Cloth-Changing Person Re-identification.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Self-supervised Learning of Orc-Bert Augmentor for Recognizing Few-Shot Oracle Characters.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Joint Parsing and Generation for Abstractive Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Scale Self-Attention for Text Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Feature Deformation Meta-Networks in Image Captioning of Novel Objects.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-Level Semantic Feature Augmentation for One-Shot Learning.
IEEE Trans. Image Process., 2019

Low-Rank and Locality Constrained Self-Attention for Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Fast Color Constancy with Patch-wise Bright Pixels.
CoRR, 2019

Question Guided Modular Routing Networks for Visual Question Answering.
CoRR, 2019

Star-Transformer.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Generative Modeling of Infinite Occluded Objects for Compositional Scene Representation.
Proceedings of the 36th International Conference on Machine Learning, 2019

CODA: Counting Objects via Scale-Aware Adversarial Density Adaption.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Parasitic GAN for Semi-Supervised Brain Tumor Segmentation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

SSF-DAN: Separated Semantic Feature Based Domain Adaptation Network for Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Towards Instance-Level Image-To-Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Spatial Mixture Models with Learnable Deep Priors for Perceptual Grouping.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

MEAL: Multi-Model Ensemble via Adversarial Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Arbitrary-Oriented Scene Text Detection via Rotation Proposals.
IEEE Trans. Multim., 2018

Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.
IEEE Trans. Multim., 2018

Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content.
IEEE Signal Process. Mag., 2018

Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Stacked multichannel autoencoder - an efficient way of learning from synthetic data.
Multim. Tools Appl., 2018

High Order Neural Networks for Video Classification.
CoRR, 2018

Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective.
CoRR, 2018

Top-Down Tree Structured Text Generation.
CoRR, 2018

SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners.
CoRR, 2018

Semantic Feature Augmentation in Few-shot Learning.
CoRR, 2018

Learning to score and summarize figure skating sport videos.
CoRR, 2018

Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

ExFuse: Enhancing Feature Fusion for Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Pose-Normalized Image Generation for Person Re-identification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Dual Skipping Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids.
CoRR, 2017

DeepSkeleton: Skeleton Map for 3D Human Pose Regression.
CoRR, 2017

Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization.
CoRR, 2017

Recent Advances in Zero-shot Recognition.
CoRR, 2017

Weakly-supervised Transfer for 3D Human Pose Estimation in the Wild.
CoRR, 2017

Semi-Latent GAN: Learning to generate and modify facial images from attributes.
CoRR, 2017

A Jointly Learned Deep Architecture for Facial Attribute Analysis and Face Detection in the Wild.
CoRR, 2017

Learning to Generate and Edit Hairstyles.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Weighted Multi-task Deep Network for Person Attribute Classification.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Frame-Transformer Emotion Classification Network.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Patch Image Based LSMR Method for Moving Point Target Detection.
Proceedings of the Intelligence Science I, 2017

Evolving boxes for fast vehicle detection.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Iterative object and part transfer for fine-grained recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach.
Proceedings of the IEEE International Conference on Computer Vision, 2017

DSOD: Learning Deeply Supervised Object Detectors from Scratch.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Multi-scale Deep Learning Architectures for Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Weakly Supervised Dense Video Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017


2016
Flexible multi-task learning with latent task grouping.
Neurocomputing, 2016

Multiple task learning with flexible structure regularization.
Neurocomputing, 2016

Low-Rank and Sparse Decomposition Based Frame Difference Method for Small Infrared Target Detection in Coastal Surveillance.
IEICE Trans. Inf. Syst., 2016

Facial Landmark Localization by Part-Aware Deep Convolutional Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Face Recognition via Active Annotation and Learning.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Model-Based Deep Hand Pose Estimation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Online video tracking using collaborative convolutional networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Robust online visual tracking via a temporal ensemble framework.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Regional Gating Neural Networks for Multi-label Image Classification.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling.
IEEE Trans. Image Process., 2015

Fusing Multi-Stream Deep Networks for Video Classification.
CoRR, 2015

Learning to Point and Count.
CoRR, 2015

Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Evaluating Two-Stream CNN for Video Classification.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Multiple Granularity Descriptors for Fine-Grained Categorization.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Weakly supervised semantic segmentation for social images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Cross-Modal Image Clustering via Canonical Correlation Analysis.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
A Graph Minor Perspective to Multicast Network Coding.
IEEE Trans. Inf. Theory, 2014

Bounding the Advantage of Multicast Network Coding in General Network Models.
IEEE Trans. Commun., 2014

Cost-Sensitive Multi-View Learning Machine.
Int. J. Pattern Recognit. Artif. Intell., 2014

Do More Dropouts in Pool5 Feature Maps for Better Object Detection.
CoRR, 2014

Addressing cold start in recommender systems: a semi-supervised co-training algorithm.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Leveraging Color Harmony and Spatial Context for Aesthetic Assessment of Photographs.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Which Looks Like Which: Exploring Inter-class Relationships in Fine-Grained Visual Categorization.
Proceedings of the Computer Vision - ECCV 2014, 2014

Semantic Segmentation Using Multiple Graphs with Block-Diagonal Constraints.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Predicting Emotions in User-Generated Videos.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Query-Adaptive Image Search With Hash Codes.
IEEE Trans. Multim., 2013

A Segmentation and Graph-Based Video Sequence Matching Method for Video Copy Detection.
IEEE Trans. Knowl. Data Eng., 2013

Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation.
IEEE Trans. Speech Audio Process., 2013

An efficient Kernel-based matrixized least squares support vector machine.
Neural Comput. Appl., 2013

Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

A graph minor perspective to network coding: Connecting algebraic coding with network topologies.
Proceedings of the IEEE INFOCOM 2013, Turin, Italy, April 14-19, 2013, 2013

Sparse Reconstruction for Weakly Supervised Semantic Segmentation.
Proceedings of the IJCAI 2013, 2013

Multi-View Embedding Learning for Incompletely Labeled Data.
Proceedings of the IJCAI 2013, 2013

Automatic Name-Face Alignment to Enable Cross-Media News Retrieval.
Proceedings of the IJCAI 2013, 2013

Multiple Task Learning Using Iteratively Reweighted Least Square.
Proceedings of the IJCAI 2013, 2013

An Adaptive Query Prototype Modeling Method for Image Search Reranking.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Understanding and Predicting Interestingness of Videos.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation.
IEEE Trans. Image Process., 2012

Gradient Ordinal Signature and Fixed-Point Embedding for Efficient Near-Duplicate Video Detection.
IEEE Trans. Circuits Syst. Video Technol., 2012

A simplified multi-class support vector machine with reduced dual optimization.
Pattern Recognit. Lett., 2012

A covariance-free iterative algorithm for distributed principal component analysis on vertically partitioned data.
Pattern Recognit., 2012

Inverse matrix-free incremental proximal support vector machine.
Decis. Support Syst., 2012

Groupwise Constrained Reconstruction for Subspace Clustering
CoRR, 2012

A Double-Ranking Strategy for Long-Tail Product Recommendation.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

Smart Information Network: A Testbed Architecture for Future Internet.
Proceedings of the Testbeds and Research Infrastructure. Development of Networks and Communities, 2012

Leveraging Exemplar and Saliency Model for Image Search Reranking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Semi-supervised multi-instance multi-label learning for video annotation task.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

A fast video event recognition system and its application to video search.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Min-cost multicast networks in Euclidean space.
Proceedings of the 2012 IEEE International Symposium on Information Theory, 2012

On benefits of network coding in bidirected networks and hyper-networks.
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012

A Novel and Adaptive Method for Image Search Reranking.
Proceedings of the Advances on Digital Television and Wireless Multimedia Communications, 2012

Groupwise Constrained Reconstruction for Subspace Clustering.
Proceedings of the 29th International Conference on Machine Learning, 2012

Learning Hybrid Part Filters for Scene Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

Trajectory-Based Modeling of Human Actions with Motion Reference Points.
Proceedings of the Computer Vision - ECCV 2012, 2012

Learning attention map from images.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Parallel proximal support vector machine for high-dimensional pattern classification.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Semantic context learning with large-scale weakly-labeled image set.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Real-Time, Adaptive, and Locality-Based Graph Partitioning Method for Video Scene Clustering.
IEEE Trans. Circuits Syst. Video Technol., 2011

A Hybrid Probabilistic Model for Unified Collaborative and Content-Based Image Tagging.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Refining local descriptors by embedding semantic information for visual categorization.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Automatic image annotation with weakly labeled dataset.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards content-based audio fragment authentication.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Ensemble multi-instance multi-label learning approach for video annotation task.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Ensemble approach based on conditional random field for multi-label image and video annotation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Level influence of spatial pyramid matching in object classification.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Multi-Kernel Multi-Label Learning with Max-Margin Concept Network.
Proceedings of the IJCAI 2011, 2011

Fusion of Multiple Features and Supervised Learning for Chinese OOV Term Detection and POS Guessing.
Proceedings of the IJCAI 2011, 2011

Learning Inter-Related Statistical Query Translation Models for English-Chinese Bi-Directional CLIR.
Proceedings of the IJCAI 2011, 2011

Cross-Domain Collaborative Filtering over Time.
Proceedings of the IJCAI 2011, 2011

Correlative multi-label multi-instance image annotation.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Salient Object Detection using concavity context.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Transfer active learning.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Tracking User-Preference Varying Speed in Collaborative Filtering.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Constructions of cryptographically significant boolean functions using primitive polynomials.
IEEE Trans. Inf. Theory, 2010

Semi-automatic dynamic auxiliary-tag-aided image annotation.
Pattern Recognit., 2010

Normalized dimensionality reduction using nonnegative matrix factorization.
Neurocomputing, 2010

Fudan University at TRECVID 2010 : Semantic Indexing.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Robust music identification based on low-order zernike moment in the compressed domain.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Robust audio identification for MP3 popular music.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Achieving O(1) IP lookup on GPU-based software routers.
Proceedings of the ACM SIGCOMM 2010 Conference on Applications, 2010

The architecture and recognition algorithm in Haibao perceptual development robot.
Proceedings of the 2010 IEEE International Conference on Robotics and Biomimetics, 2010

A novel audio fingerprinting method robust to time scale modification and pitch shifting.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Semantic video indexing by fusing explicit and implicit context spaces.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Bilingual query translation and expansion for supporting more effective cross-language image retrieval.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Robust hashing for music copyright protection by combining beat segmentation and chroma.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

How context helps: A discriminative codeword selection method for object detection.
Proceedings of the International Conference on Image Processing, 2010

SVD-SIFT for web near-duplicate image detection.
Proceedings of the International Conference on Image Processing, 2010

Quick matting: A matting method based on pixel spread and propagation.
Proceedings of the International Conference on Image Processing, 2010

Maximizing Growth Codes Utility in Large-Scale Wireless Sensor Networks.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Fusion of Multiple Features and Ranking SVM for Web-based English-Chinese OOV Term Translation.
Proceedings of the COLING 2010, 2010

Structured max-margin learning for multi-label image annotation.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

An effective method for video genre classification.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Transfer incremental learning for pattern classification.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
New Balanced Boolean Functions with Good Cryptographic Properties.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Web image retrieval reranking with multi-view clustering.
Proceedings of the 18th International Conference on World Wide Web, 2009

Fudan University at TRECVID 2009 High Level Feature Extraction and Copy Detection.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Tree-structured data regeneration with network coding in distributed storage systems.
Proceedings of the 17th International Workshop on Quality of Service, 2009

Temporal context as cortical spatial codes.
Proceedings of the International Joint Conference on Neural Networks, 2009

Matrix-based Kernel Principal Component analysis for large-scale data set.
Proceedings of the International Joint Conference on Neural Networks, 2009

Can Movies and Books Collaborate? Cross-Domain Collaborative Filtering for Sparsity Reduction.
Proceedings of the IJCAI 2009, 2009

Transfer learning for collaborative filtering via a rating-matrix generative model.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Content and context-based multi-label image annotation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

English-Chinese Bi-Directional OOV Translation based on Web Mining and Supervised Learning.
Proceedings of the ACL 2009, 2009

Incorporating Spatial Correlogram into Bag-of-Features Model for Scene Categorization.
Proceedings of the Computer Vision, 2009

2008
Incorporating feature hierarchy and boosting to achieve more effective classifier training and concept-oriented video summarization and skimming.
ACM Trans. Multim. Comput. Commun. Appl., 2008

The design of video segmentation-aided VCR support for P2P VoD systems.
IEEE Trans. Consumer Electron., 2008

Metric learning by discriminant neighborhood embedding.
Pattern Recognit., 2008

Multilayer in-place learning networks for modeling functional layers in the laminar cortex.
Neural Networks, 2008

Fudan University at TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Detecting Interesting Regions in Photographs - How Metadata Can Help.
Proceedings of the Advances in Multimedia Information Processing, 2008

Collaborative and content-based image labeling.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

An Improved Generalized Discriminant Analysis for Large-Scale Data Set.
Proceedings of the Seventh International Conference on Machine Learning and Applications, 2008

CODED IP: On the Feasibility of IP-Layer Network Coding.
Proceedings of the 17th International Conference on Computer Communications and Networks, 2008

Swifter: Chunked Network Coding for Peer-to-Peer Content Distribution.
Proceedings of IEEE International Conference on Communications, 2008

Scene segmentation based on video structure and spectral methods.
Proceedings of the 10th International Conference on Control, 2008

Fudan University: hierarchical video retrieval with adaptive multi-modal fusion.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
A robust incremental learning framework for accurate skin region segmentation in color images.
Pattern Recognit., 2007

A Multilayer in-Place Learning Network for Development of General Invariances.
Int. J. Humanoid Robotics, 2007

ACVoD: a peer-to-peer based video-on-demand scheme in broadband residential access networks.
Int. J. Ad Hoc Ubiquitous Comput., 2007

News Video Retrieval by Learning Multimodal Semantic Information.
Proceedings of the Advances in Visual Information Systems, 9th International Conference, 2007

A Semi-automatic Feature Selecting Method for Sports Video Highlight Annotation.
Proceedings of the Advances in Visual Information Systems, 9th International Conference, 2007

Mining Large-Scale News Video Database Via Knowledge Visualization.
Proceedings of the Advances in Visual Information Systems, 9th International Conference, 2007

Fudan University at TRECVID 2007.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Semantic Information Extraction of Video Based on Ontology and Inference.
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007

Local Dual Closed Loop Model Based Bayesian Face Tracking.
Proceedings of the Advances in Multimedia Information Processing, 2007

Incremetal Spatio-Temporal Feature Extraction and Retrieval for Large Video Database.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

The Multilayer In-Place Learning Network for the Development of General Invariances and Multi-Task Learning.
Proceedings of the International Joint Conference on Neural Networks, 2007

Optimal dimensionality of metric space for classification.
Proceedings of the Machine Learning, 2007

Support cluster machine.
Proceedings of the Machine Learning, 2007

Efficient Feature Extraction for Image Classification.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Design of a Fairness Guarantee Mechanism Based on Network Measurement.
Proceedings of the Tenth IEEE International Symposium on High Assurance Systems Engineering (HASE 2007), 2007

Salient Object Detection on Large-Scale Video Data.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Localized audio watermarking technique robust against time-scale modification.
IEEE Trans. Multim., 2006

Hierarchical Indexing Structure for Efficient Similarity Search in Video Retrieval.
IEEE Trans. Knowl. Data Eng., 2006

Discriminant neighborhood embedding for classification.
Pattern Recognit., 2006

Null Foley-Sammon transform.
Pattern Recognit., 2006

Efficient Sports Video Retrieval Based on Index Structure.
J. Comput. Res. Dev., 2006

Post-Refinement of Shot Boundary Detection Based on Manifold Feature.
J. Comput. Res. Dev., 2006

Fudan University at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

In-Place Learning for Positional and Scale Invariance.
Proceedings of the International Joint Conference on Neural Networks, 2006

A Load Balance Based On-Demand Routing Protocol for Mobile Ad-Hoc Networks.
Proceedings of the Computational Science, 2006

Quotient Set-based Nonlinear Manifold for Image Restoration.
Proceedings of the Ninth International Conference on Control, 2006

An Efficient Early Termination Algorithm of Intra Prediction for H.264/AVC.
Proceedings of the Ninth International Conference on Control, 2006

2005
InsightVideo: toward hierarchical video content organization for efficient browsing, summarization and retrieval.
IEEE Trans. Multim., 2005

Enhanced shot boundary detection using video text information.
IEEE Trans. Consumer Electron., 2005

A New Spectral-Based Approach to Query-by-Humming for MP3 Songs Database.
Proceedings of the Second World Enformatika Conference, 2005

Fudan University at TRECVID 2005.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

An efficient approach for video information retrieval.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2005, 2005

A Novel Peer-to-Peer Intrusion Detection System.
Proceedings of the Sixth International Conference on Parallel and Distributed Computing, 2005

Simulating Large-Scale Traffic Aggregation in an Automatic Switched Optical Network.
Proceedings of the Sixth International Conference on Parallel and Distributed Computing, 2005

VoD Service Model and Performance Evaluation on the China's High Performance Broadband Information Network (3Tnet).
Proceedings of the Sixth International Conference on Parallel and Distributed Computing, 2005

Efficient Video Clip Retrieval Using Index Structure.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Region-based Pornographic Image Detection.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Spectral Images and Features Co-Clustering with Application to Content-based Image Retrieval.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Efficient rate control for MPEG-2 to H.264/AVC transcoding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Effective shot boundary classification using video spatial-temporal information.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Program segmentation for TV videos.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

A Buffer-Driven Network-Adaptive Multicast Rate Control Approach for Internet DTV.
Proceedings of the Information Networking, 2005

A Mobile Agent-Based P2P Autonomous Security Hole Discovery System.
Proceedings of the Advances in Natural Computation, First International Conference, 2005

Loop-Based Topology Maintenance in Wireless Sensor Networks.
Proceedings of the Networking and Mobile Computing, Third International Conference, 2005

Loop-based topology maintenance and route discovery for wireless sensor networks.
Proceedings of the Global Telecommunications Conference, 2005. GLOBECOM '05, St. Louis, Missouri, USA, 28 November, 2005

Content-Based Image and Video Indexing and Retrieval.
Proceedings of the Cognitive Systems, Joint Chinese-German Workshop, Shanghai, 2005

A Mobile Agent-based P2P Model for Autonomous Security Hole Discovery.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005

A Comparison of End-to-end Performance Over Three Multicast Sending Rate Control Schemes For Internet DTV.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005

A Novel Architecture for Video-on-Demand Services.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005

2004
A fast video clip retrieval algorithm based on VA-file.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Gabor Features Based Method Using HDR (G-HDR) for Multiview Face Recognition.
Proceedings of the Advances in Biometric Person Authentication, 2004

Novel Video Error Concealment Using Shot Boundary Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Vote-Based Clustering Algorithm in Mobile Ad Hoc Networks.
Proceedings of the Information Networking, 2004

Musical Genre Classification by Instrumental Features.
Proceedings of the 2004 International Computer Music Conference, 2004

A New Efficient Approach to Query by Humming.
Proceedings of the 2004 International Computer Music Conference, 2004

Effective video text detection using line features.
Proceedings of the 8th International Conference on Control, 2004

Improved shot boundary detection method based on text edges.
Proceedings of the 8th International Conference on Control, 2004

Efficient identification of speakers in news video based on shot segmentation.
Proceedings of the 8th International Conference on Control, 2004

Improved Robust Watermarking in DCT Domain for Color Images.
Proceedings of the 18th International Conference on Advanced Information Networking and Applications (AINA 2004), 2004

2003
An Audio Watermarking Technique That Is Robust Against Random Cropping.
Comput. Music. J., 2003

Audio Watermarking Based on Statistical Feature in Wavelet Domain.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

An effective and simple relevance feedback algorithm for image retrieval.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003

Content Based Localized Robust Audio Watermarking.
Proceedings of the Interactive Multimedia on Next Generation Networks, 2003

Audio Watermarking Based on Music Content Analysis: Robust against Time Scale Modification.
Proceedings of the Digital Watermarking, Second International Workshop, 2003

A Novel Feature-Based Robust Audio Watermarking for Copyright Protection.
Proceedings of the 2003 International Symposium on Information Technology (ITCC 2003), 2003

Multi-channel Data Hiding Scheme for Color Images.
Proceedings of the 2003 International Symposium on Information Technology (ITCC 2003), 2003

An Optimized Multi-bits Blind Watermarking Scheme.
Proceedings of the Information and Communications Security, 5th International Conference, 2003

Robust Spatial Data Hiding for Color Images.
Proceedings of the Communications and Multimedia Security, 2003

An Improved Dynamic Priority Queue for Multimedia Network Communications.
Proceedings of the 17th International Conference on Advanced Information Networking and Applications (AINA'03), 2003

A Mobile Multicast Algorithm Using Agents for Mobile Ad-hoc Users.
Proceedings of the 17th International Conference on Advanced Information Networking and Applications (AINA'03), 2003

2002
Angle-Tree: a new index structure for high-dimensional point data.
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002

Qualitative Camera Motion Classification for Content-Based Video Indexing.
Proceedings of the Advances in Multimedia Information Processing, 2002

Semi-automatic Video Content Annotation.
Proceedings of the Advances in Multimedia Information Processing, 2002

2001
Automatic Scene Detection in News Program by Integrating Visual Feature and Rules.
Proceedings of the Advances in Multimedia Information Processing, 2001

2000
Index point data using algebraic lattice.
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000

1999
A new way to reduce candidate blocks for block matching motion estimation.
Proceedings of the ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications, 1999


  Loading...