Wen Gao

Affiliations:
  • Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
  • Peking University, School of Electrical Engineering & Computer Science
  • University of Tokyo, Japan (PhD 1991)


According to our database1, Wen Gao authored at least 1,360 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

SVT-AVS3: An Open-Source High-Performance AVS3 Encoder With Scalable Video Technology.
IEEE Trans. Multim., 2024

Cross Modal Compression With Variable Rate Prompt.
IEEE Trans. Multim., 2024

2023
Multimodal driver distraction detection using dual-channel network of CNN and Transformer.
Expert Syst. Appl., December, 2023

Magnification-arbitrary depth super-resolution with multiscale consistency deformable alignment.
Displays, December, 2023

LPHOG: A Line Feature and Point Feature Combined Rotation Invariant Method for Heterologous Image Registration.
Remote. Sens., September, 2023

Semantic-Aware Visual Decomposition for Image Coding.
Int. J. Comput. Vis., September, 2023

Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey.
Mach. Intell. Res., August, 2023

DMVC: Decomposed Motion Modeling for Learned Video Compression.
IEEE Trans. Circuits Syst. Video Technol., July, 2023

Toward 6G $\text{TK}\mu$ Extreme Connectivity: Architecture, Key Technologies and Experiments.
IEEE Wirel. Commun., June, 2023

Multiscale Attention Fusion for Depth Map Super-Resolution Generative Adversarial Networks.
Entropy, June, 2023

The Waterloo Streaming Quality-of-Experience Database-IV.
Dataset, May, 2023

Sequential Hierarchical Learning with Distribution Transformation for Image Super-Resolution.
ACM Trans. Multim. Comput. Commun. Appl., February, 2023

AMSA: Adaptive Multimodal Learning for Sentiment Analysis.
ACM Trans. Multim. Comput. Commun. Appl., 2023

A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2023

RealVR: Efficient, Economical, and Quality-of- Experience-Driven VR Video System Based on MPEG OMAF.
IEEE Trans. Multim., 2023

Textural and Directional Information Based Offset In-Loop Filtering in AVS3.
IEEE Trans. Multim., 2023

Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion.
IEEE Trans. Multim., 2023

Residual Quantization for Low Bit-Width Neural Networks.
IEEE Trans. Multim., 2023

Differential Weight Quantization for Multi-Model Compression.
IEEE Trans. Multim., 2023

STAM: A SpatioTemporal Attention Based Memory for Video Prediction.
IEEE Trans. Multim., 2023

Unsupervised Deep Exemplar Colorization via Pyramid Dual Non-Local Attention.
IEEE Trans. Image Process., 2023

Learned Image Compression Using Cross-Component Attention Mechanism.
IEEE Trans. Image Process., 2023

Driver Emotion Recognition With a Hybrid Attentional Multimodal Fusion Framework.
IEEE Trans. Affect. Comput., 2023

Multiscale and multidirection depth map super resolution with semantic inference.
IET Image Process., 2023

AI Alignment: A Comprehensive Survey.
CoRR, 2023

MPAI-EEV: Standardization Efforts of Artificial Intelligence based End-to-End Video Coding.
CoRR, 2023

A Survey on Temporal Knowledge Graph Completion: Taxonomy, Progress, and Prospects.
CoRR, 2023

CSSL-RHA: Contrastive Self-Supervised Learning for Robust Handwriting Authentication.
CoRR, 2023

SpikeCodec: An End-to-end Learned Compression Framework for Spiking Camera.
CoRR, 2023

Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation.
CoRR, 2023

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey.
CoRR, 2023

Monocular 3D Object Detection With Motion Feature Distillation.
IEEE Access, 2023

Immersive experience via 6DoF adaptive streaming.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

A Near Lossless Learned Image Coding Network Quantization Approach for Cross-Platform Inference.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

HumVis: Human-Centric Visual Analysis System.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

NUCQ: Non-Uniform Conditional Quantization for Learned Image Compression.
Proceedings of the IEEE International Conference on Image Processing, 2023

Communication power supply design based on PFC and LLC.
Proceedings of the 4th International Conference on Computer Engineering and Intelligent Control, 2023

An Adaptive Intra-frame Quantization Parameter Derivation Model Jointing with Inter-frame Analysis.
Proceedings of the Data Compression Conference, 2023

An Efficient Rate Control Scheme for Video Compression in Low-latency Interoperable Interfaces.
Proceedings of the Data Compression Conference, 2023

Learning to Compress Unmanned Aerial Vehicle (UAV) Captured Video: Benchmark and Analysis.
Proceedings of the Data Compression Conference, 2023

Rate-Distortion Optimization for Cross Modal Compression.
Proceedings of the Data Compression Conference, 2023

2022
Multiobjective Oriented Task Scheduling in Heterogeneous Mobile Edge Computing Networks.
IEEE Trans. Veh. Technol., 2022

NR-CNN: Nested-Residual Guided CNN In-loop Filtering for Video Coding.
ACM Trans. Multim. Comput. Commun. Appl., 2022

A Bayesian Quality-of-Experience Model for Adaptive Streaming Videos.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Towards Analysis-Friendly Face Representation With Scalable Feature and Texture Compression.
IEEE Trans. Multim., 2022

Iterative Network for Image Super-Resolution.
IEEE Trans. Multim., 2022

Implicitly Selected Transform for AVS3.
IEEE Trans. Image Process., 2022

An Autofocus Approach for UAV-Based Ultrawideband Ultrawidebeam SAR Data With Frequency-Dependent and 2-D Space-Variant Motion Errors.
IEEE Trans. Geosci. Remote. Sens., 2022

Enhanced Surveillance Video Compression With Dual Reference Frames Generation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Scalable Intra Coding Optimization for Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2022

A Pixel-Level Segmentation-Synthesis Framework for Dynamic Texture Video Compression.
IEEE Trans. Circuits Syst. Video Technol., 2022

Neural Network-Based Enhancement to Inter Prediction for Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2022

Cross-SRN: Structure-Preserving Super-Resolution Network With Cross Convolution.
IEEE Trans. Circuits Syst. Video Technol., 2022

FPX-NIC: An FPGA-Accelerated 4K Ultra-High-Definition Neural Video Coding System.
IEEE Trans. Circuits Syst. Video Technol., 2022

A Real-Time Ultra-High Definition Video Decoder of AVS3 on Heterogeneous Systems.
IEEE Trans. Circuits Syst. Video Technol., 2022

Reconstructing Clear Image for High-Speed Motion Scene With a Retina-Inspired Spike Camera.
IEEE Trans. Computational Imaging, 2022

Pose-Guided Representation Learning for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

SMR: Satisfied Machine Ratio Modeling for Machine Recognition-Oriented Image and Video Compression.
CoRR, 2022

Deep Lossy Plus Residual Coding for Lossless and Near-lossless Image Compression.
CoRR, 2022

Toward 6G TKμ Extreme Connectivity: Architecture, Key Technologies and Experiments.
CoRR, 2022

STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video Prediction.
CoRR, 2022

Textural-Structural Joint Learning for No-Reference Super-Resolution Image Quality Assessment.
CoRR, 2022

Learning Weighting Map for Bit-Depth Expansion within a Rational Range.
CoRR, 2022

STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond.
CoRR, 2022

Evolution of AVS video coding standards: twenty years of innovation and development.
Sci. China Inf. Sci., 2022

Classification of Giemsa staining chromosome using input-aware deep convolutional neural network with integrated uncertainty estimates.
Biomed. Signal Process. Control., 2022

Consistency-Contrast Learning for Conceptual Coding.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Review of Personalized Health Navigation for Drivers.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Monocular-Vision-Based Positioning Method for UAV Formation.
Proceedings of the 21st International Symposium on Communications and Information Technologies, 2022

Location-Aware Feature Selection Network for Multi-Oriented Scene Text Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

End-to-End Image Compression via Attention-Guided Information-Preserving Module.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Multi-Scale end-to-End Learning for Point Cloud Geometry Compression.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fast Partition Mode Decision via a Plug-in Fully Connected Network for Video Coding.
Proceedings of the Data Compression Conference, 2022

High-Order Intra Prediction for Future Video Coding.
Proceedings of the Data Compression Conference, 2022

Rate Distortion Characteristic Modeling for Neural Image Compression.
Proceedings of the Data Compression Conference, 2022

STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Instance-Aware Dynamic Neural Network Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards End-to-End Image Compression and Analysis with Transformers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Learning to Fool the Speaker Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Handling Outliers by Robust M-Estimation in Blind Image Deblurring.
IEEE Trans. Multim., 2021

Part-aware Progressive Unsupervised Domain Adaptation for Person Re-Identification.
IEEE Trans. Multim., 2021

Towards Fine-Grained Human Pose Transfer With Detail Replenishing Network.
IEEE Trans. Image Process., 2021

Sub-Sampled Cross-Component Prediction for Emerging Video Coding Standards.
IEEE Trans. Image Process., 2021

Shape-Preserving Stereo Object Remapping via Object-Consistent Grid Warping.
IEEE Trans. Image Process., 2021

Divisively Normalized Sparse Coding: Toward Perceptual Visual Signal Representation.
IEEE Trans. Cybern., 2021

Lighter but Efficient Bit-Depth Expansion Network.
IEEE Trans. Circuits Syst. Video Technol., 2021

Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds.
IEEE Trans. Circuits Syst. Video Technol., 2021

Ensemble Learning-Based Rate-Distortion Optimization for End-to-End Image Compression.
IEEE Trans. Circuits Syst. Video Technol., 2021

Deep Network-Based Frame Extrapolation With Reference Frame Alignment.
IEEE Trans. Circuits Syst. Video Technol., 2021

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding.
IEEE Trans. Circuits Syst. Video Technol., 2021

Residual Spatial and Channel Attention Networks for Single Image Dehazing.
Sensors, 2021

Just Recognizable Distortion for Machine Vision Oriented Image and Video Coding.
Int. J. Comput. Vis., 2021

Deep Trajectory Post-Processing and Position Projection for Single & Multiple Camera Multiple Object Tracking.
Int. J. Comput. Vis., 2021

Deep Human-Interaction and Association by Graph-Based Learning for Multiple Object Tracking in the Wild.
Int. J. Comput. Vis., 2021

Prediction With Multicross Component for Future Video Coding.
IEEE Multim., 2021

Driver stress detection via multimodal fusion using attention-based CNN-LSTM.
Expert Syst. Appl., 2021

Post-Training Quantization for Vision Transformer.
CoRR, 2021

Real-Time 3D Object Detection From Point Cloud Through Foreground Segmentation.
IEEE Access, 2021

Haze Relevant Feature Attention Network for Single Image Dehazing.
IEEE Access, 2021

GazeTance Guidance: Gaze and Distance-Based Content Presentation for Virtual Museum.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2021

Enhanced Cross Component Sample Adaptive Offset for AVS3.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Instance Segmentation Based Background Reference Frame Generation for Surveillance Video Coding.
Proceedings of the Picture Coding Symposium, 2021

Post-Training Quantization for Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MAU: A Motion-Aware Unit for Video Prediction and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Intrinsic Temporal Regularization for High-resolution Human Video Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Cross Modal Compression: Towards Human-comprehensible Semantic Compression.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Video Coding for Machine.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Spatial-Temporal Correlation Learning for Real-Time Video Deinterlacing.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Multiple Task-Oriented Encoders for Unified Image Fusion.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

ASTM: An Attention based Spatiotemporal Model for Video Prediction Using 3D Convolutional Neural Networks.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

STAE: A Spatiotemporal Auto-Encoder for High-Resolution Video Prediction.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Implicit Seleted Transform Skip Method For Avs3.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Knowledge Distillation From End-To-End Image Compression To Vvc Intra Coding For Perceptual Quality Enhancement.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Iprnn: An Information-Preserving Model For Video Prediction Using Spatiotemporal Grus.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Evolutionary Quantization of Neural Networks with Mixed-Precision.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Adaptive Pyramid Single-View Depth Lookup Table Coding Method.
Proceedings of the IEEE International Conference on Acoustics, 2021

Flow-Grounded Dynamic Texture Synthesis for Video Compression.
Proceedings of the 31st Data Compression Conference, 2021

Intra Block Partition Structure Prediction via Convolutional Neural Network.
Proceedings of the 31st Data Compression Conference, 2021

Progressive Stage-Wise Learning for Unsupervised Feature Representation Enhancement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Pre-Trained Image Processing Transformer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

Segatron: Segment-Aware Transformer for Language Modeling and Understanding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Fast Graph Sampling Set Selection Using Gershgorin Disc Alignment.
IEEE Trans. Signal Process., 2020

Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm.
IEEE Trans. Multim., 2020

Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval.
IEEE Trans. Multim., 2020

Multimedia Intelligence: When Multimedia Meets Artificial Intelligence.
IEEE Trans. Multim., 2020

SACNN: Self-Attention Convolutional Neural Network for Low-Dose CT Denoising With Self-Supervised Perceptual Loss Network.
IEEE Trans. Medical Imaging, 2020

Compressed Image Restoration via Artifacts-Free PCA Basis Learning and Adaptive Sparse Modeling.
IEEE Trans. Image Process., 2020

Graph-Based Non-Convex Low-Rank Regularization for Image Compression Artifact Reduction.
IEEE Trans. Image Process., 2020

Unified Intra Mode Coding Based on Short and Long Range Correlations.
IEEE Trans. Image Process., 2020

Perceptual Temporal Incoherence-Guided Stereo Video Retargeting.
IEEE Trans. Image Process., 2020

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics.
IEEE Trans. Image Process., 2020

No-Reference Image Quality Assessment: An Attention Driven Approach.
IEEE Trans. Image Process., 2020

MUGGLE: MUlti-Stream Group Gaze Learning and Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2020

Fast and Accurate Action Detection in Videos With Motion-Centric Attention Model.
IEEE Trans. Circuits Syst. Video Technol., 2020

Multi-Scale Convolutional Neural Network-Based Intra Prediction for Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2020

Contrast Enhancement via Dual Graph Total Variation-Based Image Decomposition.
IEEE Trans. Circuits Syst. Video Technol., 2020

Unsupervised Blind Image Quality Evaluation via Statistical Measurements of Structure, Naturalness, and Perception.
IEEE Trans. Circuits Syst. Video Technol., 2020

Single-Image Blind Deblurring Using Multi-Scale Latent Structure Prior.
IEEE Trans. Circuits Syst. Video Technol., 2020

Multiple Kernel $k$k-Means with Incomplete Kernels.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Direct Speech-to-Image Translation.
IEEE J. Sel. Top. Signal Process., 2020

Optimization-Inspired Compact Deep Compressive Sensing.
IEEE J. Sel. Top. Signal Process., 2020

FedHealth: A Federated Transfer Learning Framework for Wearable Healthcare.
IEEE Intell. Syst., 2020

Intrinsic Temporal Regularization for High-resolution Human Video Synthesis.
CoRR, 2020

Deinterlacing Network for Early Interlaced Videos.
CoRR, 2020

Implicit Subspace Prior Learning for Dual-Blind Face Restoration.
CoRR, 2020

Assessing the Quality-of-Experience of Adaptive Bitrate Video Streaming.
CoRR, 2020

Progressive Multi-Scale Residual Network for Single Image Super-Resolution.
CoRR, 2020

SegaBERT: Pre-training of Segment-aware BERT for Language Understanding.
CoRR, 2020

Location-Aware Feature Selection for Scene Text Detection.
CoRR, 2020

Semantics of the Unwritten.
CoRR, 2020

Rectified Meta-Learning from Noisy Labels for Robust Image-based Plant Disease Diagnosis.
CoRR, 2020

Combining a gradient-based method and an evolution strategy for multi-objective reinforcement learning.
Appl. Intell., 2020

Optical Flow Estimation Between Images of Different Resolutions via Variational Method.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multimedia Intelligence: When Multimedia Meets Artificial Intelligence.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

SVT-AVS3: Scalable Video Technology of AVS3.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Abnormal Traffic Congestion Recognition Based on Video Analysis.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

GPU-Based Intra Decompression for 8K Real-Time AVS3 Decoder.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Evaluating Multi-Class Segmentation Errors with Anatomical Priors.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Implicit-Selected Transform in Video Coding.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Inheritability-Inspired Intra Coding Optimization for AVS3.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

CNN-Based Inter Prediction Refinement for AVS3.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Region-Adaptive Texture Enhancement For Detailed Person Image Synthesis.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

A Real-Time AVS3 8K-UHD Encoding and Decoding System.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Prediction with Multi-Cross Component.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Universal Adversarial Perturbations Generative Network For Speaker Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

GPU based Real-Time UHD Intra Decoding for AVS3.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Affine Direct/Skip Mode with Motion Vector Differences in Video Coding.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Performance and Computational Complexity Analysis of Coding Tools in AVS3.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Self-Supervised Learning of Depth and Pose Using Cycle Generative Adversarial Network.
Proceedings of the IEEE International Conference on Image Processing, 2020

Optimization Of Motion Compensation Based On GPU And CPU For VVC Decoding.
Proceedings of the IEEE International Conference on Image Processing, 2020

An Original Domain based Acceleration Algorithm of Logarithmic Binary Arithmetic Coding.
Proceedings of the ICDSP 2020: 4th International Conference on Digital Signal Processing, 2020

uAVS3d: Fast Decoder for the 3rd Generation Audio Video Coding Standard (AVS3).
Proceedings of the ICDSP 2020: 4th International Conference on Digital Signal Processing, 2020

Learning to Fool the Speaker Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Two-Stage Multi-Objective Deep Reinforcement Learning Framework.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Linear Model Based Geometry Coding for Lidar Acquired Point Clouds.
Proceedings of the Data Compression Conference, 2020

Implicit Geometry Partition for Point Cloud Compression.
Proceedings of the Data Compression Conference, 2020

Sub-Sampled Cross-Component Prediction for Chroma Component Coding.
Proceedings of the Data Compression Conference, 2020

Multi-View Spectral Clustering with Optimal Neighborhood Laplacian Matrix.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CGD: Multi-View Clustering via Cross-View Graph Diffusion.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
GLAD: Global-Local-Alignment Descriptor for Scalable Person Re-Identification.
IEEE Trans. Multim., 2019

GPU-Based Hierarchical Motion Estimation for High Efficiency Video Coding.
IEEE Trans. Multim., 2019

Blind Quality Assessment of Camera Images Based on Low-Level and High-Level Statistical Features.
IEEE Trans. Multim., 2019

Deep Progressive Hashing for Image Retrieval.
IEEE Trans. Multim., 2019

Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation.
IEEE Trans. Image Process., 2019

Fine-Grained Quality Assessment for Compressed Images.
IEEE Trans. Image Process., 2019

Depth Super-Resolution via Joint Color-Guided Internal and External Regularizations.
IEEE Trans. Image Process., 2019

Depth Restoration From RGB-D Data via Joint Adaptive Regularization and Thresholding on Manifolds.
IEEE Trans. Image Process., 2019

Graph-Based Joint Dequantization and Contrast Enhancement of Poorly Lit JPEG Images.
IEEE Trans. Image Process., 2019

Graph-Based Blind Image Deblurring From a Single Photograph.
IEEE Trans. Image Process., 2019

Deep Reconstruction of Least Significant Bits for Bit-Depth Expansion.
IEEE Trans. Image Process., 2019

Joint Gaze Correction and Face Beautification for Conference Video Using Dual Sparsity Prior.
IEEE Trans. Ind. Electron., 2019

Predicting Diverse Future Frames With Local Transformation-Guided Masking.
IEEE Trans. Circuits Syst. Video Technol., 2019

CG-Cast: Scalable Wireless Image SoftCast Using Compressive Gradient.
IEEE Trans. Circuits Syst. Video Technol., 2019

How to Assess the Quality of Compressed Surveillance Videos Using Face Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2019

Efficient Prediction Methods With Enhanced Spatial-Temporal Correlation for HEVC.
IEEE Trans. Circuits Syst. Video Technol., 2019

Attention driven person re-identification.
Pattern Recognit., 2019

Late Fusion Incomplete Multi-View Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Total-ionization-dose characterization of a radiation-hardened mixed-signal microcontroller SoC in 180 nm CMOS technology for nanosatellites.
Microelectron. J., 2019

Front-End Smart Visual Sensing and Back-End Intelligent Analysis: A Unified Infrastructure for Economizing the Visual System of City Brain.
IEEE J. Sel. Areas Commun., 2019

Toward Knowledge as a Service Over Networks: A Deep Learning Model Communication Paradigm.
IEEE J. Sel. Areas Commun., 2019

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond.
IEEE Multim., 2019

Compact Descriptors for Video Analysis: The Emerging MPEG Standard.
IEEE Multim., 2019

Surface Light Field Compression Using a Point Cloud Codec.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2019

Spherical Coordinates Transform-Based Motion Model for Panoramic Video Coding.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2019

A Knowledge-Driven Quality-of-Experience Model for Adaptive Streaming Videos.
CoRR, 2019

Masked Non-Autoregressive Image Captioning.
CoRR, 2019

MCF3D: Multi-Stage Complementary Fusion for Multi-Sensor 3D Object Detection.
IEEE Access, 2019

Signal-Independent Separable KLT by Offline Training for Video Coding.
IEEE Access, 2019

Soft Transfer Learning via Gradient Diagnosis for Visual Relationship Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

No-Reference Image Quality Assessment: An Attention Driven Approach.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Toward Intelligent Visual Sensing and Low-cost Analysis: A Collaborative Computing Approach.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Recent Development of AVS Video Coding Standard: AVS3.
Proceedings of the Picture Coding Symposium, 2019

Residual in Residual Based Convolutional Neural Network In-loop Filter for AVS3.
Proceedings of the Picture Coding Symposium, 2019

Fine-granularity ownership identification of multimedia content components: a mechanism for smart media streaming.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Layered Image Compression Using Scalable Auto-Encoder.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Disentangled Human Action Video Generation via Decoupled Learning.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

History-Based Motion Vector Prediction for Future Video Coding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Scalable Facial Image Compression with Deep Feature Reconstruction.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Global-Local Temporal Representations for Video Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multiscale Directional Fusion for Depth Map Super Resolution with Denoising.
Proceedings of the IEEE International Conference on Acoustics, 2019

Reconstruction-cognizant Graph Sampling Using Gershgorin Disc Alignment.
Proceedings of the IEEE International Conference on Acoustics, 2019

Adaptive Wavelet Domain Filter for Versatile Video Coding (VVC).
Proceedings of the Data Compression Conference, 2019

Separable KLT for Intra Coding in Versatile Video Coding (VVC).
Proceedings of the Data Compression Conference, 2019

Pyramid Convolutional Network for Single Image Deraining.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Self-Critical N-Step Training for Image Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Efficient and Effective Incomplete Multi-View Clustering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Adaptive view synthesis optimization for low complexity 3D-HEVC encoding.
J. Vis. Lang. Comput., 2018

Building Invariances Into Sparse Subspace Clustering.
IEEE Trans. Signal Process., 2018

Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching.
IEEE Trans. Multim., 2018

Hybrid Intraprediction Based on Local and Nonlocal Correlations.
IEEE Trans. Multim., 2018

Supervised Distributed Hashing for Large-Scale Multimedia Retrieval.
IEEE Trans. Multim., 2018

Multiscale Deep Alternative Neural Network for Large-Scale Video Classification.
IEEE Trans. Multim., 2018

Reduced-Reference Image Quality Assessment in Free-Energy Principle and Sparse Representation.
IEEE Trans. Multim., 2018

Globally Variance-Constrained Sparse Representation and Its Application in Image Set Coding.
IEEE Trans. Image Process., 2018

Region-Aware Reflection Removal With Unified Content and Gradient Priors.
IEEE Trans. Image Process., 2018

Fast Image Super-Resolution via Local Adaptive Gradient Field Sharpening Transform.
IEEE Trans. Image Process., 2018

Prior-Based Quantization Bin Matching for Cloud Storage of JPEG Images.
IEEE Trans. Image Process., 2018

Fully Connected Network-Based Intra Prediction for Image Coding.
IEEE Trans. Image Process., 2018

Interacting Tracklets for Multi-Object Tracking.
IEEE Trans. Image Process., 2018

Minimizing Reconstruction Bias Hashing via Joint Projection Learning and Quantization.
IEEE Trans. Image Process., 2018

Fast MPEG-CDVS Encoder With GPU-CPU Hybrid Computing.
IEEE Trans. Image Process., 2018

Hybrid All Zero Soft Quantized Block Detection for HEVC.
IEEE Trans. Image Process., 2018

Learning Affective Features With a Hybrid Deep Model for Audio-Visual Emotion Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Rate-Distortion Optimized Sparse Coding With Ordered Dictionary for Image Set Compression.
IEEE Trans. Circuits Syst. Video Technol., 2018

MPEG Internet Video Coding Standard and Its Performance Evaluation.
IEEE Trans. Circuits Syst. Video Technol., 2018

Reduced-Reference Quality Assessment of Screen Content Images.
IEEE Trans. Circuits Syst. Video Technol., 2018

Image Denoising via Low Rank Regularization Exploiting Intra and Inter Patch Correlation.
IEEE Trans. Circuits Syst. Video Technol., 2018

Parallel In-Loop Filtering in HEVC Encoder on GPU.
IEEE Trans. Consumer Electron., 2018

Parametric local multiview hamming distance metric learning.
Pattern Recognit., 2018

Multi-type attributes driven multi-camera person re-identification.
Pattern Recognit., 2018

Multi-Task Learning with Low Rank Attribute Embedding for Multi-Camera Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Local patch encoding-based method for single image super-resolution.
Inf. Sci., 2018

Learning a collaborative multiscale dictionary based on robust empirical mode decomposition.
Neurocomputing, 2018

Joint Rate Allocation with Both Look-ahead And Feedback Model For High Efficiency Video Coding.
CoRR, 2018

A nonlinear transform based analog video transmission framework.
CoRR, 2018

The future of artificial intelligence in China.
Commun. ACM, 2018

CREAM: CNN-REgularized ADMM Framework for Compressive-Sensed Image Reconstruction.
IEEE Access, 2018

Beyond Knowledge Distillation: Collaborative Learning for Bidirectional Model Assistance.
IEEE Access, 2018

Visual Information Evaluation With Entropy of Primitive.
IEEE Access, 2018

BoostNet: A Structured Deep Recursive Network to Boost Image Deblocking.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Perceptual Temporal Incoherence Aware Stereo Video Retargeting.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Computed Tomography Image Enhancement Using 3D Convolutional Neural Network.
Proceedings of the Deep Learning in Medical Image Analysis - and - Multimodal Learning for Clinical Decision Support, 2018

A Frame Level Rate Control Algorithm For Screen Content Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Localized Incomplete Multiple Kernel k-means.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Robust Contrast Enhancement via Graph-Based Cartoon-Texture Decomposition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Neural Network Based Inter Prediction for HEVC.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

RAM: A Region-Aware Deep Model for Vehicle Re-Identification.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Dense Relation Network: Learning Consistent and Context-Aware Representation for Semantic Image Segmentation.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Enhanced Ctu-Level Inter Prediction with Deep Frame Rate Up-Conversion for High Efficiency Video Coding.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Multiresolution Contourlet Transform Fusion Based Depth Map Super Resolution.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A Framework for Surface Light Field Compression.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Cluster-Based Point Cloud Coding with Normal Weighted Graph Fourier Transform.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Blind Image Deblurring Via Reweighted Graph Total Variation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Fast and Robust Image Upsampling by Local Adaptive Gradient Field Sharpening Transform.
Proceedings of the 2018 Data Compression Conference, 2018

Compressed Image Restoration via External-Image Assisted Band Adaptive PCA Model Learning.
Proceedings of the 2018 Data Compression Conference, 2018

Person Transfer GAN to Bridge Domain Gap for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Depth-Aware Stereo Video Retargeting.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Enhanced Motion Vector Prediction for Video Coding.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Fast Image Upsampling via Adaptive Gradient Sharpening and Transformed- Texture Constraint.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Joint Rate-Distortion Optimization for Simultaneous Texture and Deep Feature Compression of Facial Images.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

SAP: Self-Adaptive Proposal Model for Temporal Action Detection Based on Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Signal Dependent Transform Based on SVD for HEVC Intracoding.
IEEE Trans. Multim., 2017

Utility-Driven Adaptive Preprocessing for Screen Content Video Compression.
IEEE Trans. Multim., 2017

Accelerating Image-Domain-Warping Virtual View Synthesis on GPGPU.
IEEE Trans. Multim., 2017

HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval.
IEEE Trans. Multim., 2017

A Joint Compression Scheme of Video Feature Descriptors and Visual Content.
IEEE Trans. Image Process., 2017

Power Distortion Optimization for Uncoded Linear Transformed Transmission of Images and Videos.
IEEE Trans. Image Process., 2017

Hybrid Laplace Distribution-Based Low Complexity Rate-Distortion Optimized Quantization.
IEEE Trans. Image Process., 2017

Reducing Image Compression Artifacts by Structural Sparse Representation and Quantization Constraint Prior.
IEEE Trans. Circuits Syst. Video Technol., 2017

Video Compressive Sensing Reconstruction via Reweighted Residual Sparsity.
IEEE Trans. Circuits Syst. Video Technol., 2017

Low-Rank-Based Nonlocal Adaptive Loop Filter for High-Efficiency Video Compression.
IEEE Trans. Circuits Syst. Video Technol., 2017

Fast Intra-Mode and CU Size Decision for HEVC.
IEEE Trans. Circuits Syst. Video Technol., 2017

Merge Mode for Deformable Block Motion Information Derivation.
IEEE Trans. Circuits Syst. Video Technol., 2017

Inter-View Dependency-Based Rate Control for 3D-HEVC.
IEEE Trans. Circuits Syst. Video Technol., 2017

Entropy of Primitive: From Sparse Representation to Visual Information Evaluation.
IEEE Trans. Circuits Syst. Video Technol., 2017

Nonlocal Gradient Sparsity Regularization for Image Restoration.
IEEE Trans. Circuits Syst. Video Technol., 2017

Color Image-Guided Boundary-Inconsistent Region Refinement for Stereo Matching.
IEEE Trans. Circuits Syst. Video Technol., 2017

iAVS2: A Fast Intra-Encoding Platform for IEEE 1857.4.
IEEE Trans. Circuits Syst. Video Technol., 2017

A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC.
IEEE Trans. Circuits Syst. II Express Briefs, 2017

High-Efficiency Image Coding via Near-Optimal Filtering.
IEEE Signal Process. Lett., 2017

Just-Noticeable Difference-Based Perceptual Optimization for JPEG Compression.
IEEE Signal Process. Lett., 2017

uAVS2 - Fast encoder for the 2nd generation IEEE 1857 video coding standard.
Signal Process. Image Commun., 2017

Attributes driven tracklet-to-tracklet person re-identification using latent prototypes space mapping.
Pattern Recognit., 2017

Towards human-like and transhuman perception in AI 2.0: a review.
Frontiers Inf. Technol. Electron. Eng., 2017

Cross-media analysis and reasoning: advances and directions.
Frontiers Inf. Technol. Electron. Eng., 2017

Quality assessment for real out-of-focus blurred images.
J. Vis. Commun. Image Represent., 2017

Blind restoration for nonuniform aerial images using nonlocal Retinex model and shearlet-based higher-order regularization.
J. Electronic Imaging, 2017

Iterative projection reconstruction for fast and efficient image upsampling.
Neurocomputing, 2017

A Bio-Inspired Multi-Exposure Fusion Framework for Low-light Image Enhancement.
CoRR, 2017

Local Patch Classification Based Framework for Single Image Super-Resolution.
CoRR, 2017

GUN: Gradual Upsampling Network for single image super-resolution.
CoRR, 2017

Fast MPEG-CDVS Encoder with GPU-CPU Hybrid Computing.
CoRR, 2017

Wireless image and video soft transmission via perception-inspired power distortion optimization.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Rate-distortion optimized scan for point cloud color compression.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Internal generative mechanism inspired reduced reference image quality assessment with entropy of primitive.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Image super-resolution based on adaptive joint distribution modeling.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Low rank regularization exploiting intra and inter patch correlation for image denoising.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Compressive gradient based scalable image SoftCast.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Improved intra boundary filters for HEVC.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Data-Dependent Sparsity for Subspace Clustering.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Better and Faster, when ADMM Meets CNN: Compressive-Sensed Image Reconstruction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Content Adaptive Constraint Based Image Upsampling.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

ODD: An Algorithm of Online Directional Dictionary Learning for Sparse Representation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Parallax-Robust Hexahedral Panoramic Video Stitching.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Sparsity-Promoting Adaptive Coding with Robust Empirical Mode Decomposition for Image Restoration.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Panel: Cross-media Intelligence.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Fast intra coding unit size decision for HEVC with GPU based keypoint detection.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Intelligent analysis oriented surveillance video coding.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Enhanced intra prediction for inter pictures.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Asymmetric circular projection for dynamic virtual reality video stream switching.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A new motion model for panoramic video coding.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Correlation preserving on graphs for image denoising.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Pose-Driven Deep Convolutional Model for Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Performance Guaranteed Network Acceleration via High-Order Residual Quantization.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A cache-based bandwidth optimized motion compensation architecture for video decoder.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Globally Variance-Constrained Sparse Representation for Rate-Distortion Optimized Image Representation.
Proceedings of the 2017 Data Compression Conference, 2017

Compact Deep Invariant Descriptors for Video Retrieval.
Proceedings of the 2017 Data Compression Conference, 2017

Wireless Image SoftCast Using Compressive Gradient.
Proceedings of the 2017 Data Compression Conference, 2017

Fast Rate Distortion Optimized Quantization Method Based on Early Detection of Zero Block for HEVC.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Beyond Monte Carlo Tree Search: Playing Go with Deep Alternative Neural Network and Long-Term Evaluation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Exploring Algorithmic Limits of Matrix Rank Minimization Under Affine Constraints.
IEEE Trans. Signal Process., 2016

CSPS: An Adaptive Pooling Method for Image Classification.
IEEE Trans. Multim., 2016

Guided Image Contrast Enhancement Based on Retrieved Images in Cloud.
IEEE Trans. Multim., 2016

Blind Quality Assessment of Tone-Mapped Images Via Analysis of Information, Naturalness, and Structure.
IEEE Trans. Multim., 2016

Hybrid Zero Block Detection for High Efficiency Video Coding.
IEEE Trans. Multim., 2016

Efficient Generalized Fused Lasso and Its Applications.
ACM Trans. Intell. Syst. Technol., 2016

CONCOLOR: Constrained Non-Convex Low-Rank Model for Image Deblocking.
IEEE Trans. Image Process., 2016

Low-Rank Decomposition-Based Restoration of Compressed Images via Adaptive Noise Estimation.
IEEE Trans. Image Process., 2016

Analysis of Decorrelation Transform Gain for Uncoded Wireless Image and Video Communication.
IEEE Trans. Image Process., 2016

Image Denoising via Bandwise Adaptive Modeling and Regularization Exploiting Nonlocal Similarity.
IEEE Trans. Image Process., 2016

Just Noticeable Difference Estimation for Screen Content Images.
IEEE Trans. Image Process., 2016

Compressive Sampling-Based Image Coding for Resource-Deficient Visual Communication.
IEEE Trans. Image Process., 2016

Overview of the MPEG-CDVS Standard.
IEEE Trans. Image Process., 2016

Multilevel Modified Finite Radon Transform Network for Image Upsampling.
IEEE Trans. Circuits Syst. Video Technol., 2016

Joint Chroma Downsampling and Upsampling for Screen Content Image.
IEEE Trans. Circuits Syst. Video Technol., 2016

The MPEG Internet Video-Coding Standard [Standards in a Nutshell].
IEEE Signal Process. Mag., 2016

An enhanced entropy coding scheme for HEVC.
Signal Process. Image Commun., 2016

Arithmetic coding using hierarchical dependency context model for H.264/AVC video coding.
Multim. Tools Appl., 2016

Spatially variant defocus blur map estimation and deblurring from a single image.
J. Vis. Commun. Image Represent., 2016

Low complexity encoder optimization for HEVC.
J. Vis. Commun. Image Represent., 2016

Fast algorithms and VLSI architecture design for HEVC intra-mode decision.
J. Real Time Image Process., 2016

Local Quantization Code histogram for texture classification.
Neurocomputing, 2016

Nonlocal In-Loop Filter: The Way Toward Next-Generation Video Coding?
IEEE Multim., 2016

Subjective and Objective Quality Assessment of Compressed Screen Content Images.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2016

Globally Variance-Constrained Sparse Representation for Image Set Compression.
CoRR, 2016

Maximal Sparsity with Deep Networks?
CoRR, 2016

Face Detection with End-to-End Integration of a ConvNet and a 3D Model.
CoRR, 2016

Recurrent Attentional Model for No-Reference Image Quality Assessment.
CoRR, 2016

Correlation Preserving Sparse Coding Over Multi-level Dictionaries for Image Denoising.
CoRR, 2016

Blind restoration for non-uniform aerial images using non-local Retinex model and shearlet-based higher-order regularization.
CoRR, 2016

Frame interpolation with pixel-level motion vector field and mesh based hole filling.
CAAI Trans. Intell. Technol., 2016

Improved entropy of primitive for visual information estimation.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Robust view interpolation with mesh cutting.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

A structure-preserving image restoration method with high-level ensemble constraints.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Maximal Sparsity with Deep Networks?
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Deep Alternative Neural Network: Exploring Contexts as Early as Possible for Action Recognition.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

GPU based sample adaptive offset parameter decision and perceptual optimization for HEVC.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

To Project More or to Quantize More: Minimize Reconstruction Bias for Learning Compact Binary Codes.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Mesh Denoising with Local Guided Normal Filtering and Non-local Similarity.
Proceedings of the Digital TV and Wireless Multimedia Communication, 2016

Smart query expansion scheme for CDVS based on illumination and key features.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Improving chroma intra prediction for HEVC.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Adaptive multi-dimension sparsity based coefficient estimation for compression artifact reduction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Mode-dependent pixel-wise motion refinement for HEVC.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Structure preserving single image super-resolution.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Mode dependent intra smoothing filter for HEVC.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

AnalogCast: Full linear coding and pseudo analog transmission for satellite remote-sensing images.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep Attributes Driven Multi-camera Person Re-identification.
Proceedings of the Computer Vision - ECCV 2016, 2016

Compressive-Sensed Image Coding via Stripe-based DPCM.
Proceedings of the 2016 Data Compression Conference, 2016

Nonconvex Lp Nuclear Norm based ADMM Framework for Compressed Sensing.
Proceedings of the 2016 Data Compression Conference, 2016

From Visual Search to Video Compression: A Compact Representation Framework for Video Feature Descriptors.
Proceedings of the 2016 Data Compression Conference, 2016

Structure-driven Adaptive Non-local Filter for High Efficiency Video Coding (HEVC).
Proceedings of the 2016 Data Compression Conference, 2016

A Fast and Lossless IDCT Design for AVS2 Codec.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Affinity Preserving Quantization for Hashing: A Vector Quantization Approach to Compact Learn Binary Codes.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Performance Evaluation, Overview.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Face Misalignment Problem.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Reduced Reference Stereoscopic Image Quality Assessment Based on Binocular Perceptual Information.
IEEE Trans. Multim., 2015

Weighted Component Hashing of Binary Aggregated Descriptors for Fast Visual Search.
IEEE Trans. Multim., 2015

High Resolution Local Structure-Constrained Image Upsampling.
IEEE Trans. Image Process., 2015

Video Compression Artifact Reduction via Spatio-Temporal Multi-Hypothesis Prediction.
IEEE Trans. Image Process., 2015

Depth-Preserving Warping for Stereo Image Retargeting.
IEEE Trans. Image Process., 2015

AVS2 ? Making Video Coding Smarter [Standards in a Nutshell].
IEEE Signal Process. Mag., 2015

Computation-constrained dynamic search range control for real-time video encoder.
Signal Process. Image Commun., 2015

Multi-order visual phrase for scalable partial-duplicate visual search.
Multim. Syst., 2015

Video super-resolution with registration-reliability regulation and adaptive total variation.
J. Vis. Commun. Image Represent., 2015

Dynamic macroblock wavefront parallelism for parallel video coding.
J. Vis. Commun. Image Represent., 2015

Fusing cross-media for topic detection by dense keyword groups.
Neurocomputing, 2015

Instance-specific canonical correlation analysis.
Neurocomputing, 2015

Model-based low bit-rate video coding for resource-deficient wireless visual communication.
Neurocomputing, 2015

Cross-pose Face Recognition by Canonical Correlation Analysis.
CoRR, 2015

Evolving defense mechanism for future network security.
IEEE Commun. Mag., 2015

Robust image/video super-resolution display.
Proceedings of the 21st ACM Symposium on Virtual Reality Software and Technology, 2015

A dual structured-sparsity model for compressive-sensed video reconstruction.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Rate-distortion based sparse coding for image set compression.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Enhanced inter prediction with localized weighted prediction in HEVC.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Hybrid angular intra/template matching prediction for HEVC intra coding.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Registration of under-sampled images via higher resolution spectrum restoration.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Registration-reliability based strategy to enhance multi-frame super-resolution algorithms.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

A HVS-guided approach for real-time image interpolation.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

A fast super-resolution method based on sparsity properties.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Sparse Structural Similarity for Objective Image Quality Assessment.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Demo abstract: WearableHUB: An open pervasive wearable data fusion platform for personal health management.
Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communication Workshops, 2015

A study on interest point guided visual saliency.
Proceedings of the 2015 Picture Coding Symposium, 2015

Macroblock level rate control for low delay H.264/AVC based video communication.
Proceedings of the 2015 Picture Coding Symposium, 2015

Synthesized Views Distortion Model Based Rate Control in 3D-HEVC.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Mobile Visual Search.
Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2015

Thousand to one: An image compression system via cloud search.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Improving QoS in SDN with lossless multi-domain reconfigurations.
Proceedings of the 23rd IEEE International Symposium on Quality of Service, 2015

Non-Local Structure-Based Filter for Video Coding.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

An inter-image redundancy measure for image set compression.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Towards accurate visual information estimation with Entropy of Primitive.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

High accuracy sub-pixel image registration under noisy condition.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Image compressive sensing using overlapped block projection and reconstruction.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Multiple layer parallel motion estimation on GPU for High Efficiency Video Coding (HEVC).
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Quality-progressive coding for high bit-rate background frames on surveillance videos.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Fast intra mode decision algorithm based on refinement in HEVC.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Hamming Compatible Quantization for Hashing.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Feature-matching based motion prediction for high efficiency video coding in cloud.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Learning class-specific pooling shapes for image classification.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Image deblurring using robust sparsity priors.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image deblocking using group-based sparse representation and quantization constraint prior.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A compact shot representation for video semantic indexing.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image classification using RBM to encode local descriptors with group sparse learning.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A low-light image enhancement method for both denoising and contrast enlarging.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A novel integer-pixel motion estimation algorithm based on quadratic prediction.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Multi-Task Learning with Low Rank Attribute Embedding for Person Re-Identification.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

High efficiency VLSI implementation of an edge-directed video up-scaler using high level synthesis.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Optimizing Binary Fisher Codes for Visual Search.
Proceedings of the 2015 Data Compression Conference, 2015

Overview of the MPEG CDVS Standard.
Proceedings of the 2015 Data Compression Conference, 2015

Background Subtraction via generalized fused lasso foreground modeling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Image denoising via adaptive soft-thresholding based on non-local samples.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

The second generation IEEE 1857 video coding standard.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Towards Accurate and Efficient Image Quality Assessment with Interest Points.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Cloud Based Image Contrast Enhancement.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Adaptive transform with HEVC intra coding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Stable Feature Selection from Brain sMRI.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Visual Saliency Computation - A Machine Learning Perspective
Lecture Notes in Computer Science 8408, Springer, ISBN: 978-3-319-05642-5, 2014

Low Complexity Adaptive View Synthesis Optimization in HEVC Based 3D Video Coding.
IEEE Trans. Multim., 2014

Towards Mobile Document Image Retrieval for Digital Library.
IEEE Trans. Multim., 2014

Electrified Vehicles and the Smart Grid: The ITS Perspective.
IEEE Trans. Intell. Transp. Syst., 2014

A Compact Representation for Compressing Converted Stereo Videos.
IEEE Trans. Image Process., 2014

Group-Based Sparse Representation for Image Restoration.
IEEE Trans. Image Process., 2014

USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval.
IEEE Trans. Image Process., 2014

Cascade Category-Aware Visual Search.
IEEE Trans. Image Process., 2014

Optimizing the Hierarchical Prediction and Coding in HEVC for Surveillance and Conference Videos With Background Modeling.
IEEE Trans. Image Process., 2014

Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding.
IEEE Trans. Image Process., 2014

Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A Unified Framework.
IEEE Trans. Image Process., 2014

Image Interpolation via Graph-Based Bayesian Label Propagation.
IEEE Trans. Image Process., 2014

Spatiotemporal Grid Flow for Video Retargeting.
IEEE Trans. Image Process., 2014

Mining Compact Bag-of-Patterns for Low Bit Rate Mobile Visual Search.
IEEE Trans. Image Process., 2014

Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain.
IEEE Trans. Circuits Syst. Video Technol., 2014

Fast encoder decision for texture coding in 3D-HEVC.
Signal Process. Image Commun., 2014

Window-based rate control for video quality optimization with a novel INTER-dependent rate-distortion model.
Signal Process. Image Commun., 2014

Image compressive sensing recovery using adaptively learned sparsifying basis via L0 minimization.
Signal Process., 2014

Web video thumbnail recommendation with content-aware analysis and query-sensitive matching.
Multim. Tools Appl., 2014

Optimal entropy-constrained non-uniform scalar quantizer design for low bit-rate pixel domain DVC.
Multim. Tools Appl., 2014

Guest Editorial.
J. Multim., 2014

Indexing heterogeneous features with superimages.
Int. J. Multim. Inf. Retr., 2014

A Shape Reconstructability Measure of Object Part Importance with Applications to Object Detection and Localization.
Int. J. Comput. Vis., 2014

Local Stereo Matching with Improved Matching Cost and Disparity Refinement.
IEEE Multim., 2014

Compact Descriptors for Visual Search.
IEEE Multim., 2014

The IEEE 1857 Standard: Empowering Smart Video Surveillance Systems.
IEEE Intell. Syst., 2014

Guest Editorial Content-Aware Visual Systems: Analysis, Streaming, and Retargeting.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2014

ObjectPatchNet: Towards scalable and semantic image annotation and retrieval.
Comput. Vis. Image Underst., 2014

IEEE Standards for Advanced Audio and Video Coding in Emerging Applications.
Computer, 2014

Virtual network mapping for multi-domain data plane in Software-Defined Networks.
Proceedings of the 4th International Conference on Wireless Communications, 2014

An all-zero blocks early detection method for high-efficiency video coding.
Proceedings of the Visual Information Processing and Communication V, 2014

Low-cost multi-hypothesis motion compensation for video coding.
Proceedings of the Visual Information Processing and Communication V, 2014

Video compressive sensing via structured Laplacian modelling.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

A fast intra optimization algorithm for HEVC.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

A new frame interpolation method with pixel-level motion vector field.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Adaptive frame level rate control in 3D-HEVC.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Gradient based image/video softcast with grouped-patch collaborative reconstruction.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Layer-based image completion by poisson surface reconstruction.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

AdaFlow: Adaptive Control to Improve Availability of OpenFlow Forwarding for Burst Quantity of Flows.
Proceedings of the Testbeds and Research Infrastructure: Development of Networks and Communities, 2014

An Adaptive Perceptual Quantization Algorithm Based on Block-Level JND for Video Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

A Multi-exposure Fusion Method Based on Locality Properties.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Superimage: Packing Semantic-Relevant Images for Indexing and Retrieval.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Multi-level low-complexity coefficient discarding scheme for video encoder.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Transform domain energy modeling of natural images for wireless SoftCast optimization.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

A resolution-adaptive interpolation filter for video codec.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Non-local extension of total variation regularization for image restoration.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Lossless reconfiguration protocol for multi-domain data plane in software-defined networks.
Proceedings of the 2014 Proceedings IEEE INFOCOM Workshops, Toronto, ON, Canada, April 27, 2014

Image compressive-sensing recovery using structured laplacian sparsity in DCT domain and multi-hypothesis prediction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Layered image/video softcast with hybrid digital-analog transmission for robust wireless visual communication.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Overview of IEEE 1857.3: Systems of advanced audio and video coding.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Gradient based image transmission and reconstruction using non-local gradient sparsity regularization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

A low-complexity hardware-oriented mode decision scheme based on rate-distoration estimation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Inter-dependent rate-distortion modeling for video coding and its application to rate control.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Cost-volume filtering-based stereo matching with improved matching cost and secondary refinement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Artifact reduction of compressed video via three-dimensional adaptive estimation of transform coefficients.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

High quality image reconstruction via non-local collaborative estimation for wireless image/video softcast.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

A fast intra coding algorithm for HEVC.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Component hashing of variable-length binary aggregated descriptors for fast image search.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Joint optimization of JPEG quantization table and coefficient thresholding for low bitrate mobile visual search.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Hybridcast: A wireless image/video SoftCast scheme using layered representation and hybrid digital-analog modulation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Region-based depth-preserving stereoscopic image retargeting.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Hierarchical dependency context model based arithmetic coding for DCT video compression.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

A low complexity and high performance interpolation filter for MPEG IVC.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Adaptive multi-resolution motion estimation using texture-based search strategies.
Proceedings of the IEEE International Conference on Consumer Electronics, 2014

Low-delay window-based rate control scheme for video quality optimization in video encoder.
Proceedings of the IEEE International Conference on Acoustics, 2014

HEVC decoder acceleration on multi-core X86 platform.
Proceedings of the IEEE International Conference on Acoustics, 2014

G-CAST: Gradient Based Image SoftCast for Perception-Friendly Wireless Visual Communication.
Proceedings of the Data Compression Conference, 2014

Robust Estimation of 3D Human Poses from a Single Image.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Lagrange multiplier based perceptual optimization for high efficiency video coding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Hybrid-Indexing Multi-type Features for Large-Scale Image Search.
Proceedings of the Computer Vision - ACCV 2014, 2014

Efficient Generalized Fused Lasso and its Application to the Diagnosis of Alzheimer's Disease.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Advanced Video Coding Systems
Springer, ISBN: 978-3-319-14243-2, 2014

2013
Content-based copy detection through multimodal feature representation and temporal pyramid matching.
ACM Trans. Multim. Comput. Commun. Appl., 2013

On a Highly Efficient RDO-Based Mode Decision Pipeline Design for AVS.
IEEE Trans. Multim., 2013

Fast and Efficient Transcoding Based on Low-Complexity Background Modeling and Adaptive Block Classification.
IEEE Trans. Multim., 2013

Learning to Distribute Vocabulary Indexing for Scalable Visual Search.
IEEE Trans. Multim., 2013

Compression Artifact Reduction by Overlapped-Block Transform Coefficient Estimation With Block Similarity.
IEEE Trans. Image Process., 2013

Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search.
IEEE Trans. Image Process., 2013

Perceptual Video Coding Based on SSIM-Inspired Divisive Normalization.
IEEE Trans. Image Process., 2013

Interactive Stereoscopic Video Conversion.
IEEE Trans. Circuits Syst. Video Technol., 2013

Rate control for consistent visual quality of H.264/AVC encoding.
Signal Process. Image Commun., 2013

Learning from mobile contexts to minimize the mobile location search latency.
Signal Process. Image Commun., 2013

Learning discriminative features for fast frame-based action recognition.
Pattern Recognit., 2013

A comparative study on illumination preprocessing in face recognition.
Pattern Recognit., 2013

Rate-GOP Based Rate Control for High Efficiency Video Coding.
IEEE J. Sel. Top. Signal Process., 2013

Mode Dependent Coding Tools for Video Coding.
IEEE J. Sel. Top. Signal Process., 2013

Video Copy-Detection and Localization with a Scalable Cascading Framework.
IEEE Multim., 2013

IEEE 1857: Boosting Video Applications in CPSS.
IEEE Intell. Syst., 2013

Learning Compact Visual Descriptors for Low Bit Rate Mobile Landmark Search.
AI Mag., 2013

On lossless coding for HEVC.
Proceedings of the Visual Information Processing and Communication IV, 2013

Entropy of primitive: A top-down methodology for evaluating the perceptual visual information.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

An improved image compression scheme with an adaptive parameters set in encrypted domain.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Improved disparity vector derivation in 3D-HEVC.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Power-distortion optimization for wireless image/video SoftCast by transform coefficients energy modeling with adaptive chunk division.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Laplace distribution based CTU level rate control for HEVC.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Surveillance video coding with quadtree partition based ROI extraction.
Proceedings of the 30th Picture Coding Symposium, 2013

Reduced reference image quality assessment using entropy of primitives.
Proceedings of the 30th Picture Coding Symposium, 2013

Efficient bit allocation and CTU level rate control for High Efficiency Video Coding.
Proceedings of the 30th Picture Coding Symposium, 2013

Dynamic MB-level Scheduling for parallel video coding.
Proceedings of the 30th Picture Coding Symposium, 2013

A Novel Hardware-Based UHD Video Up-Scaler Based on Local Structure Estimation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Binary Adaptive Luminance Mapping for Motion Estimation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

An Efficient Zigzag Scanning and Entropy Coding Architecture Design.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Acceleration of HEVC transform and inverse transform on ARM NEON platform.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2013

A highly effective error concealment method for whole frame loss.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Performance analysis of transform in uncoded wireless visual communication.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Single underwater image enhancement with a new optical model.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Multi layer based rate control algorithm for HEVC.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

A high-throughput low-latency arithmetic encoder design for HDTV.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

A spatial inter-view auto-regressive super-resolution scheme for multi-view image via scene matching algorithm.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Mobile media communication, processing, and analysis: A review of recent advances.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Parametric Local Multimodal Hashing for Cross-View Similarity Search.
Proceedings of the IJCAI 2013, 2013

Spatial error concealment via model based coupled sparse representation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

A highly efficient external memory interface architecture for AVS HD video encoder.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

An error robust distortion model for depth map coding in error prone network.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Pair-wise event detection using cubic features and sequence discriminant learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Wavelet inpainting driven image compression via collaborative sparsity at low bit rates.
Proceedings of the IEEE International Conference on Image Processing, 2013

Overview of the IEEE 1857 surveillance groups.
Proceedings of the IEEE International Conference on Image Processing, 2013

Instance-specific canonical correlation analysis for pose alignment.
Proceedings of the IEEE International Conference on Image Processing, 2013

High definition IEEE AVS decoder on ARM NEON platform.
Proceedings of the IEEE International Conference on Image Processing, 2013

Fast multi reference frame motion estimation for high efficiency video coding.
Proceedings of the IEEE International Conference on Image Processing, 2013

Overview of IEEE 1857 video coding standard.
Proceedings of the IEEE International Conference on Image Processing, 2013

Estimation of end-to-end distortion of virtual view for error-resilient depth map coding.
Proceedings of the IEEE International Conference on Image Processing, 2013

Multi-order visual phrase for scalable image search.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Scalable mobile search with binary phrase.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

A Method of Perceptual-Based Shape Decomposition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Quadratic ρ-domain based rate control algorithm for HEVC.
Proceedings of the IEEE International Conference on Acoustics, 2013

Robust fisher codes for large scale image retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2013

Content adaptive in-loop depth map filter for HEVC based 3DV coding.
Proceedings of the IEEE International Conference on Acoustics, 2013

On the interoperability of local descriptors compression.
Proceedings of the IEEE International Conference on Acoustics, 2013

Structural Group Sparse Representation for Image Compressive Sensing Recovery.
Proceedings of the 2013 Data Compression Conference, 2013

Hierarchical-and-Adaptive Bit-Allocation with Selective Background Prediction for High Efficiency Video Coding (HEVC).
Proceedings of the 2013 Data Compression Conference, 2013

Progressive Image Restoration through Hybrid Graph Laplacian Regularization.
Proceedings of the 2013 Data Compression Conference, 2013

Low Complexity Rate Distortion Optimization for HEVC.
Proceedings of the 2013 Data Compression Conference, 2013

Image Super-Resolution via Hierarchical and Collaborative Sparse Representation.
Proceedings of the 2013 Data Compression Conference, 2013

A hybrid pixel-block based view synthesis for multiviewpoint 3D video.
Proceedings of the 3DTV-Conference 2013: The True Vision, 2013

2012
A Generic Approach for Systematic Analysis of Sports Videos.
ACM Trans. Intell. Syst. Technol., 2012

Multiview Metric Learning with Global Consistency and Local Smoothness.
ACM Trans. Intell. Syst. Technol., 2012

Group-Sensitive Multiple Kernel Learning for Object Recognition.
IEEE Trans. Image Process., 2012

Manifold-Manifold Distance and its Application to Face Recognition With Image Sets.
IEEE Trans. Image Process., 2012

Removing Label Ambiguity in Learning-Based Visual Saliency Estimation.
IEEE Trans. Image Process., 2012

Coupled Bias-Variance Tradeoff for Cross-Pose Face Recognition.
IEEE Trans. Image Process., 2012

HEVC Lossless Coding and Improvements.
IEEE Trans. Circuits Syst. Video Technol., 2012

Video Coding With Rate-Distortion Optimized Transform.
IEEE Trans. Circuits Syst. Video Technol., 2012

Packet Video Error Concealment With Auto Regressive Model.
IEEE Trans. Circuits Syst. Video Technol., 2012

A Single-Pass-Based Localized Adaptive Interpolation Filter for Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2012

A Universal Rate Control Scheme for Video Transcoding.
IEEE Trans. Circuits Syst. Video Technol., 2012

SSIM-Motivated Rate-Distortion Optimization for Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2012

Novel Spatio-Temporal Structural Information Based Video Quality Metric.
IEEE Trans. Circuits Syst. Video Technol., 2012

Multiple Hypotheses Bayesian Frame Rate Up-Conversion by Adaptive Fusion of Motion-Compensated Interpolations.
IEEE Trans. Circuits Syst. Video Technol., 2012

Recovering Missing Contours for Occluded Object Detection.
IEEE Signal Process. Lett., 2012

Learning-based image restoration for compressed images.
Signal Process. Image Commun., 2012

A Concatenational Graph Evolution Aging Model.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Side information generation with auto regressive model for low-delay distributed video coding.
J. Vis. Commun. Image Represent., 2012

Annotating traditional Chinese paintings for immersive virtual exhibition.
ACM Journal on Computing and Cultural Heritage, 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.
Int. J. Comput. Vis., 2012

Image Compressive Sensing Recovery via Collaborative Sparsity.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2012

An efficient foreground-based surveillance video coding scheme in low bit-rate compression.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Low-complexity and high-efficiency background modeling for surveillance video coding.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

New distortion model for depth coding in 3DVC.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Viewpoint-independent hand gesture recognition system.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Spatio-temporal ssim index for video quality assessment.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

A fast multiview video transcoder for bitrate reduction.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Adaptive rate control for High Efficiency Video Coding.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Zero-synthesis view difference aware view synthesis optimization for HEVC based 3D video compression.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

A comparison of fractional-pel interpolation filters in HEVC and H.264/AVC.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Low bit-rate video coding via mode-dependent adaptive regression for wireless visual communications.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Optimizing JPEG quantization table for low bit rate mobile visual search.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Adaptive loop filter with temporal prediction.
Proceedings of the 2012 Picture Coding Symposium, 2012

A flexible and high-performance hardware video encoder architecture.
Proceedings of the 2012 Picture Coding Symposium, 2012

Image Primitive Coding and Visual Quality Assessment.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

An interactive system of stereoscopic video conversion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Content-aware layered compound video compression.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

A New Just-Noticeable-Distortion Model Combined with the Depth Information and its Application in Multi-view Video Coding.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

A Novel Framework for Web Video Thumbnail Generation.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Reducing Blocking Artifacts in Compressed Images via Transform-Domain Non-local Coefficients Estimation.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Macro-Block-Level Selective Background Difference Coding for Surveillance Video.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

An Optimized Hardware Video Encoder for AVS with Level C+ Data Reuse Scheme for Motion Estimation.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Depth Template Based 2D-to-3D Video Conversion and Coding System.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Social Image Tagging by Mining Sparse Tag Patterns from Auxiliary Data.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Motion vector derivation of deformable block.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Pedestrian detection via part-based topology model.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

PQ-WGLOH: A bit-rate scalable local feature descriptor.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Web image interpolation via weighted total least squares regression.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Learning multiple codebooks for low bit rate mobile visual search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Predicting the effectiveness of queries for visual search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Pruning tree-structured vector quantizer towards low bit rate mobile visual search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Separability Oriented Preprocessing for Illumination-Insensitive Face Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

Compressed Sensing Recovery via Collaborative Sparsity.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

A Compact Stereoscopic Video Representation for 3D Video Generation and Coding.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

Multi-scale Spatial Error Concealment via Hybrid Bayesian Regression.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

Distributed Soft Video Broadcast (DCAST) with Explicit Motion.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

Towards compact topical descriptors.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

An efficient NEON-based quarter-pel interpolation method for HEVC.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
A Novel Macro-Block Group Based AVS Coding Scheme for Many-Core Processor.
J. Signal Process. Syst., 2011

High-Resolution Face Fusion for Gender Conversion.
IEEE Trans. Syst. Man Cybern. Part A, 2011

Classifiability-Based Discriminatory Projection Pursuit.
IEEE Trans. Neural Networks, 2011

Interpolation-Dependent Image Downsampling.
IEEE Trans. Image Process., 2011

Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications.
IEEE Trans. Image Process., 2011

Window-Level Rate Control for Smooth Picture Quality and Smooth Buffer Occupancy.
IEEE Trans. Image Process., 2011

Image Interpolation Via Regularized Local Linear Regression.
IEEE Trans. Image Process., 2011

A Progressive Quality Hiding Strategy Based on Equivalence Partitions of Hiding Units.
Trans. Data Hiding Multim. Secur., 2011

Multi-Task Rank Learning for Visual Saliency Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2011

Witsenhausen-Wyner Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2011

A Novel Rate Control Technique for Multiview Video Plus Depth Based 3D Video Coding.
IEEE Trans. Broadcast., 2011

Cross-pose face recognition based on partial least squares.
Pattern Recognit. Lett., 2011

Maximal Linear Embedding for Dimensionality Reduction.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Novel intra prediction via position-dependent filtering.
J. Vis. Commun. Image Represent., 2011

Mining Layered Grammar Rules for Action Recognition.
Int. J. Comput. Vis., 2011

Modeling spatial and semantic cues for large-scale near-duplicated image retrieval.
Comput. Vis. Image Underst., 2011

Binocular stereopsis of traditional Chinese paintings.
Proceedings of the 10th International Conference on Virtual Reality Continuum and its Applications in Industry, 2011

New image coding scheme with hierarchical representation and adaptive interpolation.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Fast intra mode selection for stereo video coding using epipolar constraint.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Improved spatial aided low delay Wyner-Ziv video coding by wavelet shrinkage.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Advanced spatial and Temporal Direct Mode for B picture coding.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

A parallel context model for level information in CABAC.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

PKU-IDM @TRECVID2011 CBCD: Content-Based Copy Detection with Cascade of Multimodal Features and Temporal Pyramid Matching.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

ObjectBook construction for large-scale semantic-aware image retrieval.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Grid-Based Retargeting with Transformation Consistency Smoothing.
Proceedings of the Advances in Multimedia Modeling, 2011

Towards low bit rate mobile visual search with multiple-channel coding.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Stereoscopic learning for disparity estimation.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Enhanced line-based intra prediction with fixed interpolation filtering.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Fast disparity estimation utilizing depth information for multiview video coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Side information extrapolation with temporal and spatial consistency.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Query-Adaptive Ranking with Support Vector Machines for Protein Homology Prediction.
Proceedings of the Bioinformatics Research and Applications - 7th International Symposium, 2011

Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search.
Proceedings of the IJCAI 2011, 2011

Saliency Detection Based on Scale Selectivity of Human Visual System.
Proceedings of the Neural Information Processing - 18th International Conference, 2011

Fast retargeting with adaptive grid optimization.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Visual pertinent 2D-to-3D video conversion by multi-cue fusion.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

A new image deblurring algorithm with less ringing artifacts via error variance estimation and soft decision.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

SSIM-inspired divisive normalization for perceptual video coding.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Bayesian frame interpolation by fusing multiple motion-compensated prediction frames.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning the trip suggestion from landmark photos on the web.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

PKUBench: A context rich mobile visual search benchmark.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Visual attention based image quality assessment.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Generating vocabulary for global feature representation towards commerce image retrieval.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Computing importance of 2D contour parts by reconstructability.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Face recognition based on non-corresponding region matching.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Auto-regressive model based error concealment scheme for stereoscopic video coding.
Proceedings of the IEEE International Conference on Acoustics, 2011

Rate-SSIM optimization for video coding.
Proceedings of the IEEE International Conference on Acoustics, 2011

When codeword frequency meets geographical location.
Proceedings of the IEEE International Conference on Acoustics, 2011

A lowbit rate vocabulary coding scheme for mobile landmark search.
Proceedings of the IEEE International Conference on Acoustics, 2011

Sorting local descriptors for lowbit rate mobile visual search.
Proceedings of the IEEE International Conference on Acoustics, 2011

Adaptive discriminant analysis for face recognition from single sample per person.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

iULib: where UDL and Wikipedia could meet.
Proceedings of the Imaging and Printing in a Web 2.0 World II, 2011

Transductive Regression with Local and Global Consistency for Image Super-Resolution.
Proceedings of the 2011 Data Compression Conference (DCC 2011), 2011

Instantly telling what happens in a video sequence using simple features.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Topic level sampling towards optimized locality sensitive vocabulary coding.
Proceedings of the 8th International Conference on Information, 2011

2010
AVS Video Coding Standard.
Proceedings of the Intelligent Multimedia Communication: Techniques and Applications, 2010

Error-resistance and Low-complexity Integer Inverse Discrete Cosine Transform.
J. Signal Process. Syst., 2010

Affective Visualization and Retrieval for Music Video.
IEEE Trans. Multim., 2010

Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context.
IEEE Trans. Multim., 2010

A Motion-Aligned Auto-Regressive Model for Frame Rate Up Conversion.
IEEE Trans. Image Process., 2010

Novel Statistical Modeling, Analysis and Implementation of Rate-Distortion Estimation for H.264/AVC Coders.
IEEE Trans. Circuits Syst. Video Technol., 2010

Corrections to "A Spatio-Temporal Auto Regressive Model for Frame Rate Up-Conversion" [Sep 09 1289-1301].
IEEE Trans. Circuits Syst. Video Technol., 2010

A Hardware-Efficient Multi-Resolution Block Matching Algorithm and its VLSI Architecture for High Definition MPEG-Like Video Encoders.
IEEE Trans. Circuits Syst. Video Technol., 2010

Dual Frame Motion Compensation With Optimal Long-Term Reference Frame Selection and Bit Allocation.
IEEE Trans. Circuits Syst. Video Technol., 2010

Deinterlacing Using Hierarchical Motion Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2010

A Low-Cost Very Large Scale Integration Architecture for Multistandard Inverse Transform.
IEEE Trans. Circuits Syst. II Express Briefs, 2010

Top-Down Gaze Movement Control in Target Search Using Population Cell Coding of Visual Context.
IEEE Trans. Auton. Ment. Dev., 2010

Adaptive Sign Language Recognition With Exemplar Extraction and MAP/IVFS.
IEEE Signal Process. Lett., 2010

Cost-Sensitive Rank Learning From Positive and Unlabeled Data for Visual Saliency Estimation.
IEEE Signal Process. Lett., 2010

Sigma Set Based Implicit Online Learning for Object Tracking.
IEEE Signal Process. Lett., 2010

No-reference perceptual image quality metric using gradient profiles for JPEG2000.
Signal Process. Image Commun., 2010

WLD: A Robust Local Image Descriptor.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Salient object extraction for user-targeted video content association.
J. Zhejiang Univ. Sci. C, 2010

A joint encoder-decoder error control framework for stereoscopic video coding.
J. Vis. Commun. Image Represent., 2010

RD-optimized interactive streaming of multiview video with multiple encodings.
J. Vis. Commun. Image Represent., 2010

Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video.
Int. J. Comput. Vis., 2010

Per-Sample Multiple Kernel Approach for Visual Concept Learning.
EURASIP J. Image Video Process., 2010

MOCC: A Fast and Robust Correlation-Based Method for Interest Point Matching under Large Scale Changes.
EURASIP J. Adv. Signal Process., 2010

Vlogging: A survey of videoblogging technology on the web.
ACM Comput. Surv., 2010

Mediaprinting: Identifying Multimedia Content for Digital Rights Management.
Computer, 2010

Robust video super-resolution with registration efficiency adaptation.
Proceedings of the Visual Communications and Image Processing 2010, 2010

An efficient coding scheme for surveillance videos captured by stationary cameras.
Proceedings of the Visual Communications and Image Processing 2010, 2010

High throughput VLSI architecture for multiresolution integer motion estimation in high definition AVS video encoder.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Frame rate up conversion via Bayesian motion estimation.
Proceedings of the Visual Communications and Image Processing 2010, 2010

SSIM based perceptual distortion rate optimization coding.
Proceedings of the Visual Communications and Image Processing 2010, 2010

A fast intra 4×4 mode decision algorithm for H.264/AVC down rate transcoding.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Side information enhancement via texture and motion activity analysis in distributed video coding.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Template based illumination compensation algorithm for multiview video coding.
Proceedings of the Visual Communications and Image Processing 2010, 2010

PKU@TRECVID2010: Pair-Wise Event Detection in Surveillance Video.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Fast rate-distortion optimized transform for Intra coding.
Proceedings of the Picture Coding Symposium, 2010

A robust video super-resolution algorithm.
Proceedings of the Picture Coding Symposium, 2010

A background model based method for transcoding surveillance videos captured by stationary camera.
Proceedings of the Picture Coding Symposium, 2010

An epipolar resticted inter-mode selection for stereoscopic video encoding.
Proceedings of the Picture Coding Symposium, 2010

Improved autoregressive image model estimation for directional image interpolation.
Proceedings of the Picture Coding Symposium, 2010

Image quality assessment based on local orientation distributions.
Proceedings of the Picture Coding Symposium, 2010

Image interpolation via regularized local linear regression.
Proceedings of the Picture Coding Symposium, 2010

Background aided surveillance-oriented distributed video coding.
Proceedings of the Picture Coding Symposium, 2010

Low-Complexity and Sampling-Aided Multi-view Video Coding at Low Bitrate.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Fast Mode Decision Based on RDO for AVS High Definition Video Encoder.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Building contextual visual vocabulary for large-scale image applications.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Vicept: link visual features to concepts for large-scale image understanding.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Video retargeting with multi-scale trajectory optimization.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Efficient macroblock pipeline structure in high definition AVS video encoder VLSI architecture.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Interactive Web Video Advertising with Context Analysis and Search.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Boosted Sigma Set for Pedestrian Detection.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Finding Multiple Object Instances with Occlusion.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Interactive viewpoint-space navigation for visual-audio exhibition of painting.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

A steganography strategy based on equivalence partitions of hiding units.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Fast weighted algorithms for bitstream extraction of SVC Medium-Grain scalable video coding.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Position dependent linear intra prediction for image coding.
Proceedings of the International Conference on Image Processing, 2010

Enhanced intra prediction and transform for video coding.
Proceedings of the International Conference on Image Processing, 2010

A single-pass based adaptive interpolation filtering algorithm for video coding.
Proceedings of the International Conference on Image Processing, 2010

A practical algorithm for tanner graph based image interpolation.
Proceedings of the International Conference on Image Processing, 2010

Maximizing intra-individual correlations for illumination-insensitive face recognition.
Proceedings of the International Conference on Image Processing, 2010

Gray-scale super-resolution for face recognition from low Gray-scale resolution face images.
Proceedings of the International Conference on Image Processing, 2010

An interactive method for curve extraction.
Proceedings of the International Conference on Image Processing, 2010

Rate-distortion optimized transform for intra-frame coding.
Proceedings of the IEEE International Conference on Acoustics, 2010

Context-adaptive pixel based prediction for intra frame encoding.
Proceedings of the IEEE International Conference on Acoustics, 2010

Building pair-wise visual word tree for efficent image re-ranking.
Proceedings of the IEEE International Conference on Acoustics, 2010

Spatial-Temporal Granularity-Tunable Gradients Partition (STGGP) Descriptors for Human Detection.
Proceedings of the Computer Vision, 2010

Lighting Aware Preprocessing for Face Recognition across Varying Illumination.
Proceedings of the Computer Vision, 2010

Auto Regressive Model and Weighted Least Squares Based Packet Video Error Concealment.
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Tanner Graph Based Image Interpolation.
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Visual tracking via weakly supervised learning from multiple imperfect oracles.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Measuring visual saliency by Site Entropy Rate.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Adaptive generic learning for face recognition from a single sample per person.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Novel observation model for probabilistic object tracking.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Towards semantic embedding in visual vocabulary.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Automatic video genre categorization and event detection techniques on large-scale sports data.
Proceedings of the 2010 conference of the Centre for Advanced Studies on Collaborative Research, 2010

Manifold Alignment via Corresponding Projections.
Proceedings of the British Machine Vision Conference, 2010

Skin Color Weighted Disparity Competition for Hand Segmentation from Stereo Camera.
Proceedings of the British Machine Vision Conference, 2010

A visual search system based on multi-scale population cell coding.
Proceedings of the 9th IEEE International Conference on Cognitive Informatics, 2010

2009
Performance Evaluation, Overview.
Proceedings of the Encyclopedia of Biometrics, 2009

Face Misalignment Problem.
Proceedings of the Encyclopedia of Biometrics, 2009

Variable-Bin-Rate CABAC Engine for H.264/AVC High Definition Real-Time Decoding.
IEEE Trans. Very Large Scale Integr. Syst., 2009

Event Tactic Analysis Based on Broadcast Sports Video.
IEEE Trans. Multim., 2009

Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding.
IEEE Trans. Multim., 2009

Hierarchical Ensemble of Global and Local Classifiers for Face Recognition.
IEEE Trans. Image Process., 2009

A Spatio-Temporal Auto Regressive Model for Frame Rate Upconversion.
IEEE Trans. Circuits Syst. Video Technol., 2009

On Rate-Distortion Modeling and Extraction of H.264/SVC Fine-Granular Scalable Video.
IEEE Trans. Circuits Syst. Video Technol., 2009

Complexity-Constrained H.264 Video Encoding.
IEEE Trans. Circuits Syst. Video Technol., 2009

Distributed Video Coding Based on the Human Visual System.
IEEE Signal Process. Lett., 2009

An efficient VLSI architecture for CBAC of AVS HDTV decoder.
Signal Process. Image Commun., 2009

Context-based entropy coding in AVS video coding standard.
Signal Process. Image Commun., 2009

A secure media streaming mechanism combining encryption, authentication, and transcoding.
Signal Process. Image Commun., 2009

Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model.
Signal Process. Image Commun., 2009

Special issue on AVS and its applications: Guest editorial.
Signal Process. Image Commun., 2009

Reconfigurable video coding framework and decoder reconfiguration instantiation of AVS.
Signal Process. Image Commun., 2009

Learned local Gabor patterns for face representation and recognition.
Signal Process., 2009

Contour-motion feature (CMF): A space-time approach for robust pedestrian detection.
Pattern Recognit. Lett., 2009

Synthetic data generation technique in Signer-independent sign language recognition.
Pattern Recognit. Lett., 2009

Optimization of a training set for more robust face detection.
Pattern Recognit., 2009

Are Gabor phases really useless for face recognition?
Pattern Anal. Appl., 2009

Pornographic Image Detection Based on Multilevel Representation.
Int. J. Pattern Recognit. Artif. Intell., 2009

Spatial-Aided Low-Delay Wyner-Ziv Video Coding.
EURASIP J. Image Video Process., 2009

Distributed Video Coding: Trends and Perspectives.
EURASIP J. Image Video Process., 2009

A framework for flexible summarization of racquet sports video using multiple modalities.
Comput. Vis. Image Underst., 2009

Semi-Supervised Learning in Reconstructed Manifold Space for 3D Caricature Generation.
Comput. Graph. Forum, 2009

Efficient discovery of abundant post-translational modifications and spectral pairs using peptide mass and retention time differences.
BMC Bioinform., 2009

Code-matched interleaver design over surrogate channels.
Proceedings of the 2009 IEEE Wireless Communications and Networking Conference, 2009

PKU@TRECVID2009: Single-Actor and Pair-Activity Event Detection in surveillance Video.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

An auto-regressive model for checkboard splitting based Wyner-Ziv coding.
Proceedings of the 2009 Picture Coding Symposium, 2009

Window-level rate control for smooth picture quality and smooth buffer occupancy.
Proceedings of the 2009 Picture Coding Symposium, 2009

A high efficient error concealment scheme based on auto-regressive model for video coding.
Proceedings of the 2009 Picture Coding Symposium, 2009

Estimating the value of θ in the intra frame for ρ-domain rate control algorithms.
Proceedings of the 2009 Picture Coding Symposium, 2009

Improved low delay distributed video coding.
Proceedings of the 2009 Picture Coding Symposium, 2009

Two-pass reconstruction in distributed video coding.
Proceedings of the 2009 Picture Coding Symposium, 2009

Multi-hypothesis based multi-view distributed video coding.
Proceedings of the 2009 Picture Coding Symposium, 2009

Nonlocal Edge-Directed Interpolation.
Proceedings of the Advances in Multimedia Information Processing, 2009

A Novel Macro-Block Group Based AVS Coding Scheme for Many-Core Processor.
Proceedings of the Advances in Multimedia Information Processing, 2009

Block Adaptive Super Resolution Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2009

A New Multiple Kernel Approach for Visual Concept Learning.
Proceedings of the Advances in Multimedia Modeling, 2009

Semi-supervised Learning of Caricature Pattern from Manifold Regularization.
Proceedings of the Advances in Multimedia Modeling, 2009

Demultiplexer design for multi-edge type LDPC coded modulation.
Proceedings of the IEEE International Symposium on Information Theory, 2009

An Efficient Coding Method for Intra Prediction Mode Information.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Modeling Correlation Noise Statistics at Decoder for Multi-view Distributed Video Coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

A Proposed AVS Decoder Configuration in the Reconfigurable Video Coding Framework.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Shape-Based Depth Map Coding.
Proceedings of the Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2009), 2009

VLSI friendly me search window buffer structure optimization and algorithm verification for high definition H.264/AVS video encoder.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Multiple kernel active learning for image classification.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Linking video ADS with product or service information by web search.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A hybrid text segmentation approach.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A dataset and evaluation methodology for visual saliency in video.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Utilizing affective analysis for efficient movie browsing.
Proceedings of the International Conference on Image Processing, 2009

Fea-Accu cascade for face detection.
Proceedings of the International Conference on Image Processing, 2009

Joint learning for side information and correlation model based on linear regression model in distributed video coding.
Proceedings of the International Conference on Image Processing, 2009

A no-reference perceptual blur metric using histogram of gradient profile sharpness.
Proceedings of the International Conference on Image Processing, 2009

A generic approach to classify sports video shots and its application in event detection.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Group-sensitive multiple kernel learning for object categorization.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Learning long term face aging patterns from partially dense aging databases.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Robust video fingerprinting based on visual attention regions.
Proceedings of the IEEE International Conference on Acoustics, 2009

Compression-Induced Rendering Distortion Analysis for Texture/Depth Rate Allocation in 3D Video Compression.
Proceedings of the 2009 Data Compression Conference (DCC 2009), 2009

Granularity-tunable gradients partition (GGP) descriptors for human detection.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Maximizing intra-individual correlations for face recognition across pose differences.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Sigma Set: A small second order statistical region descriptor.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Resource allocation for MIMO orthogonal relay channels with finite-rate feedback.
Proceedings of the 43rd Annual Conference on Information Sciences and Systems, 2009

Semi-Supervised Discriminant Analysis via Spectral Transduction.
Proceedings of the British Machine Vision Conference, 2009

Content-Based Video Semantic Analysis.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Semantic Classification and Annotation of Images.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

2008
Head Yaw Estimation From Asymmetry of Facial Appearance.
IEEE Trans. Syst. Man Cybern. Part B, 2008

The CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations.
IEEE Trans. Syst. Man Cybern. Part A, 2008

On guaranteed smooth switching for buffered crossbar switches.
IEEE/ACM Trans. Netw., 2008

Constructing visual phrases for effective and efficient object-based image retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Modeling Background and Segmenting Moving Objects from Compressed Video.
IEEE Trans. Circuits Syst. Video Technol., 2008

FGS Coding Using Cycle-Based Leaky Prediction Through Multiple Leaky Factors.
IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv-Based Multiview Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2008

Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.
IEEE Trans. Circuits Syst. Video Technol., 2008

A novel VLSI architecture of motion compensation for multiple standards.
IEEE Trans. Consumer Electron., 2008

Hardware architecture for AVS entropy encoder.
IEEE Trans. Consumer Electron., 2008

Fast disparity and motion estimation based on correlations for multiview video coding.
IEEE Trans. Consumer Electron., 2008

B-picture coding in AVS video compression standard.
Signal Process. Image Commun., 2008

How many users should be turned on in a multi-antenna broadcast channel?
IEEE J. Sel. Areas Commun., 2008

Unsupervised texture classification: Automatically discover and classify texture patterns.
Image Vis. Comput., 2008

Spatial relationship representation for visual object searching.
Neurocomputing, 2008

Shape from silhouettes based on a centripetal pentahedron model.
Graph. Model., 2008

FlexStem: improving predictions of RNA secondary structures with pseudoknots by reducing the search space.
Bioinform., 2008

Personalized MTV Affective Analysis Using User Profile.
Proceedings of the Advances in Multimedia Information Processing, 2008

Theoretic Analysis of Inter Frame Dependency in Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2008

Learning-Based Image Restoration for Compressed Image through Neighboring Embedding.
Proceedings of the Advances in Multimedia Information Processing, 2008

AAML Based Avatar Animation with Personalized Expression for Online Chatting System.
Proceedings of the Advances in Multimedia Information Processing, 2008

Detecting Violent Scenes in Movies by Auditory and Visual Cues.
Proceedings of the Advances in Multimedia Information Processing, 2008

A No-Reference Blocking Artifacts Metric Using Selective Gradient and Plainness Measures.
Proceedings of the Advances in Multimedia Information Processing, 2008

i.MTV: an integrated system for mtv affective analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

An efficient VLSI architecture for rate disdortion optimization in AVS video encoder.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Wavelet based distributed video coding with spatial scalability.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Visual context representation using a combination of feature-driven and object-driven mechanisms.
Proceedings of the International Joint Conference on Neural Networks, 2008

A Real-Time Video Watermarking Using Adjacent Luminance Blocks Correlation Based on Compressed Domain.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

The Meta-Ontology Model of the Fishdisease Diagnostic Knowledge Based on OWL.
Proceedings of the Computer and Computing Technologies in Agriculture II, Volume 2, 2008

Hierarchical background subtraction using local pixel clustering.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

A covariance-based method for dynamic background subtraction.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Matching images more efficiently with local descriptors.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

V-LGBP: Volume based local Gabor binary patterns for face representation and recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Symmetric segment-based stereo matching of motion blurred images with illumination variations.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Effective scene matching with local feature representatives.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Hallucinating facial images and features.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Human reappearance detection based on on-line learning.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Parzen Discriminant Analysis.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Mahalanobis distance based Polynomial Segment Model for Chinese Sign Language Recogniton.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Affective MTV analysis based on arousal and valence features.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Coarse-to-fine video text detection.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Shot classification for action movies based on motion characteristics.
Proceedings of the International Conference on Image Processing, 2008

Pedestrian detection via logistic multiple instance boosting.
Proceedings of the International Conference on Image Processing, 2008

Fast and effective text detection.
Proceedings of the International Conference on Image Processing, 2008

Multi-polarity text segmentation using graph theory.
Proceedings of the International Conference on Image Processing, 2008

Object tracking using incremental 2D-LDA learning and Bayes inference.
Proceedings of the International Conference on Image Processing, 2008

People re-detection using Adaboost with sift and color correlogram.
Proceedings of the International Conference on Image Processing, 2008

A Backward-Compatible Solution for Next Generation DVB-C System.
Proceedings of IEEE International Conference on Communications, 2008

Visual-aural attention modeling for talk show video highlight detection.
Proceedings of the IEEE International Conference on Acoustics, 2008

2D to 3D convertion based on edge defocus and segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Fast Identification of Error-Prone Patterns for LDPC Codes under Message Passing Decoding.
Proceedings of the Global Communications Conference, 2008. GLOBECOM 2008, New Orleans, LA, USA, 30 November, 2008

Design sparse features for age estimation using hierarchical face model.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Discriminant analysis for perceptionally comparable classes.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Recovering 3D facial shape via coupled 2D/3D space learning.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Illumination transfer using homomorphic wavelet filtering and its application to light-insensitive face recognition.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Face reconstruction using fixation positions and foveated imaging.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Performance Analysis of Dual Frame Motion Compensation.
Proceedings of the 2008 Data Compression Conference (DCC 2008), 2008

A Biologically Inspired Energy-Aware Routing Algorithm for Communications in Cyberworlds.
Proceedings of the International Conference on Cyberworlds 2008, 2008

Locally Assembled Binary (LAB) feature with feature-centric cascade for fast and accurate face detection.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Manifold-Manifold Distance with application to face recognition based on image set.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Classifiability-based Optimal Discriminatory Projection Pursuit.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Unified Principal Component Analysis with generalized Covariance Matrix for face recognition.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

A robust descriptor based on Weber's Law.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

PKU at ImageCLEF 2008: Experiments with Query Extension Techniques for Text-Based and Content-Based Image Retrieval.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

Large-Scale Cross-Media Retrieval of WikipediaMM Images with Textual and Visual Query Expansion.
Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

2007
Large-Vocabulary Continuous Sign Language Recognition Based on Transition-Movement Models.
IEEE Trans. Syst. Man Cybern. Part A, 2007

Enhancing Human Face Detection by Resampling Examples Through Manifolds.
IEEE Trans. Syst. Man Cybern. Part A, 2007

Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video.
IEEE Trans. Multim., 2007

Joint Source-Channel Rate-Distortion Optimization for H.264 Video Coding Over Error-Prone Networks.
IEEE Trans. Multim., 2007

Histogram of Gabor Phase Patterns (HGPP): A Novel Object Representation Approach for Face Recognition.
IEEE Trans. Image Process., 2007

Locally Linear Regression for Pose-Invariant Face Recognition.
IEEE Trans. Image Process., 2007

Block-Wise Adaptive Motion Accuracy Based B-Picture Coding With Low-Complexity Motion Compensation.
IEEE Trans. Circuits Syst. Video Technol., 2007

Reusable Architecture and Complexity-Controllable Algorithm for the Integer/Fractional Motion Estimation of H.264.
IEEE Trans. Consumer Electron., 2007

Video2Cartoon: A System for Converting Broadcast Soccer Video into 3D Cartoon Animation.
IEEE Trans. Consumer Electron., 2007

Network Adapted Selective Frame-Dropping Algorithm for Streaming Media.
IEEE Trans. Consumer Electron., 2007

A Real-Time Full Architecture for AVS Motion Estimation.
IEEE Trans. Consumer Electron., 2007

Local Gabor Binary Patterns Based on Kullback-Leibler Divergence for Partially Occluded Face Recognition.
IEEE Signal Process. Lett., 2007

Effective algorithms for fast transcoding of AVS to H.264/AVC in the spatial domain.
Multim. Tools Appl., 2007

Nonparametric background generation.
J. Vis. Commun. Image Represent., 2007

Elliptic Curve Cryptography Based Wireless Authentication Protocol.
Int. J. Netw. Secur., 2007

Novel Secure Communication Protocol for Conditional Access System.
Int. J. Netw. Secur., 2007

Local Gabor Binary Patterns Based on Mutual Information for Face Recognition.
Int. J. Image Graph., 2007

Viewpoint invariant sign language recognition.
Comput. Vis. Image Underst., 2007

Shape from silhouette outlines using an adaptive dandelion model.
Comput. Vis. Image Underst., 2007

An image fragile watermark scheme based on chaotic image pattern and pixel-pairs.
Appl. Math. Comput., 2007

Improvements of multiple FGS layers coding for low-delay applications in SVC.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Drift-compensated coding optimization for fast bit-rate reduction transcoding.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Real-time video coding under power constraint based on H.264 codec.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Mining Tandem Mass Spectral Data to Develop a More Accurate Mass Error Model for Peptide Identification.
Proceedings of the Biocomputing 2007, 2007

Signer Adaptation Based on Etyma for Large Vocabulary Chinese Sign Language Recognition.
Proceedings of the Advances in Multimedia Information Processing, 2007

A Novel Pipeline Design for H.264 CABAC Decoding.
Proceedings of the Advances in Multimedia Information Processing, 2007

Laplacian Distortion Model (LDM) for Rate Control in Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2007

Text Segmentation in Complex Background Based on Color and Scale Information of Character Strokes.
Proceedings of the Advances in Multimedia Information Processing, 2007

An End-to-End Application System of AVS: AVS-IPTV in China.
Proceedings of the Advances in Multimedia Information Processing, 2007

Trajectory based event tactics analysis in broadcast sports video.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Region-based visual attention analysis with its application in image browsing on small displays.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

QoS Adaptive Data Organizing and Delivery Framework for P2P Media Streaming.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

An Optimized Topology Maintenance Framework for P2P Media Streaming.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Challenges on Peer-to-Peer Live Media Streaming.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

FBSA: A Self-adjustable Multi-source Data Scheduling Algorithm for P2P Media Streaming.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Searching Eye Centers Using a Context-Based Neural Network.
Proceedings of the Advances in Neural Networks, 2007

Context-based Arithmetic Coding Reexamined for DCT Video Compression.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Macroblock-Level Adaptive Scan Scheme for Discrete Cosine Transform Coefficients.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Rate Control for Hierarchical B-picture Coding with Scaling-factors.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

High-Accuracy and Low-Complexity Fixed-Point Inverse Discrete Cosine Transform Based on AAN's Fast Algortihm.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Low-delay View Random Access for Multi-view Video Coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Macroblock-level Reduced Resolution Video Coding Allowing Adaptive DCT Coefficients Selection.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Distributed Video Coding with Spatial Correlation Exploited Only at the Decoder.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Learning and Memory of Spatial Relationship by a Neural Network with Sparse Features.
Proceedings of the International Joint Conference on Neural Networks, 2007

Highlight Ranking for Racquet Sports Video in User Attention Subspaces Based on Relevance Feedback.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

An Effective Local Invariant Descriptor Combining Luminance and Color Information.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Adaptive Spatial and Transform Domain FGS Coding.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Statistical Analysis and Modeling of Rate-Distortion Function in SVC Fine-Granular SNR Scalable Videos.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

The Demo: A Real-Time Score Detection and Recognition Approach in Broadcast Basketball Sports Video.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Direct Mode Coding for B Pictures using Virtual Reference Picture.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Generating Video Sequence from Photo Image for Mobile Screens by Content Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Mining Information of Attack-Defense Status from Soccer Video Based on Scene Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

3D Haar-Like Features for Pedestrian Detection.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

DSP Implementation of Deblocking Filter for AVS.
Proceedings of the International Conference on Image Processing, 2007

Software Pipelines Design for Variable Block-Size Motion Estimation with Large Search Range.
Proceedings of the International Conference on Image Processing, 2007

A Novel Error Concealment Method for Stereoscopic Video Coding.
Proceedings of the International Conference on Image Processing, 2007

Monocular Tracking 3D People By Gaussian Process Spatio-Temporal Variable Model.
Proceedings of the International Conference on Image Processing, 2007

Mean-Shift Blob Tracking with Adaptive Feature Selection and Scale Adaptation.
Proceedings of the International Conference on Image Processing, 2007

A Fast Inter Frame Prediction Algorithm for Multi-View Video Coding.
Proceedings of the International Conference on Image Processing, 2007

Object Recognition Based on Dependent Pachinko Allocation Model.
Proceedings of the International Conference on Image Processing, 2007

Minimizing the Distortion Spatial Data Hiding Based on Equivalence Class.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007

Low-Complexity FGS Coding for Interlaced SVC in Low-Delay Applications.
Proceedings of the IEEE International Conference on Acoustics, 2007

Generating Data for Signer Adaptation.
Proceedings of the Gesture-Based Human-Computer Interaction and Simulation, 2007

Practical Algorithms of Bandwidth Regulation for Rate-Based Switching.
Proceedings of the Global Communications Conference, 2007

An Enhanced Robust Entropy Coder for Video Codecs Based on Context-Adaptive Reversible VLC.
Proceedings of the 2007 Data Compression Conference (DCC 2007), 2007

Matrix-Structural Learning (MSL) of Cascaded Classifier from Enormous Training Set.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A Flexible Stem-Based Local Search Algorithm for Predicting RNA Secondary Structures Including Pseudoknots.
Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering, 2007

2006
Learning Contextual Dependency Network Models for Link-Based Classification.
IEEE Trans. Knowl. Data Eng., 2006

Drift-free switching of compressed video bitstreams at predictive frames.
IEEE Trans. Circuits Syst. Video Technol., 2006

Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos.
IEEE Trans. Circuits Syst. Video Technol., 2006

H.264 video encryption scheme adaptive to DRM.
IEEE Trans. Consumer Electron., 2006

A flexible VLSI architecture of transport processor for an AVS HDTV decoder SoC.
IEEE Trans. Consumer Electron., 2006

Deeply pipelined DSP solution to deblocking filter for H.264/AVC.
IEEE Trans. Consumer Electron., 2006

An efficient VLSI architecture of VLD for AVS HDTV decoder.
IEEE Trans. Consumer Electron., 2006

Algorithmic and architectural co-design for integer motion estimation of AVS.
IEEE Trans. Consumer Electron., 2006

An AVS HDTV video decoder architecture employing efficient HW/SW partitioning.
IEEE Trans. Consumer Electron., 2006

An efficient and robust adaptive deinterlacing technique.
IEEE Trans. Consumer Electron., 2006

An effective method to detect and categorize digitized traditional Chinese paintings.
Pattern Recognit. Lett., 2006

Adaptive rate control for H.264.
J. Vis. Commun. Image Represent., 2006

Latent linkage semantic kernels for collective classification of link data.
J. Intell. Inf. Syst., 2006

Context-Based 2D-VLC Entropy Coder in AVS Video Coding Standard.
J. Comput. Sci. Technol., 2006

Low Complexity Integer Transform and Adaptive Quantization Optimization.
J. Comput. Sci. Technol., 2006

Security on Aydos et al's Elliptic Curve Cryptography Based Wireless Authentication Protocol.
J. Comput. Res. Dev., 2006

A High Efficient Architecture for Motion Estimation Based on AVC/AVS Coding Standard.
J. Comput. Res. Dev., 2006

Object detection using spatial histogram features.
Image Vis. Comput., 2006

Extracting 3D information from broadcast soccer video.
Image Vis. Comput., 2006

Shape-based Adult Image Detection.
Int. J. Image Graph., 2006

Throughput analysis of a buffered crossbar switch with multiple input queues under burst traffic.
IEEE Commun. Lett., 2006

IndexToolkit: an open source toolbox to index protein databases for high-throughput proteomics.
Bioinform., 2006

An SVM Scorer for More Sensitive and Reliable Peptide Identification via Tandem Mass Spectrometry.
Proceedings of the Biocomputing 2006, 2006

A Motion Vector Predictor Architecture for AVS and MPEG-2 HDTV Decoder.
Proceedings of the Advances in Multimedia Information Processing, 2006

Video Coding by Texture Analysis and Synthesis Using Graph Cut.
Proceedings of the Advances in Multimedia Information Processing, 2006

Adaptive Search Range Scaling for B Pictures Coding.
Proceedings of the Advances in Multimedia Information Processing, 2006

An Improved Motion Vector Prediction Scheme for Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2006

Multi-view Video Coding with Flexible View-Temporal Prediction Structure for Fast Random Access.
Proceedings of the Advances in Multimedia Information Processing, 2006

Online Selection of Discriminative Features Using Bayes Error Rate for Visual Tracking.
Proceedings of the Advances in Multimedia Information Processing, 2006

A Robust Approach for Object Recognition.
Proceedings of the Advances in Multimedia Information Processing, 2006

A Broadcast Model for Web Image Annotation.
Proceedings of the Advances in Multimedia Information Processing, 2006

A Real-Time Video Deinterlacing Scheme for MPEG-2 to AVS Transcoding.
Proceedings of the Advances in Multimedia Information Processing, 2006

Joint Data Partition and Rate-Distortion Optimized Mode Selection for H.264 Error-Resilient Coding.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Sports video analysis.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Player action recognition in broadcast tennis video with applications to semantic analysis of sports game.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Effective and efficient object-based image retrieval using visual phrases.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Diversifying the image retrieval results.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Mapping learning in eigenspace for harmonious caricature generation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Accurately weighting subbands in temporal wavelet transform.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

An efficient SNR scalability coding framework hybrid open-close loop FGS coding.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Distributed video coding using wavelet.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

A proactive fault-detection mechanism in large-scale cluster systems.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A Visual Perceiving and Eyeball-Motion Controlling Neural Network for Object Searching and Locating.
Proceedings of the International Joint Conference on Neural Networks, 2006

Action Recognition in Broadcast Tennis Video.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Robust Split-and-Merge Text Segmentation Approach for Images.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Verification Method for Viewpoint Invariant Sign Language Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Enhancing Training Set for Face Detection.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Patch-Based Gabor Fisher Classifier for Face Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Ensemble of Piecewise FDA Based on Spatial Histograms of Local (Gabor) Binary Patterns for Face Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Face Recognition under Varying Lighting Based on the Probabilistic Model of Gabor Phase.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Unsupervised Texture Classification: Automatically Discover and Classify Texture Patterns.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

2D Cascaded AdaBoost for Eye Localization.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Local Visual Primitives (LVP) for Face Modelling and Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Robust Head Pose Estimation Using LGBP.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Novel Volumetric Shape from Silhouette Algorithm Based on a Centripetal Pentahedron Model.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Bagging Based Efficient Kernel Fisher Discriminant Analysis for Face Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Isomap Based on the Image Euclidean Distance.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Modification of the AdaBoost-based Detector for Partially Occluded Faces.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Automatic Multi-Player Detection and Tracking in Broadcast Sports Video using Support Vector Machine and Particle Filter.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An Efficient Reference Frame Storage Scheme for H.264 HDTV Decoder.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Complexity-Distortion Optimized Motion Estimation Algorithm with Fine-Granular Scalable Complexity.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Performance-Complexity Analysis of High Resolution Video Encoder and its Memory Organization for DSP Implementation.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An Optimal Non-Uniform Scalar Quantizer for Distributed Video Coding.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Fast Intra Mode Decision Algorithm for AVS to H.264 Transcoding.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Sign Language Recognition from Homography.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An Effective Mode Decision Scheme in Macroblock-Based PFGS.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Low-Complexity Adaptive Block-Size Transform Based on Extended Transforms.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Motion Aligned Spatial Scalable Video Coding.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An Edge-Based Median Filtering Algorithm with Consideration of Motion Vector Reliability for Adaptive Video Deinterlacing.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An Adaptive De-Interlacing Algorithm Based on Texture and Motion Vector Analysis.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An Efficient FGS Coding Scheme for Interlaced Scalable Video Coding.
Proceedings of the International Conference on Image Processing, 2006

Adaptive Block-Size Transform Based on Extended Integer 8×8/4×4 Transforms for H.264/AVC.
Proceedings of the International Conference on Image Processing, 2006

Visual Hull Embossment by Graph Cuts.
Proceedings of the International Conference on Image Processing, 2006

Practical Wyner-Ziv Switching Scheme for Multiple Bit-Rate Video Streaming.
Proceedings of the International Conference on Image Processing, 2006

Wyner-Ziv Video Coding Based on Set Partitioning in Hierarchical Tree.
Proceedings of the International Conference on Image Processing, 2006

Switch Simulations Based on Workload Pattern Generation and Smoothed Periodic Input.
Proceedings of the 15th International Conference On Computer Communications and Networks, 2006

Performance Characterisation of Face Recognition Algorithms and Their Sensitivity to Severe Illumination Changes.
Proceedings of the Advances in Biometrics, International Conference, 2006

The Application of Extended Geodesic Distance in Head Poses Estimation.
Proceedings of the Advances in Biometrics, International Conference, 2006

Pose Estimation Based on Gaussian Error Models.
Proceedings of the Advances in Biometrics, International Conference, 2006

Image Matching by Normalized Cross-Correlation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Expanding Training Set for Chinese Sign Language Recognition.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Hierarchical Ensemble of Gabor Fisher Classifier for Face Recognition.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Local Linear Regression (LLR) for Pose Invariant Face Recognition.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine.
Proceedings of the Computer Vision in Human-Computer Interaction, 2006

Robust Multi-view Face Detection Using Error Correcting Output Codes.
Proceedings of the Computer Vision, 2006

Robust Collective Classification with Contextual Dependency Network Models.
Proceedings of the Advanced Data Mining and Applications, Second International Conference, 2006

Semantic Scoring Based on Small-World Phenomenon for Feature Selection in Text Mining.
Proceedings of the Advanced Data Mining and Applications, Second International Conference, 2006

Image Matching by Multiscale Oriented Corner Correlation.
Proceedings of the Computer Vision, 2006

Self-calibration Based 3D Information Extraction and Application in Broadcast Soccer Video.
Proceedings of the Computer Vision, 2006

2005
Rate-distortion analysis for H.264/AVC video coding and its application to rate control.
IEEE Trans. Circuits Syst. Video Technol., 2005

An efficient hardware implementation for motion estimation of AVC standard.
IEEE Trans. Consumer Electron., 2005

Predicting Molecular Formulas of Fragment Ions with Isotope Patterns in Tandem Mass Spectra.
IEEE ACM Trans. Comput. Biol. Bioinform., 2005

Adaptive relevance feedback based on Bayesian inference for image retrieval.
Signal Process., 2005

Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model.
Real Time Imaging, 2005

Robust real-time transmission of scalable multimedia for heterogeneous client bandwidths.
Real Time Imaging, 2005

Thresholding technique with adaptive window selection for uneven lighting image.
Pattern Recognit. Lett., 2005

Efficient 3D reconstruction for face recognition.
Pattern Recognit., 2005

Real-time smoothing for network adaptive video streaming.
J. Vis. Commun. Image Represent., 2005

Visual Ontology Construction for Digitized Art Image Retrieval.
J. Comput. Sci. Technol., 2005

BLOSSOMS: Building Lightweight Optimized Sensor Systems on a Massive Scale.
J. Comput. Sci. Technol., 2005

Fast and robust text detection in images and video frames.
Image Vis. Comput., 2005

Face recognition under generic illumination based on harmonic relighting.
Int. J. Pattern Recognit. Artif. Intell., 2005

pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry.
Bioinform., 2005

Motion Retargeting for the Hand Gesture.
Proceedings of the 13-th International Conference in Central Europe on Computer Graphics, 2005

Smooth switching problem in buffered crossbar switches.
Proceedings of the International Conference on Measurements and Modeling of Computer Systems, 2005

Fast Adaptive Skin Detection in JPEG Images.
Proceedings of the Advances in Multimedia Information Processing, 2005

High Efficient Context-Based Variable Length Coding with Parallel Orientation.
Proceedings of the Advances in Multimedia Information Processing, 2005

Illumination Invariant Feature Selection for Face Recognition.
Proceedings of the Advances in Multimedia Information Processing, 2005

Creative Cartoon Face Synthesis System for Mobile Entertainment.
Proceedings of the Advances in Multimedia Information Processing, 2005

A Scheme for Ball Detection and Tracking in Broadcast Soccer Video.
Proceedings of the Advances in Multimedia Information Processing, 2005

Adaptive Deinterlacing for Real-Time Applications.
Proceedings of the Advances in Multimedia Information Processing, 2005

Reducing Spatial Resolution for MPEG-2 to H.264/AVC Transcoding.
Proceedings of the Advances in Multimedia Information Processing, 2005

Hierarchical voting classification scheme for improving visual sign language recognition.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Exciting event detection in broadcast soccer video with mid-level description and incremental learning.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Video2Cartoon: generating 3D cartoon from broadcast soccer video.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Using Score Normalization to Solve the Score Variation Problem in Face Authentication.
Proceedings of the Advances in Biometric Person Authentication, 2005

Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization and Confidence-Constraint Shape Modeling.
Proceedings of the Advances in Biometric Person Authentication, 2005

Watermark Detection Schemes with High Security.
Proceedings of the International Symposium on Information Technology: Coding and Computing (ITCC 2005), 2005

Research on the Discrimination of Pornographic and Bikini Images.
Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM 2005), 2005

A novel algorithm of spatial scalability for scrambled video.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Shot change detection on H.264/AVC compressed video.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Low complexity RDO mode decision based on a fast coding-bits estimation model for H.264/AVC.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Linear transform based motion compensated prediction for luminance intensity changes.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Viewpoint switching in multiview video streaming.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Parallelized scheduling algorithm for input queued switches using local search technique.
Proceedings of the 24th IEEE International Performance Computing and Communications Conference, 2005

Scheduling Algorithms for Input Queued Switches Using Local Search Technique.
Proceedings of the Networking, 2005

Spatial Video Watermarking Based on Stability of DC Coefficients.
Proceedings of the Advances in Machine Learning and Cybernetics, 2005

Recognition of sign language subwords based on boosted hidden Markov models.
Proceedings of the 7th International Conference on Multimodal Interfaces, 2005

A System for Automatic Generation of Music Sports-Video.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Hybrid Algorithm with Adaptive Complexity for Integer Pel Motion Estimation of H.264.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Improving particle filter with support vector regression for efficient visual tracking.
Proceedings of the 2005 International Conference on Image Processing, 2005

Extended direct mode for hierarchical B picture coding.
Proceedings of the 2005 International Conference on Image Processing, 2005

An adaptive skin color detection algorithm with confusing backgrounds elimination.
Proceedings of the 2005 International Conference on Image Processing, 2005

Image matching based on a local invariant descriptor.
Proceedings of the 2005 International Conference on Image Processing, 2005

An active volumetric model for 3D reconstruction.
Proceedings of the 2005 International Conference on Image Processing, 2005

Motion vector prediction in multiview video coding.
Proceedings of the 2005 International Conference on Image Processing, 2005

Local Gabor Binary Pattern Histogram Sequence (LGBPHS): A Novel Non-Statistical Model for Face Representation and Recognition.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

A dual round-robin algorithm for combined input-crosspoint-queued switches.
Proceedings of the 14th International Conference On Computer Communications and Networks, 2005

Local invariant descriptor for image matching.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Playfield Detection Using Adaptive GMM and Its Application.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Empirical comparisons of several preprocessing methods for illumination insensitive face recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Visual Sign Language Recognition Based on HMMs and Auto-regressive HMMs.
Proceedings of the Gesture in Human-Computer Interaction and Simulation, 2005

A Comparison Between Etymon- and Word-Based Chinese Sign Language Recognition Systems.
Proceedings of the Gesture in Human-Computer Interaction and Simulation, 2005

Re-sampling for Chinese Sign Language Recognition.
Proceedings of the Gesture in Human-Computer Interaction and Simulation, 2005

Bandwidth Adaptive Quality Smoothing for Unequal Error Protected Scalable Video Streaming.
Proceedings of the 2005 Data Compression Conference (DCC 2005), 2005

Nonlinear Face Recognition Based on Maximum Average Margin Criterion.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Online Selecting Discriminative Tracking Features Using Particle Filter.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Multiple Fisher Classifiers Combination for Face Recognition based on Grouping AdaBoosted Gabor Features.
Proceedings of the British Machine Vision Conference 2005, Oxford, UK, September 2005, 2005

Multi-resolution Histograms of Local Variation Patterns (MHLVP) for Robust Face Recognition.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2005

Discriminant Analysis Based on Kernelized Decision Boundary for Face Recognition.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2005

Face Detection Based on the Manifold.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2005

Pose Invariant Face Recognition Under Arbitrary Illumination Based on 3D Face Reconstruction.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2005

AdaBoost Gabor Fisher Classifier for Face Recognition.
Proceedings of the Analysis and Modelling of Faces and Gestures, 2005

How to Train a Classifier Based on the Huge Face Database?
Proceedings of the Analysis and Modelling of Faces and Gestures, 2005

Static Gesture Quantization and DCT Based Sign Language Generation.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

The Bunch-Active Shape Model.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

Randomized Parallel Scheduling Algorithm for Input Queued Crossbar Switches.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005

An Adaptive Dandelion Model for Reconstructing Spherical Terrain-Like Visual Hull Surfaces.
Proceedings of the Fifth International Conference on 3D Digital Imaging and Modeling (3DIM 2005), 2005

2004
Two-phase Web site classification based on Hidden Markov Tree models.
Web Intell. Agent Syst., 2004

Large vocabulary sign language recognition based on fuzzy decision trees.
IEEE Trans. Syst. Man Cybern. Part A, 2004

Seamless switching of scalable video bitstreams for efficient streaming.
IEEE Trans. Multim., 2004

A block-based support vector machine approach to the protein homology prediction task in KDD Cup 2004.
SIGKDD Explor., 2004

Real-time scheduling based on imprecise computation for scalable streaming media system over the Internet.
Real Time Imaging, 2004

A Chinese sign language recognition system based on SOFM/SRN/HMM.
Pattern Recognit., 2004

Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry.
Bioinform., 2004

An Ontology-based Approach to Retrieve Digitized Art Images.
Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2004), 2004

Online smoothing for scalable media stream delivery.
Proceedings of the Visual Communications and Image Processing 2004, 2004

An Overview of Streaming Video.
Proceedings of the 4th IEEE International Workshop on Source Code Analysis and Manipulation (SCAM 2004), 2004

Component-Based Cascade Linear Discriminant Analysis for Face Recognition.
Proceedings of the Advances in Biometric Person Authentication, 2004

Face Recognition Under Varying Lighting Based on Derivates of Log Image.
Proceedings of the Advances in Biometric Person Authentication, 2004

Novel Face Detection Method Based on Gabor Features.
Proceedings of the Advances in Biometric Person Authentication, 2004

Pose Normalization Using Generic 3D Face Model as a Priori for Pose-Insensitive Face Recognition.
Proceedings of the Advances in Biometric Person Authentication, 2004

Baseline Evaluations on the CAS-PEAL-R1 Face Database.
Proceedings of the Advances in Biometric Person Authentication, 2004

A Kernel-Based Case Retrieval Algorithm with Application to Bioinformatics.
Proceedings of the PRICAI 2004: Trends in Artificial Intelligence, 2004

Optimum End-to-End Distortion Estimation for Error Resilient Video Coding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Key Techniques of Bit Rate Reduction for H.264 Streams.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Vision-Based Sign Language Recognition Using Sign-Wise Tied Mixture HMM.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Approximating Inference on Complex Motion Models Using Multi-model Particle Filter.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Mapping Energy Video Watermarking Algorithm Based on Compressed Domain.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A Study on the Quantization Scheme in H.264/AVC and Its Application to Rate Control.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Correlation Learning Method Based on Image Internal Semantic Model for CBIR.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

An Efficient VLSI Implementation for MC Interpolation of AVS Standard.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Face Samples Re-lighting for Detection Based on the Harmonic Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

BLOSSOMS: A CAS/HKUST Joint Project to Build Lightweight Optimized Sensor Systems on a Massive Scale.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

Open Issues on Intelligent Sensor Networks.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

A new method to segment playfield and its applications in match analysis in sports video.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

An Efficient VLSI Implementation of MC Interpolation for MPEG-4.
Proceedings of the 4th IEEE International Workshop on System-on-Chip for Real-Time Applications (IWSOC'04), 2004

Steganalysis of Data Hiding Techniques in Wavelet Domain.
Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04), 2004

The fast close-loop video transcoder with limited drifting error.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

A texture-based tamper detection scheme by fragile watermark.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Enhanced direct mode coding for bi-predictive pictures.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

A robust watermarking method based on wavelet and Zernike transform.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Context-Based Classification for Link Data.
Proceedings of the Advances in Web-Based Learning, 2004

Information Fusion in Face Identification.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Semantic Object Segmentation by a Spatio-Temporal MRF Model.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Review the Strength of Gabor Features for Face Recognition from the Angle of Its Robustness to Mis-Alignment.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A Novel Approach to Automatically Extracting Basic Units from Chinese Sign Language.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Support Vector Machines for Face Recognition with Two-layer Generated Virtual Data.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Resampling for Face Detection by Self-Adaptive Genetic Algorithm.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Multiple-threshold based scheduling algorithm for high performance input queued switches.
Proceedings of the 12th IEEE International Conference on Networks, 2004

A vision-based sign language recognition system using tied-mixture density HMM.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Multiple modes intra-prediction in intra coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Context-based 2D-VLC for video coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Improved error concealment algorithms based on H.264/AVC non-normative decoder.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Secure watermark verification scheme.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

New bi-prediction techniques for B pictures coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Mode mapping method for h.264/avc spatial downscaling transcoding.
Proceedings of the 2004 International Conference on Image Processing, 2004

Error resilience video coding in H.264 encoder with potential distortion tracking.
Proceedings of the 2004 International Conference on Image Processing, 2004

Automatic text segmentation from complex background.
Proceedings of the 2004 International Conference on Image Processing, 2004

A novel pupil localization method based on gaboreye model and radial symmetry operator.
Proceedings of the 2004 International Conference on Image Processing, 2004

An implemented architecture of deblocking filter for H.264/AVC.
Proceedings of the 2004 International Conference on Image Processing, 2004

A novel compressed domain shot segmentation algorithm on H.264/AVC.
Proceedings of the 2004 International Conference on Image Processing, 2004

New scaling technique for direct mode coding in B pictures.
Proceedings of the 2004 International Conference on Image Processing, 2004

Shape-based adult images detection.
Proceedings of the Third International Conference on Image and Graphics, 2004

Singular value decomposition based image matching.
Proceedings of the Third International Conference on Image and Graphics, 2004

Skin-color detection based on adaptive thresholds.
Proceedings of the Third International Conference on Image and Graphics, 2004

Interacting multiple model particle filter to adaptive visual tracking.
Proceedings of the Third International Conference on Image and Graphics, 2004

A novel rate control scheme for video streaming over wireless networks.
Proceedings of the Third International Conference on Image and Graphics, 2004

A novel FGS base-layer encoding model and weight-based rate adaptation for constant-quality streaming.
Proceedings of the Third International Conference on Image and Graphics, 2004

FEC-based multiple description coding for heterogeneous client bandwidths.
Proceedings of the Third International Conference on Image and Graphics, 2004

Inter-frame Differential Energy Video Watermarking Algorithm Based on Compressed Domain.
Proceedings of the Image Analysis and Recognition: International Conference, 2004

Fast Moving Region Detection Scheme in Ad Hoc Sensor Network.
Proceedings of the Image Analysis and Recognition: International Conference, 2004

Enhance the Alignment Accuracy of Active Shape Models Using Elastic Graph Matching.
Proceedings of the Biometric Authentication, First International Conference, 2004

Accurate moving object segmentation by a hierarchical region labeling approach.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Color space compatible coding framework for YUV422 video coding.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

HandTalker II: a Chinese sign language recognition and synthesis system.
Proceedings of the 8th International Conference on Control, 2004

Distributed load adaptive scheduling for high speed input queued switch.
Proceedings of the Global Telecommunications Conference, 2004. GLOBECOM '04, Dallas, Texas, USA, 29 November, 2004

Face Recognition Using Ada-Boosted Gabor Features.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Curse of Mis-alignment in Face Recognition: Problem and a Novel Mis-alignment Learning Solution.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Eigen-Harmonics Faces: Face Recognition under Generic Lighting.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Transition Movement Models for Large Vocabulary Continuous Sign Language Recognition.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Expand Training Set for Face Detection by GA Re-sampling.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

An Efficient VLSI Architecture for MC Interpolation in AVC Video Coding.
Proceedings of the International Conference on Embedded Systems and Applications, 2004

2003
Efficient background video coding with static sprite generation and arbitrary-shape spatial prediction techniques.
IEEE Trans. Circuits Syst. Video Technol., 2003

Learning and synthesizing MPEG-4 compatible 3-D face animation from video sequence.
IEEE Trans. Circuits Syst. Video Technol., 2003

Incorporating Linguistic Structure into Maximum Entropy Language Models.
J. Comput. Sci. Technol., 2003

An Admission Control Scheme for End-to-End Statistical QoS Provision in IP Networks.
J. Comput. Sci. Technol., 2003

Face recognition based on face-specific subspace.
Int. J. Imaging Syst. Technol., 2003

A System for Human Face and Facial Feature Location.
Int. J. Image Graph., 2003

Two-Phase Web Site Classification Based on Hidden Markov Tree Models.
Proceedings of the 2003 IEEE / WIC International Conference on Web Intelligence, 2003

IISM: An Image Internal Semantic Model for Image Database Based on Relevance Feedback.
Proceedings of the 2003 IEEE / WIC International Conference on Web Intelligence, 2003

Steganalysis of Images Created in Wavelet Domain Using Quantization Modulation.
Proceedings of the Wavelet Analysis and Its Applications, 2003

Real-time scheduling and online resource allocation on scalable streaming media server.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Multiscale load adaptive scheduling for energy efficient transmission over wireless networks.
Proceedings of the IEEE 14th International Symposium on Personal, 2003

Real-time scheduling supporting VCR functionality for scalable video streaming.
Proceedings of the IEEE 14th International Symposium on Personal, 2003

Enhanced Active Shape Models with Global Texture Constraints for Image Analysis.
Proceedings of the Foundations of Intelligent Systems, 14th International Symposium, 2003

Intelligent Protein 3D Structure Retrieval System.
Proceedings of the Foundations of Intelligent Systems, 14th International Symposium, 2003

Automatic moving object extraction in MPEG video.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

A regularized simultaneous autoregressive model for texture classification.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Rate control for advance video coding (AVC) standard.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Real-time scheduling on scalable media stream delivery.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

A new algorithm for remotely sensed image texture classification and segmentation.
Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium, 2003

Objectionable Image Recognition System in Compression Domain.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2003

Illumination Invariant Shot Boundary Detection.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2003

Large vocabulary sign language recognition based on hierarchical decision trees.
Proceedings of the 5th International Conference on Multimodal Interfaces, 2003

Latest arrival time leaky bucket for HRD constrained video coding.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Neural network based steganalysis in still images.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A novel coefficient scanning scheme for directional spatial prediction-based image compression.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

The improved SP frame coding technique for the JVT standard.
Proceedings of the 2003 International Conference on Image Processing, 2003

Rate control for JVT video coding scheme with HRD considerations.
Proceedings of the 2003 International Conference on Image Processing, 2003

A Lattice Based General Blind Watermark Scheme.
Proceedings of the Information and Communications Security, 5th International Conference, 2003

Reliability Theory Model and Expected Life Shortest Path in Stochastic and Time-Dependent Networks.
Proceedings of the Computational Science - ICCS 2003, 2003

Network-Tree Model and Shortest Path Algorithm.
Proceedings of the Computational Science - ICCS 2003, 2003

Color image segmentation using density-based clustering.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Facial feature tracking combining model-based and model-free method.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Rotated face detection in color images using radial template (RT).
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Virtual face image generation for illumination and pose insensitive face recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

SVMs for few examples-based face recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Novel example-based shape learning for fast face alignment.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

A Reconfigurable High Availability Infrastructure in Cluster for Grid.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003

Constraint Shape Model Using Edge Constraint and Gabor Wavelet Based Search.
Proceedings of the Audio-and Video-Based Biometrie Person Authentication, 2003

Visual Features Extracting & Selecting for Lipreading.
Proceedings of the Audio-and Video-Based Biometrie Person Authentication, 2003

Illumination Normalization for Robust Face Recognition Against Varying Lighting Conditions.
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003

CSLDS: Chinese Sign Language Dialog System.
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003

2002
Morphological representation of DCT coefficients for image compression.
IEEE Trans. Circuits Syst. Video Technol., 2002

Learning Prosodic Patterns for Mandarin Speech Synthesis.
J. Intell. Inf. Syst., 2002

Automatic Segmentation of News Items Based on Video and Audio Features.
J. Comput. Sci. Technol., 2002

A Dynamic Recommendation System Based on Log Mining.
Int. J. Found. Comput. Sci., 2002

General Blind Watermark Schemes.
Proceedings of the Second International Conference on WEB Delivering of Music, 2002

NCPN: A Simulation Tool for Coloured Petri Nets.
Proceedings of the International Conference on Parallel and Distributed Computing Systems, 2002

Secure Watermark Verification Scheme.
Proceedings of the Advances in Multimedia Information Processing, 2002

An Index Model for MPEG-2 Streams.
Proceedings of the Advances in Multimedia Information Processing, 2002

Flexible and Efficient Switching Techniques between Scalable Video Bitstreams.
Proceedings of the Advances in Multimedia Information Processing, 2002

A Framework for Background Detection in Video.
Proceedings of the Advances in Multimedia Information Processing, 2002

System Identification for Nonlinear Control Using Lifted Wavelet.
Proceedings of the 6th Joint Conference on Information Science, 2002

Mouth-Shape Classification and Recognition for Lipreading.
Proceedings of the 6th Joint Conference on Information Science, 2002

A Novel Method to Compensate Variety of Illumination in Face Detection.
Proceedings of the 6th Joint Conference on Information Science, 2002

Quantitatively evaluating the influence of online social interactions in the community-assisted digital library.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

Toward a distributed terabyte text retrieval system in China-US million book digital library.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

Recognition of Strong and Weak Connection Models in Continuous Sign Language.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Robust Frontal Face Detection in Complex Environment.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

AppManager: A Powerful Service-Based Application Management System for Clusters.
Proceedings of the 31st International Conference on Parallel Processing Workshops (ICPP 2002 Workshops), 2002

An edge-to-edge congestion control scheme for assured forwarding.
Proceedings of the Proceedings 10th IEEE International Conference on Networks: Towards Network Superiority, 2002

An Improved Active Shape Model for Face Alignment.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Animating Arbitrary Topology 3D Facial Model Using the MPEG-4 FaceDefTables.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Blind watermarking method based on DWT middle frequency pair.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Efficient and flexible drift-free video bitstream switching at predictive frames.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Localizing the iris center by region growing search.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Video indexing by motion activity maps.
Proceedings of the 2002 International Conference on Image Processing, 2002

High efficient sprite coding with directional spatial prediction.
Proceedings of the 2002 International Conference on Image Processing, 2002

Face identification from a single example image based on Face-Specific Subspace (FSS).
Proceedings of the IEEE International Conference on Acoustics, 2002

A Cache-Based Distributed Terabyte Text Retrieval System in CADAL.
Proceedings of the Digital Libraries: People, 2002

An Approach Based on Phonemes to Large Vocabulary Chinese Sign Language Recognition.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

A SRN/HMM System for Signer-Independent Continuous Sign Language Recognition.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

Animating 3D Facial Models with MPEG-4 FaceDefTables.
Proceedings of the Proceedings 35th Annual Simulation Symposium (ANSS-35 2002), 2002

2001
Low-complexity and low-memory entropy coder for image compression.
IEEE Trans. Circuits Syst. Video Technol., 2001

Face detection and location based on skin chrominance and lip chrominance transformation from color images.
Pattern Recognit., 2001

Robust Speech Recognition Method Based on Discriminative Environment Feature Extraction.
J. Comput. Sci. Technol., 2001

A greedy clustering algorithm along the time axis for Chinese language recognition.
Proceedings of the IEEE International Conference on Systems, 2001

Image scale-smoothing, scale-differentiating and binarizing: a new frame for general-scale-edge (GSE) extraction.
Proceedings of the IEEE International Conference on Systems, 2001

Mining audio/visual database for speech driven face animation.
Proceedings of the IEEE International Conference on Systems, 2001

LLEC: An Image Coder with Low-Complexity and Low-Memory Requirement.
Proceedings of the Advances in Multimedia Information Processing, 2001

A Real-Time Large Vocabulary Continuous Recognition System for Chinese Sign Language.
Proceedings of the Advances in Multimedia Information Processing, 2001

A Fast Anchor Shot Detection Algorithm on Compressed Video.
Proceedings of the Advances in Multimedia Information Processing, 2001

Automatic Segmentation of News Items Based on Video and Audio Features.
Proceedings of the Advances in Multimedia Information Processing, 2001

A Face-Unlock Screen Saver by Using Face Verification Based on Identity-Specific Subspaces.
Proceedings of the Advances in Multimedia Information Processing, 2001

Fast and Robust Sprite Generation for MPEG-4 Video Coding.
Proceedings of the Advances in Multimedia Information Processing, 2001

Classification of Facial Images Using Gaussian Mixture Models.
Proceedings of the Advances in Multimedia Information Processing, 2001

Fusion of Biometrics Based on D-S Theory.
Proceedings of the Advances in Multimedia Information Processing, 2001

Speech Driven MPEG-4 Based Face Animation via Neural Network.
Proceedings of the Advances in Multimedia Information Processing, 2001

Different cultures meet: lessons learned in global digital library development (panel session).
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

A SOFM/HMM System for Person-Independent Isolated Sign Language Recognition.
Proceedings of the Human-Computer Interaction INTERACT '01: IFIP TC13 International Conference on Human-Computer Interaction, 2001

Macroblock-Based Progressive Fine Granularity Scalable Video Coding.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Macroblock-based progressive fine granularity scalable (PFGS) video coding with flexible temporal-SNR scalablilities.
Proceedings of the 2001 International Conference on Image Processing, 2001

Sprite generation for frame-based video coding.
Proceedings of the 2001 International Conference on Image Processing, 2001

FaceTracker: A Human Face Tracking and Facial Organ Localizing System.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

The Recognition of Finger-Spelling for Chinese Sign Language.
Proceedings of the Gesture and Sign Languages in Human-Computer Interaction, 2001

A Real-Time Large Vocabulary Recognition System for Chinese Sign Language.
Proceedings of the Gesture and Sign Languages in Human-Computer Interaction, 2001

Signer-Independent Continuous Sign Language Recognition Based on SRN/HMM.
Proceedings of the Gesture and Sign Languages in Human-Computer Interaction, 2001

Morphological Representation of DCT Data for Image Coding.
Proceedings of the Data Compression Conference, 2001

2000
Study on Translating Chinese into Chinese Sign Language.
J. Comput. Sci. Technol., 2000

Human-Computer Chinese Sign Language Interaction System.
Int. J. Virtual Real., 2000

Sign Language Recognition Based on HMM/ANN/DP.
Int. J. Pattern Recognit. Artif. Intell., 2000

Web Clustering and Association Rule Discovery for Web Broadcast.
Proceedings of the Web-Age Information Management, First International Conference, 2000

Adaptive Online Retail Web Site Based on Hidden Markov Model.
Proceedings of the Web-Age Information Management, First International Conference, 2000

A Fast Globally Supervised Learning Algorithm for Gaussian Mixture Models.
Proceedings of the Web-Age Information Management, First International Conference, 2000

A Rich Get Richer Strategy for Content-Based Image Retrieval.
Proceedings of the Advances in Visual Information Systems, 4th International Conference, 2000

Learning Prosodic Patterns for Mandarin Speech Synthesis.
Proceedings of the International Workshop on Multimedia Data Mining, 2000

A new approach for modeling OOV words.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A parallel multi-stream model for sign language recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust speaker recognition based on high order cumulant.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Towards robust lipreading.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multi-strategy data mining on Mandarin prosodic patterns.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

News Content Highlight via Fast Caption Text Detection on Compressed Video.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2000

Combining Skin Color Model and Neural Network for Rotation Invariant Face Detection.
Proceedings of the Advances in Multimodal Interfaces, 2000

A Fast Sign Word Recognition Method for Chinese Sign Language.
Proceedings of the Advances in Multimodal Interfaces, 2000

An Approach to Robust and Fast Locating Lip Motion.
Proceedings of the Advances in Multimodal Interfaces, 2000

Gravity-Center Template Based Human Face Feature Detection.
Proceedings of the Advances in Multimodal Interfaces, 2000

A Parallel Multistream Model for Integration of Sign Language Recognition and Lip Motion.
Proceedings of the Advances in Multimodal Interfaces, 2000

<i>HandTalker: </i> A Multimodal Dialog System Using Sign Language and 3-D Virtual Human.
Proceedings of the Advances in Multimodal Interfaces, 2000

Individual 3D Face Synthesis Based on Orthogonal Photos and Speech-Driven Facial Animation.
Proceedings of the 2000 International Conference on Image Processing, 2000

A Continuous Chinese Sign Language Recognition System.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

Discovering Sequence Association Rules with User Access Transaction Grammars.
Proceedings of the 11th International Workshop on Database and Expert Systems Applications (DEXA'00), 2000

1999
Robust telephone speech recognition based on channel compensation.
Pattern Recognit., 1999

The Discretization of Continuous Attributes Based on Compatibility Rough Set and Generic Algorithm.
Proceedings of the New Directions in Rough Sets, 1999

A Robust Method for Unknown Forms Analysis.
Proceedings of the Fifth International Conference on Document Analysis and Recognition, 1999

Prediction based on backward adaptive recognition of local texture orientation and Poisson statistical model for lossless/near-lossless image compression.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Extending DACLIC for Near Lossless Compression with Postprocessing of Greyscale Images.
Proceedings of the Data Compression Conference, 1999

1998
The supervised learning Gaussian mixture model.
J. Comput. Sci. Technol., 1998

Discriminative learning of additive noise and channel distortions for robust speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

FACOLA - Face Coder Based on Location and Attention.
Proceedings of the Data Compression Conference, 1998

On-Line Sprite Encoding with Large Global Motion Estimation.
Proceedings of the Data Compression Conference, 1998

1997
A stochastic approach for blurred image restoration and optical flow computation on field image sequence.
J. Comput. Sci. Technol., 1997

Radial basis function interpolation surface on space mesh.
Proceedings of the ACM SIGGRAPH 97 Visual Proceedings: The art and interdisciplinary programs of SIGGRAPH '97, 1997

Text-driven deaf-mute sign language synthesis system.
Proceedings of the ACM SIGGRAPH 97 Visual Proceedings: The art and interdisciplinary programs of SIGGRAPH '97, 1997

Relative mel-frequency cepstral coefficients compensation for robust telephone speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Recognizing components of handwritten characters by attributed relational graphs with stable features.
Proceedings of the 4th International Conference Document Analysis and Recognition (ICDAR '97), 1997

Compunication: From Concept to Practice.
Proceedings of the 21st International Computer Software and Applications Conference (COMPSAC '97), 1997

Text-independent Speaker Identification Based on Spectral Weighting Functions.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 1997

1996
A Human Face Detection and Tracking System for Complex Backgrounds.
Proceedings of the Multimedia Technology and Applications, 1996

1995
Independent Hand Gesture Recognition in HandTalker.
Proceedings of the Image Analysis Applications and Computer Graphics, 1995

A system for automatic Chinese seal imprint verification.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

Sequence Decomposition Method for Computing a Gröbner Basis and Its Application to Bivariate Spline.
Proceedings of the Computing and Combinatorics, First Annual International Conference, 1995


  Loading...