Shanghang Zhang

Orcid: 0000-0003-4047-3526

According to our database¹, Shanghang Zhang authored at least 289 papers between 2012 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2026

CoCoGesture: Towards coherent co-speech 3D gesture generation in the wild.

[BibT_eX]

[DOI]

Inf. Fusion, 2026

How EEG-based cross-subject driving emotion is recognized: A multi-source transfer manifold learning model.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2026

2025

XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations.

[BibT_eX]

[DOI]

CoRR, November, 2025

URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model.

[BibT_eX]

[DOI]

CoRR, November, 2025

RoboOS-NeXT: A Unified Memory-based Framework for Lifelong, Scalable, and Robust Multi-Robot Collaboration.

[BibT_eX]

[DOI]

CoRR, October, 2025

Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain.

[BibT_eX]

[DOI]

CoRR, October, 2025

From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance.

[BibT_eX]

[DOI]

CoRR, October, 2025

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, October, 2025

OmniSAT: Compact Action Token, Faster Auto Regression.

[BibT_eX]

[DOI]

CoRR, October, 2025

WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, October, 2025

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics.

[BibT_eX]

[DOI]

CoRR, October, 2025

Can World Models Benefit VLMs for World Dynamics?

[BibT_eX]

[DOI]

CoRR, October, 2025

MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles.

[BibT_eX]

[DOI]

CoRR, October, 2025

RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., September, 2025

EEG-Driven Classification of Driver Mental Workload in Diverse Environments: A Dual-Branch Network for Efficient In-Vehicle Applications.

[BibT_eX]

[DOI]

IEEE Internet Things J., September, 2025

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, September, 2025

dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, September, 2025

WoW: Towards a World omniscient World model Through Embodied Interaction.

[BibT_eX]

[DOI]

CoRR, September, 2025

Orochi: Versatile Biomedical Image Processor.

[BibT_eX]

[DOI]

CoRR, September, 2025

MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies.

[BibT_eX]

[DOI]

CoRR, September, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory.

[BibT_eX]

[DOI]

CoRR, September, 2025

MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs.

[BibT_eX]

[DOI]

CoRR, August, 2025

4D Visual Pre-training for Robot Learning.

[BibT_eX]

[DOI]

CoRR, August, 2025

HumanoidVerse: A Versatile Humanoid for Vision-Language Guided Multi-Object Rearrangement.

[BibT_eX]

[DOI]

CoRR, August, 2025

NavA3: Understanding Any Instruction, Navigating Anywhere, Finding Anything.

[BibT_eX]

[DOI]

CoRR, August, 2025

UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying.

[BibT_eX]

[DOI]

CoRR, August, 2025

FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning.

[BibT_eX]

[DOI]

CoRR, July, 2025

Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition.

[BibT_eX]

[DOI]

CoRR, July, 2025

RwoR: Generating Robot Demonstrations from Human Hand Collection for Policy Learning without Robot.

[BibT_eX]

[DOI]

CoRR, July, 2025

RoboBrain 2.0 Technical Report.

[BibT_eX]

[DOI]

CoRR, July, 2025

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation.

[BibT_eX]

[DOI]

CoRR, July, 2025

Implicit Neural Image Field for Biological Microscopy Image Compression.

[BibT_eX]

[DOI]

Shanghang Zhang

Jianxu Chen

Dataset, July, 2025

SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents.

[BibT_eX]

[DOI]

CoRR, June, 2025

MinD: Unified Visual Imagination and Control via Hierarchical World Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

FastInit: Fast Noise Initialization for Temporally Consistent Video Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

[BibT_eX]

[DOI]

CoRR, June, 2025

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, June, 2025

SpikePingpong: High-Frequency Spike Vision-based Robot Learning for Precise Striking in Table Tennis Game.

[BibT_eX]

[DOI]

CoRR, June, 2025

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics.

[BibT_eX]

[DOI]

CoRR, June, 2025

Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2025

GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control.

[BibT_eX]

[DOI]

CoRR, May, 2025

OmniIndoor3D: Comprehensive Indoor 3D Reconstruction.

[BibT_eX]

[DOI]

CoRR, May, 2025

SpikeGen: Generative Framework for Visual Spike Stream Processing.

[BibT_eX]

[DOI]

CoRR, May, 2025

AFCL: Analytic Federated Continual Learning for Spatio-Temporal Invariance of Non-IID Data.

[BibT_eX]

[DOI]

CoRR, May, 2025

ACU: Analytic Continual Unlearning for Efficient and Exact Forgetting with Privacy Preservation.

[BibT_eX]

[DOI]

CoRR, May, 2025

H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos.

[BibT_eX]

[DOI]

CoRR, May, 2025

RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration.

[BibT_eX]

[DOI]

CoRR, May, 2025

CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, May, 2025

Co3Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion.

[BibT_eX]

[DOI]

CoRR, May, 2025

ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance.

[BibT_eX]

[DOI]

CoRR, April, 2025

EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler.

[BibT_eX]

[DOI]

CoRR, April, 2025

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2025

MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation.

[BibT_eX]

[DOI]

CoRR, March, 2025

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?

[BibT_eX]

[DOI]

CoRR, March, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model.

[BibT_eX]

[DOI]

CoRR, March, 2025

AffordGrasp: In-Context Affordance Reasoning for Open-Vocabulary Task-Oriented Grasping in Clutter.

[BibT_eX]

[DOI]

CoRR, March, 2025

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., February, 2025

CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World.

[BibT_eX]

[DOI]

CoRR, February, 2025

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation.

[BibT_eX]

[DOI]

CoRR, January, 2025

GaussianEnhancer++: A General GS-Agnostic Rendering Enhancer.

[BibT_eX]

[DOI]

Symmetry, 2025

Empowering Corner Case Detection in Autonomous Vehicles With Multimodal Large Language Models.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2025

A diffusion-based feature enhancement approach for driving behavior classification with EEG data.

[BibT_eX]

[DOI]

Adv. Eng. Informatics, 2025

FreqMoE: Dynamic Frequency Enhancement for Neural PDE Solvers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

High-Quality 3D Creation From a Single Image Using Subject-Specific Knowledge Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

PINNsAgent: Automated PDE Surrogation with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SAN: Hypothesizing Long-Term Synaptic Development and Neural Engram Mechanism in Scalable Model's Parameter-Efficient Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Empowering World Models with Reflection for Embodied Video Prediction.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OmniArch: Building Foundation Model for Scientific Computing.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Adaptive Semantic Compression: Compatible Bitstream for Scalable Human-Machine Perception Sample Adaption.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Co3Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient Quality Controllable Neural Image Compression based on QD-Model.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

GaussianEnhancer: A General Rendering Enhancer for Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Three-Stage Progressive Pre-Analysis Framework for VMAF Controllable Image Coding.

[BibT_eX]

[DOI]

Proceedings of the Data Compression Conference, 2025

Decouple Distortion from Perception: Region Adaptive Diffusion for Extreme-low Bitrate Perception Image Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Segment Any Motion in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., September, 2024

Exploring Generalizable Distillation for Efficient Medical Image Segmentation.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, July, 2024

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Veh., January, 2024

DECOR: Dynamic Decoupling and Multiobjective Optimization for Long-Tailed Remote Sensing Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

A lightweight multi-layer perceptron for efficient multivariate time series forecasting.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2024

The Emerging Issues in Bioimaging AI Publications and Research (Dagstuhl Seminar 24042).

[BibT_eX]

[DOI]

Dagstuhl Reports, 2024

SCBench: A Sports Commentary Benchmark for Video LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

GaussianAD: Gaussian-Centric End-to-End Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

GPD-1: Generative Pre-training for Driving.

[BibT_eX]

[DOI]

CoRR, 2024

ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model.

[BibT_eX]

[DOI]

CoRR, 2024

[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.

[BibT_eX]

[DOI]

CoRR, 2024

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, 2024

MC-LLaVA: Multi-Concept Personalized Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Training-free Regional Prompting for Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

EVA: An Embodied World Model for Future Video Anticipation.

[BibT_eX]

[DOI]

CoRR, 2024

Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation.

[BibT_eX]

[DOI]

CoRR, 2024

Discovering Long-Term Effects on Parameter Efficient Fine-tuning.

[BibT_eX]

[DOI]

CoRR, 2024

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions.

[BibT_eX]

[DOI]

CoRR, 2024

Multimodal Large Language Models for Bioimage Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

MAVIS: Mathematical Visual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

Fisher-aware Quantization for DETR Detectors with Critical-category Objectives.

[BibT_eX]

[DOI]

CoRR, 2024

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception.

[BibT_eX]

[DOI]

CoRR, 2024

RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

S3Gaussian: Self-Supervised Street Gaussians for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

Implicit Neural Image Field for Biological Microscopy Image Compression.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention.

[BibT_eX]

[DOI]

CoRR, 2024

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning.

[BibT_eX]

[DOI]

CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2024

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing.

[BibT_eX]

[DOI]

CoRR, 2024

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge - Multi-Task Robustness Track.

[BibT_eX]

[DOI]

CoRR, 2024

Building Flexible Machine Learning Models for Scientific Computing at Scale.

[BibT_eX]

[DOI]

CoRR, 2024

Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness.

[BibT_eX]

[DOI]

CoRR, 2024

RustNeRF: Robust Neural Radiance Field with Low-Quality Images.

[BibT_eX]

[DOI]

CoRR, 2024

TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unveiling the Tapestry of Consistency in Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Compositional Few-Shot Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

I-MedSAM: Implicit Medical Image Segmentation with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Gradient-based Parameter Selection for Efficient Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FreeKD: Knowledge Distillation via Semantic Frequency Prompt.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTO3D: Neural Target Object 3D Reconstruction with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cloud-Device Collaborative Learning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-Speech Gesture Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Frame-Recurrent Video Crowd Counting.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Learning Deep Features for Robotic Inference From Physical Interactions.

[BibT_eX]

[DOI]

IEEE Trans. Cogn. Dev. Syst., September, 2023

Expanding the prediction capacity in long sequence time-series forecasting.

[BibT_eX]

[DOI]

Artif. Intell., May, 2023

P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification.

[BibT_eX]

[DOI]

Remote. Sens., April, 2023

Caching in Dynamic Environments: A Near-Optimal Online Learning Approach.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation.

[BibT_eX]

[DOI]

CoRR, 2023

Cloud-Device Collaborative Learning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Iterative Prompt Relabeling for diffusion model with RLDF.

[BibT_eX]

[DOI]

CoRR, 2023

FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection.

[BibT_eX]

[DOI]

CoRR, 2023

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior.

[BibT_eX]

[DOI]

CoRR, 2023

Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training.

[BibT_eX]

[DOI]

CoRR, 2023

MoEC: Mixture of Experts Implicit Neural Compression.

[BibT_eX]

[DOI]

CoRR, 2023

ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model.

[BibT_eX]

[DOI]

CoRR, 2023

COLE: A Hierarchical Generation Framework for Graphic Design.

[BibT_eX]

[DOI]

CoRR, 2023

Heterogenous Memory Augmented Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2023

Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything.

[BibT_eX]

[DOI]

CoRR, 2023

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.

[BibT_eX]

[DOI]

CoRR, 2023

PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

DiffuseIR: Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images.

[BibT_eX]

[DOI]

CoRR, 2023

UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering.

[BibT_eX]

[DOI]

CoRR, 2023

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Chain of Thought Prompt Tuning in Vision Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

MoWE: Mixture of Weather Experts for Multiple Adverse Weather Removal.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Sparse Visual Prompt for Cross-domain Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

When Visible Light (Backscatter) Communication Meets Neuromorphic Cameras in V2X.

[BibT_eX]

[DOI]

Proceedings of the 24th International Workshop on Mobile Computing Systems and Applications, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Electroencephalogram-Based Driver Emotional State Detection with Manifold Learning.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems, 2023

A Text Prompt-Based Approach for Zero-Shot Corner Case Object Detection in Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems, 2023

Uncertainty-Aware Dynamic Learning for Cross-Domain Few-Shot Scene Classification from Remote Sensing Imagery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023

Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Q-Diffusion: Quantizing Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BadRes: Reveal the Backdoors Through Residual Connection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Improving Generalization of Meta-Learning with Inverted Regularization at Inner-Level.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Annealing-based Label-Transfer Learning for Open World Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Vocabulary Point-Cloud Object Detection without 3D Annotation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

A Review of Single-Source Deep Unsupervised Visual Domain Adaptation.

[BibT_eX]

[DOI]

Alberto L. Sangiovanni-Vincentelli

Sanjit A. Seshia

Kurt Keutzer

IEEE Trans. Neural Networks Learn. Syst., 2022

Active Gradual Domain Adaptation: Dataset and Approach.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Uncertainty Guided Depth Fusion for Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2022

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

CoRR, 2022

Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data.

[BibT_eX]

[DOI]

CoRR, 2022

UnrealNAS: Can We Search Neural Architectures with Unreal Data?

[BibT_eX]

[DOI]

CoRR, 2022

Cross-Domain Object Detection with Mean-Teacher Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Self-Supervised Pretraining Improves Self-Supervised Pretraining.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Margin-Based Few-Shot Class-Incremental Learning with Class-Level Overfitting Mitigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Jump Self-attention: Capturing High-order Statistics in Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

DNA: Domain Generalization with Diversified Neural Averaging.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

MTTrans: Cross-domain Object Detection with Mean Teacher Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Meta-Tuning for Content-Aware Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Online Continual Adaptation with Active Self-Training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Learning graph attention-aware knowledge graph embedding.

[BibT_eX]

[DOI]

Neurocomputing, 2021

2nd Place Solution for VisDA 2021 Challenge - Universally Domain Adaptive Image Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[BibT_eX]

[DOI]

CoRR, 2021

Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Annotation-Efficient Untrimmed Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Triplet Attention: Rethinking the Similarity in Transformers.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Decoupling Global and Local Representations via Invertible Generative Flows.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

MERITS: Medication Recommendation for Chronic Disease with Irregular Time-Series.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining, 2021

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Contrastive Multimodal Fusion with TupleInfoNCE.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Alberto L. Sangiovanni-Vincentelli

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Modeling relation paths for knowledge base completion via joint adversarial training.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2020

P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2020

Cross-Domain Sentiment Classification with In-Domain Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Revisiting Mid-Level Patterns for Distant-Domain Few-Shot Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms.

[BibT_eX]

[DOI]

CoRR, 2020

Rethinking Distributional Matching Based Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2020

Decoupling Global and Local Representations from/for Image Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Generalized Zero-Shot Text Classification for ICD Coding.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Instance Adaptive Self-training for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

TCGM: An Information-Theoretic Framework for Semi-supervised Multi-modality Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Multi-Source Distilling Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Generalized Zero-shot ICD Coding.

[BibT_eX]

[DOI]

CoRR, 2019

Feature Fusion for Image Retrieval With Adaptive Bitrate Allocation and Hard Negative Mining.

[BibT_eX]

[DOI]

Chuang Zhu

Huihui Dong

Shanghang Zhang

IEEE Access, 2019

Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning.

[BibT_eX]

[DOI]

Jian Ni

Shanghang Zhang

Haiyong Xie

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

MaCow: Masked Convolutional Generative Flow.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

Deep Understanding of Urban Mobility from CityscapeWebcams.

[BibT_eX]

[DOI]

Shanghang Zhang

PhD thesis, 2018

Hierarchical Attention Networks for Knowledge Base Completion via Joint Adversarial Training.

[BibT_eX]

[DOI]

CoRR, 2018

Adversarial Multiple Source Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Multiple Source Domain Adaptation with Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

A Deep Learning Approach to IoT Authentication.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Communications, 2018

Learning to Understand Image Blur.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Topology adaptive graph convolutional networks.

[BibT_eX]

[DOI]

CoRR, 2017

Multiple Source Domain Adaptation with Adversarial Training of Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Understanding Traffic Density from Large-Scale Web Camera Data.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2015

Traffic flow from a low frame rate city camera.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

2014

Bayesian model fusion: Enabling test cost reduction of analog/RF circuits via wafer-level spatial variation modeling.

[BibT_eX]

[DOI]

Shanghang Zhang

Xin Li

Ronald D. Blanton

José Machado da Silva

John M. Carulli Jr.

Kenneth M. Butler

Proceedings of the 2014 International Test Conference, 2014

2013

On a Highly Efficient RDO-Based Mode Decision Pipeline Design for AVS.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

A high-throughput low-latency arithmetic encoder design for HDTV.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

2012

An efficient foreground-based surveillance video coding scheme in low bit-rate compression.

[BibT_eX]

[DOI]

Proceedings of the 2012 Visual Communications and Image Processing, 2012

A flexible and high-performance hardware video encoder architecture.

[BibT_eX]

[DOI]

Proceedings of the 2012 Picture Coding Symposium, 2012

An Optimized Hardware Video Encoder for AVS with Level C+ Data Reuse Scheme for Motion Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Shanghang Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...