Dahua Lin

CoRR, 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K.

[BibT_eX]

[DOI]

CoRR, 2024

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.

[BibT_eX]

[DOI]

CoRR, 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.

[BibT_eX]

[DOI]

CoRR, 2024

Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora.

[BibT_eX]

[DOI]

CoRR, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2024

Characterization of Large Language Model Development in the Datacenter.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Streaming Long Video Understanding with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Are We on the Right Way for Evaluating Large Vision-Language Models?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MGF: Mixed Gaussian Flow for Diverse Trajectory Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CriticEval: Evaluating Large-scale Language Model as Critic.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Flames: Benchmarking Value Alignment of LLMs in Chinese.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

X-neuron: Interpreting, Locating and Editing of Neurons in Reinforcement Learning Policy.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Unified Human-Scene Interaction via Prompted Chain-of-Contacts.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Scaling Laws of RoPE-based Extrapolation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LongWanjuan: Towards Systematic Measurement for Long Text Quality.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

PointLLM: Empowering Large Language Models to Understand Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Rethinking Image-to-Video Adaptation: An Object-Centric Perspective.

[BibT_eX]

[DOI]

Rui Qian

Shuangrui Ding

Proceedings of the Computer Vision - ECCV 2024, 2024

MMBench: Is Your Multi-modal Model an All-Around Player?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ShareGPT4V: Improving Large Multi-modal Models with Better Captions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

Towards Text-guided 3D Scene Composition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cinematic Behavior Transfer via NeRF-based Differentiable Filming.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VBench: Comprehensive Benchmark Suite for Video Generative Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OneLLM: One Framework to Align All Modalities with Language.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoBooth: Diffusion-based Video Generation with Image Prompts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Alpha-CLIP: A CLIP Model Focusing on Wherever you Want.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Learning H-Infinity Locomotion Control.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

SpotServe: Serving Generative Large Language Models on Preemptible Instances.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

A Knowledge-driven Self-healing Dual-loop and Validation for Autonomous Networks.

[BibT_eX]

[DOI]

Proceedings of the 8th Asia-Pacific Workshop on Networking, 2024

Uncertainty Aware Learning for Language Model Alignment.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Navigating the OverKill in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Balanced Data Sampling for Language Model Training with Clustering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

ANAH: Analytical Annotation of Hallucinations in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

SPTS v2: Single-Point Scene Text Spotting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Parsing-Conditioned Anime Translation: A New Dataset and Method.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2023

A Coarse-to-Fine Framework for Automatic Video Unscreen.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases.

[BibT_eX]

[DOI]

CoRR, 2023

T-Eval: Evaluating the Tool Utilization Capability Step by Step.

[BibT_eX]

[DOI]

CoRR, 2023

SceneWiz3D: Towards Text-guided 3D Scene Composition.

[BibT_eX]

[DOI]

CoRR, 2023

Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future.

[BibT_eX]

[DOI]

CoRR, 2023

InterControl: Generate Human Motion Interactions by Controlling Every Joint.

[BibT_eX]

[DOI]

CoRR, 2023

Flames: Benchmarking Value Alignment of Chinese Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Scaling Laws of RoPE-based Extrapolation.

[BibT_eX]

[DOI]

CoRR, 2023

Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.

[BibT_eX]

[DOI]

CoRR, 2023

WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Referring Video Object Segmentation from Weak Annotation.

[BibT_eX]

[DOI]

CoRR, 2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering.

[BibT_eX]

[DOI]

CoRR, 2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.

[BibT_eX]

[DOI]

CoRR, 2023

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars.

[BibT_eX]

[DOI]

CoRR, 2023

RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer.

[BibT_eX]

[DOI]

CoRR, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.

[BibT_eX]

[DOI]

CoRR, 2023

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation.

[BibT_eX]

[DOI]

CoRR, 2023

VR-NeRF: High-Fidelity Virtualized Walkable Spaces.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGGRAPH 2023 Posters, 2023

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AssetField: Assets Mining and Reconfiguration in Ground Feature Plane Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

V3Det: Vast Vocabulary Visual Detection Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Human Dynamics in Autonomous Driving Scenarios.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scene as Occupancy.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Pixel-based MIM by Reducing Wasted Modeling Capability.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

E2EAI: End-to-End Deep Learning Framework for Active Investing.

[BibT_eX]

[DOI]

Zikai Wei

Proceedings of the 4th ACM International Conference on AI in Finance, 2023

Chimera: An Analytical Optimizing Framework for Effective Compute-intensive Operators Fusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

CLEVA: Chinese Language Models EVAluation Platform.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Cali-NCE: Boosting Cross-modal Video Representation Learning with Calibrated Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Grid-guided Neural Radiance Fields for Large Urban Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Controllable Mesh Generation Through Sparse Latent Point Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OmniCity: Omnipotent City Understanding with Multi-Level and Multi-View Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multi-Level Logit Distillation.

[BibT_eX]

[DOI]

Ying Jin

Jiaqi Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

Force-Aware Interface via Electromyography for Natural VR/AR Interaction.

[BibT_eX]

[DOI]

Seyed Farokh Atashzar

Qi Sun

ACM Trans. Graph., 2022

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

CARAFE++: Unified Content-Aware ReAssembly of FEatures.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Factor Investing with a Deep Multi-Factor Model.

[BibT_eX]

[DOI]

Zikai Wei

CoRR, 2022

Rethinking Trajectory Prediction via "Team Game".

[BibT_eX]

[DOI]

CoRR, 2022

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows.

[BibT_eX]

[DOI]

CoRR, 2022

DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe.

[BibT_eX]

[DOI]

CoRR, 2022

Guided Diffusion Model for Adversarial Purification.

[BibT_eX]

[DOI]

CoRR, 2022

Accelerating Diffusion Models via Early Stop of the Diffusion Process.

[BibT_eX]

[DOI]

CoRR, 2022

MINI: Mining Implicit Novel Instances for Few-Shot Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Shoot360: Normal View Video Creation from City Panorama Footage.

[BibT_eX]

[DOI]

Anyi Rao

Linning Xu

Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

Audio-Driven Co-Speech Gesture Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Semi-Supervised Semantic Segmentation via Gentle Teaching Assistant.

[BibT_eX]

[DOI]

Ying Jin

Jiaqi Wang

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Transcript to Video: Efficient Clip Sequencing from Texts.

[BibT_eX]

[DOI]

Yu Xiong

Fabian Caba Heilbron

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cycle-Consistent Learning for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022

SPTS: Single-Point Text Spotting.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PYSKL: Towards Good Practices for Skeleton Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

LongTail-Bench: A Benchmark Suite for Domain-Specific Operators in Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2022

EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers.

[BibT_eX]

[DOI]

Proceedings of the 51st International Conference on Parallel Processing, 2022

A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Monocular 3D Object Detection with Depth from Motion.

[BibT_eX]

[DOI]

Tai Wang

Jiangmiao Pang

Proceedings of the Computer Vision - ECCV 2022, 2022

Static and Dynamic Concepts for Self-supervised Video Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

OCSampler: Compressing Videos to One Clip with Single-step Sampling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Revisiting Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Learning Diverse Fashion Collocation by Neural Graph Filtering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Towards Statistically Provable Geometric 3D Human Pose Recovery.

[BibT_eX]

[DOI]

SIAM J. Imaging Sci., 2021

Distributions.jl: Definition and Modeling of Probability Distributions in the JuliaStats Ecosystem.

[BibT_eX]

[DOI]

J. Stat. Softw., 2021

Towards Balanced Learning for Instance Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

SPTS: Single-Point Text Spotting.

[BibT_eX]

[DOI]

CoRR, 2021

CityNeRF: Building NeRF at City Scale.

[BibT_eX]

[DOI]

CoRR, 2021

Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion.

[BibT_eX]

[DOI]

CoRR, 2021

INTERN: A New Learning Paradigm Towards General Vision.

[BibT_eX]

[DOI]

CoRR, 2021

WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Revisiting Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Welcome back!

[BibT_eX]

[DOI]

Hai Jin

Yuanchun Shi

Commun. ACM, 2021

Generative Occupancy Fields for 3D Surface-Aware Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Few-Shot Object Detection via Association and DIscrimination.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Vision Transformer with Progressive Sampling.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

BlockPlanner: City Block Generation with Vectorized Graph Representation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

3D Building Reconstruction from Monocular Remote Sensing Images.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Visually Informed Binaural Audio Generation without Binaural Audios.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adversarial Robustness Under Long-Tailed Distribution.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Seesaw Loss for Long-Tailed Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Scene-Aware Generative Network for Human Motion Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Evaluating and Training Verifiably Robust Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Probabilistic and Geometric Depth: Detecting Objects in Perspective.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Understanding the wiring evolution in differentiable neural architecture search.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Joint Semantic-geometric Learning for Polygonal Building Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Temporal ROI Align for Video Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Parallel Multi-Environment Shaping Algorithm for Complex Multi-step Task.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2020

Novel Policy Seeking with Constrained Optimization.

[BibT_eX]

[DOI]

CoRR, 2020

Evolutionary Stochastic Policy Distillation.

[BibT_eX]

[DOI]

CoRR, 2020

Feature Pyramid Grids.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

CoRR, 2020

Regularizing Reasons for Outfit Evaluation with Gradient Penalty.

[BibT_eX]

[DOI]

CoRR, 2020

FLAVA: Find, Localize, Adjust and Verify to Annotate LiDAR-based Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the UIST '20 Adjunct: The 33rd Annual ACM Symposium on User Interface Software and Technology, 2020

Real or Not Real, that is the Question.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learn to Propagate Reliably on Noisy Affinity Graphs.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Online Multi-modal Person Search in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Distribution-Balanced Loss for Multi-label Classification in Long-Tailed Datasets.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Side-Aware Boundary Localization for More Precise Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Motion Guided 3D Pose Estimation from Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

A Unified Framework for Shot Type Classification Based on Subject Centric Lens.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Placepedia: Comprehensive Place Understanding with Multi-faceted Annotations.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model Without Manual Annotation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

MovieNet: A Holistic Dataset for Movie Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Omni-Sourced Webly-Supervised Learning for Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Self-Supervised Scene De-Occlusion.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning to Cluster Faces via Confidence and Connectivity Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FineGym: A Hierarchical Video Dataset for Fine-Grained Action Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Intra- and Inter-Action Understanding via Temporal Action Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

DSNAS: Direct Neural Architecture Search Without Parameter Retraining.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

When NAS Meets Robustness: In Search of Robust Architectures Against Adversarial Attacks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Prime Sample Attention in Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Open Compound Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Reconfigurable Voxels: A New Representation for LiDAR-Based Point Clouds.

[BibT_eX]

[DOI]

Tai Wang

Xinge Zhu

Proceedings of the 4th Conference on Robot Learning, 2020

Learning a Decision Module by Imitating Driver's Control Behaviors.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Fastened CROWN: Tightened Neural Network Robustness Certificates.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Temporal Segment Networks for Action Recognition in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Learning Driving Decisions by Imitating Drivers' Control Behaviors.

[BibT_eX]

[DOI]

CoRR, 2019

Learning to Synthesize Fashion Textures.

[BibT_eX]

[DOI]

CoRR, 2019

Biased Estimates of Advantages over Path Ensembles.

[BibT_eX]

[DOI]

Lanxin Lei

Zhizhong Li

CoRR, 2019

Compound Domain Adaptation in an Open World.

[BibT_eX]

[DOI]

CoRR, 2019

MMDetection: Open MMLab Detection Toolbox and Benchmark.

[BibT_eX]

[DOI]

CoRR, 2019

POPQORN: Quantifying Robustness of Recurrent Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Online Hyper-parameter Learning for Auto-Augmentation Strategy.

[BibT_eX]

[DOI]

CoRR, 2019

WIDER Face and Pedestrian Challenge 2018: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2019

Policy Continuation with Hindsight Inverse Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

POPQORN: Quantifying Robustness of Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Convolutional Sequence Generation for Skeleton-Based Action Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Recursive Visual Sound Separation Using Minus-Plus Net.

[BibT_eX]

[DOI]

Xudong Xu

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Graph-Based Framework to Bridge Movies and Synopses.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CARAFE: Content-Aware ReAssembly of FEatures.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Online Hyper-Parameter Learning for Auto-Augmentation Strategy.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Adapting Object Detectors via Selective Cross-Domain Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Self-Supervised Learning via Conditional Motion Propagation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning to Cluster Faces on an Affinity Graph.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Region Proposal by Guided Anchoring.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Libra R-CNN: Towards Balanced Learning for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning a Unified Classifier Incrementally via Rebalancing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

IRLAS: Inverse Reinforcement Learning for Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Hybrid Task Cascade for Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Monocular 3D Pose Recovery via Nonconvex Sparsity with Theoretical Analysis.

[BibT_eX]

[DOI]

CoRR, 2018

Improving On-policy Learning with Statistical Reward Accumulation.

[BibT_eX]

[DOI]

CoRR, 2018

From Trailers to Storylines: An Efficient Way to Learn from Movies.

[BibT_eX]

[DOI]

CoRR, 2018

Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination.

[BibT_eX]

[DOI]

CoRR, 2018

Trajectory Convolution for Action Recognition.

[BibT_eX]

[DOI]

Yue Zhao

Yuanjun Xiong

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Neural Compositional Paradigm for Image Captioning.

[BibT_eX]

[DOI]

Sanja Fidler

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

PSANet: Point-wise Spatial Attention Network for Scene Parsing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Pose Guided Human Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Move Forward and Tell: A Progressive Generator of Video Descriptions.

[BibT_eX]

[DOI]

Yilei Xiong

Proceedings of the Computer Vision - ECCV 2018, 2018

Find and Focus: Retrieve and Localize Video Events with Natural Language Queries.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Person Search in Videos with One Portrait Through Visual and Temporal Links.

[BibT_eX]

[DOI]

Qingqiu Huang

Wentao Liu

Proceedings of the Computer Vision - ECCV 2018, 2018

Lifelong Learning via Progressive Distillation and Retrospection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Rethinking the Form of Latent States in Image Captioning.

[BibT_eX]

[DOI]

Deming Ye

Proceedings of the Computer Vision - ECCV 2018, 2018

Recognize Actions by Disentangling Components of Dynamics.

[BibT_eX]

[DOI]

Yue Zhao

Yuanjun Xiong

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Unsupervised Feature Learning via Non-Parametric Instance Discrimination.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Globally Optimized Object Detector via Policy Gradient.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Low-Latency Video Semantic Segmentation.

[BibT_eX]

[DOI]

Yule Li

Jianping Shi

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Unifying Identification and Context Learning for Person Recognition.

[BibT_eX]

[DOI]

Qingqiu Huang

Yu Xiong

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Optimizing Video Object Detection via a Scale-Time Lattice.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Accelerated Training for Massive Classification via Dynamic Class Selection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Sijie Yan

Yuanjun Xiong

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Probabilistic Ensemble of Collaborative Filters.

[BibT_eX]

[DOI]

Zhiyu Min

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Generative Adversarial Frontal View to Bird View Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on 3D Vision, 2018

2017

Peephole: Predicting Network Performance Before Training.

[BibT_eX]

[DOI]

Boyang Deng

Junjie Yan

CoRR, 2017

Learning Sparse Visual Representations with Leaky Capped Norm Regularizers.

[BibT_eX]

[DOI]

Jianqiao Wangni

CoRR, 2017

A Pursuit of Temporal Accuracy in General Activity Detection.

[BibT_eX]

[DOI]

CoRR, 2017

Contrastive Learning for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Scalable Estimation of Dirichlet Process Mixture Models on Distributed Data.

[BibT_eX]

[DOI]

Ruohui Wang

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Integrating Specialized Classifiers Based on Continuous Time Markov Chain.

[BibT_eX]

[DOI]

Zhizhong Li

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Be Your Own Prada: Fashion Synthesis with Structural Coherence.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Temporal Action Detection with Structured Segment Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Towards Diverse and Natural Image Descriptions via a Conditional GAN.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

PolyNet: A Pursuit of Structural Diversity in Very Deep Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

UntrimmedNets for Weakly Supervised Action Recognition and Detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Detecting Visual Relationships with Deep Relational Networks.

[BibT_eX]

[DOI]

Yuqi Zhang

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Discover and Learn New Objects from Documentaries.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Joint Inference of Objects and Scenes With Efficient Learning of Text-Object-Scene Relations.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016.

[BibT_eX]

[DOI]

CoRR, 2016

Deep Markov Random Field for Image Modeling.

[BibT_eX]

[DOI]

Zhirong Wu

Proceedings of the Computer Vision - ECCV 2016, 2016

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

2015

Adjustable Bounded Rectifiers: Towards Deep Binary Representations.

[BibT_eX]

[DOI]

Zhirong Wu

CoRR, 2015

Generating Multi-Sentence Lingual Descriptions of Indoor Scenes.

[BibT_eX]

[DOI]

CoRR, 2015

Recognize complex events from static images by fusing deep channels.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Generating Multi-sentence Natural Language Descriptions of Indoor Scenes.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2015, 2015

2014

Mining text snippets for images on the web.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Visual Semantic Search: Retrieving Videos via Complex Textual Queries.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

What Are You Talking About? Text-to-Image Coreference.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013

Online Learning of Nonparametric Mixture Models via Sequential Variational Approximation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Characterizing Layouts of Outdoor Scenes Using Spatial Topic Processes.

[BibT_eX]

[DOI]

Jianxiong Xiao

Proceedings of the IEEE International Conference on Computer Vision, 2013

Holistic Scene Understanding for 3D Object Detection with RGBD Cameras.

[BibT_eX]

[DOI]

Sanja Fidler

Raquel Urtasun

Proceedings of the IEEE International Conference on Computer Vision, 2013

Hidden Factor Analysis for Age Invariant Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

2012

Generative modeling of dynamic visual scenes.

[BibT_eX]

[DOI]

PhD thesis, 2012

Efficient Sampling from Combinatorial Space via Bridging.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Coupling Nonparametric Mixtures via Latent Dirichlet Processes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Learning Deformations with Parallel Transport.

[BibT_eX]

[DOI]

Donglai Wei

Proceedings of the Computer Vision - ECCV 2012, 2012

Low level vision via switchable Markov random fields.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Manifold guided composite of Markov random fields for image modeling.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2010

Construction of Dependent Dirichlet Processes based on Poisson Processes.

[BibT_eX]

[DOI]

Eric Grimson

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Joint People, Event, and Location Recognition in Personal Photo Collections Using Cross-Domain Context.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2010

Modeling and estimating persistent motion with geometric flows.

[BibT_eX]

[DOI]

Eric Grimson

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Nonparametric Discriminant Analysis for Face Recognition.

[BibT_eX]

[DOI]

Zhifeng Li

IEEE Trans. Pattern Anal. Mach. Intell., 2009

Learning visual flows: A Lie algebraic approach.

[BibT_eX]

[DOI]

W. Eric L. Grimson

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2007

Quality-Driven Face Occlusion Detection and Recovery.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Discriminant Mutual Subspace Learning for Indoor and Outdoor Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006

Inter-modality Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2006

Conditional Infomax Learning: An Integrated Framework for Feature Extraction and Fusion.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2006

Pursuing Informative Projection on Grassmann Manifold.

[BibT_eX]

[DOI]

Shuicheng Yan

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Recognize High Resolution Faces: From Macrocosm to Microcosm.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005

Neighbor combination and transformation for hallucinating faces.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Face hallucination through dual associative learning.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Image Processing, 2005

Comparative study: face recognition on unspecific persons using linear subspace methods.

[BibT_eX]

[DOI]

Shuicheng Yan

Proceedings of the 2005 International Conference on Image Processing, 2005

Feedback-based dynamic generalized LDA for face recognition.

[BibT_eX]

[DOI]

Shuicheng Yan

Proceedings of the 2005 International Conference on Image Processing, 2005

Tensor-based factor decomposition for relighting.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Image Processing, 2005

Layered local prediction network with dynamic learning for face super-resolution.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Image Processing, 2005

Coupled Space Learning for Image Style Transformation.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Hallucinating Faces: TensorPatch Super-Resolution and Coupled Residue Compensation.

[BibT_eX]

[DOI]