Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues.

[BibT_eX]

[DOI]

Haodong Duan

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Differentiable Model Scaling using Differentiable Topk.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Can AI Assistants Know What They Don't Know?

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LawBench: Benchmarking Legal Knowledge of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A Task Is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

4D Contrastive Superflows are Dense 3D Representation Learners.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

MMBench: Is Your Multi-modal Model an All-Around Player?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Language-Driven Video Inpainting via Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OMG-Seg: Is One Model Good Enough for all Segmentation?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

ANAH: Analytical Annotation of Hallucinations in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

T-Eval: Evaluating the Tool Utilization Capability Step by Step.

[BibT_eX]

[DOI]

CoRR, 2023

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit.

[BibT_eX]

[DOI]

CoRR, 2023

Mixed Pseudo Labels for Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Evaluating Hallucinations in Chinese Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

CoRR, 2023

LawBench: Benchmarking Legal Knowledge of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.

[BibT_eX]

[DOI]

CoRR, 2023

Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Referring Video Object Segmentation from Weak Annotation.

[BibT_eX]

[DOI]

CoRR, 2023

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest.

[BibT_eX]

[DOI]

CoRR, 2023

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans.

[BibT_eX]

[DOI]

CoRR, 2023

RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions.

[BibT_eX]

[DOI]

CoRR, 2023

RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer.

[BibT_eX]

[DOI]

CoRR, 2023

RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose.

[BibT_eX]

[DOI]

CoRR, 2023

Segment Any Point Cloud Sequences by Distilling Vision Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TG-VQA: Ternary Game of Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Improving Pixel-based MIM by Reducing Wasted Modeling Capability.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dense Distinct Query for End-to-End Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

CARAFE++: Unified Content-Aware ReAssembly of FEatures.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

RTMDet: An Empirical Study of Designing Real-Time Object Detectors.

[BibT_eX]

[DOI]

CoRR, 2022

DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

What Are Expected Queries in End-to-End Object Detection?

[BibT_eX]

[DOI]

CoRR, 2022

Dense Siamese Network.

[BibT_eX]

[DOI]

CoRR, 2022

Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MMRotate: A Rotated Object Detection Benchmark using PyTorch.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PYSKL: Towards Good Practices for Skeleton Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Arithmetic optimization algorithm to optimize support vector machine for chip defect Identification.

[BibT_eX]

[DOI]

Kai Chen

Heng Yao

Zhenhua Han

Proceedings of the 28th International Conference on Mechatronics and Machine Vision in Practice, 2022

Dense Siamese Network for Dense Unsupervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Group R-CNN for Weakly Semi-supervised Object Detection with Points.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OCSampler: Compressing Videos to One Clip with Single-step Sampling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Revisiting Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Towards Balanced Learning for Instance Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

STransGAN: An Empirical Study on Transformer in GANs.

[BibT_eX]

[DOI]

CoRR, 2021

WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Revisiting Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

K-Net: Towards Unified Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Few-Shot Object Detection via Association and DIscrimination.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Positional Encoding As Spatial Inductive Bias in GANs.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Seesaw Loss for Long-Tailed Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Temporal ROI Align for Video Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Feature Pyramid Grids.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

CoRR, 2020

Side-Aware Boundary Localization for More Precise Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Prime Sample Attention in Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

MMDetection: Open MMLab Detection Toolbox and Benchmark.

[BibT_eX]

[DOI]

CoRR, 2019

CARAFE: Content-Aware ReAssembly of FEatures.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Region Proposal by Guided Anchoring.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Libra R-CNN: Towards Balanced Learning for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Hybrid Task Cascade for Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Optimizing Video Object Detection via a Scale-Time Lattice.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Video Object Segmentation with Re-identification.

[BibT_eX]

[DOI]

CoRR, 2017

Discover and Learn New Objects from Documentaries.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Kai Chen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...