Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts.

[BibT_eX]

[DOI]

Zhenpeng Su

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Advancing Reliable Test-Time Adaptation of Vision-Language Models under Visual Variations.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Exploiting Position Information in Convolutional Kernels for Structural Re-parameterization.

[BibT_eX]

[DOI]

Tianxiang Hao

Hui Chen

Guiguang Ding

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

YOLOE: Real-Time Seeing Anythi.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LBPE: Long-token-first Tokenization to Improve Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Temporal Scaling Law for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

AdaTP: Attention-Debiased Token Pruning for Video Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Fast Quiet-STaR: Thinking Without Thought Tokens.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LSNet: See Large, Focus Small.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Can Sequential Persuasion Strategies Referencing Specific Purposes Enhance the Persuasiveness of Online Requests? A Case Study.

[BibT_eX]

[DOI]

Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

Extending LLM Context Window with Adaptive Grouped Positional Encoding: A Training-Free Method.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

YOLO-UniOW: Efficient Universal Open-World Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image.

[BibT_eX]

[DOI]

CoRR, 2024

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts.

[BibT_eX]

[DOI]

CoRR, 2024

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal.

[BibT_eX]

[DOI]

CoRR, 2024

Temporal Scaling Law for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

YOLOv10: Real-Time End-to-End Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Multi-Label Learning with Block Diagonal Labels.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

More is Better: Deep Domain Adaptation with Multiple Sources.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

TaD: A Plug-and-Play Task-Aware Decoding Method to Better Adapt LLMs on Downstream Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

A Simple Confidence-Supervised Model for High-Resolution Defect Recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Robotics, Control and Automation, 2024

PYRA: Parallel Yielding Re-activation for Training-Inference Efficient Task Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Quantized Prompt for Efficient Generalization of Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Rep ViT: Revisiting Mobile CNN From ViT Perspective.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Geometry-Guided Domain Generalization for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

GPro3D: Deriving 3D BBox from ground plane in monocular 3D object detection.

[BibT_eX]

[DOI]

Neurocomputing, December, 2023

Re-parameterized Low-rank Prompt: Generalize a Vision-Language Model within 0.5K Parameters.

[BibT_eX]

[DOI]

CoRR, 2023

RepViT-SAM: Towards Real-Time Segmenting Anything.

[BibT_eX]

[DOI]

CoRR, 2023

InfoEntropy Loss to Mitigate Bias of Learning Difficulties for Generative Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs.

[BibT_eX]

[DOI]

CoRR, 2023

RepViT: Revisiting Mobile CNN From ViT Perspective.

[BibT_eX]

[DOI]

CoRR, 2023

Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Hierarchical Prompt Learning Using CLIP for Multi-label Classification with Single Positive Labels.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Consolidator: Mergable Adapter with Group Connections for Visual Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Confidence-based Visual Dispersal for Few-shot Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Box-Level Active Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Ground Plane Matters: Picking Up Ground Plane Prior in Monocular 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Image Captioning with Memorized Knowledge.

[BibT_eX]

[DOI]

Cogn. Comput., 2021

2020

ACMNet: Adaptive Confidence Matching Network for Human Behavior Analysis via Cross-modal Retrieval.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Consistent Address Allocation Algorithm Mitigating Address Conflict for Large-Scale LoRa-Enabled IoT Networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on Computational Science and Engineering, 2020

Enhanced Meta-Learning for Cross-Lingual Named Entity Recognition with Minimal Resources.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Neural Image Caption Generation with Weighted Training and Reference.

[BibT_eX]

[DOI]

Cogn. Comput., 2019

PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Cross-Modal Image-Text Retrieval with Semantic Consistency.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

An Adaptive MAC Layer Energy-Saving Algorithm for ZigBee-Enabled IoT Networks.

[BibT_eX]

[DOI]

Yaxuan Zhang

Kun Yang

Hui Chen

Proceedings of the Smart City and Informatization - 7th International Conference, 2019

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Show, Observe and Tell: Attribute-driven Attention Model for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Attend to Knowledge: Memory-Enhanced Attention Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Brain Inspired Cognitive Systems, 2018

Temporal-Difference Learning With Sampling Baseline for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Reference Based LSTM for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Hui Chen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...