Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks.

[BibT_eX]

[DOI]

Chonghua Wang

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LawBench: Benchmarking Legal Knowledge of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MMBench: Is Your Multi-modal Model an All-Around Player?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

T-Eval: Evaluating the Tool Utilization Capability Step by Step.

[BibT_eX]

[DOI]

CoRR, 2023

LawBench: Benchmarking Legal Knowledge of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Referring Video Object Segmentation from Weak Annotation.

[BibT_eX]

[DOI]

CoRR, 2023

RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer.

[BibT_eX]

[DOI]

CoRR, 2023

Temporal Segment Transformer for Action Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

TG-VQA: Ternary Game of Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Improving Pixel-based MIM by Reducing Wasted Modeling Capability.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Budget-aware Few-shot Learning via Graph Convolutional Network.

[BibT_eX]

[DOI]

Shipeng Yan

Songyang Zhang

Xuming He

CoRR, 2022

Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation.

[BibT_eX]

[DOI]

Qiuyue Wang

Songyang Zhang

Xuming He

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning Semantic Correspondence with Sparse Annotations.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Action Quality Assessment with Temporal Parsing Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SGTR: End-to-end Scene Graph Generation with Transformer.

[BibT_eX]

[DOI]

Rongjie Li

Songyang Zhang

Xuming He

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge.

[BibT_eX]

[DOI]

CoRR, 2021

Dynamic Grained Encoder for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An EM Framework for Online Incremental Learning of Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Implicit Temporal Alignment for Few-shot Video Classification.

[BibT_eX]

[DOI]

Songyang Zhang

Jiale Zhou

Xuming He

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Transformer with Bidirectional Decoder for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Part-Aware Prototype Network for Few-Shot Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

LatentGNN: Learning Efficient Non-local Relations for Visual Recognition.

[BibT_eX]

[DOI]

Songyang Zhang

Xuming He

Shipeng Yan

Proceedings of the 36th International Conference on Machine Learning, 2019

Dynamic Context Correspondence Network for Semantic Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Dual Attention Network with Semantic Embedding for Few-Shot Learning.

[BibT_eX]

[DOI]

Shipeng Yan

Songyang Zhang

Xuming He

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2017

Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning.

[BibT_eX]

[DOI]

CoRR, 2017

Predicting Salient Face in Multiple-Face Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Songyang Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...