Shijie Geng

According to our database1, Shijie Geng authored at least 46 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CLIP-Adapter: Better Vision-Language Models with Feature Adapters.
Int. J. Comput. Vis., February, 2024

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models.
CoRR, 2024

2023
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model.
CoRR, 2023

Mono-STAR: Mono-Camera Scene-Level Tracking and Reconstruction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

VIP5: Towards Multimodal Foundation Models for Recommendation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs.
Proceedings of the Conference on Robot Learning, 2023

2022
Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Path Language Modeling over Knowledge Graphsfor Explainable Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Explainable Fairness in Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Recommendation as Language Processing (RLP): A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5).
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality.
Proceedings of the Computer Vision - ECCV 2022, 2022

Frozen CLIP Models are Efficient Video Learners.
Proceedings of the Computer Vision - ECCV 2022, 2022

Hierarchically Self-supervised Transformer for Human Skeleton Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Improving Personalized Explanation Generation through Visualization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
COMPOSER: Compositional Learning of Group Activity in Videos.
CoRR, 2021

A Simple Long-Tailed Recognition Baseline via Vision-Language Model.
CoRR, 2021

CLIP-Adapter: Better Vision-Language Models with Feature Adapters.
CoRR, 2021

Counterfactual Evaluation for Explainable AI.
CoRR, 2021

Scalable Transformers for Neural Machine Translation.
CoRR, 2021

RomeBERT: Robust Training of Multi-Exit BERT.
CoRR, 2021

Dense Contrastive Visual-Linguistic Pretraining.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Popcorn: Human-in-the-loop Popularity Debiasing in Conversational Recommender Systems.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Multi-Pass Transformer for Machine Translation.
CoRR, 2020

Contrastive Visual-Linguistic Pretraining.
CoRR, 2020

Spatio-Temporal Scene Graphs for Video Dialog.
CoRR, 2020

Character Matters: Video Story Understanding with Character-Aware Relations.
CoRR, 2020

Fairness-Aware Explainable Recommendation over Knowledge Graphs.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Combining CGAN and Mil for Hotspot Segmentation in Bone Scintigraphy.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multi-Layer Content Interaction Through Quaternion Product for Visual Question Answering.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

CAFE: Coarse-to-Fine Neural Symbolic Reasoning for Explainable Recommendation.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

ABSent: Cross-Lingual Sentence Representation Mapping with Bidirectional GANs.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
2nd Place Solution to the GQA Challenge 2019.
CoRR, 2019

Maximizing Marginal Utility per Dollar for Economic Recommendation.
Proceedings of the World Wide Web Conference, 2019

2018
Quantized Densely Connected U-Nets for Efficient Landmark Localization.
Proceedings of the Computer Vision - ECCV 2018, 2018

CU-Net: Coupled U-Nets.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Image Segmentation with Pyramid Dilated Convolution Based on ResNet and U-Net.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Robust Visual Tracking via Occlusion Detection Based on Depth-Layer Information.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Semantic segmentation with multi-path refinement and pyramid pooling dilated-resnet.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2016
A MIL-based interactive approach for hotspot segmentation from bone scintigraphy.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Combining CNN and MIL to Assist Hotspot Segmentation in Bone Scintigraphy.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

NSLIC: SLIC superpixels based on nonstationarity measure.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015


  Loading...