Tsu-Jui Fu

According to our database1, Tsu-Jui Fu authored at least 41 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Guiding Instruction-based Image Editing via Multimodal Large Language Models.
CoRR, 2023

Text-guided 3D Human Generation from 2D Collections.
CoRR, 2023

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation.
CoRR, 2023

Discriminative Diffusion Models as Few-shot Vision and Language Learners.
CoRR, 2023

PHOTOSWAP: Personalized Subject Swapping in Images.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

EDIS: Entity-Driven Image Search over Multimodal Web Content.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text-guided 3D Human Generation from 2D Collections.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CPL: Counterfactual Prompt Learning for Vision and Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ULN: Towards Underspecified Vision-and-Language Navigation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Language-Driven Artistic Style Transfer.
Proceedings of the Computer Vision - ECCV 2022, 2022

M<sup>3</sup>L: Language-based Video Editing via Multi-Modal Multi-Level Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling.
CoRR, 2021

Language-Driven Image Style Transfer.
CoRR, 2021

Language-based Video Editing via Multi-Modal Multi-Level Transformer.
CoRR, 2021

Semi-Supervised Policy Initialization for Playing Games with Language Hints.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

L2C: Describing Visual Differences Needs Semantic Understanding of Individuals.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

H-FND: Hierarchical False-Negative Denoising for Distant Supervision Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler.
Proceedings of the Computer Vision - ECCV 2020, 2020

Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling.
CoRR, 2019

Remedying BiLSTM-CNN Deficiency in Modeling Cross-Context for NER.
CoRR, 2019

Attentive and Adversarial Learning for Video Summarization.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Learning from Observation-Only Demonstration for Task-Oriented Language Grounding via Self-Examination.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

A Distributed Scheme for Accelerating Semantic Video Segmentation on An Embedded Cluster.
Proceedings of the 37th IEEE International Conference on Computer Design, 2019

Adversarial Active Exploration for Inverse Dynamics Model Learning.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Adversarial Exploration Strategy for Self-Supervised Imitation Learning.
CoRR, 2018

Diversity-Driven Exploration Strategy for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Speed Reading: Learning to Read ForBackward via Shuttle.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Visual Relationship Prediction via Label Clustering and Incorporation of Depth Information.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Dynamic Video Segmentation Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Region-Semantics Preserving Image Synthesis.
Proceedings of the Computer Vision - ACCV 2018, 2018


  Loading...