Tsu-Jui Fu
Cited by
Cited by
GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction
TJ Fu, PH Li, WY Ma
ACL (Long), 2019
VIOLET: End-to-End Video-Language Transformers with Masked Visual-token Modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv:2111.12681, 2021
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
W Feng, X He, TJ Fu, V Jampani, A Akula, P Narayana, S Basu, XE Wang, ...
ICLR, 2023
Dynamic Video Segmentation Network
YS Xu, TJ Fu*, HK Yang*, CY Lee
CVPR, 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
ZW Hong, TY Shann, SY Su, YH Chang, TJ Fu, CY Lee
NeurIPS, 2018
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
TJ Fu, X Wang, M Peterson, S Grafton, M Eckstein, WY Wang
ECCV (Spotlight), 2020
Attentive and Adversarial Learning for Video Summarization
TJ Fu, SH Tai, HT Chen
WACV (Oral), 2019
Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER
PH Li, TJ Fu, WY Ma
AAAI (Oral), 2020
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
W Feng*, W Zhu*, T Fu, V Jampani, A Akula, X He, S Basu, XE Wang, ...
NeurIPS, 2023
Language-Driven Artistic Style Transfer
TJ Fu, XE Wang, WY Wang
ECCV, 2022
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
TJ Fu*, L Li*, Z Gan, K Lin, WY Wang, L Wang, Z Liu
CVPR, 2023
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
TJ Fu, X Wang, S Grafton, M Eckstein, WY Wang
EMNLP (Oral), 2020
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
TJ Fu, WY Wang, D McDuff, Y Song
AAAI, 2022
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
W Zhu, X Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
EACL (Long), 2021
Guiding Instruction-based Image Editing via Multimodal Large Language Models
TJ Fu, W Hu, X Du, WY Wang, Y Yang, Z Gan
ICLR (Spotlight), 2024
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
TJ Fu, L Yu, N Zhang, CY Fu, JC Su, WY Wang, S Bell
CVPR, 2023
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
R Schumann, W Zhu, W Feng, TJ Fu, S Riezler, WY Wang
AAAI, 2024
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
TJ Fu, XE Wang, ST Grafton, MP Eckstein, WY Wang
CVPR, 2022
CPL: Counterfactual Prompt Learning for Vision and Language Models
X He, D Yang, W Feng, TJ Fu, A Akula, V Jampani, P Narayana, S Basu, ...
EMNLP (Long), 2022
Speed Reading: Learning to Read ForBackward via Shuttle
TJ Fu, WY Ma
EMNLP (Long), 2018
The system can't perform the operation now. Try again later.
Articles 1–20