Follow
Yonatan Bitton
Yonatan Bitton
Research Scientist, Google
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Openflamingo: An open-source framework for training large autoregressive vision-language models
A Awadalla, I Gao, J Gardner, J Hessel, Y Hanafy, W Zhu, K Marathe, ...
arXiv preprint arXiv:2308.01390, 2023
2472023
Datacomp: In search of the next generation of multimodal datasets
SY Gadre, G Ilharco, A Fang, J Hayase, G Smyrnis, T Nguyen, R Marten, ...
Advances in Neural Information Processing Systems 36, 2024
1762024
Openflamingo
A Awadalla, I Gao, J Gardner, J Hessel, Y Hanafy, W Zhu, K Marathe, ...
Zenodo, March, 2023
43*2023
What you see is what you read? improving text-image alignment evaluation
M Yarom, Y Bitton, S Changpinyo, R Aharoni, J Herzig, O Lang, E Ofek, ...
Advances in Neural Information Processing Systems 36, 2024
322024
Breaking common sense: Whoops! a vision-and-language benchmark of synthetic and compositional images
N Bitton-Guetta, Y Bitton, J Hessel, L Schmidt, Y Elovici, G Stanovsky, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
312023
Automatic generation of contrast sets from scene graphs: Probing the compositional consistency of GQA
Y Bitton, G Stanovsky, R Schwartz, M Elhadad
NAACL 2021, 2021
302021
Visit-bench: A benchmark for vision-language instruction following inspired by real-world use
Y Bitton, H Bansal, J Hessel, R Shao, W Zhu, A Awadalla, J Gardner, ...
arXiv preprint arXiv:2308.06595, 2023
292023
Data efficient masked language modeling for vision and language
Y Bitton, G Stanovsky, M Elhadad, R Schwartz
EMNLP 2021, Findings, 2021
222021
WinoGAViL: Gamified association benchmark to challenge vision-and-language models
Y Bitton, NB Guetta, R Yosef, Y Elovici, M Bansal, G Stanovsky, ...
NeurIPS 2022, Oral, Datasets and Benchmarks, 2022
182022
Irfl: Image recognition of figurative language
R Yosef, Y Bitton, D Shahaf
arXiv preprint arXiv:2303.15445, 2023
102023
VASR: Visual Analogies of Situation Recognition
Y Bitton, R Yosef, E Strugo, D Shahaf, R Schwartz, G Stanovsky
AAAI 2023 (Oral), 2022
102022
Cross-lingual Unified Medical Language System entity linking in online health communities
Y Bitton, R Cohen, T Schifter, E Bachmat, M Elhadad, N Elhadad
Journal of the American Medical Informatics Association 27 (10), 1585-1592, 2020
92020
VideoCon: Robust video-language alignment via contrast captions
H Bansal, Y Bitton, I Szpektor, KW Chang, A Grover
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
42024
q2d: Turning questions into dialogs to teach models how to search
Y Bitton, S Cohen-Ganor, I Hakimi, Y Lewenberg, R Aharoni, E Weinreb
arXiv preprint arXiv:2304.14318, 2023
42023
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
B Gordon, Y Bitton, Y Shafir, R Garg, X Chen, D Lischinski, D Cohen-Or, ...
arXiv preprint arXiv:2312.03766, 2023
32023
VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models
Y Bitton, H Bansal, J Hessel, R Shao, W Zhu, A Awadalla, J Gardner, ...
Advances in Neural Information Processing Systems 36, 2024
22024
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
A Jacovi, Y Bitton, B Bohnet, J Herzig, O Honovich, M Tseng, M Collins, ...
arXiv preprint arXiv:2402.00559, 2024
22024
Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks
J Bordalo, V Ramos, R Valério, D Glória-Silva, Y Bitton, M Yarom, ...
arXiv preprint arXiv:2405.10122, 2024
12024
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
H Bansal, Y Bitton, M Yarom, I Szpektor, A Grover, KW Chang
arXiv preprint arXiv:2405.04682, 2024
12024
ImageInWords: Unlocking Hyper-Detailed Image Descriptions
R Garg, A Burns, BK Ayan, Y Bitton, C Montgomery, Y Onoe, A Bunner, ...
arXiv preprint arXiv:2405.02793, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20