Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor O Honovich, T Scialom, O Levy, T Schick
arXiv preprint arXiv:2212.09689, 2022
286 2022 TRUE: Re-evaluating Factual Consistency Evaluation O Honovich, R Aharoni, J Herzig, H Taitelbaum, D Kukliansy, V Cohen, ...
NAACL 2022, 2022
204 2022 : Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question AnsweringO Honovich, L Choshen, R Aharoni, E Neeman, I Szpektor, O Abend
EMNLP 2021, 2021
183 2021 Instruction Induction: From Few Examples to Natural Language Task Descriptions O Honovich, U Shaham, SR Bowman, O Levy
arXiv preprint arXiv:2205.10782, 2022
126 2022 DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering E Neeman, R Aharoni, O Honovich, L Choshen, I Szpektor, O Abend
arXiv preprint arXiv:2211.05655, 2022
52 2022 LMentry: A Language Model Benchmark of Elementary Language Tasks A Efrat, O Honovich, O Levy
arXiv preprint arXiv:2211.02069, 2022
24 2022 Machine reading of historical events O Honovich, LT Hennigen, O Abend, SB Cohen
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
10 2020 A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains A Jacovi, Y Bitton, B Bohnet, J Herzig, O Honovich, M Tseng, M Collins, ...
arXiv preprint arXiv:2402.00559, 2024
8 2024 Surfacing Biases in Large Language Models using Contrastive Input Decoding G Yona, O Honovich, I Laish, R Aharoni
arXiv preprint arXiv:2305.07378, 2023
7 2023 Keep Guessing? When Considering Inference Scaling, Mind the Baselines G Yona, O Honovich, O Levy, R Aharoni
arXiv preprint arXiv:2410.15466, 2024
2024