Follow
Julian Michael
Title
Cited by
Cited by
Year
GLUE: A multi-task benchmark and analysis platform for natural language understanding
A Wang, A Singh, J Michael, F Hill, O Levy, SR Bowman
Proceedings of the 7th annual International Conference on Learning …, 2019
77142019
Superglue: A stickier benchmark for general-purpose language understanding systems
A Wang, Y Pruksachatkun, N Nangia, A Singh, J Michael, F Hill, O Levy, ...
Advances in neural information processing systems 32, 2019
23402019
Supervised open information extraction
G Stanovsky, J Michael, L Zettlemoyer, I Dagan
Proceedings of the 2018 Conference of the North American Chapter of the …, 2018
3712018
Language models don't always say what they think: unfaithful explanations in chain-of-thought prompting
M Turpin, J Michael, E Perez, S Bowman
Advances in Neural Information Processing Systems 36, 2024
2822024
AmbigQA: Answering ambiguous open-domain questions
S Min, J Michael, H Hajishirzi, L Zettlemoyer
arXiv preprint arXiv:2004.10645, 2020
2502020
Gpqa: A graduate-level google-proof q&a benchmark
D Rein, BL Hou, AC Stickland, J Petty, RY Pang, J Dirani, J Michael, ...
arXiv preprint arXiv:2311.12022, 2023
1412023
Large-scale QA-SRL parsing
N FitzGerald, J Michael, L He, L Zettlemoyer
Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018
1182018
Crowdsourcing question-answer meaning representations
J Michael, G Stanovsky, L He, I Dagan, L Zettlemoyer
Proceedings of the 16th Annual Conference of the North American Chapter of …, 2018
902018
Prompting contrastive explanations for commonsense reasoning tasks
B Paranjape, J Michael, M Ghazvininejad, L Zettlemoyer, H Hajishirzi
arXiv preprint arXiv:2106.06823, 2021
752021
We're afraid language models aren't modeling ambiguity
A Liu, Z Wu, J Michael, A Suhr, P West, A Koller, S Swayamdipta, ...
arXiv preprint arXiv:2304.14399, 2023
602023
The winograd schema challenge and reasoning about correlation
D Bailey, AJ Harrison, Y Lierler, V Lifschitz, J Michael
2015 AAAI Spring Symposium Series, 2015
602015
Controlled crowdsourcing for high-quality QA-SRL annotation
P Roit, A Klein, D Stepanov, J Mamou, J Michael, G Stanovsky, ...
arXiv preprint arXiv:1911.03243, 2019
542019
kNN-Prompt: Nearest Neighbor Zero-Shot Inference
W Shi, J Michael, S Gururangan, L Zettlemoyer
arXiv preprint arXiv:2205.13792, 2022
482022
Asking without telling: Exploring latent ontologies in contextual representations
J Michael, JA Botha, I Tenney
arXiv preprint arXiv:2004.14513, 2020
442020
Human-in-the-loop parsing
L He, J Michael, M Lewis, L Zettlemoyer
Proceedings of the 2016 Conference on Empirical Methods in Natural Language …, 2016
392016
Asking it all: Generating contextualized questions for any semantic role
V Pyatkin, P Roit, J Michael, R Tsarfaty, Y Goldberg, I Dagan
arXiv preprint arXiv:2109.04832, 2021
382021
What do nlp researchers believe? results of the nlp community metasurvey
J Michael, A Holtzman, A Parrish, A Mueller, A Wang, A Chen, D Madaan, ...
arXiv preprint arXiv:2208.12852, 2022
312022
Debate helps supervise unreliable experts
J Michael, S Mahdi, D Rein, J Petty, J Dirani, V Padmakumar, ...
arXiv preprint arXiv:2311.08702, 2023
262023
Eliciting language model behaviors using reverse language models
J Pfau, A Infanger, A Sheshadri, A Panda, J Michael, C Huebner
Socially Responsible Language Modelling Research, 2023
72023
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
J Chua, E Rees, H Batra, SR Bowman, J Michael, E Perez, M Turpin
arXiv preprint arXiv:2403.05518, 2024
62024
The system can't perform the operation now. Try again later.
Articles 1–20