Follow
Edoardo Debenedetti
Edoardo Debenedetti
PhD student @ ETH Zürich
Verified email at inf.ethz.ch - Homepage
Title
Cited by
Cited by
Year
Robustbench: a standardized adversarial robustness benchmark
F Croce*, M Andriushchenko*, V Sehwag*, E Debenedetti*, N Flammarion, ...
NeurIPS 2021 Datasets and Benchmark Track, 2021
7552021
Jailbreakbench: An open robustness benchmark for jailbreaking large language models
P Chao*, E Debenedetti*, A Robey*, M Andriushchenko*, F Croce, ...
NeurIPS 2024 Datasets and Benchmark Track, 2024
762024
A light recipe to train robust vision transformers
E Debenedetti, V Sehwag, P Mittal
IEEE SaTML 2023, 225-253, 2023
632023
Privacy side channels in machine learning systems
E Debenedetti, G Severi, N Carlini, CA Choquette-Choo, M Jagielski, ...
33rd USENIX Security Symposium (USENIX Security 24), 6861-6848, 2024
302024
AI Risk Management Should Incorporate Both Safety and Security
X Qi, Y Huang, Y Zeng, E Debenedetti, J Geiping, L He, K Huang, ...
arXiv preprint arXiv:2405.19524, 2024
92024
Evading black-box classifiers without breaking eggs
E Debenedetti, N Carlini, F Tramèr
IEEE SaTML 2024 (Distinguished Paper Award Runner-up), 408-424, 2024
82024
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
E Debenedetti*, J Rando*, D Paleka*, SF Florin, D Albastroiu, N Cohen, ...
NeurIPS 2024 Datasets and Benchmark Track (Spotlight), 2024
72024
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
E Debenedetti, J Zhang, M Balunovic, L Beurer-Kellner, M Fischer, ...
NeurIPS 2024 Datasets and Benchmark Track, 2024
7*2024
Scaling compute is not all you need for adversarial robustness
E Debenedetti, Z Wan, M Andriushchenko, V Sehwag, K Bhardwaj, ...
ICLR 2024 Workshop on Reliable and Responsible Foundation Models, 2023
72023
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
M Aerni, J Rando, E Debenedetti, N Carlini, D Ippolito, F Tramèr
arXiv preprint arXiv:2411.10242, 2024
2024
Adversarial Search Engine Optimization for Large Language Models
F Nestaas, E Debenedetti, F Tramèr
arXiv preprint arXiv:2406.18382, 2024
2024
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit
J Freeman, C Rippe, E Debenedetti, M Andriushchenko
NeurIPS 2024 Safe Generative AI Workshop, 0
The system can't perform the operation now. Try again later.
Articles 1–12