Comprehensive assessment of toxicity in ChatGPT B Zhang, X Shen, WM Si, Z Sha, Z Chen, A Salem, Y Shen, M Backes, ... arXiv preprint arXiv:2311.14685, 2023 | 4 | 2023 |
Breaking agents: Compromising autonomous llm agents through malfunction amplification B Zhang, Y Tan, Y Shen, A Salem, M Backes, S Zannettou, Y Zhang arXiv preprint arXiv:2407.20859, 2024 | 3 | 2024 |
{SecurityNet}: Assessing Machine Learning Vulnerabilities on Public Models B Zhang, Z Li, Z Yang, X He, M Backes, M Fritz, Y Zhang 33rd USENIX Security Symposium (USENIX Security 24), 3873-3890, 2024 | 3 | 2024 |
A plot is worth a thousand words: model information stealing attacks via scientific plots B Zhang, X He, Y Shen, T Wang, Y Zhang 32nd USENIX Security Symposium (USENIX Security 23), 5289-5306, 2023 | 2 | 2023 |
The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective Y Ma, X Shen, Y Wu, B Zhang, M Backes, Y Zhang Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | | 2024 |