📝 Publications

(*) represents equal contribution

ICML 2025 MAS
sym

The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets

Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei

MAS - ICML 2025 Workshop

Project | Code | Paper

ICML 2025 MAS
sym

Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning

Junchi Yao, Jianhua Xu, Tianyu Xin, Ziyi Wang, Shenzhe Zhu, Shu Yang, Di Wang

MAS - ICML 2025 Workshop

Paper

arXiv 2025
sym

JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model

Yi Nian*, Shenzhe Zhu*, Yuehan Qin, Li Li, Ziyi Wang, Chaowei Xiao, Yue Zhao

Code | Paper

ACL 2025 Findings
sym

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Shu Yang*, Shenzhe Zhu*, Zeyu Wu, Keyu Wang, Junchi Yao, Junchao Wu, Lijie Hu, Mengdi Li, Derek F. Wong, Di Wang

Findings of the Association for Computational Linguistics, 2025

Project | Code | Paper

arXiv 2024
sym

AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving

Shuo Xing, Hongyuan Hua, Xiangbo Gao, Shenzhe Zhu, Renjie Li, Kexin Tian, Xiaopeng Li, Heng Huang, Tianbao Yang, Zhangyang Wang, Yang Zhou, Huaxiu Yao, Zhengzhong Tu

Project | Code | Paper

NeurIPS 2024 LanGame
sym

Exploring the Personality Traits of LLMs through Latent Features Steering

Shu Yang*, Shenzhe Zhu*, Liang Liu, Lijie Hu, Mengdi Li, Di Wang

Language Gamification - NeurIPS 2024 Workshop

Code | Paper

Other Papers