Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. dyval.jpg
    DyVal: Graph-informed Dynamic Evaluation of Large Language Models
    Kaijie Zhu, Jiaao Chen, Jindong Wang, and 3 more authors
    ICLR (Spotlight), 2024
  2. Emotionprompt: Leveraging psychology for large language models enhancement via emotional stimulus
    Cheng Li, Jindong Wang, Kaijie Zhu, and 4 more authors
    ICML, 2024
  3. CompeteAI: Understanding the Competition Behaviors in Large Language Model-based Agents
    Qinlin Zhao, Jindong Wang, Yixuan Zhang, and 4 more authors
    ICML (Oral), 2024
  4. dyval2.jpg
    DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents
    Kaijie Zhu, Jindong Wang, Qinlin Zhao, and 2 more authors
    In , 2024
  5. AgentReview: Exploring Peer Review Dynamics with LLM Agents
    Yiqiao Jin, Qinlin Zhao, Yiyang Wang, and 4 more authors
    In The 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

  1. promptbench.jpg
    PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts
    Kaijie Zhu, Jindong Wang, Jiaheng Zhou, and 8 more authors
    CCS LAMPS Workshop, 2023
  2. rift.jpg
    Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning
    Kaijie Zhu, Xixu Hu, Jindong Wang, and 2 more authors
    In ICCV, 2023
  3. A survey on evaluation of large language models
    Yupeng Chang, Xu Wang, Jindong Wang, and 8 more authors
    ACM TIST, 2023