Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. dyval2.jpg
    DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents
    Kaijie Zhu, Jindong Wang, Qinlin Zhao, and 2 more authors
    In , 2024
  2. AgentReview: Exploring Peer Review Dynamics with LLM Agents
    Yiqiao Jin, Qinlin Zhao, Yiyang Wang, and 4 more authors
    In The 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

  1. dyval.jpg
    DyVal: Graph-informed Dynamic Evaluation of Large Language Models
    Kaijie Zhu, Jiaao Chen, Jindong Wang, and 3 more authors
    ICLR 2024 (Spotlight, Top 5%), 2023
  2. promptbench.jpg
    PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts
    Kaijie Zhu, Jindong Wang, Jiaheng Zhou, and 8 more authors
    CCS 2024 LAMPS Workshop, 2023
  3. rift.jpg
    Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning
    Kaijie Zhu, Xixu Hu, Jindong Wang, and 2 more authors
    In ICCV 2023, 2023
  4. A survey on evaluation of large language models
    Yupeng Chang, Xu Wang, Jindong Wang, and 8 more authors
    arXiv preprint arXiv:2307.03109, 2023
  5. Emotionprompt: Leveraging psychology for large language models enhancement via emotional stimulus
    Cheng Li, Jindong Wang, Kaijie Zhu, and 4 more authors
    arXiv preprint arXiv:2307.11760, 2023
  6. CompeteAI: Understanding the Competition Behaviors in Large Language Model-based Agents
    Qinlin Zhao, Jindong Wang, Yixuan Zhang, and 4 more authors
    arXiv preprint arXiv:2310.17512, 2023