Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- DyVal: Graph-informed Dynamic Evaluation of Large Language ModelsICLR (Spotlight), 2024
- Emotionprompt: Leveraging psychology for large language models enhancement via emotional stimulusICML, 2024
- CompeteAI: Understanding the Competition Behaviors in Large Language Model-based AgentsICML (Oral), 2024
- DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing AgentsIn , 2024
- AgentReview: Exploring Peer Review Dynamics with LLM AgentsIn The 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
- PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial PromptsCCS LAMPS Workshop, 2023
- Improving Generalization of Adversarial Training via Robust Critical Fine-TuningIn ICCV, 2023
- A survey on evaluation of large language modelsACM TIST, 2023