Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing AgentsIn , 2024
- AgentReview: Exploring Peer Review Dynamics with LLM AgentsIn The 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
- DyVal: Graph-informed Dynamic Evaluation of Large Language ModelsICLR 2024 (Spotlight, Top 5%), 2023
- PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial PromptsCCS 2024 LAMPS Workshop, 2023
- Improving Generalization of Adversarial Training via Robust Critical Fine-TuningIn ICCV 2023, 2023
- A survey on evaluation of large language modelsarXiv preprint arXiv:2307.03109, 2023
- Emotionprompt: Leveraging psychology for large language models enhancement via emotional stimulusarXiv preprint arXiv:2307.11760, 2023
- CompeteAI: Understanding the Competition Behaviors in Large Language Model-based AgentsarXiv preprint arXiv:2310.17512, 2023