TrustLLM: Trustworthiness in Large Language Models Y Huang, L Sun, H Wang, S Wu, Q Zhang, C Gao, Y Huang, W Lyu, ... ICML 2024, 2024 | 285* | 2024 |
A review on background, technology, limitations, and opportunities of large vision models Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen, Z Yuan, Y Huang, H Sun, ... arXiv preprint arXiv:2402.17177, 2024 | 227* | 2024 |
Deid-gpt: Zero-shot medical text de-identification by gpt-4 Z Liu, X Yu, L Zhang, Y Huang, Z Wu, C Cao, H Dai, L Zhao, W Liu, ... arXiv Preprint, 2023 | 175* | 2023 |
TrustGPT: A Benchmark for Trustworthy and Responsible Large Language Models Y Huang, Q Zhang, PS Yu, L Sun arXiv preprint arXiv:2306.11507, 2023 | 69 | 2023 |
Metatool benchmark for large language models: Deciding whether to use tools and which to use Y Huang, J Shi, Y Li, C Fan, S Wu, Q Zhang, Y Liu, P Zhou, Y Wan, ... ICLR 2024, 2024 | 63 | 2024 |
Alignbench: Benchmarking chinese alignment of large language models X Liu, X Lei, S Wang, Y Huang, Z Feng, B Wen, J Cheng, P Ke, Y Xu, ... Main Conference of ACL 2024, 2024 | 44 | 2024 |
From Creation to Clarification: ChatGPT's Journey Through the Fake News Quagmire Y Huang, K Shu, PS Yu, L Sun WWW 2024, 2023 | 40* | 2023 |
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? C Gao, D Chen, Q Zhang, Y Huang, Y Wan, L Sun Findings of NAACL 2024, 2024 | 27* | 2024 |
GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents D Chen*, Y Huang*, S Wu, J Tang, L Chen, Y Bai, Z He, C Wang, H Zhou, ... arXiv preprint arXiv:2406.10819, 2024 | 20 | 2024 |
Quantifying ai psychology: A psychometrics benchmark for large language models Y Li, Y Huang, H Wang, X Zhang, J Zou, L Sun arxiv, 2024 | 18* | 2024 |
Optimization-based Prompt Injection Attack to LLM-as-a-Judge J Shi, Z Yuan, Y Liu, Y Huang, P Zhou, L Sun, NZ Gong ACM CCS 2024, 2024 | 18 | 2024 |
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge J Ye*, Y Wang*, Y Huang*, D Chen, Q Zhang, N Moniz, T Gao, W Geyer, ... arXiv preprint arXiv:2410.02736, 2024 | 12 | 2024 |
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models S Wu*, Y Huang*, C Gao, D Chen, Q Zhang, Y Wan, T Zhou, X Zhang, ... arXiv preprint arXiv:2406.18966, 2024 | 10 | 2024 |
ObscurePrompt: Jailbreaking Large Language Models via Obscure Input Y Huang, J Tang, D Chen, B Tang, Y Wan, L Sun, X Zhang arXiv preprint arXiv:2406.13662, 2024 | 9 | 2024 |
HonestLLM: Toward an Honest and Helpful Large Language Model C Gao*, S Wu*, Y Huang*, D Chen*, Q Zhang*, Z Fu, Y Wan, L Sun, ... NeurIPS 2024, 2024 | 6* | 2024 |
Can Large Language Models Automatically Jailbreak GPT-4V? Y Wu, Y Huang, Y Liu, X Li, P Zhou, L Sun TrustNLP@NAACL 2024, 2024 | 6 | 2024 |
Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations? Y Huang, Z Yuan, Y Zhou, K Guo, X Wang, H Zhuang, W Sun, L Sun, ... arXiv preprint arXiv:2410.23426, 2024 | 4 | 2024 |
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? H Bao*, Y Huang*, Y Wang*, J Ye*, X Wang, X Chen, M Elhoseiny, ... arXiv preprint arXiv:2410.21259, 2024 | 3 | 2024 |
1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? Y Huang, C Fan, Y Li, S Wu, T Zhou, X Zhang, L Sun Main Conference of EMNLP 2024, 2024 | 1 | 2024 |
Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment D Chen, R Chen, S Pu, Z Liu, Y Wu, C Chen, B Liu, Y Huang, Y Wan, ... arXiv preprint arXiv:2411.17188, 2024 | | 2024 |