Követés
Weixin Chen
Cím
Hivatkozott rá
Hivatkozott rá
Év
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
1182023
Effective Backdoor Defense by Exploiting Sensitivity of Poisoned Samples
W Chen, B Wu, H Wang
Advances in Neural Information Processing Systems (NeurIPS), 2022
432022
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
W Chen, D Song, B Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
342023
GRATH: Gradual Self-Truthifying for Large Language Models
W Chen, D Song, B Li
International Conference on Machine Learning (ICML), 2024
12024
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–4