Entity-relation extraction as multi-turn question answering X Li, F Yin, Z Sun, X Li, A Yuan, D Chai, M Zhou, J Li arXiv preprint arXiv:1905.05529, 2019 | 467 | 2019 |
Glyce: Glyph-vectors for chinese character representations Y Meng, W Wu, F Wang, X Li, P Nie, F Yin, M Li, Q Han, X Sun, J Li Advances in Neural Information Processing Systems 32, 2019 | 252 | 2019 |
Prompt-driven llm safeguarding via directed representation optimization C Zheng, F Yin, H Zhou, F Meng, J Zhou, KW Chang, M Huang, N Peng arXiv e-prints, arXiv: 2401.18018, 2024 | 117* | 2024 |
Red teaming language model detectors with language models Z Shi*, Y Wang*, F Yin*, X Chen, KW Chang, CJ Hsieh Transactions of the Association for Computational Linguistic 12, 174-189, 2024 | 76 | 2024 |
Cleanclip: Mitigating data poisoning attacks in multimodal contrastive learning H Bansal, N Singhi, Y Yang, F Yin, A Grover, KW Chang Proceedings of the IEEE/CVF International Conference on Computer Vision, 112-123, 2023 | 55 | 2023 |
On the Sensitivity and Stability of Model Interpretations in NLP F Yin, Z Shi, CJ Hsieh, KW Chang ACL 2022, 2021 | 46* | 2021 |
Dynosaur: A dynamic growth paradigm for instruction-tuning data curation D Yin, X Liu, F Yin, M Zhong, H Bansal, J Han, KW Chang arXiv preprint arXiv:2305.14327, 2023 | 37 | 2023 |
On the robustness of language encoders against grammatical errors F Yin, Q Long, T Meng, KW Chang arXiv preprint arXiv:2005.05683, 2020 | 33 | 2020 |
Did you read the instructions? rethinking the effectiveness of task definitions in instruction learning F Yin, J Vig, P Laban, S Joty, C Xiong, CSJ Wu arXiv preprint arXiv:2306.01150, 2023 | 32 | 2023 |
Enhancing large vision language models with self-training on image comprehension Y Deng, P Lu, F Yin, Z Hu, S Shen, Q Gu, JY Zou, KW Chang, W Wang Advances in Neural Information Processing Systems 37, 131369-131397, 2024 | 31 | 2024 |
Active instruction tuning: Improving cross-task generalization by training on prompt sensitive tasks PN Kung, F Yin, D Wu, KW Chang, N Peng arXiv preprint arXiv:2311.00288, 2023 | 26 | 2023 |
Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension F Yin, J Srinivasa, KW Chang ICML 2024, 2024 | 15 | 2024 |
Contrastive instruction tuning TL Yan, F Wang, JY Huang, W Zhou, F Yin, A Galstyan, W Yin, M Chen arXiv preprint arXiv:2402.11138, 2024 | 11 | 2024 |
Efficient shapley values estimation by amortization for text classification C Yang, F Yin, H He, KW Chang, X Ma, B Xiang arXiv preprint arXiv:2305.19998, 2023 | 5 | 2023 |
ADDMU: Detection of far-boundary adversarial examples with data and model uncertainty estimation F Yin, Y Li, CJ Hsieh, KW Chang arXiv preprint arXiv:2210.12396, 2022 | 5 | 2022 |
Synchronous faithfulness monitoring for trustworthy retrieval-augmented generation D Wu, JC Gu, F Yin, N Peng, KW Chang arXiv preprint arXiv:2406.13692, 2024 | 3 | 2024 |
Evaluating Human Alignment and Model Faithfulness of LLM Rationale M Fayyaz, F Yin, J Sun, N Peng arXiv preprint arXiv:2407.00219, 2024 | 1 | 2024 |
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller M Cai, Y Zhang, S Zhang, F Yin, D Zhang, D Zou, Y Yue, Z Hu arXiv preprint arXiv:2406.02721, 2024 | 1 | 2024 |
Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation F Yin, Z Wang, I Hsu, J Yan, K Jiang, Y Chen, J Gu, LT Le, KW Chang, ... arXiv preprint arXiv:2503.07826, 2025 | | 2025 |
BingoGuard: LLM Content Moderation Tools with Risk Levels F Yin, P Laban, X Peng, Y Zhou, Y Mao, V Vats, L Ross, D Agarwal, ... arXiv preprint arXiv:2503.06550, 2025 | | 2025 |