An Yan
Cited by
Cited by
CosRec: 2D convolutional neural networks for sequential recommendation
A Yan, S Cheng, WC Kang, M Wan, J McAuley
Proceedings of the 28th ACM international conference on information and …, 2019
PA3D: Pose-action 3D machine for video recognition
A Yan, Y Wang, Z Li, Y Qiao
Proceedings of the ieee/cvf conference on computer vision and pattern …, 2019
RadBERT: Adapting transformer-based language models to radiology
A Yan, J McAuley, X Lu, J Du, EY Chang, A Gentili, CN Hsu
Radiology: Artificial Intelligence 4 (4), e210258, 2022
Weakly supervised contrastive learning for chest x-ray report generation
A Yan, Z He, X Lu, J Du, E Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2109.12242, 2021
Multimodal text style transfer for outdoor vision-and-language navigation
W Zhu, XE Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
arXiv preprint arXiv:2007.00229, 2020
Personalized complementary product recommendation
A Yan, C Dong, Y Gao, J Fu, T Zhao, Y Sun, J McAuley
The ACM Web Conference, 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
W Zhu, A Yan, Y Lu, W Xu, XE Wang, M Eckstein, WY Wang
arXiv preprint arXiv:2210.03765, 2022
Learning concise and descriptive attributes for visual recognition
A Yan, Y Wang, Y Zhong, C Dong, Z He, Y Lu, WY Wang, J Shang, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
Cross-lingual vision-language navigation
A Yan, XE Wang, J Feng, L Li, WY Wang
arXiv preprint arXiv:1910.11301, 2019
Gpt-4v in wonderland: Large multimodal models for zero-shot smartphone gui navigation
A Yan, Z Yang, W Zhu, K Lin, L Li, J Wang, J Yang, Y Zhong, J McAuley, ...
arXiv preprint arXiv:2311.07562, 2023
Gpt-4v (ision) as a generalist evaluator for vision-language tasks
X Zhang, Y Lu, W Wang, A Yan, J Yan, L Qin, H Wang, X Yan, WY Wang, ...
arXiv preprint arXiv:2311.01361, 2023
Personalized Showcases: Generating multi-modal explanations for recommendations
A Yan, Z He, J Li, T Zhang, J McAuley
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
L2C: Describing visual differences needs semantic understanding of individuals
A Yan, XE Wang, TJ Fu, WY Wang
arXiv preprint arXiv:2102.01860, 2021
Imagine: An imagination-based automatic evaluation metric for natural language generation
W Zhu, XE Wang, A Yan, M Eckstein, WY Wang
Robust and interpretable medical image classifiers via concept bottleneck models
A Yan, Y Wang, Y Zhong, Z He, P Karypis, Z Wang, C Dong, A Gentili, ...
arXiv preprint arXiv:2310.03182, 2023
CLIP also Understands Text: Prompting CLIP for Phrase Understanding
A Yan, J Li, W Zhu, Y Lu, WY Wang, J McAuley
arXiv preprint arXiv:2210.05836, 2022
" Nothing Abnormal": Disambiguating Medical Reports via Contrastive Knowledge Infusion
Z He, A Yan, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2305.08300, 2023
Semi-supervised Multi-Label Classification with 3D CBAM Resnet for Tuberculosis Cavern Report
X Lu, A Yan, EY Chang, C Hsu, J McAuley, J Du, A Gentili
CLEF2022 Working Notes, CEUR Workshop Proceedings, CEUR-WS. org< http …, 2022
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Z He, Y Wang, A Yan, Y Liu, EY Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2310.14088, 2023
Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving
J Echterhoff, A Yan, K Han, A Abdelraouf, R Gupta, J McAuley
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
The system can't perform the operation now. Try again later.
Articles 1–20