Follow
Marzieh Fadaee
Marzieh Fadaee
Senior Research Scientist, Cohere For AI
Verified email at cohere.com - Homepage
Title
Cited by
Cited by
Year
Data Augmentation for Low-Resource Neural Machine Translation
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
6162017
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L Henrique Bonifacio, V Jeronymo, H Queiroz Abonizio, I Campiotti, ...
arXiv preprint arXiv:2108.13897, 2021
922021
Back-translation sampling by targeting difficult words in neural machine translation
M Fadaee, C Monz
arXiv preprint arXiv:1808.09006, 2018
892018
Inpars: Data augmentation for information retrieval using large language models
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
arXiv preprint arXiv:2202.05144, 2022
862022
Inpars-v2: Large language models as efficient dataset generators for information retrieval
V Jeronymo, L Bonifacio, H Abonizio, M Fadaee, R Lotufo, J Zavrel, ...
arXiv preprint arXiv:2301.01820, 2023
792023
Aya model: An instruction finetuned open-access multilingual language model
A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ...
arXiv preprint arXiv:2402.07827, 2024
762024
Inpars: Unsupervised dataset generation for information retrieval
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
752022
When less is more: Investigating data pruning for pretraining llms at scale
M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker
arXiv preprint arXiv:2309.04564, 2023
552023
Aya dataset: An open-access collection for multilingual instruction tuning
S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ...
arXiv preprint arXiv:2402.06619, 2024
502024
Back to basics: Revisiting reinforce style optimization for learning from human feedback in llms
A Ahmadian, C Cremer, M Gallé, M Fadaee, J Kreutzer, O Pietquin, ...
arXiv preprint arXiv:2402.14740, 2024
452024
Examining the tip of the iceberg: A data set for idiom translation
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1802.04681, 2018
402018
Aya 23: Open weight releases to further multilingual progress
V Aryabumi, J Dang, D Talupuru, S Dash, D Cairuz, H Lin, B Venkitesh, ...
arXiv preprint arXiv:2405.15032, 2024
332024
No parameter left behind: How distillation and model size affect zero-shot retrieval
GM Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2206.02873, 2022
312022
In defense of cross-encoders for zero-shot retrieval
G Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2212.06121, 2022
202022
Learning Topic-Sensitive Word Representations
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
202017
Elo uncovered: Robustness and best practices in language model evaluation
M Boubdir, E Kim, B Ermis, S Hooker, M Fadaee
arXiv preprint arXiv:2311.17295, 2023
192023
The unreasonable volatility of neural machine translation models
M Fadaee, C Monz
arXiv preprint arXiv:2005.12398, 2020
182020
Data augmentation for low-resource neural machine translation. arXiv 2017
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1705.00440, 0
14
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm
Aakanksha, A Ahmadian, B Ermis, S Goldfarb-Tarrant, J Kreutzer, ...
arXiv preprint arXiv:2406.18682, 2024
11*2024
A New Neural Search and Insights Platform for Navigating and Organizing AI Research
M Fadaee, O Gureenkova, F Rejon-Barrera, C Schnober, W Weerkamp, ...
arXiv preprint arXiv:2011.00061, 2020
92020
The system can't perform the operation now. Try again later.
Articles 1–20