FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions N Rotstein, D Bensaid, S Brody, R Ganz, R Kimmel Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 58 | 2023 |
Enhancing diffusion-based image synthesis with robust classifier guidance B Kawar, R Ganz, M Elad Transactions on Machine Learning Research, 2022 | 43 | 2022 |
Threat model-agnostic adversarial defense using diffusion models T Blau, R Ganz, B Kawar, A Bronstein, M Elad arXiv preprint arXiv:2207.08089, 2022 | 32 | 2022 |
Multimodal semi-supervised learning for text recognition A Aberdam, R Ganz, S Mazor, R Litman arXiv preprint arXiv:2205.03873, 2022 | 27 | 2022 |
Do Perceptually Aligned Gradients Imply Adversarial Robustness? R Ganz, B Kawar, M Elad Proceedings of the 40th International Conference on Machine Learning 202 …, 2022 | 22* | 2022 |
Clipter: Looking at the bigger picture in scene text recognition A Aberdam, D Bensaīd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 21 | 2023 |
Question aware vision transformer for multimodal reasoning R Ganz, Y Kittenplon, A Aberdam, E Ben Avraham, O Nuriel, S Mazor, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 19 | 2024 |
Towards models that can see and read R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 15 | 2023 |
Clipag: Towards generator-free text-to-image generation R Ganz, M Elad Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 11 | 2024 |
GRAM: Global reasoning for multi-page VQA T Blau, S Fogel, R Ronen, A Golts, R Ganz, E Ben Avraham, A Aberdam, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 10 | 2024 |
Paint by inpaint: Learning to add image objects by removing them first N Wasserman, N Rotstein, R Ganz, R Kimmel arXiv preprint arXiv:2404.18212, 2024 | 9 | 2024 |
Classifier robustness enhancement via test-time transformation T Blau, R Ganz, C Baskin, M Elad, A Bronstein arXiv preprint arXiv:2303.15409, 2023 | 9 | 2023 |
BIGRoC: Boosting Image Generation via a Robust Classifier R Ganz, M Elad Transactions on Machine Learning Research, 2021 | 8 | 2021 |
Improved Image Generation via Sparse Modeling R Ganz, M Elad ICLR Workshop on Deep Generative Models for Highly Structured Data, 2021 | 2* | 2021 |
Enhancing consistency-based image generation via adversarialy-trained classification and energy-based discrimination S Golan, R Ganz, M Elad arXiv preprint arXiv:2405.16260, 2024 | 1 | 2024 |
DocVLM: Make Your VLM an Efficient Reader MS Nacson, A Aberdam, R Ganz, EB Avraham, A Golts, Y Kittenplon, ... arXiv preprint arXiv:2412.08746, 2024 | | 2024 |
DocVLM: Make Your VLM an Efficient Reader M Shpigel Nacson, A Aberdam, R Ganz, E Ben Avraham, A Golts, ... arXiv e-prints, arXiv: 2412.08746, 2024 | | 2024 |
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models J Fhima, EB Avraham, O Nuriel, Y Kittenplon, R Ganz, A Aberdam, ... arXiv preprint arXiv:2411.04642, 2024 | | 2024 |
Text-to-Image Generation Via Energy-Based CLIP R Ganz, M Elad arXiv preprint arXiv:2408.17046, 2024 | | 2024 |
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness M Ehrenberg, R Ganz, N Rosenfeld arXiv preprint arXiv:2406.11458, 2024 | | 2024 |