PaliGemma2 for image to JSON data extraction
- used google/paligemma2-3b-pt-336 checkpoint; I tried to make it happen with 224, but 336 performed a lot better
- trained on A100 with 40GB VRAM
- trained with LoRA
colab with complete fine-tuning code: https://t.co/M1lbYXQUg6... See more