PaliGemma

PaliGemma

Category: Computer Vision
Framework: PyTorch
Dataset: Flickr
Created: April 20, 2025

Overview

From scratch implementation of PaliGemma

Technical Details

  • Framework: PyTorch
  • Dataset: Flickr
  • Category: Computer Vision

Implementation Details

Paligemma architecture in Pytorch

I implemented the Paligemma using Pytorch on the flickr8000 dataset.

PaliGemma: A versatile 3B VLM for transfer

Datasets

flickr 8000: Link

Frameworks:

Pytorch

Source Code

๐Ÿ“ GitHub Repository: PaliGemma

View the complete implementation, training scripts, and documentation on GitHub.