[Usage] Llava-Gemma Pretraining + Fine tuning Usage issue and missing Fine tuned projector.bin #1548

nlpkiddo-2001 · 2024-06-07T15:56:34Z

Describe the issue

Issue:
I first pretrained the projector using Clip + Gemma Model and then FIne tuned the Gemma and Projector, but no matter what It is giving in correct outputs, and the loss is revolving around 1-2 in pretraining for projector and 0.4 - 0.7 in fine tuning. I tried without Lora.

Screenshots:

Kindly assist me. I have a similar setup for Gemma as like in this PR .
#1247

Screenshot of fine tuning from wandb

shan23chen · 2024-09-08T04:31:59Z

is this first gen gemma or second?

nlpkiddo-2001 changed the title ~~[Usage] Llava-Gemma Pretraining + Fine tuning Usage issue~~ [Usage] Llava-Gemma Pretraining + Fine tuning Usage issue and missing Fine tuned projector.bin Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage] Llava-Gemma Pretraining + Fine tuning Usage issue and missing Fine tuned projector.bin #1548

[Usage] Llava-Gemma Pretraining + Fine tuning Usage issue and missing Fine tuned projector.bin #1548

nlpkiddo-2001 commented Jun 7, 2024

shan23chen commented Sep 8, 2024

[Usage] Llava-Gemma Pretraining + Fine tuning Usage issue and missing Fine tuned projector.bin #1548

[Usage] Llava-Gemma Pretraining + Fine tuning Usage issue and missing Fine tuned projector.bin #1548

Comments

nlpkiddo-2001 commented Jun 7, 2024

Describe the issue

shan23chen commented Sep 8, 2024