Would you plan to adapt it to qwen2-7B? #13

Nastu-Ho · 2024-07-10T03:48:48Z

No description provided.

xiaoachen98 · 2024-07-10T06:31:47Z

No description provided.

Maybe you can contribute to this part. All you need to do is add a llava_qwen2.py, the corresponding conv_mode, and a preprocess_qwen2 function in the train.py file to handle the corresponding mask.

Nastu-Ho · 2024-07-10T06:37:56Z

No description provided.

Maybe you can contribute to this part. All you need to do is add a llava_qwen2.py, the corresponding conv_mode, and a preprocess_qwen2 function in the train.py file to handle the corresponding mask.

I have tried using qwen2-7B-instruct as llm, but found that the fine-tuned results were not as expected. It may be that the preprocess was not done well. Now trying to find a public solution

xiaoachen98 · 2024-07-24T04:57:58Z

No description provided.

Maybe you can contribute to this part. All you need to do is add a llava_qwen2.py, the corresponding conv_mode, and a preprocess_qwen2 function in the train.py file to handle the corresponding mask.

I have tried using qwen2-7B-instruct as llm, but found that the fine-tuned results were not as expected. It may be that the preprocess was not done well. Now trying to find a public solution

I have conducted the experiments on Qwen2-0.5B and everything is fine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Would you plan to adapt it to qwen2-7B? #13

Would you plan to adapt it to qwen2-7B? #13

Nastu-Ho commented Jul 10, 2024

xiaoachen98 commented Jul 10, 2024

Nastu-Ho commented Jul 10, 2024

xiaoachen98 commented Jul 24, 2024

Would you plan to adapt it to qwen2-7B? #13

Would you plan to adapt it to qwen2-7B? #13

Comments

Nastu-Ho commented Jul 10, 2024

xiaoachen98 commented Jul 10, 2024

Nastu-Ho commented Jul 10, 2024

xiaoachen98 commented Jul 24, 2024