Skip to content

The issue of training the projector (following llava-1.5) #142

@gaowei724

Description

@gaowei724

Hello, I conducted an experiment incorporating Vary-tiny (loading Vary-toy weights) into Internvl7B, but I used llava-558K to train the projector. The final model outputs a lot of irrelevant content, and I suspect the alignment stage failed. I'd like to ask if anyone have tried aligning Vary using only llava-558k, and whether it's necessary for me to use 4M samples from Laion-coco for alignment.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions