training a new model from nothing #4029

infdivzero · 2023-11-10T22:47:44Z

infdivzero
Nov 10, 2023

The train-text-from-scratch program looks like it should do what I'm looking for but it needs one of the vocab models under models/. I remember reading somewhere in this repo about models typically being developed in something like pytorch then being converted to gguf, but is it possible to develop them using ggml or llama.cpp without building off of one of the vocab models, or even create a new vocab model?

Answered by KerfuffleV2

Nov 10, 2023

The convert.py script has a --vocab-only option, so you can convert for example a HF model to GGUF and only include the metadata. Pretty sure that's also how those vocab only models were created.

So basically two options, find a model that you want to clone the vocab/metadata from and just use that with --vocab-only or just build the vocab/metadata from scratch if you want in a format convert.py can handle. (Or of course write your own script for a completely new format if you want.)

View full answer

KerfuffleV2 · 2023-11-10T23:00:26Z

KerfuffleV2
Nov 10, 2023
Collaborator

The convert.py script has a --vocab-only option, so you can convert for example a HF model to GGUF and only include the metadata. Pretty sure that's also how those vocab only models were created.

So basically two options, find a model that you want to clone the vocab/metadata from and just use that with --vocab-only or just build the vocab/metadata from scratch if you want in a format convert.py can handle. (Or of course write your own script for a completely new format if you want.)

0 replies

StuartIanNaylor · 2023-11-11T09:23:05Z

StuartIanNaylor
Nov 11, 2023

https://github.com/jmorganca/ollama

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

training a new model from nothing #4029

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

training a new model from nothing #4029

Uh oh!

infdivzero Nov 10, 2023

Replies: 2 comments

Uh oh!

KerfuffleV2 Nov 10, 2023 Collaborator

Uh oh!

StuartIanNaylor Nov 11, 2023

infdivzero
Nov 10, 2023

KerfuffleV2
Nov 10, 2023
Collaborator

StuartIanNaylor
Nov 11, 2023