Table of Best Results #2755

SilvaRaulEnrique · 2023-08-24T00:12:14Z

SilvaRaulEnrique
Aug 24, 2023

The idea of this post is construct and maintain colaborativaly a complete and actualizated best orders ranking list table to compile and executate, considering best models results for the more very long context with efectively complety coherent response, by example:

For NVidia 3090 (with CUDA and 24 GB RAM), for now the best orders are:

compile:
make LLAMA_CUBLAS=1

run main example:

./main \
-m models/TheBloke/llama-2-13b-chat.ggmlv3.q8_0.bin \
-ngl 254 \
-c 4096 \
-gqa 8 \
--reverse-prompt "User:" \
--file chat_dies_User_Assistant.txt \
--in-prefix ' ' "$@"

or?

./main \
-m models/s3nh/longchat-7b-v1.5-32k.ggmlv3.q8_0.bin \
-ngl 254 \
-c 32768 \
--rope-scale 8 \
--reverse-prompt "User:" \
--file chat_dies_User_Assistant.txt \
--in-prefix ' ' "$@"

fine tuning? train-text-from scratch:

./train-text-from-scratch \
--vocab-model ServidorIA/models/models/TheBloke/llama-2-13b-chat.ggmlv3.q8_0.bin \
--ctx 64 \
--embd 256 \
--head 8 \
--layer 16 \
--checkpoint-out chk-Ultimas_Acordadas_y_Circulares-256x16.bin \
--model-out ggml-Ultimas_Acordadas_y_Circulares-256x16-f32.bin \
--train-data "Ultimas_Acordadas_y_Circulares.txt" \
-t 4 \
-b 8 \
-n 32 \
--seed 1 \
--adam-iter 16 \
--print-details-interval 0 \
--predict 16 \
--use-flash \
--mem-compute 8

or?

./train-text-from-scratch
--vocab-model ServidorIA/models/s3nh/longchat-7b-v1.5-32k.ggmlv3.q8_0.bin
--ctx 64
--embd 256
--head 8
--layer 16
--checkpoint-out chk-Ultimas_Acordadas_y_Circulares-256x16.bin
--model-out ggml-Ultimas_Acordadas_y_Circulares-256x16-f32.bin
--train-data "Ultimas_Acordadas_y_Circulares.txt"
-t 4
-b 8
-n 32
--seed 1
--adam-iter 16
--print-details-interval 0
--predict 16
--use-flash
--mem-compute 8

Note: models TheBloke, and s3nh are in https://huggingface.co/

¿There is the best parameters or considerer "now" one best?, ¿and for another hardware configuration?

JRZS · 2023-09-03T02:39:05Z

JRZS
Sep 3, 2023

Running on an M2, built from scratch, I get this error:
error: unknown argument: -gqa

Anyone know why this is happening?

4 replies

ianscrivener · 2023-09-03T05:31:21Z

ianscrivener
Sep 3, 2023

"-gqa used to be needed when loading Llama2 70B model, but in the current code with the new GGUF format it is no longer used"
~ @ScarletEmerald

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Table of Best Results #2755

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 4 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Table of Best Results #2755

Uh oh!

SilvaRaulEnrique Aug 24, 2023

Replies: 2 comments · 4 replies

Uh oh!

JRZS Sep 3, 2023

Uh oh!

ScarletEmerald Sep 4, 2023

Uh oh!

JRZS Sep 4, 2023

Uh oh!

ScarletEmerald Sep 5, 2023

Uh oh!

JRZS Sep 5, 2023

Uh oh!

Uh oh!

ianscrivener Sep 3, 2023

SilvaRaulEnrique
Aug 24, 2023

Replies: 2 comments 4 replies

JRZS
Sep 3, 2023

ianscrivener
Sep 3, 2023