ispyu85 / LLM-benchmark Public

forked from daixd5520/LLM-benchmark

Notifications You must be signed in to change notification settings
Fork 0
Star 0

test model inference benchmark（ChatGLM2-6B，LLaMA2-7b-chat，Baichuan2-7B-chat）

0 stars 2 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Baichuan2-7b-chat		Baichuan2-7b-chat
ChatGLM2-6B		ChatGLM2-6B
LLaMA2-7B-chat		LLaMA2-7B-chat
README.md		README.md

Repository files navigation

LLM-benchmark

test model inference benchmark（ChatGLM2-6B，LLaMA2-7b-chat，Baichuan2-7B-chat）

input token	throughput*	first token耗时（ms）	one token耗时（ms）｜
32
64
128
256
512
1024
2048

About

test model inference benchmark（ChatGLM2-6B，LLaMA2-7b-chat，Baichuan2-7B-chat）

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 99.7%
Shell 0.3%