Skip to content

test model inference benchmark(ChatGLM2-6B,LLaMA2-7b-chat,Baichuan2-7B-chat)

Notifications You must be signed in to change notification settings

ispyu85/LLM-benchmark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 

Repository files navigation

LLM-benchmark

test model inference benchmark(ChatGLM2-6B,LLaMA2-7b-chat,Baichuan2-7B-chat)

input token throughput* first token耗时(ms) one token耗时(ms)|
32
64
128
256
512
1024
2048

About

test model inference benchmark(ChatGLM2-6B,LLaMA2-7b-chat,Baichuan2-7B-chat)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.7%
  • Shell 0.3%