A collection of different benchmarks
Allows benchmarking koboldcpp alongside llamacpp using the same GGUF model cache. Downloads models from HuggingFace via stdlib urllib if not already cached by llamacpp. Usage: --backends "llamacpp:auto koboldcpp:auto" |
||
|---|---|---|
| llm | ||
| yocto | ||
| .gitignore | ||