llama.cpp
Public

LLM inference in C/C++