Serving Llama model via FastAPI for AI-driven responses