An API-based application that communicates with a large language model over a network through HTTP requests
-
Run app
uvicorn main:app --reload
-
Test the API:
curl -X POST "http://localhost:8000/generate" -H "Content-Type: application/json" -d '{"prompt": "Once upon a time", "max_tokens": 50}'