LiteLLM

meta

Feb 25, 2025 1 min read

Call 100+ LLMs using the OpenAI Input/Output Format

Translate inputs to provider's completion, embedding, and image_generation endpoints
Consistent output, text responses will always be available at ['choices'][0]['message']['content']
Retry/fallback logic across multiple deployments (e.g. Azure/OpenAI) - Router
Track spend & set budgets per project LiteLLM Proxy Server

How to use LiteLLM

You can use litellm through either:

LiteLLM Proxy Server - Server (LLM Gateway) to call 100+ LLMs, load balance, cost tracking across projects
LiteLLM python SDK - Python Client to call 100+ LLMs, load balance, cost tracking

FRAMEWORK

About the author

meta