Phoenix is an open-source AI observability platform designed for experimentation, evaluation, and troubleshooting. It provides:
- Tracing - Trace your LLM application's runtime using OpenTelemetry-based instrumentation.
- Evaluation - Leverage LLMs to benchmark your application's performance using response and retrieval evals.
- Datasets - Create versioned datasets of examples for experimentation, evaluation, and fine-tuning.
- Experiments - Track and evaluate changes to prompts, LLMs, and retrieval.
Phoenix is vendor and language agnostic with out-of-the-box support for popular frameworks (🦙LlamaIndex, 🦜⛓LangChain, Haystack, 🧩DSPy) and LLM providers (OpenAI, Bedrock, MistralAI, VertexAI, LiteLLM, and more). For details on auto-instrumentation, check out the OpenInference project.
Phoenix runs practically anywhere, including your Jupyter notebook, local machine, containerized deployment, or in the cloud.
Install Phoenix via pip
or conda
pip install arize-phoenix
Phoenix container images are available via Docker Hub and can be deployed using Docker or Kubernetes.
Key Features | Availability |
---|---|
Tracing | ✅ |
Evaluation | ✅ |
Retrieval (RAG) Analysis | ✅ |
Datasets | ✅ |
Fine-Tuning Export | ✅ |
Annotations | ✅ |
Human Feedback | ✅ |
Experiments | ✅ |
Embeddings Analysis | ✅ |
Data Export | ✅ |
REST API | ✅ |
GraphQL API | ✅ |
Data Retention | Customizable |
Authentication | ✅ |
Social Login | ✅ |
RBAC | ✅ |
Projects | ✅ |
Self-Hosting | ✅ |
Jupyter Notebooks | ✅ |
Prompt Playground | ✅ |
Sessions | ✅ |
Prompt Management | Coming soon ⏱️ |
Phoenix is built on top of OpenTelemetry and is vendor, language, and framework agnostic.
Python
Integration | Package | Version Badge |
---|---|---|
OpenAI | openinference-instrumentation-openai |
|
LlamaIndex | openinference-instrumentation-llama-index |
|
DSPy | openinference-instrumentation-dspy |
|
AWS Bedrock | openinference-instrumentation-bedrock |
|
LangChain | openinference-instrumentation-langchain |
|
MistralAI | openinference-instrumentation-mistralai |
|
Guardrails | openinference-instrumentation-guardrails |
|
VertexAI | openinference-instrumentation-vertexai |
|
CrewAI | openinference-instrumentation-crewai |
|
Haystack | openinference-instrumentation-haystack |
|
LiteLLM | openinference-instrumentation-litellm |
|
Groq | openinference-instrumentation-groq |
|
Instructor | openinference-instrumentation-instructor |
|
Anthropic | openinference-instrumentation-anthropic |
Integration | Package | Version Badge |
---|---|---|
OpenAI | @arizeai/openinference-instrumentation-openai |
|
LangChain.js | @arizeai/openinference-instrumentation-langchain |
|
Vercel AI SDK | @arizeai/openinference-vercel |
For details about tracing integrations and example applications, see the OpenInference project.
Join our community to connect with thousands of AI builders.
- 🌍 Join our Slack community.
- 📚 Read our documentation.
- 💡 Ask questions and provide feedback in the #phoenix-support channel.
- 🌟 Leave a star on our GitHub.
- 🐞 Report bugs with GitHub Issues.
- 𝕏 Follow us on 𝕏.
- 🗺️ Check out our roadmap to see where we're heading next.
See the migration guide for a list of breaking changes.
Copyright 2024 Arize AI, Inc. All Rights Reserved.
Portions of this code are patent protected by one or more U.S. Patents. See IP_NOTICE.
This software is licensed under the terms of the Elastic License 2.0 (ELv2). See LICENSE.