Skip to content

livekit-examples/multimodal-agent-python

Repository files navigation

LiveKit logo

Python Multimodal Voice Agent

Deploy a sandbox appLiveKit Agents DocsLiveKit CloudBlog

A basic example of a multimodal voice agent using LiveKit and the Python Agents Framework.

Dev Setup

Clone the repository and install dependencies to a virtual environment:

cd multimodal-agent-python
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Set up the environment by copying .env.example to .env.local and filling in the required values:

  • LIVEKIT_URL
  • LIVEKIT_API_KEY
  • LIVEKIT_API_SECRET
  • OPENAI_API_KEY

You can also do this automatically using the LiveKit CLI:

lk app env

Run the agent:

python3 agent.py dev

This agent requires a frontend application to communicate with. You can use one of our example frontends in livekit-examples, create your own following one of our client quickstarts, or test instantly against one of our hosted Sandbox frontends.