Word Wrangler

Word Wrangler is a voice-based word guessing game powered by Pipecat and the Gemini Live API. The game is available in two versions: a web-based experience and a phone-based experience. Test your description skills in this AI-powered twist on classic word games!

Game Modes

Web-Based Game

In this version, you provide the words, and an AI player attempts to guess them based on your descriptions.

Try it now: https://word-wrangler.vercel.app

Phone-Based Game

In this three-way conversation, an AI host provides words, you describe them without saying the actual word, and an AI player tries to guess. The host tracks your score and manages game flow.

Try it now: Call +1-929-LLM-GAME (+1-929-556-4263)

Game Rules

Web-based Game

The web app provides words for you to describe
You describe the word WITHOUT saying any part of it
The AI player tries to guess based on your description
The app will automatically check the guesses and keep score
Click "Skip" to advance to the next word
You have 60 seconds to score as many points as possible

Phone Game

The AI host provides a word for you to describe
You describe the word WITHOUT saying any part of it
The AI player tries to guess based on your description
Score points for each correct guess
Use commands like "skip" to get a new word or "repeat" to hear the current word again
You have 120 seconds to score as many points as possible

Architecture

Web Game Architecture

The web game uses a simple linear flow:

Transport Input - Receives audio from the web browser via a Daily WebRTC transport.
RTVIProcessor - RTVI is a standard for client/server communication in a voice AI context. This processor collects server-side information and makes it available to the client. Additionally, the client can send events to the server, which are handled through this processor.
STTMuteFilter - Filters out speech during specific conditions. In this game, the user's initial speech is "muted", ensuring that the bot can deliver the entire initial message without being interrupted.
User Context Aggregator - Aggregates user messages as part of the conversation context.
LLM - The LLM powers the AI player's interactions.
Transport Output - Sends audio back to the browser using the Daily WebRTC transport.
Assistant Context Aggregator - Aggregates assistant messages as part of the conversation context.

Phone Game Architecture

The phone game implements a three-way conversation using Pipecat's parallel pipeline architecture. This design addresses the fundamental challenge of LLMs - they're built for turn-based interactions, while this game requires real-time, multi-participant conversation management.

Conversation Participants

Audio Flow Requirements:

User: Must hear both the Host and Player outputs; must be heard by both Host and Player
Host: Must hear the User and Player inputs; its output must be heard by User but NOT by Player
Player: Must hear only the User inputs; its output must be heard by both User and Host

Technical Implementation

The parallel pipeline pattern allows us to create two isolated processing branches, with controlled audio flow between them:

Transport Input - Receives audio from the phone call (Twilio)
Audio Branch Separation:
- Left Branch (Host Pipeline): ConsumerProcessor → Host LLM → Game State Tracker → TTS → Bot Stop Detector
- Right Branch (Player Pipeline): StartFrame Gate → Player LLM → ProducerProcessor

Host LLM Configuration:

The Host uses Gemini Live API, configured with specific response patterns to handle different input types:

- Correct guess: "Correct! That's [N] points. Your next word is [new word]"
- Incorrect guess: "NO" (filtered out by TTS filter)
- User descriptions: "IGNORE" (filtered out by TTS filter)
- Skip requests: "The new word is [new word]"
- Repeat requests: "Your word is [current word]"

Audio Flow Management:

By default, all input audio flows to both branches, so both LLMs hear the user. To implement the complex routing:

Producer/Consumer Pattern: Captures the Player's output audio and feeds it to the Host
- ProducerProcessor filters TTSAudioRawFrames from the Player
- Transforms them from 24kHz to 16kHz (required by Gemini Live)
- Passes them to the ConsumerProcessor at the top of the Host branch
Text Filtering: The HostResponseTextFilter intercepts the "NO" and "IGNORE" responses
- Prevents TTS vocalization of these responses
- Ensures that only meaningful Host responses are spoken
Host-Player Synchronization:
- BotStoppedSpeakingNotifier detects when the Host finishes speaking
- GameStateTracker parses the streamed text to detect new words and track score
- NewWordNotifier triggers the ResettablePlayerLLM to disconnect and reconnect when a new word is presented
- This reset ensures the Player has no context of previous words or guesses
StartFrameGate: The gate holds the Player's StartFrame until the Host has completed its introduction
- Ensures the Player doesn't start interacting until the game has been properly set up

All processed audio is collected at the end of the Parallel Pipeline and sent via the transport output back to Twilio.

Game State Management

The implementation tracks:

Current words being guessed
Running score (points for correct guesses)
Game duration with automatic timeout

This architecture enables complex interaction patterns that would be difficult to achieve with traditional turn-based conversation models, allowing each AI participant to function effectively in their specific game role.

Run Locally

Web Game

Run the Server

Switch to the server directory:
```
cd server
```

Set up and activate your virtual environment:

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Create an .env file and add your API keys:
```
cp env.example .env
```

Add environment variables for:

DAILY_API_KEY=
DAILY_SAMPLE_ROOM_URL=
GOOGLE_API_KEY=

Run the server:
```
LOCAL_RUN=1 python server.py
```

Run the Client

In a new terminal window, navigate to client:
```
cd client
```
Install dependencies:
```
npm install
```
Create an .env.local file:
```
cp env.example .env.local
```
In .env.local:
- NEXT_PUBLIC_API_BASE_URL=http://localhost:7860 is used for local development. For deployments, either remove this env var or replace with /api.
- AGENT_NAME should be set to the name of your deployed Pipecat agent (e.g., "word-wrangler").
- PIPECAT_CLOUD_API_KEY is used only for deployments to Pipecat Cloud.
Run the app:
```
npm run dev
```
Open http://localhost:3000 in your browser

Phone Game

There are two versions of the phone game:

Local Development (bot_phone_local.py):
- For testing locally before deployment
Deployment (bot_phone_twilio.py):
- Ready for deployment to Pipecat Cloud

Running Locally

Set up and activate your virtual environment:

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Create an .env file in the server directory with your API keys:
```
cd server
cp env.example .env
```

Configure Daily information in your .env:

DAILY_API_KEY=your_daily_api_key
DAILY_SAMPLE_ROOM_URL=your_daily_room_url
GOOGLE_API_KEY=your_google_api_key
GOOGLE_TEST_CREDENTIALS_FILE=path_to_credentials_file

Run the local bot:
```
LOCAL_RUN=1 python bot_phone_local.py
```

Deployment

Web Game

Deploy your Server

You can deploy your server code using Pipecat Cloud. For a full walkthrough, start with the Pipecat Cloud Quickstart.

Here are the steps you'll need to complete:

Build, tag, and push your Docker image to a registry.
Create Pipecat Cloud secrets using the CLI or dashboard. For this agent, you only need a GOOGLE_API_KEY. Your DAILY_API_KEY is automatically applied.
Deploy your agent image. You can use a pcc-deploy.toml file to make deploying easier. For example:

agent_name = "word-wrangler"
image = "your-dockerhub-name/word-wrangler:0.1"
secret_set = "word-wrangler-secrets"
enable_krisp = true

[scaling]
  min_instances = 1
  max_instances = 5

Then, you can deploy with the CLI using pcc deploy.

Finally, confirm that your agent is deployed. You'll get feedback in the terminal.

Deploy your Client

This project uses TypeScript, React, and Next.js, making it a perfect fit for Vercel.

In your client directory, install Vercel's CLI tool: npm install -g vercel
Verify it's installed using vercel --version
Log in your Vercel account using vercel login
Deploy your client to Vercel using vercel

Phone Game

Deploy your Server

Again, we'll use Pipecat Cloud. Follow the steps from above. The only difference will be the secrets required; in addition to a GOOGLE_API_KEY, you'll need GOOGLE_APPLICATION_CREDENTIALS in the format of a .json file with your Google Cloud service account information.

You'll need to modify the Dockerfile so that the credentials.json and word_list.py are accessible. This Dockerfile will work:

FROM dailyco/pipecat-base:latest

COPY ./requirements.txt requirements.txt

RUN pip install --no-cache-dir --upgrade -r requirements.txt

COPY ./word_list.py word_list.py
COPY ./credentials.json credentials.json
COPY ./bot_phone_twilio.py bot.py

Note: Your credentials.json file should have your Google service account credentials.

Buy and Configure a Twilio Number

Check out the Twilio Websocket Telephony guide for a step-by-step walkthrough on how to purchase a phone number, configure your TwiML, and make or receive calls.

Tech stack

Both games are built using:

Pipecat framework for real-time voice conversation
Google's Gemini Live API
Real-time communication (Web via Daily, Phone via Twilio)

The phone game features:

Parallel processing of host and player interactions
State tracking for game progress and scoring
Dynamic word selection from multiple categories
Automated game timing and scoring

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
client		client
images		images
server		server
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Word Wrangler

Game Modes

Web-Based Game

Phone-Based Game

Game Rules

Web-based Game

Phone Game

Architecture

Web Game Architecture

Phone Game Architecture

Conversation Participants

Technical Implementation

Game State Management

Run Locally

Web Game

Run the Server

Run the Client

Phone Game

Running Locally

Deployment

Web Game

Deploy your Server

Deploy your Client

Phone Game

Deploy your Server

Buy and Configure a Twilio Number

Tech stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

daily-co/word-wrangler-gemini-live

Folders and files

Latest commit

History

Repository files navigation

Word Wrangler

Game Modes

Web-Based Game

Phone-Based Game

Game Rules

Web-based Game

Phone Game

Architecture

Web Game Architecture

Phone Game Architecture

Conversation Participants

Technical Implementation

Game State Management

Run Locally

Web Game

Run the Server

Run the Client

Phone Game

Running Locally

Deployment

Web Game

Deploy your Server

Deploy your Client

Phone Game

Deploy your Server

Buy and Configure a Twilio Number

Tech stack

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages