
Personalization (Digital Twin)

A.I. Name
A.I. Name
A.I. Picture URL
A.I. Picture URL
- Click the avatar to launch the picture picker (file dialog or drag-and-drop)
- Hover the avatar to see the upload prompt overlay with the camera icon
- Remove the current picture using the small
xoverlay that appears on hover - A small helper line “Click the picture to upload a new one” appears beside the avatar
| Property | Recommended |
|---|---|
| Aspect ratio | Square (1:1) |
| Minimum size | 256 × 256 px |
| Optimal size | 512 × 512 px or 1024 × 1024 px for retina displays |
| Formats | PNG, JPEG, WebP |
| Max file size | 5 MB |
| Shape | Circular crop applied automatically; design with the centre in mind |
A.I. Animated Picture URL (GIF)
A.I. Animated Picture URL (GIF)
A.I. Background URL (Large Picture)
A.I. Background URL (Large Picture)
16:9 crop, so start from a landscape image at least 1920 px wide.A.I. Dark Mode Background URL (Large Picture)
A.I. Dark Mode Background URL (Large Picture)
16:9 crop). If left blank, the default dark mode background is used.About this Digital Twin
About this Digital Twin
Use About for Welcome Screen
Use About for Welcome Screen
Guest UI (Minimum Screen)
Guest UI (Minimum Screen)
Disable Audio Notes for Users
Disable Audio Notes for Users
Compact History
Compact History
Digital Twin Instructions
Digital Twin Instructions
UI CSS
UI CSS
Config Key (Branding)
Config Key (Branding)
AI Models
Select which AI models power each capability of your Digital Twin. Each dropdown lists the models available to your instance from the system-level model catalog.Conversation
Conversation
Image Analysis
Image Analysis
Image Generation
Image Generation
Embeddings Generation
Embeddings Generation
Audio Transcription
Audio Transcription
Text to Speech
Text to Speech
Documents Summarization
Documents Summarization
Convo (Real Time Speech-to-Speech)
Convo (Real Time Speech-to-Speech)
| Provider | Voice Catalog | Tool Calling | Avatar | Transport |
|---|---|---|---|---|
| OpenAI GPT-Realtime | 10 native voices | Yes (full Pria tools + MCP) | No | WebRTC |
| Google Gemini Live | 8 native voices | Yes (native function calling) | No | WebSocket |
| xAI Voice | 5 native voices | Yes | No | WebSocket |
| ElevenLabs | Custom voice clones from your library | Yes (Pria as Custom LLM) | No | WebRTC or WebSocket |
| Anam Avatar | Voice + photorealistic avatar | Yes (Pria as Custom LLM) | Yes (photorealistic) | WebRTC |
| LemonSlice | Voice + animated avatar | Yes | Yes (animated) | WebRTC |
- ElevenLabs — Agent ID and API Key
- Anam — Avatar ID, Voice ID, and API Key
- LemonSlice — Agent ID, API Key (placeholder image / loading video / intro message configured below)
- xAI — xAI API Key
- Gemini Live — Gemini API Key
- OpenAI GPT-Realtime — Uses the institution OpenAI API key (no separate setup)
Moderation
Moderation
Convo (Real-Time Speech-to-Speech)
Configure voice conversation settings for real-time speech-to-speech interactions. These settings are visible when Enable Convo is turned on in the Configuration tab.Speech-to-Speech Model
Speech-to-Speech Model
- OpenAI GPT-Realtime:
gpt-realtime-1.5(default),gpt-realtime,gpt-realtime-mini - Google Gemini Live:
gemini-3.1-flash-live-preview(default),gemini-2.5-flash-native-audio-preview-12-2025 - ElevenLabs: Uses ElevenLabs Conversational AI (requires Agent ID and API Key in Configuration tab)
Voice Selection
Voice Selection
Voice Activity Detection (VAD) Eagerness
Voice Activity Detection (VAD) Eagerness
- Low: Waits longer before considering a pause as end-of-turn. Best for thoughtful conversations.
- Medium: Balanced detection (default).
- High: Responds quickly to short pauses. Best for fast-paced interactions.
Noise Reduction
Noise Reduction
- Off: No noise reduction applied.
- Near Field: Optimized for close-range microphones (headsets, laptop mics).
- Far Field: Optimized for distant microphones (room setups, speakerphones).
Transcription Language
Transcription Language
en for English, es for Spanish, fr for French). This helps the transcription model produce more accurate results for non-English conversations.ElevenLabs Connection Method
ElevenLabs Connection Method
- WebRTC (default): Low-latency peer-to-peer connection. Best for most use cases.
- WebSocket: Server-relayed connection. Use when WebRTC is blocked by firewalls.
Voice Tuning Reference
A condensed view of how voice behaviour is shaped across the three universal knobs. Each provider mixes these signals with its own defaults — values below describe Pria’s mapping.| Knob | Effect | Recommended Default | When to Change |
|---|---|---|---|
| VAD Eagerness | How quickly the system decides the user finished speaking. Low waits for long pauses, high cuts in on micro-pauses. | Medium | Switch to Low when users tend to pause mid-thought (research, tutoring); High for fast back-and-forth (Q&A). |
| Noise Reduction | Server-side filter applied to inbound audio before transcription. | Near Field for headsets, Off for studio mics | Far Field is only useful for room-scale microphones — over-filters small mics. |
| Transcription Language | ISO 639-1 language hint passed to the STT model. | Empty (auto-detect) | Pin to a language code when the same user code-switches and the auto-detector is unreliable. |
Avatar Customization (Anam & LemonSlice)
When the voice provider has an avatar (Anam Avatar or LemonSlice), additional fields appear so you can brand the in-call experience.Placeholder Image
Placeholder Image
anam_placeholder_image_url, lemonslice_placeholder_image_urlThe static picture shown before the avatar connects and after the call ends. Keep this on-brand and matching the Digital Twin’s identity — most users see this longer than the live avatar.Recommended: a square (1:1) or 4:3 image, at least 512 × 512 px, hosted on a CDN or your media library.Loading Video
Loading Video
anam_loading_video_url, lemonslice_loading_video_urlShort, looping MP4 shown while the avatar provider warms up the WebRTC connection (typically 1–3 seconds). A subtle idle animation (gentle breathing, blinking) feels more alive than a frozen frame.Recommended: 3–8 second loop, square 1:1, ≤ 5 MB, H.264 / .mp4.Intro Message
Intro Message
anam_intro_message, lemonslice_intro_messageUp to 500 characters of text the avatar speaks first when the call connects, before the user says anything. Use this to set context (“Hi, I’m Pria — I can help you find course materials, run searches, and answer questions about the syllabus”).If left blank, the avatar waits silently for the user to speak.Allow Imagine Prompts (LemonSlice)
Allow Imagine Prompts (LemonSlice)
lemonslice_allow_imagine Default: OnLemonSlice supports user-generated “imagine” prompts during a call — the avatar can show generated imagery in real time. Turn this off for instances where you want the visual to stay strictly on-brand (no user-controlled content).Anam Conversation Model Override
Anam Conversation Model Override
anam_conversation_modelOptional per-institution conversation-model override applied only to Anam voice turns. Empty = fall back to the institution conversation model (assistant model still wins when set). Useful when the avatar should run on a faster, cheaper model than the in-browser experience.Picture Upload UX (Universal)
Every avatar in Pria — Digital Twin avatar, user profile picture, assistant avatar, institution logo — is clickable to upload. There is no separate “Upload” or “Change picture” button. The pattern:- Hover the avatar to see the camera-icon overlay and the prompt copy
- Click the avatar to launch the file picker (or drag-and-drop a file onto the avatar)
- Hover and click the small
xin the corner to remove the current picture without uploading a new one - A small helper line beside the avatar reads “Click the picture to upload a new one”
Inference Settings
These settings control how the AI models generate responses across all conversations in this instance.Conversation Completion Max Tokens
Conversation Completion Max Tokens
- Unspecified (LLM Default) — Let the model determine the optimal response length (recommended)
- Auto — System-managed token allocation
- OpenAI values — 1,024 to 65,536 tokens
- Anthropic values — 1,000 to 64,000 tokens
Reasoning Effort
Reasoning Effort
- None — Disable thinking (fastest, lowest cost)
- Low — Minimal reasoning
- Medium — Balanced reasoning
- High — Thorough reasoning
- Max — Maximum reasoning depth (highest latency and cost)
Extended Context (1M tokens)
Extended Context (1M tokens)
Related
- Crafting Digital Twin Instructions — Frameworks and best practices for writing effective system prompts
- AI Models — Provider catalog, reasoning effort, prompt caching, BYOM
- Realtime Voice & Avatars — Provider catalog (OpenAI, Gemini Live, xAI, ElevenLabs, Anam, LemonSlice), voice tuning, avatar setup
- Knowledge & RAG Configuration — Embeddings selection, chunking, KAG fusion, retrieval display
- UI Customization — Full CSS template reference for branding your Digital Twin
- Onboarding Questions — Use guided questions to help generate your Digital Twin instructions
- Configuration — Instance-level settings (credits, status, Convo, moderation)