Live
Script Runtime Estimator
Paste any script - straight voiceover narration, a film script with scene directions, or a mix of both - and Tony returns an estimated runtime. For VO-only scripts, Tony calculates at standard narration pace (~130 words/min) and flags if the pace should be adjusted (slow/dramatic vs. fast/energetic). For full film scripts, Tony strips stage directions and action lines, estimates dialogue delivery time separately, and adds scene transition buffers. Returns total runtime + a per-section breakdown so artists know exactly how long each animation segment needs to be before they start building.
Example A - VO script: "Here's the narration for the Guardian Decom explainer - how long will it run at normal pace?"
Tony returns: ~2 min 10 sec · 283 words · Recommend pacing: moderate. Breakdown by paragraph.
Example B - Film script: Paste a multi-scene script with action lines and dialogue.
Tony returns: Scene-by-scene runtime + total, with notes on which sections are dialogue-heavy vs. action-driven.
Live
Text-to-Speech / Voiceover Generation
Email Tony a script and describe the voice style you want. Tony generates an audio file using ElevenLabs and emails it back ready to use.
Subject: Voiceover request
Body: Please read the following in a calm, professional male voice: [your script]
Live
Sound Effects Generation
Describe any sound effect and Tony will generate and send back an audio file. Useful for video production, presentations, or demos.
Subject: Sound effect request
Body: Generate: cinematic whoosh transition, 2 seconds
Live
AI Music Generation
Describe the style, mood, or feel of music you need and Tony will generate an MP3 and email it back. Good for concept video background tracks, client presentations, or demos. Powered by ElevenLabs.
Subject: Generate music
Body: Upbeat cinematic corporate background music, motivational and modern, about 30 seconds
Or: Tense industrial underscore for an oil and gas safety video
Live
AI Video Generation (Veo)
Generate short AI video clips from a text description using Google Veo. Default model is Veo 2 (standard quality, lowest cost). To request a higher quality model, include a reason - it will be flagged to Matt for awareness. Video is emailed back to you as an attachment.
Subject: Generate video [Project Name]
Body: A cinematic aerial shot of a modern office building at sunrise, photorealistic
To upgrade: Add "Use Veo 3 because: client presentation next week"
Live
Kling Start/End Frame Video Workflow
Send Tony high-res start + end frames (or a multi-image reference set) plus the creative brief. Tony packages the shot for Kling, tracks the render, and drops the MP4 + thumbnails into artifacts/kling/[project] with ready-to-paste storyboard notes.
Subject: Kling video - AcmeWell Shot 05
Attachments / Links: start.png, end.png (or a Dropbox folder with numbered frames).
Body: "4s @ 24fps, 16:9. Camera push-in over drone footage. Prompt: sunrise flare over offshore rig, stylized energy bloom. Need 2 variations."
Tony replies: Dropbox link, MP4 download, thumbnails, and board-ready caption as soon as Kling finishes (usually 2-5 minutes).
Live
Runway ML — Text/Image to Video (Gen 4.5 / Seedance 2.0)
Generate high-quality AI video from a text prompt or reference image using Runway ML's Gen 4.5 (flagship quality) or Seedance 2.0 (multi-modal). Specify duration, ratio, and style. Video is returned as a download link.
Ask Tony: "Generate a 5-second Runway video of a medical device rotating in a clean studio environment, 16:9, Gen 4.5"
Tony replies: Download link to the generated MP4.
Live
Runway ML — Image Generation (Gen 4 Image / GPT Image 2)
Generate still images via Runway's image API. Supports Gen 4 Image, Gen 4 Image Turbo, GPT Image 2, Gemini Image 3 Pro, and Gemini 2.5 Flash. Use for concept art, storyboard frames, product mockups, and reference images.
Ask Tony: "Generate a storyboard frame using Runway Gen 4 Image: a surgeon reviewing a 3D hologram of a heart, cinematic lighting"
Tony replies: Image attachment or Dropbox link.
Live
Runway ML — Voice Dubbing, Isolation & Speech-to-Speech
Powered by ElevenLabs via Runway API. Clean background noise from audio, dub video into another language, or convert one voice to another style. Useful for client deliverables, localization, and audio post-production.
Ask Tony: "Clean the background noise from this voiceover recording" or "Dub this English video into Spanish"
Tony replies: Processed audio/video file.
Live
Runway ML — Image Upscaling (Magnific v2)
Upscale any image to higher resolution using Magnific Precision Upscaler v2 via Runway. Good for client deliverables, print-ready assets, and improving low-res reference images.
Ask Tony: "Upscale this image using Runway Magnific"
Tony replies: High-res version via Dropbox link.
Live
Video Watermark Grid
Email any MP4 to jennifer@austinvisuals.com with instructions like "Watermark this clip." Tony tiles "Austin Visuals Sample" (or any text you specify) in a 3x3 grid across the frame at ~18% opacity, keeps audio intact, and emails the watermarked file back for safe sharing.
Subject: Watermark sample clip
Body: "Use text: Austin Visuals Sample. 3x3 grid, 20% opacity."
Attachment: client_preview.mp4
Tony replies: client_preview_watermarked.mp4 + summary of the watermark settings.
Live
📞 Tony Can Call You
Say "call me" to Tony on Telegram and Tony will call you directly on your phone using a real two-way AI voice conversation. Tony speaks with a natural American voice and can hold a full conversation. Calls come from (775) 347-7887. SMS notifications from that number are also coming soon.
Trigger: Say "call me" in Telegram
Tony calls your registered number and says: "Hey, Tony here with Austin Visuals. What's on your mind?"
In Progress
💬 Tony Can Text You
A2P carrier vetting submitted April 22 - pending approval (24-48hr window). Once approved, Tony will send and receive SMS from (775) 347-7887. Use cases: project updates, reminders, quick confirmations, and client notifications. Say "text me [message]" and Tony sends it to your registered number.
Trigger: "Text me that the Vendry proposal was submitted"
Tony sends an SMS from (775) 347-7887 to your phone with the confirmation.
In Progress
Storyboard → Animation Workflow
Extracts ordered frames from a Word storyboard, pairs start/end images per beat, applies quick Ken Burns moves, stitches the beats, and layers optional music so clients can preview timing before production.
Input: storyboard.docx + beat sheet + optional MP3
Output: 1080p MP4 with each storyboard beat animated (crossfades/zooms) plus optional backing track.
Active
Animation Script Expansion (2D Production)
Takes a rough episode brief and expands it into a production-ready 2D animation script - not a generic story outline. Output is structured for animation: Cold Open, Act 1/2/3, scene headings, action lines, dialogue with comedic cadence, and a PRODUCTION NOTES section that auto-populates the Show Bible (character list, locations, hero objects). Understands shot types, comedy timing, cutaway structure, and setup/reaction/punchline pacing. Output feeds directly into the storyboard generation pipeline. This is different from what a general AI writes - it knows it\'s writing for 2D animation production, not a screenplay.
Input: “Bob gets a parking ticket and decides to fight it himself in court”
Output: Full episode script with scene headings, dialogue, comedy beats, and production notes ready for storyboard generation.
Active
Show Bible - Persistent Production Reference
Maintains a show-level reference system that gets injected into every storyboard panel generation prompt. Includes: global art style description + reference images, character model sheets + 6-expression ranges, recurring environment establishing shots, hero object references. Set once per show, applied automatically to every episode. Eliminates character drift across panels without requiring manual prompt engineering per frame.
Input: Upload character sheets, expression ranges, environment references, write style description
Output: Every generated panel inherits the show\'s visual identity automatically.