Search Phrase = sample
Write your First Program
Get started with Processing by clicking here to load the editor.
Copy and paste the code below into Processing to load exactly like I show in the video above.
function setup() { createCanvas(400, 400);}
function draw() { background(220); ellipse(200,200,100,100); }
sample" width="1690" height="805" />
Get started with Processing by clicking here to load the editor.
Check out the student example, the Moody Ninja
Also, check out the student example, Josh's Water Bottle.
Automated Video Production Pipeline
This video guides you through setting up an automated video production pipeline, from selecting and testing brand voices using Eleven Labs to pairing them with digital avatars in HeyGen. By following the steps, you'll learn how to catalog and integrate voices, match them with visual characters, and generate preview videos for evaluation. Once you complete the video, you'll be able to efficiently create, test, and organize multiple spokesperson options for your brand's automated content generation. This process empowers you to streamline video production and build a scalable library of branded video assets.
Following are the key things you will be able to do after you watch this demo:
Identify suitable brand voices using generative AI tools.
Catalog and organize voice and avatar options for efficient selection.
Integrate third-party voices into video production platforms.
Pair voices with digital avatars to create compelling spokesperson combinations.
Generate and preview automated video content for evaluation.
Document and track production assets for streamlined workflow.
Select and finalize top spokesperson options for automated content generation.
Introduction to Automated Video Production Pipeline (00:00:00 – 00:00:59)
Josh kicks off the demo by outlining the goal: selecting brand-aligned voices and digital doubles (either your own clone or hired actors), organizing those assets, and laying out the end-to-end steps needed to spin up a fully automated video production pipeline.
Content Sequencing Concept and Cloning (00:00:59 – 00:02:20)
He explains the core idea of building a repeatable sequence of content—cloning a finished production over and over—so you can continually generate new videos by plugging different scripts into the same automated workflow.
Defining Digital Doubles and Voice Types (00:02:20 – 00:03:11)
Josh clarifies terminology (digital twin vs. digital double), walks through the two main “buckets” of voice assets (personality-based clones vs. spokesperson avatars), and discusses how to mix and match them depending on your brand needs.
Selecting Platforms for Generative AI and Deployment (00:03:11 – 00:04:00)
He emphasizes the importance of vetting your generative-AI tools—voice engines and video avatars—and making sure they’re compatible with your target platforms before committing to any given solution.
Brand-Focused Workflow and SRT Utilization (00:04:00 – 00:05:25)
Josh decides to focus on one streamlined method for this demo, using a single SRT transcript file as the “source of truth” for automation—underscoring that a clean, well-formatted SRT is absolute gold when you’re architecting an automated pipeline.
Importing SRT and Leveraging Automation (00:05:25 – 00:07:40)
He shows how to import the SRT into the voice-generation platform, highlighting how the time-coded script drives every subsequent step—from audio rendering to scene assembly.
Setting Up Voice Design in ElevenLabs (00:07:40 – 00:11:49)
A step-by-step walkthrough of testing voice presets, tweaking text lengths, integrating third-party voices, and crafting voice-design prompts to nail down the exact tone and style you want.
Managing Credits and Reviewing Generated Audio (00:11:49 – 00:15:46)
Josh demonstrates how to monitor and conserve your generation credits, preview the rendered audio, swap out placeholder text, and ensure you’re only spending resources on polished clips.
Applying Voiceover and Text Overlays to Video (00:15:46 – 00:19:08)
He attaches the finalized voice track to the video timeline, adds and styles text overlays (centering, contrast adjustments), and assembles the basic video composition ready for export.
Enhancing Prompts with AI Tools for Voice Design (00:19:08 – 00:22:04)
Introduces additional AI utilities for brainstorming and refining your voice-design prompts—showing how to iterate until you get a sample that truly matches your brand voice.
API Key Handling and Asset Export Configuration (00:22:04 – 00:27:28)
A practical guide on securely copying your ElevenLabs API key, configuring export settings (e.g., 4K output), and organizing all generated files into branded folders for easy access.
Frame Rate Considerations and Quality Checks (00:27:28 – 00:31:42)
Notes the default 25 fps setting, explains how frame rate impacts perceived motion, and walks through checking your export quality to avoid any unexpected artifacts.
Avatar Adjustments, Project Naming, and Fallbacks (00:31:42 – 01:05:16)
Josh covers fine-tuning avatar scale and positioning, updating project names for consistency, and setting up fallback workflows if you need to swap voices or visuals mid-pipeline.
Avatar Replacement and Cataloging (00:31:42 – 00:34:06)
Pair your chosen voice with visuals by replacing the default avatar, browsing through the 21 “looks” in each category, using the snipping tool to capture promising thumbnails, and logging each candidate’s name and category in your tracking spreadsheet.
Avatar Testing and Video Formatting (00:34:07 – 00:36:24)
Brainstorm voice–visual combinations (e.g. “August”), select a portrait-mode avatar, preview the static image, upload any custom avatars into the pipeline, drag your source video beneath the avatar layer, and confirm the composition and framing.
Voice-Avatar Sync and Quality Comparison (00:36:24 – 00:37:39)
Generate audio samples to compare HeyGen vs. ElevenLabs quality, force-refresh the clip to confirm it’s using the intended voice (e.g. Ryan Kirk), and watch for the spinning indicator to verify successful render.
Preview Generation and File Labeling (00:38:10 – 00:39:11)
Render a 4K preview of the voice-avatar pairing, then label the export asset with your convention (e.g. 001_RyanKirk_CharlieAvatar) so each test remains organized and easily identifiable.
Pipeline Duplication for Variant Testing (00:39:11 – 00:41:15)
Duplicate the entire sequence to create “Test 002,” swap in a new avatar (such as Colton), explore lifestyle/UGC categories, and note how background removal and frame size affect the final look.
Background Removal and Frame Adjustments (00:41:15 – 00:42:32)
Apply the background-remover tool to avatars with built-in backgrounds, observe any cut-offs (like arms being cropped), tweak the canvas framing, and decide between static vs. transparent backgrounds based on brand needs.
Third-Party Voice Integration Workflow (00:42:32 – 00:44:03)
In the “My Voices” tab, toggle on integrated voices (e.g. Charlie), heart your favorites so they surface first, preview each sample, and ensure the API integration is active before proceeding.
Voice Audition Labeling and Mood Board Documentation (00:44:03 – 00:47:09)
Name each audition (e.g. 002_CharlieAvatar), update your mood board with snipped thumbnails, record which browser tab or category each came from, and keep this documentation up to date for reproducibility.
Frame Rate and Credit Management (00:47:09 – 00:48:06)
Note the default 25 fps setting—mismatches can cause audio sync issues—toggle off “Avatar 4” if you’re on an unlimited plan, and monitor your generation credits to avoid unexpected limits.
Styling and Folder Organization (00:48:06 – 00:49:29)
Adjust text overlay colors to maintain contrast (match your brand palette), create new folders for each batch, and standardize your output directory structure so you know exactly where each rendered clip lives.
Option Preview and Cataloging Workflow (00:49:30 – 00:55:51)
Refresh thumbnails, scroll through voice-avatar combos, assign option numbers, screenshot grids of candidates, and log each pairing’s status (“Yes,” “Maybe,” “No”) in your spreadsheet.
Iteration Process and Consistency Notes (00:55:51 – 00:57:23)
Always regenerate every variation (never reuse stale renders), note any limitations (e.g. animated text can cover on-screen elements), and keep your naming and documentation consistent so the pipeline remains bullet-proof.
Ranking Options and Visual Separators (00:57:24 – 01:02:40)
Introduce visual separators in your catalog (e.g. blank rows), rank the top voice-avatar combos, screenshot your “definite yes” list, and preserve those as templates for future batches.
Additional Voice Integration: Amelia (01:02:40 – 01:04:33)
Search for “Amelia” in your voice library, verify whether it’s built-in or needs third-party integration, add it to favorites, preview the sample, and record its ID for consistent reuse.
Final Voice Candidate Integration (01:04:33 – 01:05:16)
Confirm Amelia’s render, then search for any last candidates (e.g. “Analore”), heart and test them, catalog the results, and ensure each new voice is fully integrated into the pipeline.
Final Pipeline Recap and Scale Duplication (01:07:40 – 01:08:34)
Recap how you’ve selected your final set of voices and avatars, finalize your naming conventions, and highlight that you can now duplicate this entire automated workflow to churn out an endless library of on-brand social-media videos.
Keywords: audio,recording,microphone,quality,live,studio,interface,phantom,power,sample,rate,uncompressed,format,pop,filter,level,balancing,Camtasia,Studio,file,organization,voice,clone,AI,avatar,sound,absorption
This video provides a comprehensive guide to professional audio recording for content creators, focusing on essential equipment and techniques for high-quality sound production. Viewers will learn how to select the right microphone, set up a proper recording environment, and use audio interfaces and editing tools to capture clean, professional-grade audio. By following Josh Lomelino's expert advice, participants will be able to create polished audio recordings suitable for workshops, demos, podcasts, and even AI-generated video content. The tutorial equips creators with practical skills to improve their audio recording process and produce more engaging, professional-sounding content.
This video provides a comprehensive guide to professional audio recording for content creators, focusing on essential equipment and techniques for high-quality sound production. Viewers will learn how to select the right microphone, set up a proper recording environment, and use audio interfaces and editing tools to capture clean, professional-grade audio. By following Josh Lomelino's expert advice, participants will be able to create polished audio recordings suitable for workshops, demos, podcasts, and even AI-generated video content. The tutorial equips creators with practical skills to improve their audio recording process and produce more engaging, professional-sounding content.
Here are the key things you will be able to do after you watch this demo:
Select an appropriate high-quality microphone for professional audio recording
Set up a clean, noise-free recording environment
Configure audio interfaces and software for optimal sound capture
Choose the correct sample rate and recording format
Use a pop filter and mic positioning techniques to improve audio quality
Perform audio test recordings and evaluate sound levels
Utilize audio editing tools for recording and post-production
Implement file organization strategies for audio projects
Export audio files in various formats for different content needs
Create consistent, professional-grade audio recordings for workshops, demos, and presentations
Prepare audio recordings for potential AI avatar or voice clone generation
Troubleshoot common audio recording and equipment setup challenges
Basic Method of Production 0:09
Josh Lomelino explains the simplicity and power of recording thoughts and ideas using just a microphone.
Live recordings during workshops or demos are more engaging but harder to edit if mistakes are made.
Studio recordings allow for pauses and polished takes but require maintaining a natural and conversational tone.
The importance of a high-quality microphone and a quiet, clean recording space is emphasized.
Microphone Setup and Recording Quality 1:31
Josh recommends the AKG condenser mic for its clean, detailed sound, which requires phantom power.
The Shure 57 microphone is mentioned as a versatile option for various recording situations.
The Zoom H6 USB audio interface is preferred for its compatibility with various software like Camtasia.
Recording at 48,000 Hz instead of the default 44.1 Hz is suggested to preserve audio detail.
Audio Recording Practices 3:18
Josh advises recording in an uncompressed format like WAV until the final export to avoid audio degradation.
Ensuring the computer and audio interface are set to the same sample rate prevents speed mismatches.
The use of a pop filter and an adjustable mic arm helps maintain consistent audio quality.
Test recordings and listening on different devices help ensure balanced sound levels.
Audio Editing and Tools 4:53
Josh mentions various audio editing tools like Audacity, Adobe Audition, Pro Tools, and FL Studio.
Camtasia Studio is recommended for its convenience in recording and managing audio projects.
The Auto Normalize feature in Camtasia helps maintain consistent volume throughout recordings.
Exporting recordings as MP3s allows for generating on-camera videos using AI avatars.
File Organization and Studio Setup 5:55
A consistent naming system for recordings and exports is crucial for easy retrieval and updates.
Avoiding rooms with echo and using soft materials to absorb sound helps improve recording quality.
A good studio setup, including soundproofing and proper equipment, is essential for high-quality recordings.
Josh hints at a future demo on creating a voice clone, which requires clean and consistent audio recordings.
In this video, Josh Lomelino demonstrates how to create an AI-powered digital voice replica using 11 Labs, enabling content creators to rapidly generate high-quality audio and video content at scale. By training the system with a consistent audio sample, users can produce automated voice performances that sound like their own, allowing them to create lectures, demos, and other content quickly and efficiently. The method involves uploading 1-3 hours of controlled audio recordings, fine-tuning voice settings, and integrating with platforms like HeyGen to automate video production. After watching this tutorial, viewers will be able to develop their own AI voice clone, streamline content creation, and overcome time constraints by generating multiple scripts and videos with minimal manual effort.
In this video, Josh Lomelino demonstrates how to create an AI-powered digital voice replica using 11 Labs, enabling content creators to rapidly generate high-quality audio and video content at scale. By training the system with a consistent audio sample, users can produce automated voice performances that sound like their own, allowing them to create lectures, demos, and other content quickly and efficiently. The method involves uploading 1-3 hours of controlled audio recordings, fine-tuning voice settings, and integrating with platforms like HeyGen to automate video production. After watching this tutorial, viewers will be able to develop their own AI voice clone, streamline content creation, and overcome time constraints by generating multiple scripts and videos with minimal manual effort.
Here are the key things you will be able to do after you watch this demo:
Train an AI voice synthesis system using personal audio recordings
Generate consistent voice replicas with controlled audio samples
Optimize AI-generated voice settings for natural-sounding output
Integrate voice cloning technology with video production platforms
Create automated content at scale using text-to-speech technologies
Manage AI voice generation credits efficiently
Export and store audio files in multiple formats for different applications
Prototype and refine scripts using AI voice technology
Develop a workflow for rapid content creation across lectures, demos, and presentations
Leverage AI tools to overcome time constraints in content production
Creating a Voice Replica Using AI 0:09
Josh Lomelino discusses the use of AI-powered voice synthesis to create a voice replica, emphasizing the challenge of matching human recordings.
He highlights the effectiveness of using text prompts to quickly prototype, test, and revise scripts or generate finished audio files.
Josh mentions his preference for the 11 labs tool, which offers a studio mode for producing longer form audio tracks.
He shares his initial struggles with the tool and how contacting their support provided helpful suggestions.
Training the System for Consistent Output 1:24
Josh explains the importance of training the system with a consistent audio sample to avoid unnatural variations in volume and tone.
He describes his initial mistake of using diverse recordings from different sessions, which led to inconsistent results.
Josh emphasizes the need for a controlled environment with a single, consistent audio sample for better results.
He plans to demonstrate the settings that produce the best results for replicating his voice in the user interface.
Optimizing Generated Audio Files 2:56
Josh advises generating audio sparingly to avoid exhausting monthly credits and recommends starting with smaller sections of text.
He explains the process of refining the output and generating both wave and mp3 audio files for different applications.
Josh mentions the importance of storing both wave and mp3 files for secure storage and project organization.
He notes that it may take several attempts to develop a method that works well for the user.
Exporting and Integrating Audio Files 4:19
Josh describes two methods for uploading audio files to virtual avatars: exporting both wave and mp3 versions or integrating the 11 labs API directly with Hey Gen.
He prefers using the wave audio file for higher quality and to avoid double compression but acknowledges the need to export the mp3 format for larger tracks.
Josh explains the integration of the 11 labs API with Hey Gen, which allows for rapid development of prototypes and large volumes of content.
He mentions the need to break up scripts into manageable sections for efficient processing by the software.
Automating Video Production with AI 6:02
Josh discusses the ability to produce videos at scale by automating both audio and video avatars from text.
He highlights the productivity gains from using AI to generate video scripts and produce audio and video automatically.
Josh notes the cost of AI-generated voice and the strategy of using high-quality audio only when necessary.
He explains the use of draft versions of scripts with Hey Gen's voice replica to refine the script without incurring additional costs.
Finalizing and Exporting Scripts 8:04
Josh describes the process of finalizing scripts and either reading and recording them manually or using the 11 labs integration within Hey Gen.
He mentions the use of a side-by-side display setup with a Google document and video avatar performance for quick edits.
Josh emphasizes the usefulness of this method for high-end projects that require detailed polishing and iteration.
He concludes the demo by encouraging the use of digital voice replicas to scale beyond time constraints and improve productivity.
There are no Main Site search results.