Search Phrase = multilingual
Keywords: Automated, performance, text, video, Otter, AI, voice, clone, Eleven Labs, HeyGen, audio, multilingual
In this video, Josh demonstrates how to create fully automated video performances directly from text using tools like Otter AI, 11 Labs, and HeyGen. Viewers will learn how to generate high-quality voice clones, prototype video scripts, and produce professional-looking content with minimal effort by leveraging AI-powered voice and video generation technologies. The workflow allows content creators to transform written or spoken text into polished video presentations quickly and efficiently. By following Josh's method, users can generate multiple video iterations, edit audio precisely, and create digital avatars that replicate their voice and performance with remarkable accuracy.
In this video, Josh demonstrates how to create fully automated video performances directly from text using tools like Otter AI, 11 Labs, and HeyGen. Viewers will learn how to generate high-quality voice clones, prototype video scripts, and produce professional-looking content with minimal effort by leveraging AI-powered voice and video generation technologies. The workflow allows content creators to transform written or spoken text into polished video presentations quickly and efficiently. By following Josh's method, users can generate multiple video iterations, edit audio precisely, and create digital avatars that replicate their voice and performance with remarkable accuracy.
Following are the key things you will be able to do after you watch this demo:
Generate video scripts from transcribed audio using AI tools
Create high-quality voice clones with consistent audio recordings
Prototype video content using free and paid AI platforms
Optimize voice training for digital avatars
Manage content production across multiple AI environments
Edit audio tracks with minimal credit consumption
Develop a systematic workflow for automated video creation
Replicate personal performance using digital voice technology
Transform text-based content into professional video presentations
Implement cost-effective strategies for video and audio generation
Creating a Fully Automated Performance from Text 0:08
Josh Lomelino explains the process of creating a fully automated performance directly from text, including generating audio prompts using Otter AI.
He describes how he brainstorms ideas while walking and exports the subtitle transcript file, SRT, to process it with AI tools like Claude or ChatGPT.
Josh mentions breaking up long scripts into manageable blocks of 1800 characters and generating a year's worth of content for various platforms.
He emphasizes the use of text, whether written manually or spoken and transcribed, to craft a video script using two primary methods.
Generating High-Quality Voice Clones 1:51
Josh discusses creating a high-quality voice clone using 11 Labs, initially finding the results artificial but later perfecting the settings.
He highlights the importance of using a consistent audio clip for training the voice digital double, ideally around three hours of spoken audio.
Josh explains the challenges of recording consistently for three hours and how he stitches together previous demo recordings to create a large audio clip.
He stresses the need for meticulous tracking of audio settings to ensure uniformity and avoid sudden changes in volume or tonal quality.
Optimizing Audio Recording for Consistency 3:36
Josh shares his experience of recording multiple live sessions with an audience, which infused the audio with personality and energy.
He explains the importance of having consistently dialed-in audio for generating a high-quality performance, as the AI listens to everything in the audio track.
Josh mentions the time and cost involved in using 11 Labs, which can take up to six to eight hours to analyze a voice and build a model.
He advises against using cheaper models, such as the multilingual version one model or turbo 2.5, and recommends upgrading to the multilingual version two model for better results.
Using Hey Gen for Cost-Effective Prototyping 5:35
Josh introduces Hey Gen as an alternative for creating generative content when 11 Labs burns through credits too quickly.
He explains how he trains Hey Gen on his voice by uploading a 10 to 15-minute audio clip and generates unlimited videos for free, depending on the subscription plan.
Josh describes the process of creating prototypes, making real-time adjustments to the script, and rendering multiple takes.
He mentions using his phone in split screen mode while walking to make adjustments on the fly and then copying and pasting the revised script into Hey Gen.
Switching Between Hey Gen and 11 Labs 7:44
Josh explains how he can switch the voice in Hey Gen to the high-quality production voice in 11 Labs with a click of a button.
He highlights the downside of using Hey Gen, which is the risk of losing all credits if there are issues with the audio track in the final video.
Josh prefers using the Studio tool in 11 Labs for targeted editing, which allows regenerating just portions of the audio without redoing the entire clip.
He mentions the benefit of being able to download the WAV file and MP3 file from the Studio tool in 11 Labs as a fail-safe.
Organizing Video Production Phases 9:21
Josh describes his workflow of treating production as two phases: the cheap, free voice phase and the final phase.
He explains the process of pasting the text directly into the Hey Gen editor, listening to the prototype, and resolving issues before creating a new file in Hey Gen.
Josh organizes his videos into two folders: a prototype folder and a final folder, for easy organization of his methods.
He mentions using the multilingual version two model for cost-effective throwaway tests and training his voice with Hey Gen for free prototyping.
Leveraging Digital Doubles for High-Quality Videos 10:34
Josh shares how he uses his digital doubles to replicate a performance of his voice and generate a corresponding video composite.
He explains how he creates a script using Otter AI during a walk, copies and pastes it into his automated workflow, and produces a high-end video with minimal effort.
Josh highlights the benefits of this workflow, which allows him to deliver excellence without skipping a beat, even when small inconsistencies would have derailed the process before.
He concludes by mentioning the next steps in the following videos, which will cover adding automated visual elements on screen behind the virtual avatar.
AI Tools Overview and Links
Otter AI
Otter AI is a powerful transcription and collaboration tool that solves one of the biggest bottlenecks for membership owners and content creators: turning raw ideas and recordings into publish-ready content quickly. Instead of spending hours manually transcribing podcasts, coaching calls, or brainstorming sessions, Otter automatically converts audio into accurate, searchable text that can be repurposed into blog posts, course modules, captions, or marketing emails. For creators juggling multiple platforms and constant content demands, Otter removes the friction of documentation and frees up time to focus on engaging their audience, scaling their community, and generating revenue.
Otter AI Affiliate Link Signup (use this link)
HeyGen
HeyGen is an AI video creation platform that eliminates the need for expensive equipment, on-camera talent, and complex editing—solving a major pain point for membership owners and content creators who need consistent, professional-looking videos to engage their audiences. With HeyGen, you can instantly turn scripts into high-quality talking-head videos using realistic AI avatars, complete with voiceovers and multilingual capabilities. This allows creators to scale their content output, personalize training or marketing messages, and maintain a polished brand presence without the cost or time traditionally required for video production.
HeyGen Affiliate Link Signup (use this link)
ElevenLabs
ElevenLabs is an advanced AI voice generation platform that solves the challenge of producing high-quality, natural-sounding audio for membership owners and content creators without the ongoing need for sitting in a chair and recording your voice over and over. It allows creators to instantly convert written content—like course modules, podcasts, or marketing scripts—into realistic human-like narrations in multiple voices and languages. This not only speeds up content production but also ensures a consistent, professional sound across all audio materials, helping creators deliver a polished experience that builds trust, increases engagement, and scales their content library effortlessly.
ElevenLabs Affiliate Link Signup (use this link)
Video Manager Component
After completing this video, viewers will be able to confidently upload and organize videos using the AMP Video Manager Component. They will learn how to tag and categorize content for easy searching, modify video details, and utilize advanced features like custom thumbnails and player button settings. Additionally, viewers will understand how to manage video metadata, optimize playback quality, and access analytics to track video performance. This empowers users to efficiently manage and enhance their video content within the platform.
Following are the key things you will be able to do after you watch this demo:
Video Manager Component Overview 0:08
Josh Lomelino introduces the video manager component, explaining its accessibility from both the end user's perspective and the backend.
He highlights the interactive chapters, x-ray search functionality, and closed captions capabilities.
The video manager supports various video resolutions, including 4K, 8K, and 360-degree videos, and offers a picture-in-picture feature.
Josh explains the ease of uploading videos through drag-and-drop, mentioning the automatic handling of transcripts and video resolutions on the backend.
Tagging and Metadata Management 2:23
Josh demonstrates the tagging system, which allows organizing videos into categories for easier management.
He explains the process of adding tags to videos, emphasizing the importance of tagging for advanced searches.
The metadata management includes naming, describing, and tagging videos before uploading the MP4 file.
Josh highlights the importance of uploading the highest resolution video, which will be transcoded into multiple versions for adaptive playback.
Transcoding and Video Quality Adaptation 5:49
Josh describes the transcoding process, where the highest resolution video is converted into multiple versions for different connection speeds.
He explains how the player automatically selects the best quality based on the user's connection speed.
The transcoding process ensures that the video adapts to the user's playback capabilities, enhancing the viewing experience.
Josh demonstrates the successful upload of a video and the subsequent changes in the user interface.
Advanced Features and multilingual Support 9:21
Josh mentions future demos that will cover advanced features like multiple language support for transcripts and videos.
He explains the ability to switch out videos by modifying content and using the select video feature.
The advanced search functionality allows filtering videos by tags and specific words, making it easier to find content.
Josh emphasizes the importance of categorization and organization for managing large video libraries.
Customization and Player Settings 12:00
Josh discusses the customization options for thumbnails, player buttons, and embedding restrictions.
He explains how to upload custom thumbnails and the availability of templates for creating professional-looking thumbnails.
The player settings allow customizing social media engagement features and restricting where the video can be embedded.
Josh highlights the flexibility in setting video visibility, from public to private, and the impact of these settings on the video's accessibility.
Full Screen Video Manager 12:14
Josh introduces the Full Screen Video Manager, which provides a comprehensive view of video management.
The Full Screen Video Manager allows uploading videos, managing metadata, and adding tags directly from the full-screen interface.
He explains the process of creating content again to ensure the new video appears in the search process.
The manager also allows modifying tags and thumbnails for existing videos, enhancing the flexibility of video management.
Analytics and View Tracking 17:13
Josh demonstrates the ability to track the number of views for each video, providing valuable analytics data.
He explains how the analytics data can be used to monitor the performance of embedded content on other platforms.
The tracking feature ensures that all views are accounted for, even when the video is embedded on external sites.
Josh emphasizes the importance of using this data to optimize the video manager component and improve the user experience.
Final Thoughts and Summary 21:05
Josh summarizes the key features and functionalities of the video manager component.
He reiterates the ease of uploading and modifying videos, as well as the automatic handling of metadata and video resolutions.
The advanced search and tagging features are highlighted as powerful tools for managing large video libraries.
Josh concludes by emphasizing the flexibility and scalability of the video manager component, making it a versatile tool for various content management needs.
Multiple Languages Demo in AMP
multilingual-amp-demo-english">Watch in English | multilingual-amp-demo-spanish">Watch in Spanish
Description
Discover how cutting-edge AI technology enables you to create multilingual videos that deliver natural, native-language experiences for global audiences. This demo will guide you through three methods for translating audio and video content, with a focus on the most advanced approach—generating fully synchronized video, audio, and subtitles in up to 36 languages. Learn how to spot-check translations using AI services, validate quality even without full review teams, and seamlessly integrate multi-language playback within Anomaly Amp. By the end of this session, you’ll see how learners can effortlessly switch languages during playback—while maintaining perfect lip sync and subtitle accuracy—empowering your content to reach diverse viewers with an authentic, in-language experience.
Multiple Languages Demo in AMP (Spanish)
multilingual-amp-demo-english">Watch in English | multilingual-amp-demo-spanish">Watch in Spanish
Descubra cómo la tecnología de IA de vanguardia le permite crear videos multilingües que ofrecen experiencias naturales y en el idioma nativo para audiencias globales. Esta demostración le guiará a través de tres métodos para traducir contenido de audio y video, centrándose en el enfoque más avanzado: la generación de video, audio y subtítulos completamente sincronizados en hasta 36 idiomas. Aprenda a verificar las traducciones con servicios de IA, validar la calidad incluso sin equipos de revisión completos e integrar sin problemas la reproducción multilingüe en Anomaly Amp. Al final de esta sesión, verá cómo los usuarios pueden cambiar de idioma fácilmente durante la reproducción, manteniendo una sincronización labial y una precisión de subtítulos perfectas, lo que permite que su contenido llegue a audiencias diversas con una experiencia auténtica y en su propio idioma.
Creating multilingual Videos
After completing this video, viewers will know how to create and translate audio and video content into multiple languages using advanced AI-powered workflows. They will be able to generate synchronized lip-sync performances, dubbed audio, and accurate subtitles for up to 36 languages, ensuring a seamless user experience for international audiences. Users will also be able to integrate these multilingual assets into platforms like Amp, allowing viewers to easily switch between languages. This process empowers content creators to efficiently mass produce and manage localized video content for diverse learning environments.
Following are the key things you will be able to do after you watch this demo:
Click here to see and get each tool used in this demo.
Examples Shown in this Demo
Overview of multilingual Video Creation 0:08
Josh Lomelino introduces the demo overview for creating multilingual videos and integrating them into Anomaly Amp.
Discusses the three methods for delivering multilingual content: audio-only, translation services, and advanced methods.
Highlights the advanced method's ability to generate performances with audio and video in sync in multiple languages.
Mentions the demo will focus on the advanced method, which offers the best user experience.
Preparing the Source Material 3:55
Josh emphasizes the importance of using a high-quality WAV file for the best translation and quality.
Demonstrates the process of preparing the source material, whether it's live or generated.
Explains the steps involved in exporting the audio file as a WAV or MP3.
Discusses the benefits of using a WAV file for better translation and quality.
Translation Process Using 11 Labs 7:08
Josh explains the translation process using 11 Labs, which provides the best translation and vocal performance.
Details the steps for creating a dubbing project in 11 Labs, including specifying the source and target languages.
Discusses the benefits of using multiple speakers and disabling voice cloning for better performance.
Demonstrates the process of uploading and translating an audio file using 11 Labs.
Spot Checking Translations 13:29
Josh shows how to spot check translations using AI translation services if a full translation team is not available.
Explains the process of exporting the translated audio file and re-translating it back to English for validation.
Highlights the importance of having a review team to ensure accuracy.
Discusses the steps for implementing multiple languages into Anomaly Amp.
Advanced Method Demonstration 21:05
Josh demonstrates the advanced method, which generates performances with audio and video in sync in multiple languages.
Explains the sequential process of preparing the source material and translating it using 11 Labs.
Discusses the benefits of using a digital double for creating multilingual videos.
Demonstrates the process of uploading and generating the translated video file.
Integrating multilingual Videos into Anomaly Amp 28:08
Josh explains the process of integrating multilingual videos into Anomaly Amp.
Discusses the options for switching between languages on the fly.
Demonstrates the steps for creating a new page in Anomaly Amp and uploading the multilingual video.
Highlights the benefits of using Vimeo's advanced tools for managing multilingual videos.
Handling Subtitles and Closed Captions 35:00
Josh discusses the options for handling subtitles and closed captions in multilingual videos.
Demonstrates the process of adding subtitles and closed captions in Vimeo.
Explains the benefits of using AI translation services for generating subtitles.
Highlights the importance of ensuring the subtitles and closed captions are accurate and synchronized.
Implementing Multiple Language Pages 58:30
Josh explains the process of creating multiple language pages in Anomaly Amp.
Discusses the benefits of having a separate page for each language.
Demonstrates the steps for creating and linking the multiple language pages.
Highlights the importance of organizing the content based on the target audience's language preferences.
Text Translation and Localization 59:19
Josh discusses the importance of text translation and localization for multilingual content.
Demonstrates the process of translating text using Google Translate.
Explains the benefits of having a review team to ensure the accuracy of the translated text.
Highlights the importance of localizing the entire site for a seamless user experience.
Architecting the multilingual Experience 1:04:46
Josh discusses the different ways to architect the multilingual experience in Anomaly Amp.
Explains the benefits of having a separate class for each language.
Demonstrates the steps for organizing the content based on the target audience's language preferences.
Highlights the importance of choosing the best method for delivering multilingual content.
There are no Main Site search results.