FUTURES EXPO SERIES
Sregen . ai
An AI-based , talkinghead generation and voice-cloning system for automatic contentmedia creation
Research Project
Sregen . ai is an innovative , online platform that helps with content augmentation , such as generation of personalised talking-head videos . It empowers users and content creators to easily deliver multimedia content , such as presentations or marketing videos , in multiple languages , including sign language .
Editing and modification of voice is complex and time-consuming . This platform uses AI to create realistic talking-head videos with a person ' s voice . Users provide speech , video , or text , and the system generates a video of that person speaking . It can clone voices from short audio samples and works in multiple languages .
This technology has potential in education , advertising , and social media .
Compared to competitors , this system is faster , more customisable , more ethical ( it uses hidden watermarks to prevent misuse ), more natural , and more secure .
Key capabilities
> Online platform that helps with content augmentation , such as generation of talking heads
> Expertise in machine learning and generative AI
> Faster : automatic creation vs one-week manual processing
Differentiators
> The AI model trains on local hardware for user data privacy and sovereign capability
> Papers in top international journals : evaluations showing superior performance . Patent Pending Voice Cloning Technology ( No . PCT / AU2023 / 050900 )
Key customers
> Online content creators , platforms , and marketers > Educational technology providers > Graphic designers > Multimedia companies > Film Production / Editing Software companies
Key partnerships
> Collaborating with longtailai . org to develop a product line for multilingual sign-language ( hearing-impaired people )
unsw . to / sanjay-jha
20 •