Home News Voice.ai raises $6M as its real-time voice changer approaches 500K users

Voice.ai raises $6M as its real-time voice changer approaches 500K users

by WeeklyAINews
0 comment

Companies like Midjourney and ChatGPT have pushed the boundaries of how AI can create pictures and textual content out of fundamental textual content prompts. Now, audio seems to be the inevitable subsequent frontier. Music era based mostly on phrase prompts, AI tutors for language studying and voice simulators have all seen developments in latest months. Voice.ai hopes to be part of that dialog (heh) with expertise that lets customers change (and disguise) their voices in actual time, and now it has raised its first exterior funding on the heels of early development.

With greater than 480,000 customers and a library of greater than 50,000 voice filters, Voice.ai has picked up $6 million, funding that it plans to make use of to take its voice altering tech into new locations.

Mucker Capital and M13 are main the spherical. Prior to now, Voice.ai has grown by phrase of mouth — the startup has a Discord channel with greater than 120,000 folks — on the again of $3 million in self-funding.

At the moment the corporate’s instruments — obtainable as apps for Mac, PC, Android and iOS — are getting adopted by players, content material creators, Vtubers and others on TikTok, Zoom, Discord, Minecraft, GTA5, Fortnite, Valorant, League of Legends, Amongst Us, Skype, WhatsApp and other platforms. The Voice.ai interface lets them create a brand new voice, or choose from some 50,000 completely different pre-created voices (created and shared by customers like themselves), which can be utilized as-is or modified, to make use of dwell in supported platforms, or for recordings.

The plan is to make use of the funding to rent extra technical expertise and to construct new SDKs and APIs to work with additional platforms like Meta, Unreal and Unity; carry on multi-language help; and add in new purposes like singing the place voice is heart stage.

The startup doesn’t single it out, however it will likely be attention-grabbing to see if it makes use of a number of the funding additionally to extend server capability.

That’s no small burden. Anecdotally, we’ve heard that GPU ache is without doubt one of the greatest gating components in how numerous AI apps are capable of scale for the time being. (It’s partly why you’re seeing huge offers being made that embrace strategics offering processing and server capability.)

See also  HiddenLayer raises $50M to defend enterprise AI models

For Voice.ai particularly, your voice is processed domestically and channeled into wherever it will likely be used via what founder and CEO Heath Ahrens described to me as a “digital audio cable.” However whenever you have a look at evaluations of its apps, a standard lament is that whenever you enroll you’re placed on a waitlist as a result of “overwhelming demand has our servers at max capability” with a promise that you simply’ll learn when the service will increase that capability.

There are dozens of speech-to-voice and voice-to-speech companies available in the market at present, and already numerous exercise amongst them: Final yr Spotify acquired Sonantic and Snap purchased an AI voice assistant even sooner than that; one other startup, Sanas, is engaged on altering your accent and there are the voice simulators Murf and Acapela, amongst many others. Voice.ai counts itself in the identical basic class as Respeecher and ElevenLabs, two voice-to-voice AI startups, letting customers apply masks to tweak or fully remodel their voices — in some instances creating fully artificial voices instead of the true factor.

Respeecher, based and based mostly in Ukraine, made a reputation for itself by serving to construct a brand new Darth Vader voice for brand spanking new Star Wars installments, based mostly on how James Earl Jones sounded 45 years in the past when he originated the function. (In step with a personality hell-bent on destroying worlds, Darth’s voice was delivered to the Hollywood consumer from its workplaces in Ukraine as Russia marched into the nation.)

ElevenLabs — famously (or infamously as the case may be) — has constructed a platform that’s frighteningly good at cloning voices, and earlier this month it picked up its most up-to-date funding spherical of $19 million from a bunch of big-name buyers.

Voice.ai is attempting, in that blend, to place itself because the AI voice modifying app for Everyman.

“There are many firms which might be attempting to supply a special taste of voice tech to companies,” Ahrens advised TechCrunch in an e-mail (satirically, it wasn’t attainable to rearrange a dwell interview with him). Ahrens has some expertise with the constructing of B2B AI tech: his two earlier firms — iSpeech for text-to-speech and Haystack for face recognition — are constructed round API choices.

See also  OpenAI wants to work with organizations to build new AI training data sets

“What units Voice.ai aside is that we’re centered on bringing tech that was beforehand reserved for enterprise firms immediately into the fingers of customers in an inexpensive style.” Many customers, he famous, “come to us from classical DSP voice changers and voice modulators which they’d been utilizing prior to now and that are nonetheless standard amongst many players and streamers.”

“Reasonably priced” is available in two tiers, with most customers now on a free service that requires them to choose in to offering computational energy to coach Voice.ai’s fashions, with its service constructed by itself personal information set comprised of “tens of millions of distinctive customers.” No pricing is supplied on the location: we’re asking for these particulars.

“We consider in making expertise accessible and plan on working along with the open supply neighborhood to democratize Voice AI expertise,” added Ahrens.

Voice.ai additionally claims it takes what’s a basically completely different method to the problem of adjusting a voice, tapping into a number of the ethos that has constructed up round the usage of avatars by Vtubers, players and others on-line.

“Most voice AI firms which might be coming into the house attempt to construct scalable enterprise centered text-to-speech options or costly voice-to-voice companies for manufacturing studios,” Ahrens mentioned. “We begin from the alternative spectrum and attempt to ship worth to people who wish to develop how they sound on-line. The core worth proposition of our speech-to-speech AI isn’t that it may completely replicate any given individual. It’s that it retains the core components of a person’s speech: their emotion, pacing and emphasis whereas changing the sound of the voice, with a view to create a unique new finish outcome, in real-time.”

It could be due to how the demographics in interactive platforms like gaming skew, however for now Voice.ai’s viewers is 70% male versus 30% feminine with new classes opening not simply round who’s utilizing the tech, however why.

That features not simply these utilizing avatars and constructing voices to match them, or these on the lookout for extra privateness safety, but in addition, he mentioned, “transgender customers who can characterize themselves with voices that match their id, in addition to customers exploring fully new on-line personas for themselves.”

See also  Weights and biases raises $50M to advance LLMOps efforts for generative AI

There may be already a base of customers tapping into Voice.ai’s direct-to-consumer choices, however one of many explanation why Mucker is investing within the startup is as a result of it believes that there’s a chance to construct out a community of builders utilizing and integrating its tech.

“Voice.ai is poised to revolutionize the AI developer neighborhood in a way akin to AdMob’s impression on the cellular app developer neighborhood,” mentioned Omar Hamoui, a companion at lead investor Mucker Capital. (Hamoui beforehand based the cellular advert startup AdMob, finally acquired by Google, so he has some direct expertise constructing cellular developer instruments.) “By providing user-friendly options that have been as soon as unique to giant enterprises, Voice.ai goals to democratize entry for builders worldwide.”

Karl Alomar, the previous COO of Digital Ocean, who led the funding for M13, mentioned buyers might be taking an energetic function within the subsequent stage of improvement. “At Digital Ocean too we noticed the worth of constructing a neighborhood of builders by builders,” he mentioned. “We’re excited for creators and builders to construct on the Voice.ai platform.”

Source link

You may also like

logo

Welcome to our weekly AI News site, where we bring you the latest updates on artificial intelligence and its never-ending quest to take over the world! Yes, you heard it right – we’re not here to sugarcoat anything. Our tagline says it all: “because robots are taking over the world.”

Subscribe

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

© 2023 – All Right Reserved.