Home News Reka launches Yasa-1, a multimodal AI assistant to take on ChatGPT

Reka launches Yasa-1, a multimodal AI assistant to take on ChatGPT

by WeeklyAINews
0 comment

VentureBeat presents: AI Unleashed – An unique government occasion for enterprise knowledge leaders. Community and study with business friends. Learn More


Reka, the AI startup based by researchers from DeepMind, Google, Baidu and Meta, has announced Yasa-1, a multimodal AI assistant that goes past textual content to know pictures, brief movies and audio snippets.

Accessible in personal preview, Yasa-1 could be personalized on personal datasets of any modality, permitting enterprises to construct new experiences for a myriad of use circumstances. The assistant helps 20 totally different languages and likewise brings the power to supply solutions with context from the web, course of lengthy context paperwork and execute code. 

It comes because the direct competitor of OpenAI’s ChatGPT, which not too long ago obtained its personal multimodal improve with assist for visible and audio prompts.

“I’m happy with what the workforce has achieved, going from an empty canvas to an precise full-fledged product in below 6 months,” Yi Tay, the chief scientist and co-founder of the corporate, wrote on X (previously Twitter).

This, Reka stated, included every thing, proper from pretraining the bottom fashions and aligning for multimodality to optimizing the coaching and serving infrastructure and establishing an inner analysis framework. 

Nonetheless, the corporate additionally emphasised that the assistant continues to be very new and has some limitations – which shall be ironed out over the approaching months.

Yasa-1 and its multimodal capabilities

Accessible through APIs and as docker containers for on-premise or VPC deployment, Yasa-1 leverages a single unified mannequin skilled by Reka to ship multimodal understanding, the place it understands not solely phrases and phrases but additionally pictures, audio and brief video clips.

See also  A Quick Guide to Understanding ChatGPT and Bard and How These AI Chatbots Work

This functionality permits customers to mix conventional text-based prompts with multimedia recordsdata to get extra particular solutions.

As an illustration, Yasa-1 could be prompted with the picture of a product to generate a social media publish selling it, or it might be used to detect a specific sound and its supply. 

Reka says the assistant may even inform what’s happening in a video, full with the matters being mentioned, and predict what the topic could do subsequent. This sort of comprehension can come in useful for video analytics nevertheless it appears there are nonetheless some kinks within the know-how.

“For multimodal duties, Yasa excels at offering high-level descriptions of pictures, movies, or audio content material,” the corporate wrote in a blog post. “Nonetheless, with out additional customization, its skill to discern intricate particulars in multimodal media is restricted. For the present model, we advocate audio or video clips be now not than one minute for the very best expertise.”

It additionally stated that the mannequin, like most LLMs on the market, can hallucinate and shouldn’t be solely relied upon for vital recommendation.

Further options

Past multimodality, Yasa-1 additionally brings further options akin to assist for 20 totally different languages, lengthy context doc processing and the power to actively execute code (unique to on-premise deployments) to carry out arithmetic operations, analyze spreadsheets or create visualizations for particular knowledge factors.

“The latter is enabled through a easy flag. When lively, Yasa routinely identifies the code block inside its response, executes the code, and appends the outcome on the finish of the block,” the corporate wrote.

See also  AI.com flips from ChatGPT to Elon Musk's X.ai

Furthermore, customers may even get the choice to have the newest content material from the net integrated into Yasa-1’s solutions. This shall be finished by one other flag, which is able to join the assistant to numerous business search engines like google in real-time, permitting it to make use of up-to-date info with none closing date restriction.

Notably, ChatGPT was additionally not too long ago been up to date with the identical functionality utilizing a brand new basis mannequin, GPT-4V. Nonetheless, for Yasa-1, Reka notes that there’s no assure that the assistant will fetch probably the most related paperwork as citations for a specific question.

Plan forward

Within the coming weeks, Reka plans to provide extra enterprises entry to Yasa-1 and work in the direction of enhancing the capabilities of the assistant whereas ironing out its limitations. 

“We’re proud to have among the best fashions in its compute class, however we’re solely getting began. Yasa is a generative agent with multimodal capabilities. It’s a first step in the direction of our long-term mission to construct a future the place superintelligent AI is a pressure for good, working alongside people to resolve our main challenges,” the corporate famous.

Whereas having a core workforce with researchers from corporations like Meta and Google can provide Reka a bonus, you will need to observe that the corporate continues to be very new within the AI race. It got here out of stealth simply three months in the past with $58 million in funding from DST World Companions, Radical Ventures and a number of different angels and is competing towards deep-pocketed gamers, together with Microsoft-backed OpenAI and Amazon-backed Anthropic.

See also  7 Best ChatGPT Alternatives To Boost Productivity In 2023

Different notable rivals of the corporate are Inflection AI, which has raised almost $1.5 billion, and Adept with $415 million within the bag.



Source link

You may also like

logo

Welcome to our weekly AI News site, where we bring you the latest updates on artificial intelligence and its never-ending quest to take over the world! Yes, you heard it right – we’re not here to sugarcoat anything. Our tagline says it all: “because robots are taking over the world.”

Subscribe

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

© 2023 – All Right Reserved.