Home News Arthur releases open source tool to help companies find the best LLM for a job

Arthur releases open source tool to help companies find the best LLM for a job

by WeeklyAINews
0 comment

Arthur, a machine studying monitoring startup, has benefited from the curiosity in generative AI this yr, and it has been creating instruments to assist corporations work with LLMs extra successfully. Right this moment it’s releasing Arthur Bench, an open supply device to assist customers discover the very best LLM for a specific set of information.

Adam Wenchel, CEO and co-founder at Arthur, says the corporate has seen quite a lot of curiosity in generative AI and LLMs, and they also have been placing quite a lot of effort into creating merchandise.

He says that in the present day, and granted we’re lower than a yr because the launch of ChatGPT, that corporations don’t have an organized technique to measure the effectiveness of 1 device towards one other, and that’s why they created Arthur Bench.

“Arthur Bench solves one of many important issues that we simply hear with each buyer which is [with all of the model choices], which one is greatest to your specific software,” Wenchel informed TechCrunch.

It comes with a collection of instruments you should use to methodically take a look at the efficiency, however the actual worth is that it lets you take a look at and measure how the sorts of prompts your customers would use to your specific software will carry out towards completely different LLMs.

Arthur Bench LLM comparison test suite hedging test.

Picture Credit: Arthur

“You might probably take a look at 100 completely different prompts, after which see how two completely different LLMs — like how Anthropic compares to OpenAI — on the sorts of prompts that your customers are seemingly to make use of,” Wenchel mentioned. What’s extra, he says that you are able to do that at scale and make a greater determination on which mannequin is greatest to your specific use case.

See also  Sysdig Sage brings power of generative AI to cloud security with an LLM controller approach

Arthur Bench is being launched in the present day as an open supply device. There will even be a SaaS model for patrons who don’t wish to cope with complexity of managing the open supply model, or who’ve bigger take a look at necessities, and are keen to pay for that. However for now, Wenchel mentioned they’re concentrating on the open supply mission.

The brand new device comes on the heels of the release of Arthur Shield in Could, a form of LLM firewall that’s designed to detect hallucinations in fashions, whereas defending towards poisonous info and personal information leaks.

Source link

You may also like

logo

Welcome to our weekly AI News site, where we bring you the latest updates on artificial intelligence and its never-ending quest to take over the world! Yes, you heard it right – we’re not here to sugarcoat anything. Our tagline says it all: “because robots are taking over the world.”

Subscribe

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

© 2023 – All Right Reserved.