Home Artificial Intelligence Sakana walks again claims that its AI can dramatically velocity up mannequin coaching

Sakana walks again claims that its AI can dramatically velocity up mannequin coaching

by admin
Unhappy and disappointed customer giving low rating and negative feedback in survey, poll or questionnaire. Sad and dissatisfied man giving review about service quality. Bad user experience.

This week, Sakana AI, an Nvidia-backed startup that’s raised lots of of thousands and thousands of {dollars} from VC corporations, made a outstanding declare. The corporate stated it had created an AI system, the AI CUDA Engineer, that might successfully velocity up the coaching of sure AI fashions by an element of as much as 100x.

The one drawback is, the system didn’t work.

Users on X quickly discovered that Sakana’s system really resulted in worse-than-average mannequin coaching efficiency. According to one user, Sakana’s AI resulted in a 3x slowdown — not a speedup.

What went unsuitable? A bug within the code, based on a post by Lucas Beyer, a member of the technical workers at OpenAI.

“Their orig code is unsuitable in [a] refined method,” Beyer wrote on X. “The actual fact they run benchmarking TWICE with wildly totally different outcomes ought to make them cease and assume.”

In a postmortem published Friday, Sakana admitted that the system has discovered a option to “cheat” (as Sakana described it) and blamed the system’s tendency to “reward hack” — i.e. determine flaws to attain excessive metrics with out conducting the specified objective (dashing up mannequin coaching). Comparable phenomena has been noticed in AI that’s trained to play games of chess.

In response to Sakana, the system discovered exploits within the analysis code that the corporate was utilizing that allowed it to bypass validations for accuracy, amongst different checks. Sakana says it has addressed the difficulty, and that it intends to revise its claims in up to date supplies.

“We’ve since made the analysis and runtime profiling harness extra strong to get rid of a lot of such [sic] loopholes,” the corporate wrote within the X publish. “We’re within the technique of revising our paper, and our outcomes, to mirror and focus on the consequences […] We deeply apologize for our oversight to our readers. We are going to present a revision of this work quickly, and focus on our learnings.”

Props to Sakana for proudly owning as much as the error. However the episode is an efficient reminder that if a declare sounds too good to be true, especially in AI, it most likely is.

Source Link

Related Posts

Leave a Comment