banner https://www.profitablecpmrate.com/nsirjwzb79?key=c706907e420c1171a8852e02ab2e6ea4

The US is reviewing Benchmark’s investment into Chinese AI startup Manus 

Manus AI is one of the hottest AI agent startups around, recently raising $75 million at a half-billion-dollar valuation in a round led by Benchmark. But two unnamed sources told Semafor that the investment is now under review by the U.S. Treasury Department over its compliance with 2023 restrictions on investing in Chinese companies. Benchmark’s … Read more

Sarah Tavel, Benchmark’s first woman GP, transitions to venture partner

Eight years after joining Benchmark as the firm’s first woman general partner, Sarah Tavel announced on X that she is transitioning to a more limited role at the storied venture firm. In her new position as a venture partner, Tavel will continue to make investments and serve on existing company boards, but she will have … Read more

Crowdsourced AI benchmarks have serious flaws, some experts say

AI labs are increasingly relying on crowdsourced benchmarking platforms such as Chatbot Arena to probe the strengths and weaknesses of their latest models. But some experts say that there are serious problems with this approach from an ethical and academic perspective. Over the past few years, labs including OpenAI, Google, and Meta have turned to … Read more

OpenAI launches program to design new ‘domain-specific’ AI benchmarks

OpenAI, like many AI labs, thinks AI benchmarks are broken. It says it wants to fix them through a new program. Called the OpenAI Pioneers Program, the program will focus on creating evaluations for AI models that “set the bar for what good looks like,” as OpenAI phrased it in a blog post. “As the … Read more

Meta’s benchmarks for its new AI models are a bit misleading

One of the new flagship AI models Meta released on Saturday, Maverick, ranks second on LM Arena, a test that has human raters compare the outputs of models and choose which they prefer. But it seems the version of Maverick that Meta deployed to LM Arena differs from the version that’s widely available to developers. … Read more

This Week in AI: Maybe we should ignore AI benchmarks for now

Welcome to TechCrunch’s regular AI newsletter! We’re going on hiatus for a bit, but you can find all our AI coverage, including my columns, our daily analysis, and breaking news stories, at TechCrunch. If you want those stories and much more in your inbox every day, sign up for our daily newsletters here. This week, billionaire … Read more

banner banner https://www.profitablecpmrate.com/nsirjwzb79?key=c706907e420c1171a8852e02ab2e6ea4