An international research team has developed a new benchmark that reveals the current limitations of LLMs. Even the most advanced models fail at 90 percent of the tasks - for now. The test, called ...
Inside sources told the Financial Times that the Stargate AI infrastructure project will provide computing power exclusively to OpenAI. The project, announced earlier this week by OpenAI, SoftBank, ...
A new study by OpenAI shows that AI models become more robust against manipulation attempts if they are given more time to "think". The researchers also discovered new methods of attack. A recent ...
OpenAI has just launched Operator, an AI assistant that can navigate the web on its own. The tool, currently only available to US ChatGPT Pro subscribers, represents a step toward AI assistants that ...
Perplexity is stepping into Google's territory with a new AI assistant for Android that can control apps and handle tasks on its own. The move puts the startup in direct competition with Google's ...
A team of researchers from NYU, MIT, and Google has found a way to improve AI-generated images by borrowing ideas from recent AI reasoning models like OpenAI's o1. Their approach enhances image ...
A new company called "The Stargate Project" is bringing together some of tech's biggest names to build what could become the largest AI infrastructure network in history. The joint venture between ...
According to a report from The Information, OpenAI plans to launch "Operator" as a new ChatGPT feature for browser control later this week. The feature will offer several task categories, including ...
Google's experimental AI model Gemini 2.0 Flash Thinking has jumped ahead of its competitors, scoring impressive results in math, science, and general performance tests. According to testing platform ...
Google is rolling out several updates to its Gemini AI assistant for Android, focusing on how it handles multimedia, works with other apps, and becomes more accessible. The biggest addition is Gemini ...
ElevenLabs recently shared how well its AI support agent is performing. While the system handles most documentation-related questions successfully, it starts to struggle when dealing with more complex ...
Perplexity has upgraded its Sonar API, letting developers integrate AI-powered search capabilities - complete with source citations - into their applications. The company now offers two versions of ...