ai benchmark - Search News

Testing The Limits: Three Ways AI Benchmarks Are Evolving

When it comes to real-world evaluation, appropriate benchmarks need to be carefully selected to match the context of AI ...

This new AI benchmark measures how much models lie

Researchers behind the MASK benchmark found that more knowledge doesn't mean more 'moral virtue.' See which model lies the ...

MIT Technology Review3d

These new AI benchmarks could help make models less biased

New AI benchmarks could help developers reduce bias in AI models, potentially making them fairer and less likely to cause harm. The research, from a team based at Stanford, was posted to the arXiv ...

MIT Technology Review4d

These new AI benchmarks could help make models less biased

They could offer a more nuanced way to measure AI’s bias and its understanding of the world. New AI benchmarks could help developers reduce bias in AI models, potentially making them fairer and ...

Analytics Insight1d

Gemma 3: Google’s New AI Beats OpenAI’s o3-mini and DeepSeek-V3

Google has launched Gemma 3, the third generation of its open-source AI models. The model is better than rivals like DeepSeek ...

11hon MSN

DeepSeek: Everything you need to know about the AI chatbot app

DeepSeek has gone viral. Chinese AI lab DeepSeek broke into the mainstream consciousness this week after its chatbot app rose ...

NextBigFuture3d

AI Agent Competitive Landscape and Manus AI Innovations

Manus AI, developed by the Chinese startup Monica.im is making a lot of splash as the world’s first fully autonomous AI agent ...

6don MSN

What is Compare AI Models? Everything we know about the really useful AI model comparison tool

Compare AI Models is a web-based tool designed to help you evaluate and compare different AI models based on key performance ...

OpenAI’s newest developer AI brings search capabilities to AI agents

When using Responses API to create an AI agent, developers can choose from two models: GPT-4o search and GPT-4o mini search.

2don MSN

After DeepSeek, China's ManusAI is here to challenge America's AI supremacy

China's startup Monica introduces ManusAI, the world's first autonomous AI agent. Capable of planning, analyzing, and ...

ET BrandEquity2d

Centre for AI Governance Board to Oversee Approvals

The government recommends establishing an AI Governance Board to ensure AI applications comply with legal standards.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results