LAMBDA CALCULUS BENCHMARK FOR AI LAUNCHES
■ AI-SUMMARIZED FROM 1 SOURCE BELOW
A new benchmark for evaluating AI systems based on lambda calculus has been released. The tool aims to provide a standardized measure of reasoning capabilities across different AI models.
■ MORE FROM THE AI DESK
Andon Market, a San Francisco retail boutique, operates as the first store managed entirely by an AI agent. The experiment uses Anthropic's Claude Sonnet 4.6 model to handle store operations.
A benchmark test of 500 investment bankers found that top AI models including GPT-5.4 and Claude Opus 4.6 produced no outputs suitable for client delivery. Despite the failures, over half the bankers said they would use AI results as a starting point.
A new survey reveals Claude's weekly active users in the US have significantly higher incomes than users of competing AI assistants like ChatGPT and Gemini.
The inaugural World AI Film Festival (WAIFF) premiered in Cannes this week, showcasing AI-generated cinema while the prestigious Palme d'Or competition bans emerging technology from entries.