Honor Robot Smashes Half-Marathon Record.

Share

TL;DR

  • Performance Milestone: An Honor humanoid robot set a half-marathon record, completing the Beijing E-Town course in 50:26 [link]—a decisive seven-minute lead over the current human best of 57:20 and a substantial improvement from last year's 2:40 win.
  • Regulatory Landscape & Market Reactions: Anthropic's White House engagement saw President Trump dismiss Dario Amodei as "Who?" [link], while Barclays CEO Venkatakrishnan designated Mythos "a serious threat" [link]. Concurrently, global central banks executed a "war game" [link] simulating an AI-induced banking crisis.
  • Product vs. Perception: Opus 4.7 faces mounting user complaints regarding a 35% cost inflation due to a new tokenizer [link], despite Artificial Analysis ranking it in a three-way tie for intelligence leadership [link] with Google and OpenAI.
  • OpenAI's Strategic Reshuffle: The GPT-5.5 "Spud" release window opens Monday [link], with Polymarket odds at 72% for an April 30 launch. Friday's triple executive departure at OpenAI is largely interpreted as a streamlining effort ahead of this critical launch.
  • Compliance Crunch: The EU AI Act's hiring bias audit mandate is 105 days from enforcement [link], with certified third-party auditors already operating at capacity.

Lead Story: The Beijing Half-Marathon and the Acceleration of Embodied AI

The metrics are unambiguous. At the second annual Beijing E-Town Humanoid Robot Half-Marathon, an Honor autonomous robot navigated the 21-kilometer course in 50 minutes and 26 seconds. This result definitively surpassed Jacob Kiplimo's human world record of 57:20 by almost seven minutes. CNBC characterized the event as a "non-human race" that unequivocally "shows how far robots have come." CNN's coverage adopted a more direct assessment: "A Chinese android just ran a half-marathon faster than any human ever."

The rate of advancement warrants closer examination. The inaugural race last year saw a winning time of 2 hours, 40 minutes, and 42 seconds. This year's performance represents a 68% improvement within a twelve-month span. Honor—a Huawei spinoff—secured the top three positions, all relying on fully autonomous navigation. A remotely controlled Honor robot, "Lightning," technically finished first at 48:19, but under the event's weighted scoring (autonomous systems receive a 1.0 coefficient versus 1.2 for remote-controlled), the autonomous entry secured the championship.

The event attracted 112 teams, including five international contenders from Germany, France, and Brazil, as reported by CGTN. Participants included Alibaba, Unitree, and AGIBOT. AGIBOT independently declared 2026 "Deployment Year One" at its recent Partner Conference. The South China Morning Post noted the winner "smashed" the human record by six minutes.

This is not merely an exhibition. It is a public, verifiable benchmark for real-world locomotion, balance, and high-speed autonomous navigation over distance—the physical AI equivalent of a SWE-bench evaluation. Beijing has provided a data point that renders the trajectory unmistakable.


In Other News

Opus 4.7 Backlash Solidifies Around Cost Metrics. Three days post-launch, community sentiment on Opus 4.7 is bifurcated, with critics gaining traction. Startup Fortune published an analysis arguing the backlash "may be entirely justified." The core issue centers on a new tokenizer that generates up to 35% more tokens for identical inputs, effectively a hidden price increase despite stable per-token rates. Subscribers across all tiers are reporting premature exhaustion of 5-hour and weekly caps. Concurrently, Simon Willison published an exacting system prompt diff on April 18, detailing 4.7's expanded tool surface: "Claude in Chrome," "Claude in Excel," and "Claude in PowerPoint" now appear in the system prompt, affirming Anthropic's pivot toward a comprehensive agent platform. Benchmark performance remains robust—87.6% SWE-bench, 70% CursorBench—and Artificial Analysis now assigns Opus 4.7 a score of 57 on its Intelligence Index (+4 over 4.6), resulting in "the greatest tie in Artificial Analysis history" with Anthropic, Google, and OpenAI sharing the top rank. The underlying performance is not disputed. The prevailing community narrative, however, has shifted from "upgrade" to "trade-off."

GPT-5.5 "Spud" Window Nears: OpenAI Executive Departures Signal Preparations. OpenAI's most probable release window for its next model commences April 21, aligned with Polymarket odds and pretraining completion timelines. "Spud" concluded training around March 24 at Stargate Abilene. Greg Brockman characterized it as "a significant change in the way we think about model development." Friday's triple executive exit—Kevin Weil, Bill Peebles, and Srinivas Narayanan—coupled with the Sora shutdown, is widely interpreted as OpenAI divesting non-core initiatives to concentrate on the imminent launch. Whether the model ships as GPT-5.5 or GPT-6 remains undeclared, reportedly contingent on the performance delta over GPT-5.4. Separately, Gizmodo reported increasing investor concern regarding Altman's external ventures—particularly Worldcoin—ahead of the projected IPO.

EU AI Act Hiring Audit: 105 Days Out, Auditor Capacity Strained. The EU AI Act's high-risk classification for hiring AI takes effect August 2. Post this date, any AI system deployed in employment decisions necessitates annual third-party bias audits, comprehensive technical documentation, human oversight frameworks, and transparent candidate disclosures. Non-compliance carries penalties of up to 15 million euros or 3% of global annual turnover. The immediate operational challenge, per Fisher Phillips, is the limited pool of certified auditors qualified under the EU's conformity assessment framework, with schedules rapidly filling. The regulation applies to any tool evaluating EU-based candidates, irrespective of company headquarters, encompassing ATS resume rankers, AI interview scoring systems, and ad targeting platforms.


X / Social Pulse

Simon Willison's system prompt forensics captured considerable developer attention over the weekend. His April 18 analysis leveraged Claude Code to reconstruct a Git history of Claude system prompts, subsequently diffing versions 4.6 against 4.7. Key findings—the explicit mention of Chrome, Excel, and PowerPoint agents within the prompt—substantiate Anthropic's trajectory toward a comprehensive autonomous agent platform. The former "developer platform" is now formally termed the "Claude Platform."

President Trump's disavowal undermined his administration's diplomatic outreach. When questioned about the Amodei-Wiles meeting, the president's response was a curt "Who?", which Gizmodo observed placed the White House "in a tricky position." QuantoSei described it as a "blind spot." Meanwhile, Barclays CEO CS Venkatakrishnan issued a warning to the banking sector, labeling Mythos "a serious threat." Concurrently, central bank officials from the UK, US, and EU conducted a Washington "war game" simulating a major bank collapse amidst AI and geopolitical volatilities. The Ninth Circuit May 19 oral arguments loom as a critical legal inflection point if political consensus remains elusive.

Global judiciaries are increasingly scrutinizing AI-generated legal documentation. A Nebraska attorney received a suspension after 57 of 63 citations in his appellate brief were deemed defective, including 20 AI hallucinations; the state Supreme Court ruled his denials "lack credibility." Separately, Kenya's High Court ruled AI-drafted legal pleadings illegal, citing violations of civil procedure and the creation of an "unfair advantage." India's Chief Justice Surya Kant advised judicial officers to view AI as "an aid, not a substitute." Three distinct jurisdictions, one week, converging conclusions.


One to Watch

Nature's recent publication provides a necessary counterbalance to prevailing agentic AI narratives. A new analysis concludes that "human scientists trounce the best AI agents on complex tasks," with even top-tier agents operating at approximately half the proficiency of PhD-level experts in open-ended scientific workflows. The Stanford AI Index 2026, released concurrently, situates AI mentions in natural-science publications at 6-9% of all papers—a significant figure, yet not the overwhelming surge some had projected. The implication is not that AI agents are without utility, but that the chasm between "automate a well-defined coding task" and "generate novel scientific discovery" remains substantial. SWE-bench scores do not translate directly to unconstrained research. Labs advancing the "AI researcher" premise should factor in this discount rate.


Quick Hits

  • Meta confirmed May 20 for initial layoffs, impacting approximately 8,000 employees, with further reductions planned later in 2026—India Today anticipates up to 16,000 total across two tranches, reflecting the company's $135B AI capex re-prioritization.
  • Cursor's $2B funding round at a $50B valuation is oversubscribed, per Bloomberg, attracting participation from a16z, Thrive, and Nvidia; reported revenue has surpassed $2B ARR.
  • Apple nearly enforced a Grok App Store ban due to deepfake policy infringements, per NBC News—xAI submitted content moderation updates, initially rejected by Apple, before eventual approval.
  • Grok 4.3's full 1T-parameter version training concludes around April 22, as stated by Musk on X—Grok Build and Grok CLI are anticipated next week.
  • Google and Avid announced a strategic partnership to integrate agentic AI into media production workflows, with live demonstrations slated for NAB Show Las Vegas (April 19-22).

A robot that completed a half-marathon in two and a half hours last year just eclipsed the human world record by seven minutes. This represents a 68% performance gain in twelve months, achieved through fully autonomous navigation on public roads. While the AI industry engaged in debates concerning tokenizer costs, executive departures, and political accommodations—all valid points of contention—the true vector of advancement emerges in operational environments. If one seeks to understand the actual trajectory when software meets physical reality, observe the race footage from Beijing. The machines are not deliberating their pricing strategy. They are executing.


Sources

Lock in. M. mazen@thorterminal.com

Read more