Agent Platform Scorecard

Who is building the agent operating system?

Twelve dimensions. Seven providers. Re-graded as announcements land — with the market's reaction alongside.

← all providers
5

xAI

#7 · 4.8/10 overall

Fast-moving, real-time, wired into X.

Closed the capability gap fast and owns a real-time data firehose via X. Distinctive personality and speed, but thin on enterprise, cross-app actions, the device edge, and a developer platform.

Assistant
Grok
Flagship model
Grok 4 family
Public ticker
Private (merged with X; Tesla TSLA is a related Musk-entity proxy)

Dimension breakdown

  • Assistant Intelligence
    8/10

    Grok 4 closed most of the gap to the frontier at remarkable speed.

  • Agent Capability
    5/10

    Agentic features are emerging but unproven relative to the leaders.

  • Conversational UX
    7/10

    Grok's voice mode and distinct personality make for engaging, fast interaction.

  • Cross-App Actions
    3/10

    Almost no cross-app orchestration today.

  • Personal Context
    4/10

    Real-time X context is unique, but personal memory is thin.

  • Third-Party Integrations
    4/10

    A narrow integration surface so far, centered on X.

  • Developer APIs
    6/10

    The Grok API is live and improving, but the platform is young and shallow.

  • Model Strategy
    7/10

    Owns its models and the Colossus compute to train them; the model ladder is still maturing.

  • Agent Platform Potential
    5/10

    X integration is intriguing distribution, but the agent-platform thesis is unproven.

  • On-Device AI
    2/10

    No on-device or edge story.

  • Enterprise Readiness
    3/10

    Minimal enterprise presence, governance, or compliance footprint.

  • MyAGI Alignment
    4/10

    A capable model without the platform, context, or governance of a control plane.

Trajectory

Racing up the capability curve with real-time data leverage; platform, enterprise, and edge remain unbuilt.

  • Now
    Fast model, thin platform

    Grok closed the capability gap; integrations, enterprise, and edge are minimal.

  • H2 2026
    X as a distribution + data moat

    Tighter X integration could turn real-time data into a distinctive agent surface — if a developer platform follows.

Compare all roadmaps →