Introducing Native mini 1
By Native Team
Today we're introducing Native mini 1, the smallest and most efficient in our database reasoning family.
Enterprises are betting on autonomous agents to run 24/7 across their data. That only works if they can trust the reasoning underneath. Native mini 1 was built for exactly this.
Data Reasoning
BI tells you what happened. Data reasoning tells you why it happened, and what to do next, across ten different silos that don't talk to each other.
Most enterprises have spent months building agents that demo well but can't scale past a handful of medium-complexity tables. The underlying problem is the same. Their systems don't understand the data. Native mini 1 does.
We treat databases as a distinct modality. Building a deep understanding of raw data requires learning specific behaviours, relationships, and structures. Native mini 1 is the smallest in our family, at around 1/20th the cost of our larger configurations. It is the most accurate database reasoning system in the world.
Spider2 Results
To evaluate our infrastructure, we found the benchmark with the hardest questions, across the largest, lowest quality data. In other words, what actually exists inside enterprises. This was Spider2.
Spider2 tests a system's ability to reason over complex, messy, real-world database environments. Not clean academic datasets. The kind of sprawling, contradictory, multi-system data estates that enterprises actually operate on. Hundreds of tables, inconsistent schemas, missing labels, and questions that require multi-step reasoning across siloed sources.
Native mini 1 achieved 95.80% accuracy, placing #1 on the leaderboard and becoming the first team to break the 95% barrier. The benchmark has been attempted by research teams from ByteDance, Snowflake, Alibaba, Tencent, and others. Our larger reasoners solves every error free question in the benchmark but is reserved for enterprise deployments.
| Rank | Team | Accuracy |
|---|---|---|
| 1 | Native mini 1 | 95.80% |
| 2 | Genloop v2 Pro | 95.06% |
| 3 | DAQUV + Gemini 3 Pro | 94.15% |
| 4 | Tencent with Contextual Scaling Engine | 93.97% |
| 5 | Paytm - Prism Swarm + Claude Sonnet 4.5 | 90.49% |
| 6 | AT&T & RelationalAI | 86.28% |
| 7 | ByteDance - ByteBrain Agent | 84.10% |
| 8 | Alibaba - AI Cheng Agent | 82.81% |
| 9 | Ant Group - LingXi Agent + Claude Sonnet 4.5 | 79.89% |
| 10 | Snowflake - Arctic-FLEX | 75.14% |
We are now moving onto more challenging benchmarks. A technical report on Spider2 is coming soon.
Contact Sales
Native mini 1 is a research prototype. Our enterprise model is significantly larger, built to work across every data system your organization runs on. If you'd like to learn more, contact us at contact@usenative.ai