This AI agent startup ditched Anthropic for DeepSeek — and says it’s saving millions

Lindy moved 100% of its AI agent traffic from Anthropic to DeepSeek v4 to save millions on inference, and CEO Flo Crivello explains why the migration cost far more effort than expected.

Jun 9th, 2026 12:17pm by Paul Sawers

Featued image for: This AI agent startup ditched Anthropic for DeepSeek — and says it’s saving millions

Space Stock for Unsplash+

The biggest blocker to sustainable AI deployment has emerged as inference cost. GitHub recently abandoned its flat-rate Copilot subscription in favor of usage-based billing, after agentic coding sessions drove costs beyond what a fixed monthly fee could absorb — some subscribers woke up to bills several times higher than they’d been paying. Uber, meanwhile, burned through its entire 2026 AI budget in just four months, largely on Claude Code, leaving its COO questioning whether the returns justified the outlay.

In response to this broader reckoning, the Linux Foundation launched the Tokenomics Foundation — backed by Google, Microsoft, IBM, Salesforce, among others — to build open standards around AI token costs, an acknowledgment that enterprises currently have no consistent way to measure or control what they owe.

Flipping the switch

For companies running AI agents at volume, the economics of frontier models have become almost an existential question.

Flo Crivello, a former engineer and product lead at Uber, is the founder and CEO of Lindy, a no-code AI agent platform that automates everyday work tasks — from email triage and meeting scheduling to CRM management. Crivello founded Lindy in 2023 as a pivot from Teamflow, a virtual office startup for which he had previously raised $52 million in capital — capital that now backs Lindy’s development.

Crivello took to social media last week to announce that Lindy had switched its entire model infrastructure from Anthropic to DeepSeek.

“Saves us millions of $ and we’re actually seeing an increase in performance on many core use cases.”

“Pulled the trigger today and switched 100% of Lindy traffic to DeepSeek v4, churning from Anthropic models,” Crivello wrote on X. “Saves us millions of $ and we’re actually seeing an *increase* in performance on many core use cases. Transformative for the business.”

In truth, Crivello had signaled his intentions some months earlier, writing on X in April that inference was Lindy’s single biggest cost — exceeding payroll — and that open-source models had gone from “not even close” to “at the frontier, for most use cases” in the space of a year. At the time, he said Lindy had come close to making Kimi K2.5 — a model from Chinese AI company Moonshot AI — its default, before pivoting toward GLM-5.1, developed by Beijing-based lab Zhipu AI.

As it turned out, the company settled on DeepSeek v4, a flagship open-source model from the Chinese AI research company DeepSeek.

Of course, switching from one model provider to another at full production scale is no trivial task. Crivello tells The New Stack that the timeline to completion depends on when you start counting — but either way, it was a significant undertaking.

“We’ve been looking to make this switch and evaluating new OSS models for 6-9 months.”

“We’ve been looking to make this switch and evaluating new OSS models for 6-9 months, and DeepSeek since it was released, about 2 months ago,” Crivello explains.

Notably, the migration proved far more demanding than Crivello initially anticipated — “100x more work than we thought,” as he put it. Evaluations — that is, systematically testing the new model across real-world tasks to verify it could match or exceed what Anthropic’s models had been delivering — were a major part of it.

“Lots of work to eval the models, on online evals, offline evals, and tons of ‘vibe evals’,” Crivello says. “[We then did] a gradual rollout for both online evals, and to see the impact on retention; and [then] adapt our prompts to this new model.”

The effort would have been hard to justify on cost savings alone — but the performance results gave Crivello added confidence, particularly in its core use cases, which include email inbox triaging and pre-drafting replies based on the user’s voice.

“And that’s exactly where we’ve seen surprising performance gains with DeepSeek,” Crivello explains, adding that DeepSeek still trails Anthropic on some complex automation tasks.

“It’s still less good than Sonnet at ‘workflow automation,’ which is more secondary for us,” he says.

DeepSeek moment

To understand why Lindy’s switch matters, it helps to understand what DeepSeek has come to represent in the AI industry.

DeepSeek sent shockwaves through Silicon Valley in January 2 025, when its R1 model matched the performance of leading US frontier models at a fraction of the cost — prompting a brief but dramatic selloff in Nvidia’s stock as investors questioned assumptions about AI’s compute requirements. What followed was a steady stream of releases that kept closing the gap with the frontier realm.

DeepSeek V4, released in preview in April 2026, marked a further step change — and not just on price. Marcel Salathe, professor at EPFL and co-director of the EPFL AI Center in Switzerland, noted on LinkedIn that v4 represented something more significant from a geopolitical standpoint: For the first time, a frontier-class AI stack exists that is fully Chinese from chip to framework to model. DeepSeek, it seems, spent months rewriting v4 to run on CANN — Huawei’s equivalent of Nvidia’s CUDA — reducing its reliance on US chip infrastructure.

That geopolitical shift has a direct commercial consequence. As The New Stack previously reported, the arrival of cheaper open-weight models from predominantly Chinese AI labs has split the AI model market into two clusters — ultra-premium frontier models from the likes of OpenAI and Anthropic, and dramatically cheaper open-weight alternatives — with the comfortable middle thinning out. The numbers bear this out: Vercel’s AI Gateway, which routes traffic between apps and AI providers, recorded DeepSeek’s share of token volume jumping from under 1% to 17% in a single month in May — while its share of actual spend remained near 1%, a reflection of just how cheaply those tokens are being served.

For companies running agents at volume, such as Lindy, that polarisation has forced a reckoning with which economics to route to. For Lindy’s founder, whose inference bill had grown to exceed payroll, the question really was just a matter of when.

Lindy settled on Atlas Cloud, a US-based inference provider that hosts DeepSeek v4 on American soil — an important detail given that questions around data sovereignty tend to follow Chinese-developed models. Crivello addressed this directly in response to at least one commenter on X, noting that the model is hosted in the US by an American provider — and that Atlas came out ahead after evaluating “all the major players.” Self-hosting, for what it’s worth, was never on the table.

“We did not seriously consider [self-hosting], no — it would seem like a massive distraction,” he says.

“We did not seriously consider self-hosting…it would seem like a massive distraction.”

Runway and future plans

While Crivello says the switch will ultimately save Lindy millions, the runway implications for a venture-backed company are significant.

But how much, exactly? “A ton,” is all Crivello will say.

As for whether the move is permanent, Crivello is non-committal. “Nothing in life is permanent,” he says. “I wouldn’t be surprised if Anthropic’s next release earned [them] our business back, but they would need to significantly cut prices.”

“I wouldn’t be surprised if Anthropic’s next release earned them our business back, but they would need to significantly cut prices.”

It’s also worth noting that Lindy remains an Anthropic customer — just not for its core product. The company still uses Claude internally, because the economics of the subscription make it viable.

“Our internal use is about the Max plan — if it wasn’t for it, and if we had to pay full token price, we’d switch to something else,” Crivello says.

Responding to a question from Amp CEO and founder Quinn Slack about whether Lindy might eventually be forced back to Anthropic’s models for its external product, Crivello suggested the door isn’t entirely closed. “We’ll probably still escalate to Opus when we detect Lindy is failing at a task,” he wrote, “but that’ll be marginal.”

Crivello’s view is that companies in Lindy’s position — large token consumers — have little choice but to act. “Companies like us who spend a lot on tokens, 100% — you’d be irresponsible not to,” he says. “Other companies, it will depend, but I think a lot of folks just stick to the brand name.”

Paul is an experienced technology journalist covering some of the biggest stories from Europe and beyond, most recently at TechCrunch where he covered startups, enterprise, Big Tech, infrastructure, open source, AI, regulation, and more. Based in London, these days Paul...