We Were More Confident a Year Ago

There is a specific kind of corporate statement that signals something went wrong. It does not say what went wrong. It does not name who made the decision that went wrong. It speaks in the past tense, with a kind of institutional regret that has been carefully drained of any accountability. Salesforce executives delivered one of these statements recently: “We were more confident about AI a year ago.”

That sentence is doing a lot of work. Here is what it is compressing.

What Happened a Year Ago

In early 2024, Salesforce CEO Marc Benioff announced layoffs affecting approximately 4,000 employees, roughly 10 percent of the company’s workforce. The stated rationale was, in part, that AI was going to handle work that humans previously did. Benioff was publicly enthusiastic about Agentforce, Salesforce’s AI agent platform, describing it as a genuine transformation in how businesses could handle sales, service, and customer operations. The pitch was consistent and loud: AI agents would do the work that currently required hiring.

Salesforce was not alone in this framing. In 2024 and early 2025, a cluster of large companies reduced headcount while simultaneously increasing investment in AI tools. The logic was presented as straightforward: the technology is here, it is good enough, and the math favors automation over headcount.

The executives making those decisions were, as one commenter summarized, “driven by CEOs who often don’t even know how to use their email, much less spend any significant amount of time working with these technologies they think will replace their workforce. They only hear how good it is from their connections. All of whom are also in a positivity bubble about this exciting new technology of which few of them have actually personally used in any significant amount for a real daily task.”

The Survey That Would Not Send

The specific incident that crystallized the reliability problem came from Vivint, a home security company that deployed Agentforce for customer support across 2.5 million customers.

The task that Agentforce failed at was not complex by any reasonable measure. After each customer interaction, the system was supposed to send a satisfaction survey. That is it. Send a survey when the call ends. A simple, repeated, low-stakes action with no judgment required.

Agentforce sometimes failed to send the surveys. No one could explain why. The failure was not consistent or predictable. It just occasionally did not happen.

Vivint worked with Salesforce engineers to fix this. The solution they implemented was called a “deterministic trigger.”

A deterministic trigger is not an AI innovation. It is an if/then statement. When condition A occurs, action B fires. No language model involved. No context window, no inference, no probability distribution over possible next tokens. Just a rule that executes when its precondition is met.

They had replaced human customer service agents with an AI system, the AI system could not reliably send a survey at the end of a call, and the fix was to bolt on a piece of logic that a junior developer could have written in twenty minutes in 1995.

What Institutional Knowledge Costs

One of the most-upvoted comments on the thread: “I bet you the people they lost with RTO and these layoffs could have told them that and saved them billions from lost institutional knowledge.”

This is the part of the story that does not appear in quarterly earnings calls.

When a company lays off 4,000 people, it does not just lose 4,000 units of labor. It loses accumulated judgment about why things work the way they work. It loses the person who knows why a particular customer workflow was built with that specific edge case handling. It loses the team that understands which customer types tend to escalate and how to route them. It loses the history of every time someone tried to simplify a process and discovered why it could not be simplified.

AI agents do not carry that knowledge. They carry the knowledge in their training data and in whatever context they are given at runtime. What they cannot carry is the institutional memory of a specific company, built over years by people who no longer work there.

This is one of the underexamined costs of aggressive AI replacement: the AI does not inherit the judgment of the people it replaced. It starts from zero. And starting from zero in a complex customer operation that has been running for a decade turns out to be more expensive than the headcount savings suggested.

The Hype Bubble and How It Works

A Darwin quote surfaced in the thread comments: “Ignorance more frequently begets confidence than does knowledge.”

The path from ignorance to a confident AI strategy in a large enterprise typically looks like this. An executive sees a compelling demonstration of an AI product. The demonstration is curated to show the product at its best: clean inputs, well-defined tasks, no edge cases, no institutional complexity. The executive is impressed. They ask their team to evaluate it. The team, which is not using the product daily for real work, returns an evaluation based on a similar demonstration context. Everyone concludes the technology is ready.

Meanwhile, the engineers and customer service staff who would actually deploy and maintain the system are not in the room. Their objections, when raised, are filtered through a management layer that has already decided the technology is viable. The concerns are categorized as resistance to change rather than as signals about real limitations.

The result is a deployment that looks like success on the slide deck and looks like the Vivint survey problem in production.

The Reckoning Has a Structure

What makes the Salesforce admission worth paying attention to is not that one company made a mistake. Companies make mistakes. What makes it interesting is the structure of the reckoning.

The mistakes that are emerging from aggressive AI agent deployment follow a consistent pattern. Simple tasks that require consistent, reliable execution in high volume are not reliably handled by language models operating autonomously. The models that work beautifully in demonstration fail in ways that are not predictable or debuggable in production. Fixing those failures often requires adding back rules and deterministic logic, which raises the question of why the agent was used for those tasks in the first place.

This is not a crisis for AI agents as a category. There are genuine tasks where agents outperform human teams: first-pass document review, research synthesis, drafting workflows that get human review before sending. The pattern of reliability failure is specific. It is in high-volume, repetitive, consequential tasks where consistency matters more than intelligence and where failure is invisible until someone notices the surveys are not going out.

The companies that are figuring this out are rebalancing. They are keeping agents for tasks where the flexibility and synthesis capabilities are genuinely useful. They are adding deterministic triggers, human review steps, and explicit scope limits for tasks where reliability is the requirement. They are treating agents as powerful tools with specific failure modes rather than as general-purpose employee replacements.

That rebalancing is what “we were more confident a year ago” looks like from the inside. The 4,000 people who no longer work there are watching from somewhere else.

Sources: Times of India, “After laying off 4,000 employees and automating with AI agents, Salesforce executives admit: We were more confident about AI a year ago” (2025); The Information reporting on Vivint/Agentforce reliability issues; Reddit r/technology (7,450 upvotes).