Ingka Group acquires Locus! Built for the real world, backed for the long run. Read here>Read the full story>
Ingka Group acquires Locus! Built for the real world, backed for the long run. Read the full story

Why Rule-Based TMS Logic Breaks at Modern Retail Scale An Architectural Diagnosis

Cyber Monday. 12:06am. A flash promo just dropped on a stockout-prone SKU. Your East Coast 3PL hub posts a temporary capacity alert. A returns truck routing for that exact SKU is in flight. The TMS has to compose a decision in the next three minutes that resolves all three signals together.

It composes none of them. It runs them as three separate rules, in the order they arrived, against three different decision contexts. By 12:11am, two have fired correctly, one has fired incorrectly, and the fourth decision — the one that needed all three inputs in conversation — never gets composed.

Let's go under the hood.

What rule-based architecture got right

Rule-based architecture had a job and did it well. In the late 1990s and early 2000s, when most enterprise TMS platforms were architected, retail freight was predictable. Order volume came in batches. Carrier capacity was contracted in advance. Lanes were stable. A planner ran the day in the morning, and the system executed against rules that encoded routing guides and exception thresholds.

The architecture worked because the problem was largely linear. The architectural assumption underneath it — that retail logistics is a sequence of largely independent decisions executed against stable parameters — was true at the time.

It is not true now. Where retail signal volume now arrives in continuous streams, and decisions span cost, service, capacity, and carbon objectives at once, a rule-based architecture is structurally limited to processing the past. The gap is where margin leaks.

What changed: the decision shape

Three shifts have stretched the rule-based model past its envelope. Each one moves the architectural floor faster than rule-set maintenance can follow.

Fig. 01 Static plan-and-execute vs continuous replanning, conceptual

Rule-based architectures lock a plan against a snapshot of state and execute against it. When state changes mid-execution, recovery is exception-handled by humans. Composable architectures treat planning as a continuous process — re-optimizing as state evolves.

RULE-BASED · STATIC PLAN-AND-EXECUTE One plan, locked at T=08:00. Reality keeps moving. PLAN LOCKED 08:00 10:00 12:00 14:00 18:00 09:14 capacity alert 11:42 truck rerouted 14:18 hub backlog HUMAN EXCEPTION QUEUE +1.4 h +2.1 h +1.8 h Each deviation exits the planning loop. Recovery latency is human reaction time. COMPOSABLE · CONTINUOUS REPLANNING Plan recomputes at every state change. Recovery is the operating loop. PLAN LIVE 08:00 10:00 12:00 14:00 18:00 09:14 capacity alert 11:42 truck rerouted 14:18 hub backlog CONTINUOUS RE-PLAN LOOP + 0.8 s AUTO RE-PLAN + 1.2 s AUTO RE-PLAN + 0.6 s AUTO RE-PLAN Every state change recomposes the plan. Recovery latency is machine reaction time.

It uses cost, service, capacity, and SLA constraints to build the plan. The architectural failure isn’t that it ignores those constraints — it’s that the plan is locked against a snapshot. When carriers change capacity, when a regional hub posts an alert, when a returns truck reroutes mid-flight, the rule-based architecture has no native mechanism to recompose the plan against the new state. Recovery exits the planning loop and enters the human exception queue.

Fig. 02 Autonomous resolution rate, agentic vs rule-based exception management

Up to 80% of common incidents can be resolved autonomously. Time-to-resolution drops by 60–90% versus dashboard-and-human exception handling.

AGENTIC vs RULE-BASED EXCEPTION HANDLING AUTONOMOUS RESOLUTION RATE 80% of common incidents resolved autonomously 0% 100% (McKINSEY · EXHIBIT 5) TIME-TO-RESOLUTION DASHBOARD + HUMAN 100% AGENTIC 10–40% 60–90% reduction in time to resolve (McKINSEY · EXHIBIT 5)

McKinsey QuantumBlack, “Seizing the Agentic AI Advantage,” 2025-06-13

When a single rule fires incorrectly, downstream effects cascade through every decision that depended on it. By the time a planner is alerted that a 9am cut was missed, the rerouting options that existed at 8:55am are gone.

Learn how the world's first agentic TMS can act in real-time governed by the policies your team defines

Fig. 03 Static rule-set maintenance vs continuous adaptation, conceptual

A rule-based system is exactly as smart as the day it was last patched. Agentic systems collapse cycle time through parallel execution and adapt continuously to changing conditions.

RULE-BASED · STATIC RULE-SET MAINTENANCE Logic patched once every few months. Often obsolete by go-live. LOGIC FROZEN CYCLE 1 ~3 MONTHS RECAL CYCLE 2 ~3 MONTHS RECAL CYCLE 3 ONGOING The world keeps moving. The logic does not, till your planner or their account manager recalibrates. COMPOSABLE · CONTINUOUS ADAPTATION Logic patched on every signal. Always current with the network. LOGIC LIVE CYCLE 1 · · · CYCLE n ~SEC ~SEC Every real-world signal updates the logic. Rules that stay current with your network reality.

McKinsey QuantumBlack, “Seizing the Agentic AI Advantage,” 2025-06-13

A rule-based system suffers from logic degradation: as the physical network evolves, the coded logic in the TMS becomes increasingly detached from physical reality. Maintaining it requires manual recalibration projects whose parameters are obsolete by the time they reach production. Agentic systems close that loop in production. Rules still encode policy and intent. What changes is the optimization layer, the learning loop, and the simulation capability that sits around them.

These three shifts describe a fundamentally different problem shape.

Where this surfaces in your operations

  1. Peak season chaos is data velocity hitting a system that batches.
  2. Store-fulfillment cost spikes are decision interdependency the system cannot evaluate, so it picks the closest store and hopes.
  3. Slot adherence drift is the cascading effect of one delayed offload that the system cannot recompose around.
  4. Planner firefighting is what happens when the system cannot close a learning loop, so it offloads the loop to humans.
  5. Returns routing leakage is decision interdependency at the network level, expressed as a static default that ships every parcel home.

The dashboard cannot show any of this because the dashboard reports against the parameters the system was given.

When the architecture broke in public

Two retail cases show what happens — and what doesn’t — when an architecture meets a state change it cannot replan against. The first illustrates a system halting at a partial fault. The second is a side-by-side: two retailers, the same data shock, two architectural reflexes. Neither is an outlier. They’re the same pattern that plays out every peak season inside retail TMS deployments at smaller scale — it just rarely makes the press.

ASOS, 2019.

ASOS rolled out a new automated storage-and-retrieval system at its Berlin Eurohub. The retrieval side worked. The put-away side could not keep pace, generating an inbound backlog the automation could not recompose around. Picking faults compounded — items left out of orders, customer cancellations, profit warning issued. Supply Chain Dive documented the architectural shape: the system treated the fault as an exception to halt rather than as a state to recompose around. Profit impact: £20–25M ($25–31M) immediate; warehouse-transition costs rose from a £35M budget to £47M; full-year profit slumped 68%.

The architecture had no native mechanism to recompose against partial failure. It halted, and the human queue absorbed the cost.

Walmart vs Target, 2022.

Both retailers received pandemic-era holiday inventory in early 2022. Both faced a sudden demand collapse as consumer spending shifted from goods to services. The architectural responses diverged. Walmart took an item-by-item, category-by-category approach — canceled billions of dollars in orders, repriced aggressively on shorter-lead items, and trimmed about one-third of its U.S. excess inventory between Q2 and Q3 in a single quarter. Target, by contrast, took two large inventory charges within weeks, absorbed heavy markdowns through year-end, and saw a margin compression that lasted into 2023. Both retailers had access to similar information. The architectural difference was in the speed and granularity of dynamic re-composition.

Same shock, same data — different architectural reflexes. Walmart ran a recomposition loop. Target absorbed in a queue.

Both are static architecture in production. ASOS halted at the exception. Target absorbed in the queue. Walmart, with the same shock, kept the recomposition loop running. The architecture is the difference.

Notes & sources

A note on the limits of this argument

Composable architectures bring their own challenges. Explainability gets harder. Governance gets harder. They do not eliminate the need for human planners; they shift what planners do, from manual exception triage to setting policy and reviewing the decisions the system flags. The work changes. It does not disappear.

The transition is also imperfect. Gartner finds 56% of chief supply chain officers cite integrating AI with legacy systems and processes as a major challenge, and separately predicts over 40% of agentic AI projects will be canceled by the end of 2027 due to escalating costs, unclear business value, and inadequate risk controls. Layering an agentic wrapper on a fragmented, siloed data architecture compounds the problem rather than fixing it. The architectural shift is real. So is the cost of doing it badly.

Sources

  • Fig. 02 (Autonomous resolution rate): McKinsey QuantumBlack, “Seizing the Agentic AI Advantage,” 2025-06-13 — “Up to 80 percent of common incidents could be resolved autonomously, with a reduction in time to resolution of 60 to 90 percent (Exhibit 5).”
  • Fig. 03 (Continuous adaptation): McKinsey QuantumBlack, “Seizing the Agentic AI Advantage,” 2025-06-13 — “parallel execution that collapses cycle time, real-time adaptability that reacts to changing conditions, deep personalization at scale, and elastic capacity that flexes instantly with demand.”
  • Case 01 (ASOS 2019): Supply Chain Dive, “Warehouse tech glitches cause $25M disruption for Asos,” 2019. supplychaindive.com
  • Case 02 (Walmart vs Target 2022): Supply Chain Dive, “Walmart’s inventory glut recedes,” 2022 (Q3 earnings analysis). supplychaindive.com. Retail Dive, “After prompt action on inventory, Target is poised for a comeback,” 2022. retaildive.com
  • Limits of the argument: Gartner press release, “Survey Finds Technology Integration and Talent Perceived as Key Roadblocks to Scaling AI in Supply Chain,” 2026-04-29 (56% CSCO finding). Gartner press release, “Predicts Over 40% of Agentic AI Projects Will Be Canceled by End of 2027,” 2025-06-25.

Nachiket Murthy

Nachiket Murthy

Product Marketing Manager

Nachiket leads Product Marketing at Locus, bringing over seven years of experience across financial analysis, corporate strategy, governance, and investor relations. With a multidisciplinary lens and strong analytical rigor, he shapes sharp narratives that connect business priorities with market perspectives.