Why Rule-Based TMS Logic Breaks at Modern Retail Scale An Architectural Diagnosis
May 20265 min read
Cyber Monday. 12:06am. A flash promo just dropped on a stockout-prone SKU. Your East Coast 3PL hub posts a temporary capacity alert. A returns truck routing for that exact SKU is in flight. The TMS has to compose a decision in the next three minutes that resolves all three signals together.
It composes none of them. It runs them as three separate rules, in the order they arrived, against three different decision contexts. By 12:11am, two have fired correctly, one has fired incorrectly, and the fourth decision — the one that needed all three inputs in conversation — never gets composed.
Let's go under the hood.
What rule-based architecture got right
Rule-based architecture had a job and did it well. In the late 1990s and early 2000s, when most enterprise TMS platforms were architected, retail freight was predictable. Order volume came in batches. Carrier capacity was contracted in advance. Lanes were stable. A planner ran the day in the morning, and the system executed against rules that encoded routing guides and exception thresholds.
The architecture worked because the problem was largely linear. The architectural assumption underneath it — that retail logistics is a sequence of largely independent decisions executed against stable parameters — was true at the time.
It is not true now. Where retail signal volume now arrives in continuous streams, and decisions span cost, service, capacity, and carbon objectives at once, a rule-based architecture is structurally limited to processing the past. The gap is where margin leaks.
What changed: the decision shape
Three shifts have stretched the rule-based model past its envelope. Each one moves the architectural floor faster than rule-set maintenance can follow.
Fig. 01 Static plan-and-execute vs continuous replanning, conceptual
Rule-based architectures lock a plan against a snapshot of state and execute against it. When state changes mid-execution, recovery is exception-handled by humans. Composable architectures treat planning as a continuous process — re-optimizing as state evolves.
Scroll to explore →
It uses cost, service, capacity, and SLA constraints to build the plan. The architectural failure isn’t that it ignores those constraints — it’s that the plan is locked against a snapshot. When carriers change capacity, when a regional hub posts an alert, when a returns truck reroutes mid-flight, the rule-based architecture has no native mechanism to recompose the plan against the new state. Recovery exits the planning loop and enters the human exception queue.
Fig. 02 Autonomous resolution rate, agentic vs rule-based exception management
Up to 80% of common incidents can be resolved autonomously. Time-to-resolution drops by 60–90% versus dashboard-and-human exception handling.
When a single rule fires incorrectly, downstream effects cascade through every decision that depended on it. By the time a planner is alerted that a 9am cut was missed, the rerouting options that existed at 8:55am are gone.
Learn how the world's first agentic TMS can act in real-time governed by the policies your team defines
Fig. 03 Static rule-set maintenance vs continuous adaptation, conceptual
A rule-based system is exactly as smart as the day it was last patched. Agentic systems collapse cycle time through parallel execution and adapt continuously to changing conditions.
A rule-based system suffers from logic degradation: as the physical network evolves, the coded logic in the TMS becomes increasingly detached from physical reality. Maintaining it requires manual recalibration projects whose parameters are obsolete by the time they reach production. Agentic systems close that loop in production. Rules still encode policy and intent. What changes is the optimization layer, the learning loop, and the simulation capability that sits around them.
These three shifts describe a fundamentally different problem shape.
Where this surfaces in your operations
Peak season chaos is data velocity hitting a system that batches.
Store-fulfillment cost spikes are decision interdependency the system cannot evaluate, so it picks the closest store and hopes.
Slot adherence drift is the cascading effect of one delayed offload that the system cannot recompose around.
Planner firefighting is what happens when the system cannot close a learning loop, so it offloads the loop to humans.
Returns routing leakage is decision interdependency at the network level, expressed as a static default that ships every parcel home.
The dashboard cannot show any of this because the dashboard reports against the parameters the system was given.
When the architecture broke in public
Two retail cases show what happens — and what doesn’t — when an architecture meets a state change it cannot replan against. The first illustrates a system halting at a partial fault. The second is a side-by-side: two retailers, the same data shock, two architectural reflexes. Neither is an outlier. They’re the same pattern that plays out every peak season inside retail TMS deployments at smaller scale — it just rarely makes the press.
ASOS, 2019.
ASOS rolled out a new automated storage-and-retrieval system at its Berlin Eurohub. The retrieval side worked. The put-away side could not keep pace, generating an inbound backlog the automation could not recompose around. Picking faults compounded — items left out of orders, customer cancellations, profit warning issued. Supply Chain Dive documented the architectural shape: the system treated the fault as an exception to halt rather than as a state to recompose around. Profit impact: £20–25M ($25–31M) immediate; warehouse-transition costs rose from a £35M budget to £47M; full-year profit slumped 68%.
The architecture had no native mechanism to recompose against partial failure. It halted, and the human queue absorbed the cost.
Walmart vs Target, 2022.
Both retailers received pandemic-era holiday inventory in early 2022. Both faced a sudden demand collapse as consumer spending shifted from goods to services. The architectural responses diverged. Walmart took an item-by-item, category-by-category approach — canceled billions of dollars in orders, repriced aggressively on shorter-lead items, and trimmed about one-third of its U.S. excess inventory between Q2 and Q3 in a single quarter. Target, by contrast, took two large inventory charges within weeks, absorbed heavy markdowns through year-end, and saw a margin compression that lasted into 2023. Both retailers had access to similar information. The architectural difference was in the speed and granularity of dynamic re-composition.
Same shock, same data — different architectural reflexes. Walmart ran a recomposition loop. Target absorbed in a queue.
Both are static architecture in production. ASOS halted at the exception. Target absorbed in the queue. Walmart, with the same shock, kept the recomposition loop running. The architecture is the difference.
Nachiket Murthy
Product Marketing Manager
Nachiket leads Product Marketing at Locus, bringing over seven years of experience across financial analysis, corporate strategy, governance, and investor relations. With a multidisciplinary lens and strong analytical rigor, he shapes sharp narratives that connect business priorities with market perspectives.