Evaluating what the word agentic does and doesn't commit a vendor to.
May 20267 min read
The rubric.
Seven specific, falsifiable properties an agentic TMS either has or doesn't, each verifiable in the product without taking a vendor's word for it. Use them in the RFI. Use them as questions to ask your incumbent.
01Tenet 01
Decisions, not insights.
A legacy TMS surfaces information. An agentic TMS surfaces decisions.
The legacy promise was that visibility would make planners faster. A decade in, the data says otherwise. When the control tower flags a carrier rejection spike or a regional disruption, the dashboard demands a planner who can model alternatives, assess downstream impact across inventory echelons, and execute a rebooking in the TMS. The window for effective mitigation usually closes before the planner finishes interpreting the alert.
An agentic system reverses the flow. The carrier rejection is not an alert; it is the trigger. The agent evaluates SLA risk, queries spot capacity, calculates cost-to-serve, tenders to the new carrier, updates the ledger, and notifies the customer. McKinsey's March 2025 work with COOs frames the shift as one from AI-as-recommendation-engine to AI-as-execution-engine.
Same disruption. One asks a planner to fix it. The other has already fixed it.
02Tenet 02
Learning, not configuration.
A legacy TMS improves when solution engineers update it. An agentic TMS improves from execution behaviour and outcomes, without re-engineering.
Every vendor claims continuous learning. Configuration looks like learning when a planner only sees the output. The difference shows up when conditions change. A carrier starts rejecting tenders on a specific lane because of an unannounced capacity reallocation. A configured system keeps offering that carrier until someone codifies a new rule. A learning system observes the rejection pattern, deprioritises the carrier on that lane, and shifts volume, all without an admin altering the routing guide.
Crucially, the system is not changing itself on its own. It is absorbing signal from your operators' overrides and from the outcomes your network produces, then refining its decision policies within the boundaries you set. Deloitte's Tech Trends 2026 makes the point directly: organisations that still improve sequentially cannot keep pace with those running continuous learning loops.
The fake-proof test: ask the vendor to walk you through three specific behaviour changes the system made in the last 90 days, each with the data signal that triggered it and the audit-trail entry that recorded it.
Configured systems improve when someone tells them to. Learning systems improve as the network teaches them.
03Tenet 03
Events, not batches.
A batch decision is made before the day begins, against the data the day already had. An event-driven decision is made when the data arrives.
The risk in batch is not slowness. It is silent algorithmic failure: a mathematically optimal decision against a reality that no longer exists. A 30-minute-old picture of inventory is enough to make a confident, wrong allocation. A 24-hour-old picture is enough to do it at scale.
Event-driven architecture treats every state change in the physical world (a geofence crossing, a reefer temperature deviation, a carrier rejection, a customer reschedule) as a discrete signal the decisioning layer consumes in milliseconds. McKinsey's February 2025 supply-chain work notes that real-time, connected ecosystems remove silos and produce "lower costs, smaller inventories, and fewer lost sales." SCMR's June 2024 coverage of the CSCMP State of Logistics frames the broader context: logisticians are adapting to "permanent volatility." A system that integrates that volatility as it arrives produces different decisions than one that absorbs it overnight.
Same reschedule. Batch waits until morning. Event-driven rebuilds the day in seconds.
04Tenet 04
Composable, not monolithic.
A monolithic TMS ships as one product whose modules entangle. A composable TMS ships as components you can swap, configure, or extend independently, whether through APIs, configs, or natural-language policy.
Components can be replaceable through APIs, but they can also be replaceable through config layers or through natural-language policy interfaces that operators write into the system directly. What matters is whether you can pull the carrier-management module without re-implementing the pricing engine or the optimisation layer. The seam between components and modules.
Gartner's January 2024 "Predicts: Composable Modularity Shapes the New Digital Foundation" calls this the foundational architecture for "continuous access to adaptive change." In practice this looks like Packaged Business Capabilities, MACH-style decoupling, and agent-builder interfaces that let an enterprise hot-swap a routing optimiser or a settlement workflow without triggering a regression test across the entire suite.
Same need: swap one module. One requires a re-platform. The other takes a config change.
05Tenet 05
Governance, not goodwill.
A system that can tender freight, sign digital contracts, and reroute intermodal shipments has direct control over financial liability. Operating that system on goodwill, the hope a probabilistic model will respect compliance and cost constraints, invites gaps at machine speed.
The agentic shift demands a layered governance regime built into the architecture, not bolted on after a deployment review. Forrester's March 2025 analysis frames the requirement plainly: scaling agent workflows depends on "effective data and communications governance," not assumed system benevolence.
In practice this means every autonomous decision must carry its own receipt: why it was made, what data triggered it, what the outcome was, and whether a human needs to intervene. The system should let operators dial autonomy up or down by decision type, run new policies in shadow mode before they touch live freight, and score its own choices against what a planner would have done.
A decision without governance is a verdict. With governance, it is a contract you can audit.
06Tenet 06
Ground-truth, not happy-path.
A TMS is only as smart as the data that feeds into it. The plan diverges from reality the moment a feed lags, and from there, every "exception" the planner reviews is either spurious or already late.
Most legacy TMS were designed for a happy-path operation: EDI arrives clean, dock times hold, inventory updates synchronise. Real logistics is none of those things. Carrier APIs fail. GPS pings drop. POD scans come in late. Weather closes a corridor at 03:00 and the system finds out at 09:00. Naive systems treat these as edge cases. Agentic systems treat them as the default operating condition, and they architect for it.
The cost of getting this wrong is silent. A traditional application that hits bad data throws an error. A naive AI agent that hits stale data confidently executes a wrong decision and nobody knows for hours. McKinsey's April 2025 supply-chain research frames the operating reality: companies are "striving for end-to-end visibility, efficiency, and agility... yet outdated infrastructure, fragmented data, and supply chain disruptions make these goals difficult to achieve." Ground-truth orchestration is what closes the gap. Exception handling is downstream of it; without ground-truth, exception handling is just reconciliation labour.
Same day. One TMS believes its own plan. The other knows reality has moved.
07Tenet 07
Network-aware, not lane-by-lane.
A lane-by-lane TMS optimises the next deviation. A network-aware TMS solves for cost-to-serve across the whole network.
Local or node optima are routinely network-pessimal. Saving $200 on a cheaper truckload can delay a delivery, trigger stockouts at the destination, and force $10,000 in expedited primary freight. The TMS that picked the cheap leg "won" on its own metric; the enterprise lost. Network-aware decisioning is what the industry calls Multi-Echelon Inventory and Transportation Optimization (MEIO). The system weighs every transport decision against inventory holding, real-time demand, facility throughput, and working-capital constraints in real time.
The empirical case is increasingly hard to argue with. BCG's October 2025 work on AI-first companies documents a medtech firm that embedded AI forecasting across its end-to-end planning process and optimised multi-echelon inventory: 60% reduction in backorders, more than $125M in inventory reduction in under 12 months. Those numbers don't come from local optimisation. They come from a system whose decisions are conditioned on the whole network.
The cheapest leg picks one edge. The cheapest route reasons over the whole graph.
A counterpoint.
Ascertaining a vendor's score against these tenets is the easy part. Reading their trajectory is the work.
Most TMS in market clear few of the seven, perhaps even partially. A composable, network-aware, ground-truth system that learns continuously and allows your teams to govern every decision is the end goal.
What separates the field is what is sitting behind the score. A small group of vendors are genuinely building toward the rubric, with the architecture, multi-agent systems, and policy-as-code infrastructure to back it.
A larger group are selling an impossible roadmap. The plan is credible, but the underlying system is still mostly legacy now with a language-model layer. The veneer is recent and the engineering required to support it has not started.
Agentic architecture takes years to build, and the vendors who have been building for years are visibly ahead. Read their trajectory, not the slide.
On schematic visualisations. All seven figures are schematic. Resolution times, drift curves, and network graphs illustrate the structural distinction.
Nachiket Murthy
Product Marketing Manager
Nachiket leads Product Marketing at Locus, bringing over seven years of experience across financial analysis, corporate strategy, governance, and investor relations. With a multidisciplinary lens and strong analytical rigor, he shapes sharp narratives that connect business priorities with market perspectives.