Martin Szerment
AuthorPublished on April 6, 2026
Did you like this article? Share it!
In 2024, factories began experimenting with generative AI — copilots for documentation, SOP assistants, shift-report generators. In 2025, Gartner named Agentic AI the #1 strategic technology trend for 2026. The difference is fundamental: generative AI answers; agentic AI acts.
An agent is a system that receives a goal — not a step-by-step instruction — plans the sequence of actions itself, calls tools (MES API, ERP, CMMS, planning system), observes the result, and iterates until the goal is met or an exception is escalated. In the factory, this means that instead of a planner manually sequencing orders in an APS system, the goal ("fulfill orders 4512–4518 by Friday with minimum changeovers") is handed to an agent that draws up the schedule, picks the lines, checks component availability in ERP, and publishes the plan.
Agent, not chatbot — what really makes the difference
A classical factory chatbot answers questions and generates text. An agent:
- Has memory (contextual and long-term) — remembers what it did five minutes and five days ago.
- Uses tools (tool use) — calls the APIs of factory systems, reads databases, queries the MES.
- Plans — decomposes a goal into steps, executes them in order, reacts to errors.
- Observes the world — consumes telemetry, alarms, machine status.
- Acts autonomously — makes decisions without human confirmation, within a defined policy.
The typical 2026 architecture is a multi-agent system: one agent handles planning, another maintenance, a third quality, a fourth energy. Agents communicate through a shared memory layer and an orchestrator (LangGraph, CrewAI, AutoGen, Microsoft Semantic Kernel), each with access to a specific subset of industrial systems.
Market data — how large is this really
Gartner forecasts that by 2028, 33% of enterprise applications will include an Agentic AI component (versus less than 1% in 2024). The same report estimates that 15% of daily operational decisions in enterprises will be made autonomously by agents.
McKinsey's early-2026 analysis sizes the potential of Agentic AI in manufacturing at USD 0.5–1.5 trillion of annual added value globally — driven mostly by reductions in planning cost, maintenance cost, and supply-chain management.
Deloitte's State of AI in Manufacturing 2026 reports:
- 42% of manufacturers have at least one AI agent pilot running in production.
- 18% have deployed an agent at line or functional-area level (maintenance, quality, planning).
- Only 3% use agents in decisions that directly affect machine control without human sign-off.
Concrete deployments — 2025–2026
Siemens Industrial Copilot and PLC agents
In mid-2025, Siemens extended Industrial Copilot with an agent layer that generates and modifies PLC programs (Siemens TIA Portal) based on a business goal. The agent analyzes existing code, proposes a change, generates a test, and deploys it to a test cell. BSH Hausgeräte reports a 30% reduction in engineering time for new line processes.
Schneider Electric and autonomous energy planning
Schneider deployed a multi-agent system at plants in India and France. An energy agent optimizes, in real time, load distribution between the grid, on-site generation (PV) and storage, communicating with a production agent. Early data shows 8–12% reduction in energy cost compared to a classical EMS.
Hitachi and autonomous maintenance
Hitachi deployed a CMMS agent in semiconductor plants. The agent reads alarms, vibration and operating parameters, opens service orders, orders parts in SAP and proposes a maintenance window aligned with the production plan. The mean time from anomaly detection to work order dropped from roughly 4 hours to roughly 8 minutes.
BMW and multi-agent quality control
BMW is piloting, in Regensburg, a quality agent that consumes machine-vision images and MES data, correlates defects with process parameters (mold temperature, pressure), and proposes recipe adjustments — applied to production after engineer approval. Defect reduction in the pilot: 17%.
OmniMES and operational agents
In the MES segment for mid-sized plants, we are seeing broader deployment of operational agents — shift planning, OEE analysis, shift-report drafting. The same trend shows up at all major MES vendors (AVEVA, Siemens Opcenter, Rockwell Plex).
Technology stack — what this actually runs on
The foundation layer:
- Language models: Claude 4.6, GPT-5, Gemini 2.5 — all offer dedicated agent APIs with tool-calling, streaming and long contexts (1M tokens).
- Orchestration: LangGraph, CrewAI, AutoGen (Microsoft), Semantic Kernel, Anthropic Agents SDK.
- Memory: vector stores (Pinecone, Weaviate, pgvector) plus episodic memory.
- Observability: LangSmith, Arize, Helicone — tools dedicated to agent decision logging and audit.
The integration layer has converged on MCP (Model Context Protocol) — the Anthropic standard from 2024 that lets an agent talk to tools (MES, ERP, CMMS) without writing a dedicated integration per model. In 2026 MCP is adopted by OpenAI and Google as well, removing lock-in on any single model vendor.
Barriers — where agents don't work yet in 2026
1. Control and auditability
An agent makes decisions based on a probabilistic model. For ISO 9001, IATF 16949, FDA 21 CFR Part 11, every product-affecting decision requires an audit trail that stands up in front of an auditor. Logging prompts is not enough — you need a structured representation of the decision, versioned against the model and policy. Mature tooling is only now emerging.
2. Hallucinations in tool calls
An agent can "invent" non-existent API parameters, wrong order IDs, or malformed data structures. Error rates in tool calling on production-grade benchmarks (τ-bench, WebArena) sit at 3–10% depending on model and domain in 2026. In a production environment, that is too high to let an agent change a recipe without a second pair of eyes.
3. No standard data model for industry
An agent needs to understand what a line, order, product, operation, or resource is. In most plants, this vocabulary is inconsistent between MES, ERP and PLM. Without a Unified Namespace and a coherent data model (ISA-95, B2MML), the agent gets data it either doesn't understand or understands incorrectly.
4. Inference cost at scale
An agent that queries the model at every iteration of a planning loop generates real cost. For a plant running hundreds of agents around the clock, API cost can hit USD 10k–50k per month. Mitigations: smaller local models (Llama 4, Qwen 3) for routine decisions and prompt caching (up to 90% cost reduction for repeated context).
5. Cybersecurity and prompt injection
An agent with access to MES/ERP is a new attack surface. Indirect prompt injection — injecting instructions via an external document, email or CMMS entry — is a real risk. In 2026, OWASP published the first "Top 10 for LLM Applications" guidelines dedicated to agents, but adoption in factories is uneven.
6. Organizational readiness
An agent changes the decision process. The question of who is responsible for an agent's decision — the process engineer, the system vendor, the model vendor — is open. The EU AI Act enters active enforcement in 2026 and imposes concrete obligations on high-risk systems, including agents managing critical production infrastructure.
What this means for your factory — practical takeaways
Four recommendations for manufacturers in 2026:
-
Start with high-volume, low-decision-risk processes — shift reports, incident summaries, service-order drafting. The ROI lands in months and errors are reversible.
-
Deploy agents in a "human-in-the-loop" mode before "human-on-the-loop". First mode: the agent proposes, the human approves. Second mode: the agent acts autonomously, the human supervises and can stop it. The transition between modes is a matter of metric maturity, not executive enthusiasm.
-
Fix the data foundation. MES, Unified Namespace, a coherent ISA-95 model, clean masters in ERP. An agent built on chaotic data produces chaotic decisions — only faster.
-
Invest in observability and policy guardrails. Every agent action must be logged in an auditable way. Define the list of operations an agent cannot perform without human approval (recipe change, purchase orders above a threshold, line stop).
Bottom line
Agentic AI is not chatbot v2. It is a change in the software operating model of a factory — from a tool the operator launches to an actor that makes decisions within a delegated policy. In 2026, the technology is ready for deployment in low-risk, high-volume areas. Companies that build the data and observability foundation this year will, in 18–24 months, be ready to move agents into critical areas — production planning, maintenance, energy control.
Companies that don't do this today will, two years from now, be buying ready-made agents from competitors — and paying for integration far more than it would have cost to build it themselves.
Sources
- Gartner, Top Strategic Technology Trends for 2026: Agentic AI, 2025.
- McKinsey & Company, The Economic Potential of Agentic AI in Manufacturing, 2026.
- Deloitte, State of AI in Manufacturing 2026, 2026.
- IBM Institute for Business Value, Agentic AI: From Experiment to Enterprise, 2025.
- Anthropic, Model Context Protocol Specification, 2024–2025.
- Siemens, Industrial Copilot — Customer Case Studies (BSH), 2025.
- Schneider Electric, Multi-Agent Energy Management Pilot Results, 2025.
- OWASP, Top 10 for LLM Applications, 2026.
- European Commission, AI Act — Guidelines for High-Risk Industrial Systems, 2026.
