Skip to main content

Pump by Exception: The Definitive Guide for Oil & Gas Operators

Pump by exception replaced fixed routes with SCADA-driven alerts, cutting unnecessary site visits by 20-40%. But as operations scale, alarm fatigue and missing economic context expose its limits. Here is everything you need to know about where it started, where it breaks down, and what comes next.

Why Pump by Exception Exists

For decades, oil and gas field operations ran on fixed routes. A pumper would visit every well on a set schedule, whether anything was wrong or not. The model had deep problems:

Wasteful

Up to 60% of scheduled visits found nothing wrong. Trucks rolled, fuel burned, and hours passed with zero productive outcome.

Dangerous

More windshield time means more exposure to road hazards, especially on rural lease roads in poor conditions.

Slow to Respond

A well could go down Monday morning and not be checked until Wednesday afternoon if that was the next scheduled visit.

Does Not Scale

When operators consolidated through M&A, well counts doubled but headcounts did not. Fixed routes became physically impossible.

Pump by exception emerged as the answer: use SCADA sensors to monitor wells continuously and only dispatch operators when something actually needs attention. It was a genuine breakthrough that transformed field operations across the industry.

How Pump by Exception Works

A pump by exception system has three fundamental layers, each building on the one below it.

Layer 1: SCADA & Sensors

Pressure transducers, flow meters, tank-level sensors, and RTUs continuously monitor every well. Data flows via cellular, radio, or satellite to a central historian at intervals ranging from seconds to minutes.

Layer 2: Rules Engine & Alarm Management

Configurable thresholds define "normal" operating windows for each parameter. When a reading breaches its threshold, the system generates an exception. More sophisticated engines allow compound rules (e.g., high casing pressure AND low production rate).

Layer 3: Mobile Delivery & Dispatch

Exceptions are pushed to field operators via mobile apps, text messages, or email. The operator reviews the alert, decides whether to visit the site, and logs the outcome. In basic implementations, dispatching is manual. In more advanced systems, alerts are routed to the nearest qualified crew member.

The Competitive Landscape

Several vendors have built products around exception-based field operations, each with different strengths and target markets.

IFS Merrick has long served mid-size operators with production management and field data capture. Their exception workflows are tightly integrated with production accounting, making them popular with operators who want a single vendor for back-office and field systems. The trade-off is limited flexibility in alarm configuration and a user experience that reflects the system's legacy architecture.

W Energy Software provides enterprise resource management with exception-based workflows layered on top. Their strength is the breadth of their platform, covering everything from accounting to regulatory. The exception management module is functional but not the core focus, and operators looking for advanced alarm logic or mobile-first experiences often find it lacking.

PakEnergy (formerly SherWare) offers a cloud-based production management platform with growing exception capabilities via Scout FDC. Strong in the Permian and Mid-Continent, they have invested in mobile field capture and basic alarm workflows. Their platform is evolving, but the progression to ranked-priority work execution is not on their published roadmap.

Seven Lakes (JOYN 2) launched in 2019 as "the first pump-by-exception platform built for oilfields." Strong configurable rules engine and dynamic routing; cloud + on-premise architecture; mobile app for field workers. The published platform brief is feature-led with no quantified customer outcomes and predates the 2025–2026 agentic-AI shift. Operators who started on JOYN 2 for pump-by-exception are now evaluating the next-generation ranked-priority loop.

EZ Ops is a Canadian-born platform focused on field data capture and exception management. Their mobile-first approach resonates with operators who want a clean, modern user experience. Strong in Western Canada, they have been expanding into the U.S. market. Exception handling is well-implemented but stops at the alarm level without economic scoring.

Ambyint focuses on rod pump optimization using AI to detect inefficiencies and recommend setpoint changes. Their approach is narrower than full exception management, concentrating on specific equipment types rather than whole-field operations. Effective for rod pump fleets but does not address the broader operational workflow.

ChampionX (formerly Apergy) offers production optimization services including XSPOC for rod pump analysis and SMARTEN for ESP optimization. Their strength is deep equipment-level intelligence, but the platform operates as a point solution rather than an integrated field operations system.

Emerson / Zedi combines SCADA infrastructure with cloud-based production monitoring and basic exception alerting. Their advantage is vertical integration from the sensor to the dashboard. Operators using Emerson SCADA hardware get a seamless data pipeline, but the exception management layer is relatively basic compared to purpose-built operations platforms.

Where Does WorkSync Fit?

The solutions above focus on identifying exceptions. WorkSync OPS goes further, using ML anomaly detection, economic scoring, and constraint-based route optimization to turn those exceptions into complete, ranked, crew-specific daily work plans. It layers on top of any existing SCADA or exception-based system without replacing it. See how we compare →

Where Pump by Exception Breaks Down

Pump by exception was a genuine leap forward. But as operators scale and margins tighten, five fundamental limitations emerge.

1. Exception Overload

As well counts grow, the number of active exceptions can overwhelm operators. A pumper responsible for 150 wells may see 30-50 exceptions on a busy day with no way to triage them effectively.

2. All Exceptions Look Equal

A low-pressure alarm on a 5 BOPD stripper well generates the same alert as a pump failure on a 200 BOPD producer. Without economic context, operators cannot distinguish between a $50/day problem and a $15,000/day problem.

3. No Economic Context

Exception systems operate on engineering parameters alone. They know nothing about commodity prices, working interest, lifting costs, or net revenue. The alarm does not know what a barrel of oil is worth.

4. One-Dimensional Route Optimization

Most exception systems dispatch based on proximity or simple geographic clustering. They do not account for crew qualifications, equipment requirements, safety constraints, or the economic value of competing tasks.

5. No Integrated Work Planning

Exception-based operations handle reactive work but exist in a silo. Scheduled maintenance, regulatory compliance tasks, and optimization work live in separate systems, making it impossible to build a truly unified daily plan.

What Comes Next: From Exception to Intelligence

The evolution from exception-based operations follows a clear path. First, you move from fixed routes to exception-driven dispatch. Then, you layer economic scoring on top to create pump by priority. Finally, you close the loop with AI that learns from every outcome.

WorkSync OPS is purpose-built for this full evolution. It does not replace your SCADA or exception system; it sits on top of them and transforms raw alarms into economically-ranked, route-optimized, AI-validated work plans.

How AI Solves Each Limitation

False alarms waste time

AI Data QA

Machine learning models validate incoming SCADA data against expected patterns, flagging sensor drift and communication errors before they generate false exceptions.

All exceptions look equal

ML Anomaly Detection

Instead of static thresholds, WorkSync's decision agents and ML models learn each well's unique operating signature and identify meaningful deviations from normal behavior, dramatically reducing noise.

React only after things break

Predictive Models

Pattern recognition across failure histories enables the system to predict equipment failures days or weeks before they occur, shifting from reactive to preventive.

Which exception first?

Economic Scoring

Every issue is scored by estimated financial impact: production at risk, commodity price, working interest, and lifting cost. The highest-dollar problems always surface first.

Operators plan their own day

Route Optimization

Constraint-aware algorithms build optimized daily plans considering geography, crew skills, equipment needs, urgency, and the economic value of each task.

Conditions changed mid-shift

Real-Time Re-Optimization

When new issues emerge during the day, the system re-ranks and re-routes in real time, ensuring crews are always working on the most current highest-value tasks.

The system never gets smarter

Continuous Learning

Every field outcome feeds back into the models. Scoring accuracy improves, false alarm rates decrease, and routing gets more efficient with every cycle.

WellOPS · the positioning, plainly stated

WellOPS is the best pump-by-exception system on the market.

The only end-to-end closed-loop automated workflow that balances cash flow, risk, and maintenance to maximize value while efficiently managing asset and personnel safety risks.

Other PBE systems detect exceptions. They send alerts. They might dispatch. They do not close the loop on the work that the exception triggers, and they do not balance cash flow against risk against maintenance against safety on the same ranked queue. WellOPS does. Below is what that actually means in operations.

01 · Cash flow at the top of the ranking

Every issue scored by dollar impact, not by who calls loudest.

The Work Engine ranks every alarm, deferment event, scheduled task, and inspection finding by estimated cash-flow impact. Production at risk, commodity price, working interest, lifting cost, value density. The highest-dollar problems surface first, every shift. Static-threshold PBE systems treat all exceptions equally. WellOPS does not.

02 · Risk balanced against the dollars

Asset risk and operational risk are constraints, not weights.

Equipment-failure-mode risk, regulatory windows (OOOOb, AQCC Reg 7, NDIC capture targets), basin rules (the 2,000-ft setback, OCC seismicity protocol), and asset-class limits all live in the optimizer as hard constraints. The ranking respects them. The pumper never gets dispatched into a job the system knows would breach a constraint.

03 · Maintenance integrated into the same queue

Predictive maintenance is on the ranked plan, not in a separate system.

Anomaly detection on rotating equipment 48 to 72 hours ahead of failure. Workover priority scored on cost-of-failure × probability × intervention payback. Same data layer, same queue, same crew-day allocation as the production work. Other PBE systems run maintenance in a separate CMMS that does not talk to the dispatch loop.

04 · Personnel safety enforced at dispatch

The unqualified worker cannot be dispatched to the unsafe job.

Field Work Management enforces operator qualification (OQ), contractor compliance (ISN, Avetta, Veriforce), hot-work permits, gas-test records, lone-worker check-ins, and JSA completion before a job is released. Hard constraints, not weights. The published TRIR move at the reference deployment was 1.8 to 0.3.

05 · Operator-editable workflows and escalations

No IT project to change an escalation rule.

The superintendent who wants the high-water-cut alarm escalated to the night-shift foreman after 30 minutes configures it themselves in minutes. Workflow logic, escalation timing, dispatch rules, notification routing, all operator-editable in WellOPS. Most enterprise CMMS and PBE systems require a multi-week IT cycle for the same change.

06 · Integrates with the systems you already run

Sits on top of your stack via the Data Hub. No migration required.

Ignition, AVEVA PI, Cygnet, eLynX, Emerson Zedi. Enertia, Quorum, Pak, IFS Merrick, W Energy, WolfePak. Maximo, MaintainX, Peloton. ArcGIS and any historian. Read-only Data Hub reconciles them into one operating data layer. No system of record gets moved. The closed loop runs on top. Read the full architecture.

Cash flow + risk + maintenance + safety, balanced on one ranked queue, end to end. There is no better product on the market for managing your day-to-day oil and gas operations.

Proven Results

15%+
Free Cash Flow Uplift
35%
Fewer Site Visits
83%
Safety Improvement (TRIR)
40%
Lower Liquid Inventories

Making the Transition

The move from pump by exception to intelligent operations is additive, not a replacement. Your SCADA investment, your exception rules, your field workflows: they all remain. WorkSync OPS layers on top, connecting to 40+ data sources and systems you already use.

Integration is pre-built for the platforms most operators rely on: eLynx, Emerson Zedi, ABB, IFS Merrick, W Energy, Enbase, Enverus, and dozens more. Data ingestion typically begins within the first week, with initial priority scoring live within 30 days.

The question is not whether to keep pump by exception. It is whether you are ready to add the economic and AI layers that make it actually work at scale.

Frequently Asked Questions

Pump by exception is an operational strategy that replaces fixed-route well checks with event-driven visits. SCADA systems monitor well parameters continuously and alert operators only when readings fall outside normal thresholds, eliminating unnecessary trips to wells that are running fine.
Pump by exception flags problems when sensor thresholds are breached. Pump by priority goes further by ranking every flagged issue by its estimated economic impact, so operators address the highest-value work first rather than treating all alarms equally.
Most modern SCADA platforms support exception-based workflows, including systems from Emerson (Zedi), ABB, Honeywell, Schneider Electric, and eLynx. The key requirement is the ability to set configurable alarm thresholds and deliver alerts to mobile devices.
Operators typically see a 20-40% reduction in routine site visits after implementing pump by exception, depending on well density, failure rates, and how well alarm thresholds are calibrated. However, alarm fatigue can erode these gains over time without additional prioritization.
AI transforms pump by exception by replacing static alarm thresholds with an agent pipeline, specialized agents that clean data, detect anomalies, and score issues using machine learning models trained on each well's normal behavior. This reduces false alarms by up to 80%, predicts failures before they occur, and layers economic scoring on top so operators always know what matters most.
For pump-by-exception specifically: Seven Lakes JOYN 2 was the first dedicated pump-by-exception platform (2019) and has a strong configurable rules engine; PakEnergy Scout FDC currently ranks #1 on the term; IFS Merrick, W Energy, and EZ Ops also serve the space. For operators ready to move beyond exceptions into prioritized, agentic operations, WorkSync OPS provides a full closed-loop ranked work execution system with economic scoring, constraint-based route optimization, and continuous learning, sitting on top of any existing exception-based system without replacing it.

WellOPS: cash flow, risk, maintenance, and safety on one ranked queue.

The only end-to-end closed-loop automated workflow that balances cash flow, risk, and maintenance to maximize value while efficiently managing asset and personnel safety risks. Ranked daily plan in the truck cab by 6 AM. Operator-editable workflows and escalations. Integration with any SCADA, accounting, or CMMS you already run.

Request Your Free Trial