Skip to main content
A digital visualization of upstream pumpjack operations, representing the AI substrate now disclosed on supermajor earnings calls
Back to Insights
The VisionAI Strategy

AI Is Now a Line Item on the Earnings Call

Five operators cleared the bar in public ($300M at APA, $300M+ at Devon, $9.7B booked at ExxonMobil, 10% of Permian production at Chevron, 80-90% maintenance reduction at ConocoPhillips Norway). Roughly 70 percent of operators are still stuck in pilot phase. The structural divide, the four use cases that travel from supermajor to independent, and the path that runs in weeks instead of years.

Michael Atkin, P.EngMay 22, 202612 min read
$300M+
Devon Energy 2025 AI-led optimization target, raised mid-year from $200M
$300M
APA Corporation 2025 optimization target on the Palantir AIP partnership, up from $200M
$9.7B
ExxonMobil booked against a $15B-by-2027 efficiency target, with AI named as a core lever
10%
Chevron Permian production increase attributed to the data-driven approach
80-90%
ConocoPhillips Norway BU select maintenance task time reduction, Global Digital Twin program
30%
Gartner: generative AI projects abandoned after POC by end of 2025
~70%
McKinsey: oil and gas companies still stuck in the pilot phase
<15%
BCG: oil and gas respondents who say digital has meaningfully created value

$300M at APA. $300M+ at Devon. $9.7B booked at ExxonMobil. 10% of Permian production attributed to AI at Chevron. AI is no longer a slide in the strategic plan. It is a number the CFO is being asked to defend on the quarterly call. The question for every other operator is whether the bar gets cleared on the operator's stack, or whether the company joins the roughly 70 percent still stuck in pilot purgatory.


The Bar Got Quantified in Public

For most of the last decade, "AI in oil and gas" was a tradeshow conversation. The proof points sat behind NDAs. The internal pilots were rolled into broader digital programs and never disclosed at the line-item level. The CFO had nothing to defend, because nothing was being measured at a level the investor could ask about.

That changed in the 2024 and 2025 reporting cycle. The five operators below now publish AI outcomes at a granularity the investor can underwrite. Each number is sourced to a primary disclosure. None of these are vendor claims.

Devon Energy. ChatDVN 3.0 is used daily by roughly half the company. Engineer and geoscientist productivity lift of 15 to 30 percent. 850-plus wells on autonomous artificial-lift optimization. The 2025 optimization target was raised from $200M to $300M-plus mid-year, with AI named as a core lever. Source: the Hart Energy CTO interview and the Q4 2024 earnings call summary.

ConocoPhillips. Global Digital Twin program rolled out across business units, with the Norway BU posting 80 to 90 percent reductions in select maintenance task time and 90 percent less time to locate equipment for preventive work. Source: the JPT and CIO coverage of the deployment.

APA Corporation. Palantir AIP deployed across planning, maintenance, production optimization, and contracts. The 2025 optimization target was raised from $200M to $300M, with the AI partnership extended in September 2024. Source: BusinessWire and World Oil coverage of the Palantir partnership.

Chevron. A 10 percent production increase in the Permian basin attributed to the data-driven approach. The ApEX generative AI tool was deployed for Gulf of America subsurface workflows. Source: Chevron's newsroom and the VentureBeat coverage.

ExxonMobil. $9.7B booked against a $15B-by-2027 efficiency target, with AI workflows named as a core lever. Source: the Constellation Research coverage of the company's process automation and AI program.

The pattern across all five is the same. Each operator is running a vertical AI deployment against the SCADA, EAM, production accounting, and historian data the operator already owned. None of them paused for a multi-year data lake. None of them paid for a horizontal copilot license and waited for it to figure out the operating problem on its own. The work was vertical, the data was the operator's own, and the outcomes were measured at the level the CFO needed to disclose.

What the Other 70 Percent Are Doing

The same year these five disclosures rolled into the earnings transcripts, three of the most-quoted research firms in oil and gas boardrooms published the counter-statistic. The bar exists. Most operators are not clearing it. The structural reasons keep showing up across every industry conversation.

Gartner. 30 percent of generative AI projects will be abandoned after proof of concept by the end of 2025. The agentic-AI version is worse: 40 percent of agentic-AI projects will be canceled by 2027. The model is not the problem. The data the model was asked to learn from is. Source: Gartner press releases July 2024 and June 2025.

McKinsey. Roughly 70 percent of oil and gas companies have not moved past the pilot phase. Only 17 percent of firms realize more than 75 percent of expected savings from digital programs. Source: McKinsey's industry insights on oil and gas digital transformation.

BCG. Fewer than 15 percent of oil and gas respondents say digital has meaningfully created value. The dashboard nobody opens is the most expensive code in the operator's stack. Source: BCG, "Creating Value Through Digital in Oil and Gas."

The generic-LLM trap. ChatGPT, Microsoft Copilot, and the other horizontal copilots cannot see the operator's SCADA, EAM, AFE history, reserves forecast, or historian. Asking a horizontal copilot to optimize the operation is asking it to guess. There is no published US operator failure case that names this as the proximate cause, because the failures get rolled into "the pilot was inconclusive" and shelved. The structural problem is consistent: a model that is not connected to the operator's data cannot produce field-grade decisions.

Three of those four anti-patterns are research-firm statistics. The fourth is a structural truth that every junior engineer who has pasted production data into ChatGPT will recognize. The operators clearing the bar in the section above are clearing it because they avoided all four.

Get the WorkSync Field Ops Brief

Monthly read for upstream + midstream operations leaders. Case studies, benchmarks, and what's changing in the field. Unsubscribe anytime.

What the Independent Sees on the Other Side of the Disclosure

Every operator below the supermajor tier is being asked the same question by the board right now: what is the operator's number against the AI bar? The honest answer for most independents is some version of "we are running a pilot." That answer was acceptable in 2023. It is not acceptable in 2026, because the supermajors have moved the bar from "is AI ready for our operation" to "what is the booked value, on what asset population, against what baseline." The independent that walks into a board meeting in late 2026 with a pilot-phase update will be benchmarked against ExxonMobil's $9.7B booked figure, fairly or not.

The path forward is not to copy the supermajor playbook. The supermajors paid six to eight years of tuition to learn what worked. They built data lakes that they no longer use as the primary AI substrate. They hired chief data officers who manage organizations that are too large for an independent to fund. The lesson the independent should extract is not the architecture. It is the result. The supermajor proof points (ExxonMobil and SLB gas-lift optimization on 1,300-plus wells, ConocoPhillips PLOT on 4,500-plus wells, Devon's autonomous lift on 850-plus wells) all ran against the SCADA history the operator already owned. The independent that has the same SCADA history can run the same class of decision against it, without paying the supermajor tuition. The detailed argument lives in The Data Lake Is a 2017 Idea.

Four Use Cases That Travel from the Supermajor to the Independent

The five operator disclosures above all reduce to a small set of use cases that operate at any scale, against the data the operator already has. Each one ties back to a WorkSync product line. Each one has either a public supermajor proof point or a deployed reference at a top-25 private producer running on 5,000+ wells.

Pump-by-priority. Predictive maintenance plus autonomous artificial lift, scored by economic impact rather than fixed-interval routing. Devon's autonomous-lift program on 850-plus wells is the public proof. The same loop runs at the top-25 private producer reference deployment of WellOPS, scoring every active well overnight by revenue variance and risk, publishing the ranked list before 6 AM. The four-week pilot version of this is laid out in The 4-Week Pump-by-Priority Pilot.

Variance commentary that writes itself. The agent reads the historian, the production database, and the ledger, drafts the weekly variance commentary, and surfaces the deviation worth investigating. Devon's ChatDVN 3.0 is the public proof of the daily-use case. The independent version is the WorkSync Data Hub plus the variance-commentary agent, running on the existing production accounting feed.

Engineering-hour compression. Hydraulic and facilities modeling from a multi-hundred-hour study cycle to a sub-hour iteration. FlowSync runs the engineer's own model against the calibrated network, drafts the MOC language, and cites every change back to the source document. The detailed gas utility case is in The Million-Dollar Model.

Closed-loop work management. Voice-first field capture into the system of record, every observation tagged to the well, the asset, and the cash-flow impact. The pumper's 90 minutes of post-route data entry compresses to zero, and the next day's plan reflects yesterday's actual conditions. The structural argument is in Exception-Based Surveillance.

Every one of those four use cases runs against the operator's existing data. None of them requires a multi-year data lake. None of them is a horizontal-copilot deployment. Each one is the operator's own SCADA, EAM, production accounting, and well file, surfaced to the lease operator, the dispatcher, the planning engineer, and the operations manager, in the form each one already uses.

What Clearing the Bar Looks Like on a Small-to-Mid Independent

The supermajor case studies above ran on asset bases of 850 to 4,500-plus wells. The deployed reference at the top-25 private producer on WorkSync runs across 5,000+ wells in the Western Anadarko, Permian, and Wyoming basins. The minimum viable footprint is smaller than that.

An operator with a single SCADA system, a production accounting feed, and an EAM tool has enough operating data to clear the bar on at least one of the four use cases above. The constraint at small scale is not data volume. It is the maturity of the field workflow that consumes the ranked decisions. The Data Hub integration time on a typical independent stack is under one week, in read-only mode, with no rip-and-replace. The first ranked daily plan publishes inside 30 days. The first closed-loop deployment runs inside 90.

The Impact Guarantee is in writing. Pick the metric in week zero (production uplift, deferment reduction, route-time recovery, study turnaround, whichever metric the CFO will sign for). Run the loop for four weeks. If the metric moves, the operator signs the annual subscription. If it does not, the operator walks away. No license fee. No kill fee. Modules start at $15K.

The supermajors paid six to eight years of tuition to learn what works. The vertical-AI substrate that produced their disclosures is accessible to the 500-well independent at a fraction of the supermajor cost, without paying tuition the supermajors already paid. The question is whether the operator answers the board's AI question with a pilot update in late 2026, or with a booked outcome.

Five names cleared the bar. The math says most operators are still stuck behind it. The bar is not theoretical, the proof points are not vendor claims, and the path is no longer 18 months.

You are one week away. Request your free trial.

Frequently Asked

What does "AI on the earnings call" actually mean?

It means the AI program is disclosed at a line-item level the CFO is asked to defend on the quarterly call. ExxonMobil now reports against a $15B-by-2027 efficiency target with AI named as a lever ($9.7B booked). Devon Energy publishes a raised $300M+ optimization target with ChatDVN and autonomous artificial lift named. APA discloses a $300M optimization target on the Palantir AIP partnership. The threshold is no longer "are we doing AI." It is "what booked value, on what asset population, against what baseline."

Which operators are publicly disclosing AI outcomes in oil and gas?

Five operators with primary-source disclosures in the 2024 to 2025 cycle. Devon Energy (ChatDVN 3.0 daily use, 15-30% engineer productivity, 850-plus wells on autonomous lift, $300M+ target). ConocoPhillips (Global Digital Twin program, Norway BU 80-90% maintenance reduction). APA (Palantir AIP across planning, maintenance, production optimization, contracts, $300M target). Chevron (10% Permian production attributed to the data approach, ApEX in Gulf of America). ExxonMobil ($9.7B booked of the $15B-by-2027 efficiency target with AI named as a lever). Each is sourced to the operator newsroom, an analyst note, or the JPT/CIO/Hart Energy coverage.

Why are 30 percent of generative AI projects abandoned after POC?

The Gartner number is structural. The model is not the problem. The data the model was asked to learn from is. Most POCs are run against a horizontal copilot that has no grounding in the operator SCADA, EAM, AFE history, reserves forecast, or historian. The output cannot be cited back to a source document and cannot score a field decision against cash-flow impact. The operators who cleared the bar in public (Devon, ConocoPhillips, APA, Chevron, ExxonMobil) all ran vertical AI deployments against their own data, not horizontal copilots.

Can a 500-well independent replicate the supermajor pattern?

Yes. The supermajor proof points all ran against SCADA history the operator already owned. ExxonMobil and SLB on 1,300-plus wells, ConocoPhillips PLOT on 4,500-plus wells, Devon autonomous lift on 850-plus wells. The constraint at small scale is not data volume, it is the maturity of the field workflow that consumes the ranked decisions. An operator with a single SCADA system, a production accounting feed, and an EAM tool has enough operating data to clear the bar on at least one of the four use cases (pump-by-priority, variance commentary, engineering-hour compression, closed-loop work management).

What is the difference between a horizontal copilot and a vertical AI for oil and gas?

A horizontal copilot (Microsoft Copilot, Google Antigravity, Anthropic Claude Code, Cursor) is a general productivity tool. It has no grounding in the operator SCADA, well file, procedures, or basin. It cannot cite the source document for an answer. It cannot score a pumper visit by cash-flow impact. A vertical AI is domain-bound and pre-integrated to the operator data. It reads production accounting, SCADA, EAM, GIS, HSE, and engineering drawings in their native homes, in read-only mode, and produces field-grade output the lease operator, dispatcher, planning engineer, and operations manager already use. The supermajors run vertical models. The horizontal copilot is, at best, a back-office tool for meeting summaries.

How fast does WorkSync stand up an AI pilot on a typical independent?

Initial Data Hub integration is typically under one week, read-only on the operator existing systems. Ranked daily plans go live within 30 days. Full closed-loop deployment with optimized routing, exception-based dispatch, and nightly retraining completes within 90 days. The Impact Guarantee is in writing: pick the metric in week zero (production uplift, deferment reduction, route-time recovery, study turnaround, whichever metric the CFO will sign for), run the loop for four weeks, sign the annual only if the metric moves. No license fee and no kill fee if it does not. Modules start at $15K.

What are the four AI use cases that travel from the supermajor to the independent?

Pump-by-priority (predictive maintenance plus autonomous artificial lift, scored by economic impact, Devon proof on 850-plus wells). Variance commentary that writes itself (agent reads the historian, production database, and ledger, drafts the weekly commentary, Devon ChatDVN proof of daily use). Engineering-hour compression (hydraulic and facilities modeling from a multi-hundred-hour study cycle to sub-hour iteration, WorkSync FlowSync proof on a gas utility deployment). Closed-loop work management (voice-first field capture into the system of record, every observation tagged to the well and cash-flow impact, 90 minutes per route reclaimed at the top-25 private producer reference deployment).

Why is the data lake not on the critical path for AI in oil and gas anymore?

Three things broke the lake economics. Inference cost collapsed (Stanford 2025 AI Index, GPT-3.5-class call roughly 280x lower than launch price). Vertical AI vendors emerged with pre-integrated workflows that compete on the operator data and the domain integration, not on a horizontal model. Modern integration patterns (event streams, read-only API gateways, ABAC layers) made it cheaper to connect to twelve systems in their native homes than to extract them into a thirteenth. The detailed argument is in The Data Lake Is a 2017 Idea.

Request Your Free Trial. 4-week Impact Guarantee on a metric your CFO will sign for.

See how WorkSync can transform your operations.

Related Insights

An upstream drilling rig representing the independent operator who can leapfrog the supermajor data-lake sequence with vertical AI on existing SCADA
The Approach

The Data Lake Is a 2017 Idea. Independents Don't Need One to Run AI in Production.

Every independent has heard the same three sentences from a consultant or a hyperscaler partner. We aren't ready for AI. Our data isn't ready. We are still working on governance. That sequence is a 2017 idea. The supermajors stopped paying for it four years ago, and the operating proof points they now cite (ExxonMobil/SLB 2.2% on 1,300+ wells, ConocoPhillips PLOT 30% on 4,500+ wells) ran against the SCADA history the operators already had. The independent that builds a lake in 2026 is paying for a workaround to a problem that no longer exists.

Digital pumpjack hologram representing AI-enabled operations
The Approach

AI Without Infrastructure Is Just Expensive Noise

Most oil and gas AI projects fail for the same reason: the AI has nowhere to live. You need an operational foundation BEFORE agents can do anything useful. Here are the four layers that matter.

Digital hologram visualization of agentic pumpjack operations
The Vision

AI Is Redefining the Oilfield. Are You Ready?

AI is no longer hype in oil and gas. Operators using agentic prioritization are capturing 15%+ more cash flow while reducing overhead. The question is whether you move first or last.

Wellhead operations crew in the field, the deployment surface for a pump-by-priority pilot
The Approach

The 4-Week Pump-by-Priority Pilot: What Actually Happens, Week by Week

Most operators expect a multi-quarter build. The shape of an actual pump-by-priority pilot is one week to integrate read-only onto the existing stack, two weeks to put a ranked plan in every truck cab, and one week to measure against a metric the controller signed on Day Zero.

Aerial view of a producing oilfield, the asset surface where exception-based surveillance reorders every visit by quantitative score
The Approach

Exception-Based Surveillance: The 30-Year-Old Operating Model the Supermajors Productionized and Independents Still Don't Run

Exception-based surveillance is the upstream operating framework that ranks every field action by a quantitative score derived from the data already in the historian, the SCADA, the accounting system, and the EAM. A&M defined it in 2015. ExxonMobil, ConocoPhillips, and Chevron productionized it. Most independents still run the fixed-route default. Here is the framework, the three operating levers, and the four-week adoption path.

Pipeline and process piping at a gas utility facility, the network a hydraulic study has to model accurately
The Problem

The Million-Dollar Model: Why Gas Utility Planning Teams Still Burn Six Weeks on a Single Hydraulic Study

The average hydraulic study at a North American gas distribution utility costs $15K-plus in loaded engineering time and takes more than two weeks to build. A five-person planning team running the standard loop burns $750K to $1M a year on model maintenance. The math has not changed in 30 years. Closing the loop has.

Aerial view of an oilfield at dawn, the asset surface where the 24-hour AI operations diagnostic publishes its first ranked work list by 5:30 AM the next morning
The Approach

Give Us One Day: The 24-Hour AI Operations Diagnostic That Replaces the Six-Month Discovery Phase

The discovery-then-pilot sequence the consulting industry sells is producing decks, not deployments. McKinsey reports 70% of operators are still stuck in pilot phase. Gartner reports 30% of GenAI projects are abandoned after POC. The bar moved while the workshops ran. The 24-hour AI operations diagnostic ingests the operator's SCADA, lease accounting, historian, GIS, and EAM in read-only mode and returns a ranked work list against the operator's own wells by 5:30 AM the next morning. Same vertical-AI substrate that runs the 5,000+ well deployed reference. No license fee, no kill fee, no decks.