Deep-Dive: Reliability Centered Maintenance (RCM) Strategy Guide

Back to Strategy Hub

Reliability Centered Maintenance (RCM) is not just a "task list"; it is a systematic process used to determine the maintenance requirements of any physical asset in its operating context. Developed by John Moubray and based on the revolutionary Nowlan & Heap studies for the aviation industry, RCM is the scientific foundation of modern industrial uptime.

1. The 7 Essential Questions of RCM

According to the SAE JA1011 standard, a process cannot be called "RCM" unless it answers these seven questions in the following order:

Question 1

Function

What does the equipment do? (Primary & Secondary functions).

Question 2

Functional Failure

In what ways can it FAIL to perform its function?

Question 3

Failure Mode

WHAT CAUSES each functional failure? (The Root Cause).

Question 4

Failure Effect

What HAPPENS when the failure occurs?

Question 5

Failure Consequence

WHY DOES IT MATTER? (Safety, Operations, Environment).

Question 6

Proactive Task

Can we do something to PREVENT or PREDICT the failure?

Question 7

Default Action

What if no proactive task is effective or economical?

7-Step RCM Process

Phase 01

System Selection

“Identifying critical assets and defining boundaries for study.”

The RCM methodology focuses on preserving functions rather than just preserving equipment. This ensures maintenance resources are allocated to what truly matters for system performance.

2. The FMEA: Failure Modes and Effects Analysis

The core engine of RCM is the **FMEA**. Unlike a generic maintenance checklist, an FMEA forces you to look at the component level. If a pump fails, it doesn't just "Stop". Why did it stop?

Failure Mode A: Bearing seizure due to lack of lubrication.
Failure Mode B: Seal rupture due to pressure spikes.
Failure Mode C: Electrical stator burnout due to phase imbalance.

Each failure mode requires a **different maintenance strategy**. You cannot solve a phase imbalance with a grease gun.

Function	Fail Mode	Consequence	Strategy
Deliver 500L/min at 10 Bar	Impeller Erosion	High (Efficiency Loss)	Vibration Analysis
Deliver 500L/min at 10 Bar	Bearing Heat-up	Catastrophic	Thermography

3. The Asset Criticality Matrix

Not all machines are created equal. A $10 cooling fan on a server is more "critical" than a $50,000 backup generator that sits idle. We calculate criticality using the formula:

Criticality = Probability \times Consequence

Consequence Categories:

Safety: Can someone be hurt?
Environment: Will it cause a spill or violation?
Operations: Does it stop the production line?
Cost: How much is the secondary damage?

Strategic Alignment

High Criticality assets MUST have Predictive Maintenance (PdM) or high-frequency PMs. Low Criticality assets are often candidates for "Run to Failure" to save resources. Stop wasting gold on copper problems.

4. The P-F Interval: Time to Detection

The P-F Interval is the time between the point (P) when we can first detect a failure "Potential," and the point (F) when it actually fails functionally.

The Success Selection Logic

Condition-Based

If P-F is detectable & economical.

ΓåÆ

Time-Based

If wear-out is consistent.

ΓåÆ

Run-to-Failure

If consequence is low & cheaper.

5. RCM Implementation Steps

1

System Selection

Don't RCM the whole plant at once. Use a "Pilot" on the most troublesome or critical system first.
2

Functional Definition

Define exactly what SUCCESS looks like. "Pump water" is vague. "Deliver 50 Gallons per Minute at 60 PSI" is a standard.
3

Conduct FMEA

Assemble a cross-functional team (Operators, Technicians, Engineers) to catalog failure modes.
4

Task Selection

Select tasks that are technically feasible and worth doing. If it costs $1M to prevent a $100 failure, the RCM logic says "Run to Failure."

Next in Pillar 10:

Learn how to translate RCM results into a digital Computerized Maintenance Management System for real-time tracking.

CMMS Implementation Guide ΓåÆ

Measuring Success:

How do you know RCM is working? Dive into the world of OEE and Reliability Analytics.

OEE Optimization Guide ΓåÆ

Reliability Centered Maintenance (RCM) Methodology

1. The 7 Essential Questions of RCM

Function

Functional Failure

Failure Mode

Failure Effect

Failure Consequence

Proactive Task

Default Action

7-Step RCM Process

System Selection

2. The FMEA: Failure Modes and Effects Analysis

3. The Asset Criticality Matrix

Consequence Categories:

Strategic Alignment

4. The P-F Interval: Time to Detection

The Success Selection Logic

Condition-Based

Time-Based

Run-to-Failure

5. RCM Implementation Steps

System Selection

Functional Definition

Conduct FMEA

Task Selection

Next in Pillar 10:

Measuring Success:

Technical Standards & References