ERP System Rescue

When the implementation has gone sideways,

The first move is stabilisation, not a rewrite.

We audit, triage, and harden the system. Then the rebuild decision is yours to make — on evidence, not on panic.

For COO and CTO teams in trouble Stabilise first Decide second
Triage of system modules A system schematic with modules tagged critical, at-risk, and stable. A scope-line highlights the at-risk and critical modules as the stabilisation scope. CRITICAL · stabilise this week PAYMENTS BACKUPS AUDIT LOG AT-RISK · stabilise this month REPORTING INTEGRATIONS DEPLOY STABLE · leave alone for now CRM INVENTORY HR Stabilisation scope = critical + at-risk. The rest waits for the decision-support assessment.

We don't touch the whole system. We triage, then we stabilise.

What you'll get out of this page
For COOs & Heads of Operations

The business stops bleeding

Critical workflows are hardened. Manual workarounds get retired or formalised. Operations stop running on hope.

For CTOs & CIOs

The codebase becomes knowable

Forensic audit, written documentation, observability, deploy hygiene. The system stops being a black box.

For CFOs

Sunk cost is honoured, not doubled

The previous build is not thrown out reflexively. The audit tells you what is salvageable; the rebuild conversation comes later, on evidence.

For CEO & Board

A path forward that is calm and defensible

Stabilisation in weeks, decision-grade assessment in writing. No "give us another year" pitch.

The Reality

Why The Default Response Is The Wrong One

When a project has gone wrong, the loudest options in the room are usually the worst ones.

Operations are limping. The previous vendor has gone quiet or defensive. Internally somebody is pushing "rebuild it from scratch." Externally another consultancy is pitching the same. Both ignore the fact that whatever is running, however badly, is the thing the operation now depends on — and that ripping it out without first understanding what it does will create a second outage on top of the first. Meanwhile every week of delay deepens the operational risk: workarounds entrench, the staff who remembered the original intent leave, and the parts of the system that are quietly degrading get closer to a public failure.

01 "Rebuild" as a sales pitch

A contract-size decision dressed up as a technical one.

02 Documentation that lies

The previous vendor's documentation is rarely accurate.

03 Workarounds nobody sees

The business is running on undocumented spreadsheet bridges and Slack approvals.

04 Cost rising on delay

Every week the diagnostic is delayed, the price of the eventual decision rises.

The first job is to stop the bleeding so the decision can be made calmly.

The first job isn't to decide what to do.

What Leaders See

Six Signals That A Rescue Is Overdue

Each appears in leadership meetings long before anyone says "we need to bring someone else in."

The previous vendor has gone quiet

Tickets go unanswered. Calls don't get returned. The relationship has effectively ended without being formally closed.

Nobody can describe the whole system

The original PM left. Documentation is partial. Current operators know their part; nobody knows the whole.

Manual workarounds have multiplied

What was supposed to be automated is being done in spreadsheets, in chat, in someone's head. Operations risk is invisible to the dashboard.

Outages are slow and silent

When something fails it isn't always obvious; sometimes it's discovered weeks later when a report doesn't reconcile.

The board is asking

Status updates have stopped being technical and started being political. The answer being asked for is "what do we do now?"

A rebuild quote has arrived

Another consultancy has pitched a 12-to-18-month rewrite at multiples of the original cost. The number landed and nobody knows whether it is right.

These aren't six independent problems. They are six symptoms of one situation: the implementation has stopped being managed, and the cost of every option is rising while the diagnostic hasn't been done.

Pattern Recognition

What Breaks Failed Implementations Further

Failed implementations don't usually fail in a single moment. They get worse in patterns. The patterns are not technical surprises. They are operational ones.

  1. 01

    Reflexive rebuild

    Throwing out a system that still contains real business logic because the code looks ugly. The business logic survives the rewrite as scope creep anyway, but the timeline doesn't.

  2. 02

    No forensic audit

    Decisions made on the previous vendor's documentation, on the loudest internal voice, or on a competitor's sales pitch. None of those are the actual state of the system.

  3. 03

    No stabilisation phase

    Going directly from "it's broken" to "let's redesign" without first hardening the parts that the business is currently running on.

  4. 04

    Knowledge lost in the transition

    Onboarding the new team without first extracting what the previous one knew. The new team relearns the same domain at the same cost.

  5. 05

    Sunk-cost denial

    Either treating the previous spend as throw-away (and over-rebuilding) or treating it as untouchable (and refusing to retire genuinely bad parts).

  6. 06

    Decision under pressure

    Making the rebuild-or-keep call while the system is actively on fire, instead of after the fire has been contained.

  7. 07

    Vendor selecting the scope

    The rescue vendor proposes a scope that maximises their contract, not one that minimises the client's risk.

Failed implementations get cheaper to recover from with each week of calm. They get more expensive with each week of panic.

Every rescue engagement that turned out well had the same starting move.

The first 30 days were spent stabilising and diagnosing — not rebuilding.

The temptation, on both sides of the table, is to start "fixing" inside the first week. That impulse is what turned the original implementation into a rescue case in the first place. Stabilisation buys the calm in which the right decision becomes obvious.

Failed implementations rarely need to be rebuilt. They need to be stabilised first.

What Good Looks Like

What Good System Rescue Looks Like

Four principles separate rescue work that lands from rescue work that just becomes the next problem.

  1. Audit

    Audit before action

    A written forensic audit before any structural change. The audit answers four questions: what does the system actually do, what is at immediate risk, what is salvageable, and what is genuinely broken beyond repair.

  2. Stabilise

    Stabilise before deciding

    Targeted hardening of the at-risk parts first. Backups, observability, error handling, secrets, deploy hygiene. The long-term direction decision comes after the operation is no longer running on the edge.

  3. Document

    Document before transitioning

    Knowledge reconstruction is part of the engagement, not an afterthought. The written record of what the system does is a deliverable, not a side effect.

  4. Decide

    Decide on evidence, not panic

    The rebuild-or-keep decision is made after the audit, in writing, with options costed. It belongs to the client, not to the next vendor pitching for the bigger contract.

How Rescue Looks in Practice

What This Engagement Actually Looks Like

Four honest blocks: how we work, the phased cadence, the triage architecture, and what we will not do.

Engagement cadence

The six things every rescue engagement covers

01

Days 0–14 — Forensic audit

Architecture, data model, integration points, deployment, observability, security, debt. Written report at end of week 2.

02

Days 14–28 — Triage and plan

Risk-ranked list of what to stabilise, with cost and timeline. Client approves scope before any structural work.

03

Days 28–90 — Stabilisation

Targeted hardening on the approved scope. Backups, observability, error handling, secrets, deploy hygiene. Operations stop bleeding.

04

Throughout — Knowledge reconstruction

Documentation of what the system actually does, derived from the code and operation. Delivered as a written record, not a slide deck.

05

Day 90 — Decision-support assessment

Written assessment of what to keep, refactor, retire, or rebuild. Options costed. The decision belongs to the client.

Triage architecture

We don't touch the whole system. We triage.

Modules tagged critical (stabilise this week), at-risk (stabilise this month), or stable (leave alone for now). Stabilisation scope is the critical and at-risk tiers. The stable tier waits for the decision-support assessment, which is where its future is actually decided.

CRITICAL stabilise this week
  • Payments
  • Backups
  • Audit log
AT-RISK stabilise this month
  • Reporting
  • Integrations
  • Deploy pipeline
STABLE leave alone for now
  • CRM
  • Inventory
  • HR
Outcomes

What Changes When Rescue Is Done Right

Four outcomes plus the six numbers we measure before declaring a rescue engagement complete.

01

The operation stops bleeding

Critical workflows are hardened, observability exists, manual workarounds are documented and retired or formalised. Operations risk drops in weeks, not quarters.

02

The system becomes knowable

Written documentation of what the system actually does. The codebase is no longer a black box; the next decision can be made on evidence.

03

The rebuild-or-keep decision is calm

A structural decision is made after the operation is stable, on a written assessment with costed options, not under emergency pressure.

04

Sunk cost is honoured, not doubled

Where the previous build contains real business value, it is preserved. Where it doesn't, it is retired explicitly. Either way, the decision is intentional.

ROI we measure
  • Time to first stabilised critical workflow
  • Manual workarounds retired or formalised
  • Reduction in unplanned operational incidents
  • Mean time to detect anomalies
  • Coverage of written system documentation
  • Time to decision-support assessment

Each measurable. Each tied to the operating reality, not to a status slide. The mean-time-to-detect metric typically moves from weeks to hours over the stabilisation phase — that single number tells leadership whether the operation is being managed or just observed.

Fit Assessment

When System Rescue Makes Sense

Ready if

A rescue engagement is the right move when an implementation has gone sideways, leadership is willing to spend 30 days on diagnostic before structural commitment, the business has operational continuity needs that make "throw it out and rebuild" genuinely risky, or a previous vendor relationship has ended or stalled and somebody needs to take over the codebase responsibly.

Too early if

It's too early when the implementation is still in active build with the original vendor — talk to them first. Or when leadership has already committed publicly to a full rebuild and wants only an execution vendor — that's an Implementation engagement, not a rescue. Or when the "failure" is a single bug or single bad release — that's a support engagement, not a rescue.

A rescue engagement converts to a follow-on roughly one in three times — the rest hand back to the client's internal team or to a vendor of the client's choice. That ratio is on purpose; rescues that always convert to follow-ons aren't rescues, they're sales pipelines. This ratio is currently pending leadership confirmation before publishing externally.

Engagement

How A Rescue Engagement Runs

A typical rescue engagement runs 12 to 16 weeks end-to-end. Four phases. Each phase ends with a written deliverable, not a status meeting.

01

Forensic Audit · 2 weeks

Architecture, data model, integration points, deployment, observability, security, debt. Output: written audit report.

02

Triage & Plan · 2 weeks

Risk-ranked stabilisation scope, cost and timeline. Client approves before any structural work begins.

03

Stabilisation · 6 to 10 weeks

Targeted hardening on the approved scope. Documentation in parallel. Operations stop bleeding.

04

Decision-Support Assessment · End of stabilisation

Written, costed options memo. Keep, refactor, retire, or rebuild. The decision is yours.

Recovery Discipline

The Recovery Discipline We Apply To Every Rescue

Recovery isn't heroics. It's the discipline that turns a project in crisis back into a project that can be decided on calmly.

Six engagement and engineering non-negotiables. Each is the floor, not the ceiling.

01

Forensic audit before structural change

No rewrite, no redesign, no major refactor before the written audit is delivered and the client has read it.

02

Triage before stabilisation

A risk-ranked list with cost and timeline. Stabilisation scope is approved in writing, not assumed.

03

Targeted stabilisation, not blanket rewrites

Only the at-risk parts get touched in the stabilisation phase. The stable parts wait for the decision-support assessment.

04

Knowledge reconstruction as a deliverable

Written documentation of system behaviour is part of every engagement, not a side effect.

05

Decision-support assessment in writing

At end of stabilisation, a costed options memo. Keep, refactor, retire, rebuild — each option with a number. The decision belongs to the client.

06

No forced follow-on

The rescue engagement is priced and delivered as itself. Any follow-on is quoted separately and is the client's free choice.

Common Questions

Frequently Asked Questions

Direct answers for leaders whose implementation has gone sideways.

Our previous vendor disappeared. Can you take over the codebase?

Yes. That is one of the most common starting points for a rescue engagement. The first two weeks are a forensic audit of the codebase as it currently exists; we don't need the previous vendor's cooperation to do that, though we will request whatever documentation, credentials, and access you legitimately hold.

Do we have to rebuild from scratch?

Full rebuilds are less common than buyers expect. More often the audit recommends a mix: keep the parts that work, refactor a few, retire one or two, harden the rest. The decision is yours; the audit gives you the evidence.

How long does the audit take?

2 to 4 weeks depending on system size and complexity. The written report is delivered at the end of that period. Stabilisation work, if approved, begins immediately after.

What does a rescue cost?

The audit is typically a fixed-price engagement scoped to system complexity. Stabilisation is scoped after the audit and depends on what the triage reveals; typical stabilisation engagements run 6 to 10 weeks. Concrete pricing ranges are confirmed in the consultation conversation; we don't publish numbers on this page until they have leadership sign-off.

What if the audit recommends rebuilding?

Then we say so, in writing, with costed options. The rebuild engagement is quoted separately under Implementation or Custom Applications. You are free to use us, another vendor, or your internal team for that next phase. The audit is the audit; it is not a sales document.

What if the audit recommends not rebuilding?

Also happens. Sometimes the system is fundamentally sound and what looked like a vendor problem was an operations problem, a training gap, or an integration issue. The audit will say so. The honesty is part of how the engagement actually works.

Can you take over from a vendor that is still technically engaged?

Yes, with care. The handover needs to be clean — credentials, code, documentation, customer data — and the previous vendor's contract needs to be wound down properly. We will work with you on the transition; we will not work around an active contract you haven't formally ended.

What if it's not a full failure — just one part of the system that's gone bad?

Often the better engagement type. A targeted audit plus stabilisation of one subsystem can run in 4 to 6 weeks total. Talk to us; we'll be honest about whether you need a full rescue or just a focused intervention.

Do you sign confidentiality and IP agreements before the audit?

Yes. Standard. The audit deliverable includes sensitive operational information about your business; we treat it accordingly.

What's the success rate?

We don't quote a "success rate" because the success metric for a rescue varies (stabilised and kept, stabilised and rebuilt, stabilised and handed back). What we'll say honestly: approximately one in three rescue engagements converts into a follow-on engagement with us. The rest hand back stable to the client or to another vendor. That ratio is on purpose; rescues that always convert to follow-ons aren't rescues, they're sales pipelines.

System Rescue starts with a 2-to-4-week forensic audit, not a rewrite. The audit produces a written report on what the system actually does, what is at immediate risk, what is salvageable, and what is genuinely broken beyond repair.

Most failed implementations do not need a full rebuild. The previous build typically contains six to twelve months of business logic, integration mappings, and customer-specific configuration that the operation now depends on. Stabilisation honours that sunk cost; reflexive rebuild ignores it.

A typical rescue engagement runs 12 to 16 weeks end-to-end: 2 weeks audit, 2 weeks triage, 6 to 10 weeks stabilisation, decision-support assessment at the end.

Start with a Forensic Audit

Request Consultation

Audit first. No rebuild pitch in week one.