← Back to CalculatorMethodology v1.0.2

Calculator Methodology

Redux ROT Impact Score — transparent, citable, versioned.

This document explains every number the calculator produces: the formulas, the benchmarks behind the defaults, the assumptions we make, and what the model deliberately excludes. It is designed to be shared with analysts, executives, and procurement teams evaluating the results.

01How Every Number Is Computed

Step 1 — Total Data Estate

The calculator sums storage volumes across three environments: on-premises (primary SAN/NAS + archive/tape, excluding backup infrastructure), public cloud (hot, cool, and archive tiers across AWS, Azure, GCP, and OCI), and Microsoft 365 / SaaS (mailbox, OneDrive, SharePoint, Teams, other SaaS). M365 volumes are derived from user count × per-user averages you supply.

totalEstate = totalOnPrem + totalCloud + totalM365

Step 2 — Annual Storage Cost

Each environment uses tiered cost-per-TB rates:

onPremCost = (primaryTB × $3,000) + (backupTB / dedupRatio × $600) + (archiveTB × $80)
cloudCost = Σ provider (hotTB × hotRate + coolTB × coolRate + archiveTB × archiveRate)
m365Cost = max(mailboxStorageCost + nonMailStorageCost, totalLicenseCost × 0.40)

Step 3 — ROT Percentage Estimation

Tier names follow the percentile convention introduced in Methodology v1.0.1 (May 2026).

We compute three estimate tiers — Industry P25 (29%), Industry P50 (45%), and Industry P75 (58%) — then adjust each based on two factors:

adjustedROT = max(basePct + hygieneAdj − (retentionFloor × 100), 10)
rotPct = clamp(adjustedROT, 10%, 80%)

Within the total ROT, we split into Redundant (30–40%), Obsolete (35–40%), and Trivial (25–30%) using tier-specific ratios derived from industry literature.

Step 4 — ROT Cost Attribution

Each environment's ROT cost is proportional to its volume share of the total estate. This prevents small environments from absorbing disproportionate cost.

envROTCost = envAnnualCost × rotPct × (envTB / totalEstate)

Step 5 — Backup Amplification

On-premises backup infrastructure amplifies ROT cost: every redundant file that lives on primary also exists across your backup copies, reduced only by deduplication.

onPremROTSlice = (totalOnPrem / totalEstate) × totalROTTB
backupROTTB = onPremROTSlice × backupCopies / dedupRatio
backupAmplificationCost = backupROTTB × $600/TB/yr

Cloud and M365 manage their own backup/versioning and are not amplified through on-prem backup infrastructure.

Step 6 — FinOps Cloud Cost Modeling

For each active cloud provider, we compute a detailed FinOps breakdown:

Step 7 — Multi-Year Projection

The total annual waste is compounded by your stated annual data growth rate to produce 3-year and 5-year projections.

projection(years) = Σ (year=0 to years−1) annualWaste × (1 + growthRate)^year

Quick Wins

The calculator identifies up to 7 non-overlapping remediation opportunities. Each quick win claims an explicit TB slice from the total ROT pool. A running total ensures no TB is double-counted and total savings never exceed annual waste.

02Benchmark Sources

Every external number used in the calculator is cited below with publication year and link. We refresh these annually.

BenchmarkValue UsedSourceYear
Average breach cost$4.44 M (global avg)IBM Cost of a Data Breach Report 20252025
Enterprise ROT %33% of stored data is ROTKomprise 2025 State of Unstructured Data Management Report (corroborated by Veritas Databerg)2025
Dark / ROT data %58% of unstructured data is dark or ROTExonar Dark Data Research2024
Fully-loaded storage cost$3,300/TB/yr (enterprise average)Gartner I&O Cost Optimization Survey 20252025
Productivity waste from ROT$5.7 M/yr (large enterprise, searching through ROT)IDC Data Intelligence Report2024
Enterprise data spend on ROT$34 M/yr (large enterprise)Securiti Data Intelligence Report2024
ROT midpoint (Valora)40–50% of enterprise data is ROTValora Data Waste Research2024
Unstructured data share80% of enterprise data is unstructuredKomprise 2025 / IDC2025
Cloud storage pricingPer-region, per-tier (hot/cool/archive)Published rate cards: AWS S3, Azure Blob, GCP, OCI2025–26
M365 storage attribution40% of per-user license costGartner "Microsoft 365 License Optimization" (2025) + Microsoft 365 E3/E5 SKU pricing analysis2025
Exchange Online add-on storage$0.20/GB/month ($2,458/TB/yr)Microsoft 365 Admin Center — Exchange Online Plan 2 additional storage pricing2025–26

03Assumption Table

Every default value the calculator ships with is listed below, along with the rationale and its sensitivity — how much the final annual-waste figure moves if you change the default.

ParameterDefaultWhy This DefaultSensitivity
Data growth rate25%/yrIDC Global DataSphere midpoint for enterprise data growthHigh — 5-year projection scales exponentially; ±5 pp shifts 5-yr cost ~18%
On-prem primary $/TB/yr$3,0002026 fully-loaded SAN/NAS (controllers, networking, power, floor space, admin FTE)High — directly scales on-prem ROT cost; ±$500 shifts annual waste ~8–12%
On-prem backup $/TB/yr$600Purpose-built backup appliance (e.g., Dell PowerProtect, Cohesity)Medium — affects backup amplification cost
On-prem archive $/TB/yr$80LTO-9 tape or cold object storage (media + library + admin)Low — archive is already cheap; moving ±$40 has minimal overall impact
Backup copies33-2-1 backup rule (3 copies, 2 media types, 1 offsite)High — each additional copy linearly increases backup amplification
Dedup ratio2:1Conservative default; many orgs without inline dedup achieve 1:1–2:1Medium — higher ratios reduce backup amplification cost proportionally
M365 license cost/user/mo$22Blended E3/E5 midpoint ($36 E3 list, weighted by common E3-heavy deployments)Medium — scales M365 storage attribution (40% of total license)
M365 storage attribution ratio40%Gartner 2025 analysis: storage infrastructure = 35–45% of M365 per-user costMedium — ±10 pp shifts M365 ROT cost ~25%
Avg mailbox size5 GBTypical enterprise with basic retention; ranges 2–25 GBLow — small per-user; matters at scale (10k+ users)
Avg OneDrive/user15 GBMidpoint of observed enterprise usage (quota often 1 TB, actual 5–50 GB)Low — similar scale effect as mailbox
% data untouched 12+ months60%Komprise 2025: 74% of orgs manage 5+ PB; majority untouched. Veritas Data Genomics: 40%+ untouched 3+ yearsHigh — primary driver of hygiene adjustment (±3 pp ROT)
Known duplication rate30%Enterprise average for unmanaged file shares (range 20–60%)Medium — above 40% adds +3 pp to hygiene adjustment
Temp/personal files12%Veritas Databerg: avg 26.5% store personal files; 12% is the mid-range for enterprises with BYOD policiesLow — above 20% adds +2 pp to hygiene adjustment
Savings Plan coverage30%Typical enterprise starting FinOps maturity; range 0–80%Low — only affects FinOps breakdown, not headline ROT cost
Savings Plan discount30%AWS/Azure typically 20–40% for 1-year commitmentsLow — only affects FinOps breakdown
Archive data age180 daysBeyond most early-deletion minimums (90–180 days), so penalties are usually zeroLow — only triggers penalty if below provider's minimum

Industry Retention Floors

These floors represent the fraction of data that is legally non-deletable in each industry. They reduce the recoverable ROT ceiling.

IndustryFloorRegulatory Basis
Government35%NARA, FOIA, state sunshine laws
Financial Services30%SEC Rule 17a-4, SOX, MiFID II
Healthcare25%HIPAA (7-year medical records)
Energy20%NERC CIP, EPA record-keeping
Manufacturing15%ISO 9001 quality records, OSHA
Education15%FERPA student records
Retail10%PCI DSS (limited retention)
Technology10%Minimal regulatory burden
Media & Entertainment10%Minimal regulatory burden
Other / Unregulated10%General business record-keeping

04What We Don't Model

Honesty about model boundaries is essential for credibility. The following costs and effects are deliberately excluded from the calculator.

Egress Costs on ROT Deletion

The FinOps panel shows your current egress spend, but we do not model the one-time egress cost of migrating or deleting ROT across regions. The actual cost depends on whether data is deleted in-place (zero egress) or migrated before deletion.

Reserved Instances & Committed Use Discounts (Compute)

Our savings-plan modeling covers storage commitments only. Compute-attached storage (e.g., EBS volumes on reserved EC2 instances) is not broken out separately. If your ROT lives on compute-attached volumes, the true savings from deletion may be lower than shown.

Scope 3 Carbon Emissions

Storing ROT data consumes energy — powering disks, cooling data centers, manufacturing replacement drives. We do not quantify the CO₂ impact. Estimates range from 2–7 kg CO₂/TB/year for cloud and 10–30 kg CO₂/TB/year for on-prem, but methodology varies widely.

Indirect Productivity Loss

IDC estimates $5.7M/year in large enterprises lost to searching through ROT data. We show this as a benchmark but do not incorporate it into the annual waste figure because the productivity impact varies dramatically by organisation and is difficult to attribute directly to storage costs.

Compliance Fine Exposure

The dashboard shows potential GDPR, CCPA, and HIPAA fine ranges as context, but these are not summed into the headline cost. Fine risk depends on breach probability, data sensitivity, and regulatory jurisdiction — factors outside this model's scope.

Data-in-Transit & Network Costs

Replication traffic, cross-region sync, and VPN/Direct Connect costs associated with ROT data are not modeled. These are highly architecture-dependent.

Software Licensing for Data Management Tools

The cost of classification, DLP, backup, and archival software licenses is not included. We model infrastructure cost, not the tools used to manage it.

Human Cost of Remediation

Cleaning up ROT requires project management, change management, and engineering time. This implementation cost is not deducted from the projected savings shown in Quick Wins.

05Changelog

We use semantic versioning. Major = new calculation model, Minor = new data source or input field, Patch = bug fix or cosmetic.

v1.0.2— May 2026
  • Corrected the v1.0.1 rename: “ROT Impact Score” is now used for an individual user's score (the 0-100 hero metric). “ROT Impact Benchmark” is reserved for the quarterly published aggregate report. This split makes the brand match what the methodology actually computes per user vs. what gets published quarterly.
  • Removed “Conservative / Moderate / Aggressive” tier labels in favor of “Industry P25 / P50 / P75” as the sole tier framing. The percentile labels are honest to the methodology; the qualitative labels implied a risk-judgment the math doesn't make.
  • Updated homepage and OG metadata to lead with the “Redux — ROT Impact Score” brand.
v1.0.1— May 2026
  • Renamed tier labels: Conservative/Moderate/Aggressive → Industry P25/P50/P75.
  • Renamed "Data Trust Index" → "ROT Impact Benchmark" across all user-facing text.
  • Synced all 9 non-English locale files with 33 missing translation keys.
v1.0.0— May 2026
  • Initial public release of the methodology document.
  • 7-step calculation pipeline: estate inventory → cost → ROT % → attribution → backup amplification → FinOps → projection.
  • 3 estimate tiers: Industry P25 (29%), Industry P50 (45%), Industry P75 (58%).
  • Industry-aware retention floors for 10 industries (Healthcare through Other).
  • Tiered on-prem costing: $3,000 / $600 / $80 per TB/yr (primary / backup / archive).
  • FinOps cloud modeling: savings plans, egress, early-deletion penalties, retrieval costs.
  • M365 cost model: Exchange add-on pricing ($0.20/GB/mo) + 40% storage license attribution (Gartner 2025).
  • Benchmarks updated: IBM 2025 ($4.44M breach), Komprise 2025 (unstructured data), Gartner 2025 ($3,300/TB), IDC 2024, Securiti 2024.
  • Replaced 2015 Veritas Databerg figure with 2024/2025 multi-source corroboration (Komprise, Exonar, Valora).
  • Non-overlapping Quick Wins engine with explicit TB-slice claiming and savings cap.

Questions about the methodology? [email protected]

Last updated May 2026 · Methodology v1.0.2