Affiliate disclosure: As an Amazon associate, we'll earn a commission for every successful order through our affiliate links in this article. However, you won't be charged anything extra for this.

If you’ve ever glanced at your Apple Watch and wondered, “Is this VO2 max number actually accurate?” — you’re not alone. The Apple Watch estimates your cardiovascular fitness using a metric it calls “Cardio Fitness,” which is essentially a submaximal prediction of VO2 max — the maximum amount of oxygen your body can use during intense exercise. It’s meant to reflect your aerobic endurance and long-term heart health.

But here’s the catch: unlike clinical VO2 max testing, which uses a face mask and metabolic cart to measure oxygen consumption in real time, the Apple Watch relies solely on heart rate, GPS speed, and basic user data. No breath-by-breath analysis. No lab. Just algorithms crunching numbers from optical sensors and satellite signals.

Research shows the Apple Watch consistently underestimates true VO2 max by about 6 mL/kg/min, with average errors exceeding 13%. While it may track trends over time for some users, it should not be trusted for precise values, medical assessments, or performance benchmarking. For athletes, fitness enthusiasts, or anyone serious about tracking progress, understanding the limitations of this feature is essential.

This article dives into what peer-reviewed science says, why discrepancies occur, how real users are affected, and how to get the most reliable readings — all based on clinical studies, user reports, and Apple’s own data.

Scientific Accuracy: What Peer-Reviewed Studies Reveal

Apple Watch Underestimates VO2 Max by 6 mL/kg/min

A 2024 study published in PLOS One rigorously tested Apple Watch Series 9 and Ultra 2 against indirect calorimetry — the clinical gold standard for measuring VO2 max. The results were clear:

  • Mean underestimation: 6.07 mL/kg/min
  • Mean Absolute Error (MAE): 6.92 mL/kg/min
  • Mean Absolute Percentage Error (MAPE): 13.31%
  • Limits of agreement: Ranged from –6.11 to +18.26 mL/kg/min, indicating high variability between individuals

These error margins far exceed acceptable thresholds for clinical use. While there was a moderate correlation between Apple Watch estimates and lab values, the systematic underestimation makes absolute numbers unreliable. A reading of 38 mL/kg/min doesn’t mean you’re truly at that level — it could easily be closer to 44.

Still, the study noted that the Apple Watch might serve as a scalable alternative to traditional field tests like the Cooper 12-minute run, especially for population-level fitness tracking. But for diagnosing individual fitness or health risks? It falls short.

Apple’s Own Study Shows Better Results — But With Caveats

Apple’s internal 2023 validation study, involving 755 participants, reported much more optimistic outcomes:

  • VO2 max estimates within ±1.2 to 1.4 mL/kg/min of true values
  • Accuracy within 4% of actual VO2 max
  • Reliability (ICC) between 0.86 and 0.89, suggesting strong consistency

However, this data has not been independently replicated. Differences in participant selection, testing protocols, or proprietary algorithm tuning may explain the gap. Independent research consistently finds higher error rates, suggesting Apple’s results may reflect ideal conditions not seen in everyday use.

How It Compares to Other VO2 Max Estimation Methods

Method Typical Error (MAE)
Lab VO2 max (indirect calorimetry) ~1.3 mL/kg/min
ACSM submaximal tests 3.7–4.5 mL/kg/min
Garmin/Polar wearables 3.0–5.0 mL/kg/min
Apple Watch 6.92 mL/kg/min

The Apple Watch’s error rate is worse than lab standards and comparable to older field tests. Despite using modern machine learning, its accuracy lags behind even basic prediction models. The takeaway? It’s better at showing trends than delivering truth.

How Apple Watch Calculates VO2 Max

Apple Watch VO2 max calculation algorithm diagram

Inputs That Drive the Algorithm

The Apple Watch estimates cardio fitness using four key inputs:

  • Heart rate: From green LED and infrared PPG sensors
  • Speed and distance: GPS and accelerometer data
  • User profile: Age, sex, height, weight (must be manually updated)
  • Exercise intensity: Requires ≥30% increase in heart rate above resting levels

This data comes only from outdoor walks, runs, or hikes where GPS is active and movement is continuous.

What It Doesn’t Use — And Why That’s a Problem

The Apple Watch ignores critical factors that influence cardiovascular load:

  • Respiratory gas exchange (O₂/CO₂)
  • Elevation gain
  • Power output (e.g., cycling watts)
  • Indoor workouts (unless GPS simulated)
  • Perceived exertion or effort

This creates real-world issues:
Hill sprints with high effort but slow pace may be undervalued
Cycling, even with a chest strap, does not contribute
Treadmill runs without GPS won’t generate a reading
Cool-down walks with elevated HR can drag scores down

The algorithm assumes a linear relationship between heart rate and oxygen use, but this varies widely between individuals. That oversimplification is a major source of inaccuracy.

Algorithm Evolution: From Linear Models to AI

Apple Watch watchOS VO2 max algorithm evolution timeline

watchOS 4 (2017): Basic Regression

The original model used simple linear equations based on heart rate and pace during outdoor runs. It assumed max HR = 220 – age and fixed HR-VO2 ratios — outdated assumptions that led to large individual errors.

watchOS 7 (2021): Machine Learning Upgrade

Apple introduced deep neural networks trained on real-world exercise data. The model combined:
– Physiological modeling (oxygen kinetics via ODEs)
– Data-driven learning from diverse populations

This improved accuracy across age, sex, and fitness levels — but systematic underestimation remained.

watchOS 10+: Hybrid AI Refinement

Current models use a physics-informed AI approach, blending equations with deep learning. Apple claims better personalization, but the algorithm remains closed-source, preventing external validation.

Despite upgrades, core limitations persist — especially due to reliance on imperfect optical HR and lack of metabolic data.

Why Your Apple Watch VO2 Max Might Be Too Low

Apple Watch VO2 max factors affecting accuracy infographic

Only Outdoor Workouts Count

VO2 max is only estimated during outdoor runs, walks, or hikes with GPS. Indoor treadmill workouts do not contribute unless third-party apps simulate GPS.

User report: “I run 3x/week on a treadmill. My VO2 max never updates.” – FitUser2023

Cycling Doesn’t Improve Your Score

Even if you’re training hard on a bike with a Polar H10 chest strap and power meter, Apple Watch does not use cycling data for VO2 max.

User comparison: Garmin estimates 50 mL/kg/min from cycling; Apple Watch shows 38 — same user, same fitness. – mrmarbury

Elevation Gain Is Ignored

Running uphill increases cardiovascular strain — but Apple Watch only sees slower pace. High-effort trail runs may be misinterpreted as low fitness.

User frustration: “I train on hills weekly. Watch penalizes me for slow pace.” – TrailRunnerPro

Cool-Down Walks Drag Down Your Score

Post-workout walks with elevated heart rate (e.g., after HIIT) can be logged as low-intensity activity, pulling down your average efficiency.

User insight: “My 20-minute cool-down drags my cardio fitness down.” – Original Poster

Poor Fit or Cold Weather Skews Readings

  • Loose band: Causes PPG signal dropouts → inaccurate HR
  • Cold weather: Reduces optical sensor reliability
  • Tattoos or dark skin: Can interfere with LED light absorption
  • Arm swing or weights: Introduces motion artifacts

Fix reported: “Tightened band from 5th to 3rd hole — VO2 max jumped 4 points.” – HRFixer

Real-World User Experiences: Inconsistencies Are Common

Lab vs. Apple Watch: Gaps Up to 20 Points

Multiple users have compared Apple Watch to clinical tests:

  • Clesc: Lab = 58, Apple Watch = 40 (–18)
  • Repulsive-Rub-6989: Lab = 45.7, Apple Watch ≈ 34 (–11.7)
  • CashSuccessful8699: Lab = 47, Apple Watch = 34, Polar = 46

These gaps confirm Apple Watch readings are not interchangeable with medical results.

Device Comparisons Show Wide Variation

  • Garmin (with HRM-Run): Reports 46–55
  • Polar H10 + app: Estimates 46–50
  • Apple Watch (same user): 34–38

Even with the same chest strap, Apple Watch delivers lower estimates — pointing to algorithmic bias, not sensor failure.

Fitness Improves, But VO2 Max Drops

Many users report declining scores despite real progress:

  • Faster mile times
  • Lower resting heart rate
  • Weight loss
  • More workouts

Yet Apple Watch shows dropping scores — a major pain point.

User quote: “Faster, leaner, stronger — but my cardio fitness is falling.” – credit_to_reddit

Some See Plausible Trends

Not all experiences are negative:

  • adamr_z: Stable 51–53 as a fit 45-year-old doing HIIT
  • [deleted]: Rose from 42 to 51 over months of running
  • FlyBoyMcCall: Believes it works well for long-term trend tracking

This suggests the Apple Watch is more reliable for direction than precision.

How to Improve Apple Watch VO2 Max Accuracy

Wear the Watch Snugly

A loose fit causes poor PPG signal. Wear it tight enough to limit movement, especially during runs.

Pro tip: Adjust band regularly. Bands stretch over time — reposition to earlier holes.

Use Outdoor Workouts Only

For VO2 max estimation:
– Select “Outdoor Run” or “Outdoor Walk”
– Avoid indoor treadmill unless GPS is active
– Don’t mix workout types in one session

Run on Flat Terrain for Calibration

To reduce GPS and elevation errors:
– Do a calibration run on a flat, open route
– Avoid trails, forests, or urban canyons with poor satellite signal

Follow Apple’s official guide: HT204516

Update Personal Data Regularly

Ensure these are current in the Health app:
– Weight
– Height
– Age
– Sex

Outdated info skews metabolic calculations.

Avoid Post-Workout Misclassification

  • End or pause workouts immediately after finishing
  • Don’t let cool-down walks auto-log as separate “outdoor walks”
  • Use “End Workout” instead of letting it run

Warm Up and Avoid Confounders

  • Warm up for 5–10 minutes to stabilize HR
  • Avoid caffeine, alcohol, or stress before exercise
  • Exercise in moderate temperatures

Alternative Ways to Estimate VO2 Max

Take the Cooper 12-Minute Run Test

Run as far as possible in 12 minutes. Estimate VO2 max with:

VO2 max ≈ (Distance in meters / 1000) × 6.65 – 5.0

Example: 2,700 meters → (2.7 × 6.65) – 5.0 = ~49 mL/kg/min

User result: “Ran 2.7 km. Cooper test says 49. Apple Watch says 34.” – No_Target_5218

Use a Simple HR-Based Formula

Estimate VO2 max using resting and max HR:

  1. Predicted Max HR = 208 – (0.7 × age)
  2. VO2 max ≈ 15.3 × (Max HR / Resting HR)

Example: 30-year-old, resting HR 55
→ Max HR = 187 → VO2 max ≈ 51.7

User comment: “This matches my lab test better than Apple Watch.” – Cvtwelcome

Get a Clinical CPET Test

For true accuracy, take a cardiopulmonary exercise test (CPET):

  • Uses a metabolic cart (e.g., COSMED Quark)
  • Measures O₂/CO₂, RER, HR, ECG
  • Required to meet at least two of:
  • HR near 220 – age
  • RER ≥ 1.15
  • RPE ≥ 17
  • VO2 plateau

This is the only clinically valid method.

Expert advice: “If you want real VO2 max, get a stress test. It’s often insurance-covered.” – Dark_Elf_DEX


Final Verdict: Is Apple Watch VO2 Max Trustworthy?

Short Answer: No — For Absolute Values

The Apple Watch systematically underestimates VO2 max by ~6 mL/kg/min, with 13–15% average error. It fails clinical accuracy standards and should not be used to assess fitness level, diagnose low cardiovascular health, or guide medical decisions.

Long-Term Trends? Possibly Useful

Despite flaws, it can show relative changes over time:
– Rising trend → likely improving fitness
– Sudden drop → possible overtraining, illness, or measurement error

If you maintain consistent habits, the direction matters more than the number.

Best Use Case: Motivation, Not Measurement

Apple markets VO2 max as an estimation — not a medical tool. It’s designed to:
– Encourage movement
– Highlight fitness changes
– Support heart health awareness

It succeeds as a nudge, not a diagnostic.


Expert consensus: “View it as a direction, not a destination.” – TechnicalAtlas

“Take this feature with a grain of salt.” – Throwkage

Final Recommendation

  • For general users: Track trends — ignore exact numbers
  • For athletes: Use lab tests or field protocols (e.g., Cooper test)
  • For health concerns: Consult a doctor and request a CPET
  • For Apple Watch users: Optimize setup, use outdoors, and compare changes — not values

The Apple Watch VO2 max isn’t broken — it’s limited by design. Use it wisely, and it can still help you stay motivated. But when accuracy matters, there’s no substitute for science.