Loading...
Loading...
Wearable VO₂ Max estimates overestimate by 9–20%. Here's why the algorithms fall short, when the error is largest, and how one lab test can recalibrate everything.
You go for a brisk walk. Your Apple Watch buzzes. A little number appears on the screen: your VO₂ Max is 42. Or maybe 38. Or maybe it jumped three points since last month for no obvious reason.
You screenshot it, maybe text it to a friend. Then you start Googling what the number means and whether you should be worried or proud. And now you're down the rabbit hole.
Here's the thing. That number on your wrist is not your VO₂ Max. It's an estimate of your VO₂ Max. And the difference between those two things matters more than most people realize.
We're not here to tell you your Apple Watch or Garmin is garbage. We use wearables. Our clients use wearables. They're genuinely useful tools. But we've also sat next to hundreds of people during actual lab-measured VO₂ Max tests, and we've compared those results to what their watches said walking in the door. The gap is real, it's consistent, and once you understand it, your wearable actually becomes a better tool, not a worse one.
Your smartwatch doesn't measure oxygen. It doesn't analyze your blood. It doesn't know what's happening inside your mitochondria. What it does is track two things well: your heart rate and your movement speed.
The algorithm behind most wearable VO₂ Max estimates is based on a concept called the heart rate-to-workload relationship. The idea is straightforward. If two people walk the same speed up the same hill, the person whose heart rate stays lower is probably more aerobically fit. Fitter people do more work at a lower cardiac cost.
Apple uses outdoor walking and running data paired with optical heart rate. Garmin uses a version of the Firstbeat algorithm, which layers in pace, elevation, and heart rate variability. COROS, Polar, and others use similar approaches with their own tweaks. They're all working from the same fundamental model: infer oxygen consumption from heart rate response to a known workload.
That model isn't wrong. It's built on real physiology. The problem is that it's built on population averages, and you are not a population average.
Let's look at what independent studies have found, because this is where it gets interesting.
A 2022 study published in the British Journal of Sports Medicine compared Apple Watch VO₂ Max estimates to laboratory cardiopulmonary exercise testing in over 100 participants. The watch overestimated VO₂ Max by an average of 13%, with individual errors ranging from 9% to more than 20%. A separate validation study of Garmin's Firstbeat-derived estimates found a mean absolute error of about 3.5 mL/kg/min, which in practical terms can mean the difference between "average" and "good" on any standard fitness chart.
Researchers at Stanford and Johns Hopkins have both noted that wearable estimates tend to be least accurate for the people who need accuracy most: those at the lower and higher ends of the fitness spectrum. If you're quite deconditioned, your watch tends to be more generous than reality. If you're highly trained, the algorithm often undersells you. The models are calibrated to the middle of the bell curve, and they compress the extremes.
None of this makes your watch useless. But it means the absolute number it gives you should be treated as a rough neighborhood, not a street address.
The baseline inaccuracy is one thing. But there are specific conditions where wearable VO₂ Max estimates go from "roughly directional" to "actively misleading."
If you live or train at elevation, your watch is likely overestimating more aggressively than it would at sea level. The algorithm sees your heart rate running higher than expected for a given pace and interprets that as lower fitness. But some watches then overcorrect with altitude adjustments that swing too far the other direction. We see this constantly with clients who travel between Santa Cruz and Tahoe or the Sierra. Their estimated VO₂ Max can swing four to six points in a week based purely on geography.
Heart rate drifts upward in hot conditions as your body shunts blood to the skin for cooling. Your watch doesn't know the difference between cardiac drift from heat and cardiac drift from fatigue. A run in 90-degree heat will often produce a lower VO₂ Max estimate than the same effort on a cool morning, even though your fitness didn't change overnight.
Since the algorithm depends on pace, anything that corrupts your GPS data corrupts the estimate. Running in dense tree cover, between tall buildings, or on tight switchback trails creates GPS scatter that makes your pace data unreliable. Your watch might think you ran an 8:30 mile when you actually ran a 9:15, and it recalculates your fitness estimate accordingly.
Optical heart rate sensors have improved dramatically, but they still struggle with certain skin tones, tattoos, cold conditions, and high-intensity intervals where wrist motion creates noise. If your heart rate data is off by even five to eight beats per minute consistently, the entire VO₂ Max calculation shifts with it.
Apple Watch can estimate VO₂ Max from outdoor walks, which is genuinely useful for people who don't run. But walking-based estimates carry wider error margins because the heart rate-to-workload relationship is less linear at lower intensities. We've seen walking-derived estimates differ from lab results by as much as 20% in some of our clients.
Here's where we stop sounding like wearable skeptics and start sounding like coaches who actually use this data every day.
Trend tracking is the real superpower of a wearable VO₂ Max estimate. We don't care much whether your watch says 41 or 45 in absolute terms. We care whether the number is trending up, trending down, or flat over the course of weeks and months. That directional information is genuinely valuable, and your watch delivers it for free, passively, without you having to do anything.
But trend tracking only works if you have a baseline anchor. Without knowing your real VO₂ Max, you don't know what the watch number means relative to your actual physiology. You're tracking movement in a number without knowing where you actually stand.
This is where one lab-measured test changes the game. Once you know your real VO₂ Max from a controlled, calibrated protocol with direct gas analysis, you can map your watch estimate against it. If the lab says 36 and your watch says 42, you now know your watch runs about 6 points hot. That offset tends to stay fairly consistent over time. So when your watch shows 44 a few months later, you can reasonably infer that your actual VO₂ Max has moved up by about 2 points. That's meaningful, actionable information.
Without the anchor, you're just watching a number move and guessing what it means.
We talk to a lot of people who assume that if they get a VO₂ Max test, it's a one-time curiosity. They'll see a number, feel good or bad about it, and that's that. But a lab test does something a watch cannot: it gives you your ventilatory thresholds.
Your VO₂ Max is the ceiling. Your ventilatory thresholds, VT1 and VT2, are the floors and walls of the rooms you actually train in every day. VT1 tells you where easy breathing turns into moderate effort. VT2 tells you where sustainable effort turns into a redline you can only hold for minutes. Both of these thresholds correspond to specific heart rate zones that are unique to you.
And this is the real problem with wearable-only training. Your watch estimates your VO₂ Max using a generic algorithm, then it sets your training zones using that same generic framework. If the VO₂ Max estimate is off, your zones are off. And if your zones are off, your easy runs aren't easy enough, your tempo runs aren't at the right intensity, and your interval targets are guesswork.
Once you have lab-measured thresholds, you can manually set your heart rate zones on your watch or training platform to reflect your actual physiology. Now your Garmin's zone 2 is your real zone 2. Your Apple Watch's "moderate" intensity actually aligns with your VT1. You're no longer training in a fiction. You're training in your body.
We've had clients come in, get tested, adjust their zones, and within six to eight weeks see their watch VO₂ Max estimates start climbing, not because the test made them fitter, but because they finally started training at the right intensities. They were doing the same volume of work. They just stopped accidentally spending half their runs in no-man's-land, too hard to be aerobic development and too easy to be a real threshold session.
If you're thinking about getting tested, here's how to prepare for a VO₂ Max test so you get the most accurate results possible. And after the test, understanding how to read your VO₂ Max results makes the difference between a number on a report and a training plan that actually works.
Your Apple Watch, Garmin, COROS, or Polar is not lying to you. It's doing the best it can with the information it has. But "the best it can" still means an error margin that can put you in the wrong fitness category, set your training zones incorrectly, and give you false confidence or false concern about where you stand.
The fix isn't throwing your watch in a drawer. The fix is getting one real number to anchor everything else. One lab test, once, gives you a reference point that makes every watch estimate, every training zone, and every trend line more honest and more useful.
We run these tests regularly at Mavericks. They take about 30 minutes. You don't need to be an elite athlete. You just need to be someone who wants to know what their body is actually doing instead of what an algorithm guesses it's doing. And once you know, that number on your wrist finally starts telling you something true.
The series runs in order, but each post stands alone. Pick up wherever the title catches you.
Book a private session, rent the floor for your own clients, or tell us your goals and we'll match you with a coach.
Santa Cruz coastPrivate, one-on-one sessions with a coach matched to your goals.
Book a session →Tell us what you're working on. We'll point you at the right coach on the floor.
Start here →Browse the full directory of Mavericks Fitness coaches and their specialties.
See directory →Rent the studio at a flat rate. Bring your clients. Keep your rate.
See the offer →