The April Sell List — CalledThird

The hard part about April

It’s been 22 games. Andy Pages is running a .403 wOBA. Ben Rice leads the AL in batting average. Mike Trout just put 5 home runs in the bleachers at Yankee Stadium in four days. And Mason Miller has now thrown 30⅔ scoreless innings to start his season.

If you’ve been reading any baseball coverage this month, you already know all of this. What you don’t know is which of these performances will still look special when the leaves turn — and which ones we’ll all stop talking about by the end of May.

We tried to answer that. Twice. The first time, we got Ben Rice and Mike Trout wrong. (We’ll explain.) The second time, two completely different statistical methods independently arrived at the same answer for nearly every name on the leaderboard. That’s the version you’re reading.

Here’s what 22 games of baseball can — and can’t — tell you.

The first thing to know: April lies. A lot.

Across the past four seasons, we identified the top five hottest 22-game wOBA starts each year — the players who were torching the league through April. Twenty player-seasons in total.

Of those 20: two maintained 85% of their April pace through the rest of the season. Eighteen didn’t. The median decline was 0.135 points of wOBA — roughly the gap between an MVP candidate and a league-average bat.

Hover any line for player details. (Tap on mobile.)

All 20 player-seasons that led 22-game wOBA in 2022-2025 (top 5 per season). Lines show the cumulative running wOBA at g22, g50, g100, and g162 — interior points are computed from the actual April + ROS data, not made up. Median full-season decline: -0.135 wOBA — roughly the gap between an MVP candidate and a league-average bat. Source: research/hot-start-half-life/data/noise_floor.json.

Each line is one of the past four seasons’ top wOBA starts. By July, almost none of them looks like an MVP anymore.

So when you look at this year’s leaderboard, the prior — the thing your eyes tell you when you ignore everything else — should be: most of this evaporates.

Where it gets interesting is asking which ones.

The names baseball is talking about

We ran the entire 287-hitter universe through two independent statistical pipelines: one Bayesian, one machine-learning. They disagreed plenty, but where they agreed, they agreed strongly.

For the published-leaderboard names, they agreed almost completely. Here’s the buy-and-sell sheet:

A clarification before you read the list. “Sell” doesn’t mean these players are bad. Aaron Judge projected at .395 is still elite, and Trout at .370 is still a top-30 hitter. “Sell” means: don’t pay them as if their April pace is the new baseline — the model says they’ll regress roughly to where preseason projections already had them. The April hot start adds essentially zero new information about who they are.

“Buy” means the opposite: in absolute wOBA, the buy list projects lower than the sell list. Pereira’s projected .307 is below Judge’s projected .395. The point is that Pereira’s preseason baseline was .220 (sub-replacement) — the model now thinks he’ll hit .307, which is +87 wOBA points above what anyone projected him to be. The market underrated him; the model has updated. The columns are about belief change, not raw talent.

↘sell6 names

hot but regresses to baseline

Andy Pages

LAD

-1

vs prior

Prior .331April .403Proj .330

Ben Rice

NYY

vs prior

Prior .345April .500Proj .346

Mike Trout

LAA

vs prior

Prior .362April .425Proj .370

Aaron Judge

NYY

-7

vs prior

Prior .402April .435Proj .395

Corbin Carroll

ARI

vs prior

Prior .380April .421Proj .388

Max Muncy

LAD

+10

vs prior

Prior .355April .407Proj .365

~hold1 name

above baseline with caveats

Munetaka Murakami

CWS

+57

vs prior

Prior .291April .418Proj .348

NPB rookie; prior is league-average proxy

↗buy6 names

above baseline, under the radar

Jac Caglianone

LAA

+70

vs prior

Prior .240April .320Proj .310

Everson Pereira

NYY

+87

vs prior

Prior .220April .411Proj .307

Jorge Barrosa

ARI

+81

vs prior

Prior .184April .322Proj .265

Samuel Basallo

BAL

+66

vs prior

Prior .246April .334Proj .312

Coby Mayo

BAL

+41

vs prior

Prior .263April .282Proj .304

Brady House

WSH

+45

vs prior

Prior .255April .298Proj .300

Prior (preseason baseline)April (current pace)Projected (R3 model verdict)Badge = Projected − Prior, in wOBA points. The actual signal: how much the model updates above your preseason expectation.

Sleeper relievers (4)

Antonio Senzatela (COL) • Daniel Lynch (KC) • John King (TEX) • Caleb Kilian (CHC)

Each row shows three values: Prior (preseason baseline), April (current pace), and Projected (where both methods land for rest of season). The badge is Projected − Prior in wOBA points: how much the model updated above your preseason expectation. That’s the actual signal.

Six MVP-pace starts. Six reasons they won’t last:

Andy Pages · LAD

April pace

.403

Pre-2026

.331

Projected

.330

xwOBA .360 vs wOBA .403 — luck on placement, not skill.

Ben Rice · NYY

April pace

.500

Pre-2026

.345

Projected

.346

Real walk rate, real power, but a .378 BABIP is doing too much work.

Mike Trout · LAA

April pace

.425

Pre-2026

.362

Projected

.370

His preseason baseline is already this high. April adds no info.

Aaron Judge · NYY

April pace

.435

Pre-2026

.402

Projected

.395

Same as Trout — the prior absorbs the April excursion.

Corbin Carroll · ARI

April pace

.421

Pre-2026

.380

Projected

.388

Star whose April beats the high baseline only marginally.

Max Muncy · LAD

April pace

.407

Pre-2026

.355

Projected

.365

Power surge real-ish; closer to prior than to current pace.

Andy Pages. Statcast’s expected wOBA — what his contact quality alone would normally produce — is .360. His actual wOBA is .403. That gap is luck on where the balls are landing, not skill. The hard contact is real (60% hard-hit rate, in the elite tier). The .403 isn’t. Both methods project him to land near his pre-2026 baseline of .331 — essentially zero rest-of-season improvement over what we already expected.

Ben Rice. Genuinely good plate discipline (20% walk rate) on a real power surge. But the .378 BABIP is doing too much of the work. He’ll land closer to .345 — All-Star caliber, not MVP-caliber.

Mike Trout. This is the one our model wants to take seriously, and won’t. Trout’s preseason baseline is already so high that even a 5-HR series at Yankee Stadium doesn’t move our projection meaningfully. He’s going to keep being Mike Trout. Mike Trout running .425 is not new information.

Aaron Judge. Same story as Trout. His preseason baseline is .402 wOBA. He’s running .435. The math says: that’s mostly the same Aaron Judge.

Corbin Carroll, Max Muncy. Same math. Stars whose April production is strong but not above what their baselines already predicted.

The version of this article that says “Aaron Judge will collapse” would be wrong, and we’re not writing it. The honest version is: the hot-start portion isn’t adding much information beyond what we already knew about him.

“Why should I trust this?”

Fair question. Before we get to the sleeper picks, here’s the test we ran on data the model never saw.

We trained the model on 2022-2024 player-seasons only. Then we asked it to predict the rest-of-season wOBA delta vs preseason prior for every 2025 player-season — 426 of them — using only their first 22 games. Then we waited until the 2025 season ended and looked at what actually happened.

83%

of top-decile predicted sleepers had positive ROS gain over prior in 2025

+57

wOBA points: average actual gain in top decile (vs +1 for the corpus average)

85%

of actual outcomes fell inside the model's 80% prediction interval

Predicted vs actual rest-of-season wOBA delta, by predicted decile (2025 holdout)

For each predicted decile, we show the model’s mean prediction and what those player-seasons actually delivered. Monotonic = model ranking maps to reality.

r = 0.56

Pearson correlation, predicted vs actual

RMSE 0.041

On rest-of-season wOBA delta

n = 426

Player-seasons in 2025 holdout

2022-2024

Training data only

Source: research/hot-start-half-life/codex-analysis/round3/tables/r3_qrf_hitter_raw_coverage_2025.csv. Holdout test: model trained on 2022-2024, never seen 2025 data during training, then asked to predict each 2025 player-season’s rest-of-season wOBA delta vs preseason prior. Numbers above are what happened.

Three things to take from this:

The top-decile predictions worked. Of the 42 player-seasons the model called as top-decile sleepers in 2025, 35 actually posted positive ROS gains over their preseason prior. The mean lift over the corpus average was +56 wOBA points.
The bottom-decile predictions also worked. The model’s “fade” calls had 64% actually negative ROS deltas, with a mean lift of −36 wOBA points vs corpus.
The deciles ramp monotonically. Decile 1’s actual outcome (+57) is 16× decile 10’s (−36). The model’s ranking maps to reality across the entire distribution — not just the extremes.

None of this proves Caglianone or Pereira will outperform their preseason expectations — only that, on the most recent 426 player-seasons we could test against, the same methodology produced rankings that did. The 2026 picks below come from the same model, applied to 2026’s first 22 games. If you’re skeptical about the named picks, this is the evidence we have.

The names baseball isn’t talking about

This is the part nobody else is publishing.

For every Andy Pages on the ESPN leaderboard, there’s a player whose 22 games look quieter on the surface but whose underlying components — exit velocity, plate discipline, contact quality, prior skill — project meaningfully better than their preseason baseline.

We screened all 287 hitters with at least 50 plate appearances. We required a name to clear two bars: predicted rest-of-season delta in the top decile, AND not appear in any mainstream “April hot starter” leaderboard. Then we required both of our independent methods to surface the name.

Six hitters cleared. Here they are:

Sleeper hitters

Jac Caglianone

LAA · OF

95th-percentile hard-hit rate

April .320 → Projected .310

Contact quality this loud usually persists. Not a BABIP story.

Everson Pereira

NYY · OF

.411 wOBA, quietly behind Rice

April .411 → Projected .307

Power + plate discipline shape, not a luck story.

Jorge Barrosa

ARI · OF

Loud underlying contact metrics

April .322 → Projected .265

Quiet line, real bat. Both methods agree.

Samuel Basallo

BAL · C/1B

Top-decile prospect profile

April .334 → Projected .312

The Orioles have a habit. Basallo is the next iteration.

Coby Mayo

BAL · 3B

Top-decile of the universe scan

April .282 → Projected .304

Same Orioles habit. Both methods rank him in the top 10%.

Brady House

WSH · 3B

Hasn’t shown up in early-season coverage

April .298 → Projected .300

Underlying contact metrics put him on this list.

Sleeper relievers

Antonio Senzatela

COL

Converted starter, K% jumped 11pp

April K% 23.5% → Projected K% ~22%

Coors is irrelevant for a single-inning reliever. Both methods like the strikeout shape.

Daniel Lynch

K% rose from 17.7% prior to 37.1% in April

April K% 37.1% → Projected K% ~27%

The textbook sleeper shape: real underlying gain even after we shrink the April spike.

John King

TEX

Quieter K% jump, both methods clear

April K% 22.5% → Projected K% ~22%

Less dramatic than Lynch but same direction; both pipelines surface him.

Caleb Kilian

CHC

Same shape as the others

April K% 26.1% → Projected K% ~22%

Convergent across both methods on the K% direction.

Jac Caglianone (LAA, OF). 95th-percentile hard-hit rate behind a .320 wOBA. The contact quality is the kind of thing that survives April — it’s not a BABIP story.

Everson Pereira (NYY, OF). Quietly behind Rice on the Yankees’ depth chart with a .411 wOBA. Both methods like the underlying shape: power and plate discipline, not luck.

Jorge Barrosa. Defensive utility OF whose 22-game line (.322 wOBA) doesn’t look loud — but the contact metrics and discipline both point at a real bat that’s about to surface in a regular role.

Samuel Basallo (BAL, prospect). The Orioles have a habit. Basallo is the next iteration.

Coby Mayo (BAL). Same Orioles habit. Both methods rank his 22-game profile in the top decile of the universe.

Brady House (WSH). A name that hasn’t shown up in early-season coverage at all. Underlying contact metrics put him there.

And four relievers. The reliever pool was 218 names; we required at least 25 batters faced and excluded everyone in the top 30 of last year’s saves leaders (so no Edwin Diaz, no Emmanuel Clase, no Jhoan Duran):

Antonio Senzatela (COL) — converted starter, K% jumped meaningfully against his 3-year prior. The Coors part is irrelevant for a single-inning reliever.
Daniel Lynch (KC) — 17.7% prior K%, 37.1% in April, projects ~27% rest-of-season. The textbook sleeper shape: a real underlying gain even after we shrink the April spike.
John King (TEX) — quieter K% jump but the projection clears the bar in both methods.
Caleb Kilian — same shape.

The one we couldn’t agree on

Munetaka Murakami is the only April performance that neither of our methods is willing to write off — which is interesting, because it’s also the one where our methodology is weakest.

Quick context if you haven’t been paying attention: Murakami is a 26-year-old corner infielder who spent eight seasons with the Tokyo Yakult Swallows before signing a 2-year, $34M deal with the White Sox in December. In Japan, he hit 246 home runs in eight seasons. In 2022 alone, he hit 56 — breaking Sadaharu Oh’s NPB single-season record for a Japanese-born player and winning the Triple Crown at age 22. Through his first 21 MLB games: 8 home runs, a .418 wOBA, and a thing where he homered in each of his first three career games (a White Sox first; only the fourth player in MLB history to do it).

Both of our pipelines see signal in that profile. They disagree on how strong: our Bayesian model calls it “signal, medium confidence.” Our machine-learning model calls it “ambiguous, low confidence.” They agree on the direction — both think Murakami is the cleanest of any leaderboard name — and they agree on why they’re not more sure.

The reason is the comparable. We don’t have one.

The natural reference is Shohei Ohtani, who came over from the Hokkaido Nippon-Ham Fighters in 2018. But Ohtani’s NPB power profile doesn’t actually look like Murakami’s. Ohtani’s NPB peak was 22 home runs in a season (2016, his Pacific League MVP year); Murakami’s NPB peak was 56. Ohtani arrived in MLB and immediately hit 22 home runs in 104 games as a rookie — roughly the same pace as his NPB high. Apply that same logic to Murakami and you’d project him for 50+ MLB home runs, which neither of our models will say out loud.

The honest framing: nobody with Murakami’s NPB power has ever come over to MLB before. There’s no robust translation factor for someone with his profile, because the sample size of Japanese-born sluggers with 50+ HR seasons is one (him). Our model assigns him a league-average preseason baseline because we didn’t build an NPB-to-MLB power translator. That choice makes our “signal” partly a statement about how much of his 22 games clears a methodological low bar, rather than a confident MVP prediction.

Watch him. The plate discipline, the hard contact, the early home run pace — it all looks like a real bat. Whether it ends up looking like Ohtani’s rookie year (.285, 22 HR), the high end of what NPB hitters historically translate to, or something further up the curve nobody has seen yet — that’s the part we genuinely don’t know.

Mason Miller: what’s real and what isn’t

Miller has thrown 30⅔ consecutive scoreless innings. The streak is genuinely historic for a closer. But here’s the thing: most of what gets written about a streak like this confuses real talent with predictable streak length. Those are different questions.

What’s real

His strikeout rate is genuinely elite.

Pre-2026

40.7%

April 2026

65.9%

Projected

49.5%

After shrinking April’s 65.9% K% toward his elite-but-not-supernatural prior, both methods land him at 49.0%–50.0% rest of season. That’s a real gain over an already-elite baseline.

What’s not predictable

How much longer the scoreless streak goes.

30⅔ IP

and counting…

Honest answer: we don’t know. Predicting streak length requires modeling baserunner states we didn't build. The “65% chance of 5 more IP” probabilities you’ll see elsewhere are mostly guesswork — they’re derived from “how often does he allow a home run,” which is one of three or four ways a reliever can give up a run.

What’s real: His strikeout rate. Miller’s pre-2026 K% was 40.7%. He’s running 65.9% in April. After we shrink that toward his prior — which is what statistical models do, because nobody actually maintains a 65% K% over a full season — we project his rest-of-season K% at 49–50%. That’s elite, and it’s a real gain over his already-elite baseline.

What’s not predictable, despite what you’ve read elsewhere: how much longer the scoreless streak will go. The probabilities you’ll see floated — “65% chance he gets to 50 innings,” that sort of thing — are not derived from a model that actually understands how runs score in baseball. They’re derived from “how often does he allow a home run,” which is one of three or four ways a reliever can give up a run.

We didn’t build the right model for that question, and we’re not going to publish a number we don’t believe. What we can tell you: the strikeout rate is real, and as long as the strikeout rate stays elite, the streak length is mostly a function of luck and high-leverage usage, neither of which our model has access to.

What we got wrong (and why we’re showing you)

The first time we ran this analysis, we said Ben Rice and Mike Trout were both real signals. We said it because we’d added “contact-quality features” — exit velocity, hard-hit rate, barrel rate — to our projection model.

When our second statistical pipeline disagreed, we audited our own code. The “contact-quality features” turned out to be a hand-tuned 50/50 blend of two stats (wOBA and xwOBA), with the other contact features computed but not actually entering the projection. When we replaced the hand-tuned blend with a learned-coefficient blend trained on 2025 data — the kind of thing we should have done the first time — Rice and Trout both collapsed back to noise.

Most baseball analytics sites don’t show their retractions. We do.

Ben Rice

NYY

Earlier read

SIGNAL

→

Now

NOISE

Why: Our first pass put too much weight on Statcast "expected stats." When we trained the weights properly on holdout data, his hot start mostly disappears.

Mike Trout

LAA

Earlier read

SIGNAL

→

Now

NOISE

Why: Same story. His 5-HR Yankee Stadium series is selection-bias noise on a guy whose baseline is already All-Star caliber. The math says he'll be Mike Trout. He was already Mike Trout.

We’re showing you this because most baseball analytics sites don’t show their retractions. They publish the version they thought was right and quietly stop talking about it when it isn’t. That feels worse than just being upfront: here’s what we said in our first pass, here’s what we say now, here’s what changed.

The version of this article that ran a week ago would have told you Mike Trout was real. We’d have been wrong. Worth knowing.

What this all means for the next 30 days

The shortest summary:

Don’t get attached to Pages, Rice, Trout, Judge, Carroll, or Muncy’s April lines. The data on April hot starts is brutal, and these are the most-talked-about names that the data thinks won’t last in their current form.
Watch six hitters nobody else is talking about — Caglianone, Pereira, Barrosa, Basallo, Mayo, House. Two methods independently said they’d be on this list before we filtered to “names not in the news.”
Murakami is the one big-name April signal that survives our analysis. With caveats.
Mason Miller’s K% rate is real. His streak length is not credibly forecastable, despite what you’ve read elsewhere.
April is short enough that the only thing you should be confident about is the shape of the noise. Almost nobody who looks like an MVP through 22 games is one in October.

We’ll re-check at the All-Star break.

Methodology

We ran two independent statistical pipelines on the same April 2026 universe (287 hitters with at least 50 plate appearances, 218 relievers with at least 25 batters faced through 2026-04-24). Both methods used Statcast data 2022-2025 as the historical training corpus.

Method A (Bayesian). Hierarchical Bayesian projection (NUTS in numpyro) with empirical-Bayes shrinkage. The shrinkage strength was anchored to a re-derived player-season-level stabilization rate for each component stat (the standard published Carleton 2007 stabilization rates for plate-discipline stats; corrected estimates for wOBA and ISO based on 2022-2025 data). Contact quality entered the projection via a learned linear blend of wOBA, xwOBA, EV p90, HardHit%, Barrel%, and prior wOBA, with coefficients fit on 2022-2024 player-seasons and validated on a 2025 holdout (RMSE 0.0359 vs 0.0371 for a wOBA-only baseline).

Method B (Machine learning). LightGBM gradient-boosted projection, trained on 2022-2024, validated on 2024, tested on 2025. Quantile regression forests for prediction intervals. Historical-analog retrieval via cosine similarity over standardized 22-game feature vectors.

Cross-review. Both methods were independently cross-reviewed — each agent read the other’s code and report and wrote a skeptical peer review. Across three rounds of analysis, three rounds of cross-review, and the comparison memos, we retracted multiple findings (the R1 “stabilization rates shifted” headline, the R2 Rice/Trout SIGNAL claims, several specific sleeper picks) and resolved several disagreements (Lynch as a sleeper rather than fake-dominant; the Tristan Peters and Louis Varland names dropped from sleeper lists).

Caveats we owe you. Codex’s prediction intervals over-cover by ~5pp on validation; they’re deliberately conservative. Codex’s sleeper floor required a positive preseason wOBA baseline; some of our sleeper hitters (Pereira, Barrosa) have priors below the .250 line we’d ideally use. Codex’s “fake hot” rule used a population residual standard deviation (not a per-player Bayesian SD) as the threshold; only Carter Jensen cleared it. Claude’s contact-quality blend was validated on a single 2022-2024 → 2025 holdout, not a k-fold cross-validation; coefficient stability across folds is not established. Murakami’s prior is a 60-PA league-average proxy because no NPB-translation was used. We did not model defensive metrics, park factors, or causal mechanisms behind any individual player’s profile.

Code & reproducibility. Full code, both agent prompts, all three comparison memos, and the cross-reviews are in the CalledThird research repo under research/hot-start-half-life/.

Cite this analysis

CalledThird. "The April Sell List." CalledThird.com, April 25, 2026. https://calledthird.com/analysis/april-sell-list

All CalledThird analysis is original research. If you reference our findings, data, or charts in your work, please link back to the original article. For data inquiries: hello@calledthird.com

Research code on GitHub

The hard part about April

The first thing to know: April lies. A lot.

The names baseball is talking about

“Why should I trust this?”

The names baseball isn’t talking about

The one we couldn’t agree on

Mason Miller: what’s real and what isn’t

What we got wrong (and why we’re showing you)

What this all means for the next 30 days

Methodology

Cite this analysis

Related analysis

The Re-Test Report: We Promised Six Follow-Ups. Here's What Held.

The Coin-Flip Line

Same Stuff, Different Brain