How the leaderboards are computed

The headline ranking on every segment page is the per-segment residual of an OLS regression. This page explains what that means, why it's the metric of choice, and the caveats attached to it.

Why residuals, not raw watches

Letterboxd watch counts correlate strongly with theatrical reach: a film released in 50 countries with a $200M box office will collect millions of LB watches almost regardless of how good it is. Ranking by raw watches mostly ranks by marketing budget.

Residuals strip out the expected effect of theatrical reach and ask the more interesting question: given how widely this film was released, how much more (or less) did Letterboxd embrace it? A positive residual means LB punched above the regression line; a negative one means it punched below.

The two metrics

The site sorts segments by per-segment Metric B residual. Each release type fits its own regression, so a streaming-only film's residual is interpreted relative to other streaming-only films, not against blockbusters.

What `residual_b_segment` means

For a film in wide_theatrical:

A residual of +1.0 means its log-watches are 1 above what the wide-theatrical regression predicts from its country count, year, and age — i.e. it's roughly e¹ ≈ 2.7× more watched than expected.
0 is exactly on the regression line.
−1.0 means it's roughly 2.7× less watched than expected.

What different residual values look like

Concrete examples from the current dataset, picked to give the magnitudes some weight.

Alternative ranking metrics

The leaderboard sort selector exposes three additional metrics alongside the residual:

Correspondence ratio = letterboxd_watches / (domestic_gross_usd / $11). Direct proxy for "what fraction of estimated US theatrical viewers also Letterboxd-watched the film." A value of 0.10 means LB has roughly 10% of the US theatrical audience for that film; 1.0 means equal; > 1 means LB watched MORE than the theatrical run delivered. Median across our data is ~0.10. Available only for films with reported domestic gross > 0 (theatrical releases).
Like rate = letterboxd_likes / letterboxd_watches. Of the people who logged it on LB, what fraction tapped the heart? Sorts surface films with passionate reception relative to attention.
Watches — raw letterboxd_watches. Mainstream-skewed; useful when you just want "what's the most-watched film matching these filters" without normalisation.

Each metric tells a different story. Residual and correspondence aren't the same thing — a wide-theatrical blockbuster can have a low residual (LB watched roughly what was expected for a film of that reach) but an unusually high correspondence ratio (LB punched above the local theatrical run). Cross-reference both when looking at a film.

The $11 ticket price is the ~2024 NATO U.S. average. Tweak in src/pipeline/compute_residuals.py (AVG_US_TICKET_PRICE) if you want an inflation-adjusted version.

Important caveats (don't read these as causal)

Pandemic distortion: 2020-2022 films are handled by a year fixed effect, but the box-office collapse was uneven across segments. We don't add a covid_era × release_type interaction in v1; high-residual 2020-2022 films may be partly artefacts of comparing pandemic-suppressed grosses.
Watches are cumulative, not lifetime-normalised: a film released last month and a film released five years ago aren't directly comparable on watch count. The log(months_age) covariate partially adjusts. v2+ will switch to "watches at fixed film age" once enough monthly snapshots accumulate.
Letterboxd selection bias: LB's user base skews young, cinephile, English-fluent. Residuals reveal that demographic's preferences, not "audience appeal" generally.
No causal claim about features: directors and cast are not in the regression — this is intentional (Q15). Aggregate residuals per director/country/theme are descriptive in notebooks/04_aggregate_residuals.ipynb, not headline output.
Marketing, awards, critic reception confound everything: a film with a heavy critical campaign over-performs on LB even adjusting for theatrical reach. We can't separate that from LB-specific demographic preference without extra signals.

build_universe — TMDB discover ∪ Box Office Mojo, deduped by tmdb_id
enrich_metadata — TMDB per-film metadata + watch/providers (cached)
scrape_letterboxd — Playwright per-film page (watches, likes, themes)
the_numbers — best-effort budget gap-fill
classify_release_type — wide / limited / festival / streaming / hybrid
merge — single films table + validate outlier flags
compute_residuals — Metric A + B + per-segment, written back to films.parquet

Source: GitHub repo (private until publish-decision).

Explore — aggregate views (top countries, themes, genres, directors)
Per-segment leaderboards: Wide · Limited · Festival · Streaming · Hybrid