DECOMP — Cross-region shared subspaces in mouse cortex

Abstract

When the same behavioural information appears in many brain regions at once, the natural question is whether those regions share the same neural geometry — the same low-dimensional population state, expressed along the same axes — or whether each region independently arrives at a different code that just happens to decode to the same scalar. Decoding accuracy cannot tell these apart; only the shape of the population state space can. Across fifteen simultaneous dual-region recordings from the IBL Brain-Wide Map spanning mouse visual cortex (V1), motor cortex (M1), and cerebellum (CB), we measure the geometry shared between regional subspaces with canonical correlation analysis, then strip away every direction explainable by measured behaviour — wheel velocity, pupil, whisker and body motion energy, lick rate — and ask what geometry remains. It does not collapse. Across all three pairs, 0.69–0.85 of the pair-summed shared geometry survives even the richest behavioural partial. But what survives is doing three different things in the three pairs. V1↔CB traces a slow, task-flat, zero-lag shared mode — geometry consistent with a global state we still do not measure. V1↔M1 traces a zero-lag mode with large peri-movement and peri-feedback excursions — geometry consistent with a shared cognitive variable both regions reflect simultaneously. CB↔M1 traces a directed mode with cerebellum leading motor cortex by ~40 ms — geometry consistent with real cerebellothalamocortical transmission. One scalar survival ratio, three different representational geometries.

The question

One geometry inherited everywhere, or many that all decode the same?

A population of neurons does not encode the world one cell at a time. It encodes it geometrically — as a low-dimensional state that moves along a few preferred axes inside a much higher-dimensional firing-rate space, with the rest of the directions doing little of behavioural relevance. Two regions can land at the same decoding accuracy through completely different geometries: one might place running speed along a single arousal-driven gain axis while the other carries it as a high-dimensional pattern that a linear classifier happens to read just as well. Same information, different geometry. The shared-subspace methods used below separate these two pictures; a decoder cannot.

Stringer et al. (2019) and the IBL Brain-Wide Map (2025) both report that locomotion and orofacial movement variables are decodable from essentially every mouse brain region they record. The unresolved question is whether that ubiquity reflects one geometry — one low-dimensional global state, inherited along the same axes everywhere — or many region-specific geometries whose only commonality is that the same scalar can be read from each.

Schematic of three region-pair geometries: V1↔CB slow zero-lag mode, V1↔M1 zero-lag task-locked mode, CB↔M1 directed +40 ms mode

What we mean by representation geometry. Each region's reliable population activity lives on a low-d manifold in its own firing-rate space (coloured ellipsoids). The cross-region shared subspace is a plane connecting them — a set of directions along which the two regions' activity is geometrically aligned. The glyph drawn on the plane encodes what the shared mode is doing in time: a slow smooth oscillation (V1↔CB), a synchronous pulse with no flow (V1↔M1), or a directed flow with a non-zero lead time (CB↔M1). Three pairs, three substrate geometries — same plane, different dance on it.

Three pairs in the IBL Brain-Wide Map 2023_12 release let us probe this: V1↔CB (the canonical "is V1 just inheriting cerebellar arousal geometry?" question), V1↔M1 (cortical neighbours plausibly sharing task-state axes), and CB↔M1 (the most direct anatomical pathway, via the cerebellothalamocortical loop, where shared geometry would mean shared transmission). On fifteen dual-region recordings, we partial out movement covariates at two granularities — minimal (wheel + pupil) and rich (+ wheel acceleration, whisker and body motion energy, lick rate) — and probe the shape, flow, and task-locking of whatever geometry survives.

The dataset

What's actually in 15 simultaneous V1+CB / V1+M1 / CB+M1 sessions

We use the public IBL Brain-Wide Map 2023_12 freeze — 459 sessions, 1+ Neuropixels probe each, mice performing the IBL contrast-discrimination task. Every session ships: spike-sorted units with Beryl-atlas annotations, the wheel encoder at 1 kHz, three head-fixed cameras with DLC pose markers + whisker-pad motion energy + derived pupil diameter, and the trial table. Public ONE auth, no Alyx account needed.

The Beryl atlas is IBL's standard 308-region coarsening of the Allen Mouse Brain Common Coordinate Framework v3 (CCFv3, ~1300 leaf regions). Beryl rolls leaf labels up to a uniform mid-level hierarchy — primary visual cortex stays VISp; CCFv3's many cerebellar lobule sub-labels collapse into ~17 Beryl labels (cortex lobules LING, CENT, CUL, DEC, FOTU, PYR, UVU, NOD, SIM, AN, PRM, COPY, PFL, FL, plus nuclei FN, IP, DN). Each spike-sorted cluster carries a Beryl acronym after probe-to-CCF histology alignment — that's what we filter on to build per-region populations. The cerebellum ROI in this analysis is therefore the union of all 17 Beryl cerebellar labels, not a single acronym; visual cortex is the union of 10 dorsal-VIS Beryl labels.

3-second clip of the leftCamera video with DLC pose markers overlaid, showing the mouse running during a high-locomotion bout

What the mouse is doing. 3-second clip from the leftCamera of the focal V1+CB session, with DeepLabCut pose markers overlaid (pupil corners, whisker pad, paw, tongue, nose). The mouse is in a head-fixed running bout. The behavioural covariates the GLM and pCCA Z-matrix consume — wheel velocity, whisker motion energy, pupil diameter, lick rate — are all derived from this stream + the wheel encoder.

60-second window of session 41431f53: wheel velocity, whisker motion energy, pupil diameter, V1 raster (63 units), CB raster (46 units).

What one session looks like. 60 seconds of session 41431f53 (subject CSH_ZAD_022, ~88 min total). Top three rows: behavioural covariates on a 20-ms grid (wheel velocity, whisker motion energy, pupil diameter). Bottom two rows: spike rasters from the two simultaneously recorded probes — 63 V1 units (blue) and 46 cerebellar units (red). Wheel velocity and motion energy track each other tightly during running bouts; pupil dilates slowly across the window.

A single session at 20 ms binning has T ≈ 263 000 time bins, ~50–100 sorted units per region, ~22 M spike events total. Time is plentiful; what's scarce is simultaneous dual-region coverage — the IBL probe-placement protocol is independent across sessions, so the number of sessions containing both regions of any given pair is small and outside our control.

Left: per-region sessions in the GLM pool (10 each for VISp, CB, MO, CA1). Right: per-pair simultaneous-recording sessions — 10 for V1↔CB, 4 for V1↔MO, 1 for CB↔MO.

Recording funnel. Left: per-region single-region pools used for the per-neuron GLM (10 best sessions per region, 4 886 neurons total). Right: per-pair simultaneous-recording counts at minimum 5 units per region. VIS↔CB is the largest pair (n = 10), VIS↔MO is intermediate (n = 4), CB↔MO is a single session (n = 1). Zero sessions in BWM 2023_12 contain simultaneous V1+CB+M1 coverage.

4,886Neurons (GLM)

51Sessions used

15Dual-region pair sessions

~263k20-ms bins / session

The method

Per-neuron, within-region, cross-region, residual

The pipeline moves down a ladder of geometric resolution. The GLM gives a per-neuron scalar — how much spike-count variance lives along the movement regressors — to verify the dataset behaves the way the literature predicts. SVCA then drops the single-neuron view and asks: what is each region's reliable subspace — the axes inside its firing-rate space along which population activity moves reproducibly rather than just floats on noise? CCA asks the cross-region geometric question: which axes inside region A's subspace co-rotate with which axes inside region B's, and how strongly? Partial CCA strips out the axes linearly explainable by behaviour and re-asks the same question on the residual geometry. Dynamics asks what the surviving axes are doing in time — whether the shared mode flows from one region to the other and whether it locks to task events. The prose below sketches each step in turn; the full math, with equations and "what we need it for" notes per stage, lives in the Appendix, which the reader new to these methods may want to skim first.

GLM Per-neuron Ridge with raised-cosine kernels, leave-one-group-out ΔR² 4,886 neurons

SVCA Within-region split-half reliability (Stringer 2019); K = 8 components Stringer 2019 / neuropop

CCA + pCCA Cross-region canonical correlations + Frisch-Waugh-Lovell partialling 200 phase-shuffle surrogates

Dynamics Lag (cross-correlation) + peri-event averages of residual canonical variate τ ∈ [-500, +500] ms

Per-neuron GLM. Before any cross-region method runs, we need to know what fraction of each neuron's spike-count variance is uniquely explained by movement, after stimulus, choice, feedback, prior, wheel, motion-energy, and pupil regressors have all competed for the same variance. No simpler estimator gives that decomposition. We fit Ridge regression on 20-ms-binned spike counts with a raised-cosine basis expansion of the eight regressor groups (~60 design columns), compute leave-one-group-out cross-validated ΔR² as the per-group unique contribution, and average over 5 KFold splits. This mirrors paper-brain-wide-map/encoding almost verbatim.

Within-region SVCA. The cross-region CCA stage needs two trustworthy low-dimensional summaries — one per region. Naive PCA returns impressive-looking leading components on any matrix, including pure noise, when the population is small and the time axis is long. To avoid feeding CCA dimensions that wouldn't survive a re-sample, we use Stringer et al.'s split-half-cells × split-half-time procedure: components are fit on one cell-half over training time, then both halves are projected into the held-out time block and the cross-half product is divided by total energy. The resulting reliability ρ approaches 1 for reproducible population modes and 0 for noise.

Partial CCA via Frisch–Waugh–Lovell. Two regions bathed in the same global locomotion + arousal drive will share geometry for trivial reasons — both will carry a behavioural axis, both will look aligned along it. The actual question is whether the two subspaces share any direction that cannot be expressed as a linear function of wheel velocity and pupil diameter. To answer it, we regress each region's SVCA scores on Z, take the residuals — the components of each region's state orthogonal to the behavioural axes — and fit CCA on those residuals. Both the regression and the CCA fits are estimated on training folds only and applied to held-out folds — strict no-leakage cross-validation.

Phase-shuffle null. At T ≈ 263 000 bins with slow neural drift, two completely unrelated regions still produce non-trivial canonical correlations from autocorrelated noise alone. We need a noise floor that preserves each region's autocorrelation while breaking the cross-region temporal alignment, and phase-randomising the Fourier coefficients of each channel does exactly that. We generate 200 such surrogates per pair, take the 99% bound, and use it as the band against which raw and partialled canonical correlations are compared.

Result 3

The cross-pair convergence in survival ratio is not three identical measurements

Two further observations bear on how Fig. 2 should be read. First, SVCA reliability — the cross-validated reproducibility of each population component — is highly asymmetric across pairs.

Cross-pair SVCA reliability across sessions

Figure 3. SVCA reliability per region across pair-sessions, with Stringer et al.'s $\rho_1 = 0.5$ threshold marked.

Cerebellum on the V1+CB pair sessions is uniquely well-resolved (median $\rho_1^{\text{SVCA}} \approx 0.76$, with several sessions clearing Stringer et al.'s 2019 $\rho_1 = 0.5$ threshold on multiple components). Visual cortex and motor cortex on every other pair-session sit at $\rho_1 \approx 0.20$–$0.25$; the V1↔M1 pair has uniformly poor reliability on both sides. The CCA on V1↔CB is therefore being computed between a noisy V1 score and a relatively clean CB score, with the cerebellar side carrying the recoverable structure.

Cross-pair survival of shared subspace under wheel + pupil partial

Figure 4. Survival ratio $\sum_k \rho_k^{\text{pCCA}} / \sum_k \rho_k^{\text{CCA}}$ under wheel + pupil partialling, per pair-session.

Second, the pair-summed canonical correlations converge to nearly identical ratios across pairs — median $\sum_k \rho_k^{\text{pCCA}} / \sum_k \rho_k^{\text{CCA}} = 0.97, 0.98, 0.98$ for V1↔CB, V1↔M1, CB↔M1 — despite the underlying signal magnitudes ($\sum_k \rho_k^{\text{CCA}} \approx 0.80, 0.44, 0.62$) differing by nearly a factor of two. The convergence is a proportional statement about preserved-versus-discarded variance, not a magnitude statement about the size of the shared subspace itself. If wheel and pupil were exhaustive measures of global movement state, each region's projection of that state would be the dominant axis of cross-region coupling, partialling would cancel the leading canonical correlations, and the survival ratio would tend to zero. The fact that it tends to one for every pair is the headline of the partialling analysis under wheel + pupil — but the headline is only as strong as the assumption that wheel + pupil exhausts the relevant global state.

Result 5

Lag and task-locking discriminate three different mechanisms

The survival ratio under richer Z is a single number per session; collapsed to a median per pair, it forces three pairs whose biology is plausibly different to look the same. Two structural properties of the leading residual canonical variate $U(t) = U_A(t) \approx U_B(t)$ discriminate sharply between the candidate mechanisms — temporal flow between the two regions, and locking of $U(t)$ to task events.

We extract the leading residual canonical variate per session (cross-validated, fold-aligned for sign) and compute (i) the cross-correlation function $\mathrm{corr}(U_A(t), U_B(t + \tau))$ for $\tau \in [-500, +500]$ ms with a phase-shuffle null, and (ii) peri-event averages of $U(t)$ aligned to first-movement and feedback times from the trial table.

Cross-correlation function and peri-event averages of the leading residual canonical variate, per pair

Figure 6. Lag (top row) and task-locking (bottom two rows) of the leading residual canonical variate, per pair. V1↔CB: zero-lag coupling, flat peri-event averages. V1↔M1: zero-lag coupling, strong peri-movement and peri-feedback transients. CB↔M1 (n = 1): cerebellum leads motor cortex by ~40 ms, large transients around first-movement.

The three pairs separate.

V1↔CB

The cross-correlation function has a clear peak at $\tau = 0$ (median $\rho = 0.33$ across n = 10 sessions, above the phase-shuffle 99% null), but the peri-event averages are essentially flat at first-movement ($|\text{peak}| = 0.09$) and at feedback ($|\text{peak}| = 0.07$). The residual carries shared structure that has no temporal flow and no task-event signature — what a low-dimensional global state inherited identically by both regions would produce: a state that fluctuates on slow, non-event-locked timescales (basal arousal modes, brain-state oscillations, slow drift) and that whisker motion energy, lick rate, and wheel acceleration do not capture.

V1↔M1

The lag function also peaks at zero ($\rho \approx 0.17$, smaller magnitude than V1↔CB), but the peri-event averages show pronounced transients at first-movement ($|\text{peak}| = 0.37$, ~360 ms post-movement) and at feedback ($|\text{peak}| = 0.34$, ~80 ms post-feedback). The residual is task-locked but transmission-free — the signature of a shared internal state (attention, expectation, motor preparation) that both regions reflect simultaneously without one transmitting to the other.

CB↔M1 (n = 1)

The lag function peaks at $\tau = +40$ ms with cerebellum leading motor cortex ($\rho = 0.21$, above the surrogate band), and the peri-event averages show large transients around first-movement ($|\text{peak}| = 0.60$, ~140 ms pre-movement) and feedback ($|\text{peak}| = 0.65$, ~440 ms pre-feedback). The residual has both directional flow at the timescale of disynaptic cerebellothalamocortical transmission and pronounced motor-event-locked dynamics — the signature of genuine cross-region coupling along the cerebellothalamocortical pathway, with the lag magnitude consistent with the published latency of cerebellar output to motor cortex (≈ 30–60 ms).

The three pairs therefore look similar in proportion of shared variance preserved under behavioural partialling but different in what the preserved structure is doing. V1↔CB on this dataset is the strongest case for "global state we did not measure" rather than real cross-region coupling. CB↔M1, even at n = 1, is the strongest case for genuine cortico-cerebellar communication. V1↔M1 sits between, as a case of shared task-related cognitive state without measurable transmission.

Result 6

The geometry, made visible

The five preceding results live as scalars: ΔR² distributions, canonical correlations, survival ratios, lag-of-peak, peri-event amplitude. Numbers can rank, but they do not let the reader see what the shared geometry is doing. Two pictures below turn the central claim — three pairs reach similar survival through different geometric mechanisms — from a sentence into a shape, and add one piece of information the scalar analyses cannot give us: the shape persists under partialling, but what that shape means about behaviour does not.

What does the shared mode mean? Joint state-space, coloured by wheel velocity

Canonical correlation gives us a single number — how aligned the two regions' subspaces are — but it does not tell us what that alignment is. To see the shared mode itself, we plot the leading canonical variates of the two regions, $U_A(t)$ and $U_B(t)$, against each other on a 2-D plane. Each point in this plane is one 20-ms time bin: $U_A$ is region A's projection of its population activity onto the leading cross-region axis at that moment; $U_B$ is region B's projection onto its matched axis. A strong cross-region coupling shows up as a cloud aligned along the diagonal — when region A's projection is high, so is region B's. The shape of the cloud is the geometry of the shared mode.

The shape alone, however, says nothing about what the shared mode is about. So we colour each pixel of the 2-D histogram by the mean wheel velocity (z-scored) of the time bins that fall into it. If the shared mode is, in essence, the running-versus-quiet axis, the bins on one side of the diagonal will be times when the mouse is stationary (blue) and the bins on the other side will be times when the mouse is running (red): the cloud develops a clear blue-to-red gradient along its long axis. If the shared mode is not primarily about running, the wheel velocity is distributed indiscriminately across positions in the cloud and the colour is uniform grey. The shape tells us whether the geometry survives; the gradient tells us how much of that geometry was the wheel axis to begin with.

The left column of Figure 7 shows the raw CCA — no behavioural partialling. All three pairs carry a non-trivial wheel-velocity gradient: positions along the leading shared canonical axis are partly diagnostic of how fast the mouse was moving. The signed slope of this gradient is printed inside each panel as "diag slope". The right column shows pCCA under rich-Z partialling — wheel velocity, wheel acceleration, whisker motion energy, body motion energy, pupil diameter, and lick rate all linearly removed from each region's activity before CCA is fit. The cloud shape barely changes between the two columns — the cross-region canonical correlation $\rho$ printed in the corner moves by 0.02–0.04 across all three pairs — yet the wheel gradient collapses to near zero in every panel. The shared subspace did not collapse, but the wheel-velocity meaning of position along it did. The geometry persists, stripped of its behavioural meaning, and what remains must be aligned to something we have not measured.

Joint state-space heatmap per pair, coloured by mean wheel velocity in each (U_A, U_B) bin, raw CCA vs partial CCA

Figure 7. Joint state-space heatmap of the leading canonical variates per pair (rows) under raw CCA vs rich-Z pCCA (columns). Bin colour = mean z-scored wheel velocity of the time bins falling into that 2-D bin. Dotted ellipse = 2σ outline of the cloud. Dashed line = diagonal reference. The "diag slope" inset quantifies how strongly the diagonal coordinate $(U_A + U_B)/\sqrt{2}$ co-varies with wheel z-score per panel — large in the left column, ≈ 0 in the right.

The same effect is more visceral when shown as a continuous transition. The animation below sweeps a partial coefficient $\alpha \in [0, 1]$ — fraction of $Z$ partialled out — from raw CCA ($\alpha = 0$) to full rich-Z partialled ($\alpha = 1$) and back, side-by-side for all three pairs. Watch the wheel-velocity colour gradient flatten while the diagonal cloud structure stays where it was. The progress bar at the top is $\alpha$.

Partialling morph: three side-by-side pair panels animating from raw CCA to rich-Z partialled, wheel-velocity gradient fading while cloud shape persists

Figure 7b · Partialling morph. Continuous shrinkage residualisation $X_\alpha = X - \alpha \cdot \widehat{X}(Z)$, then CCA on $(X_\alpha, Y_\alpha)$, swept across $\alpha$ for each of the three pairs. The diag-slope annotation per panel quantifies the gradient and goes to ≈ 0 by $\alpha = 1$ in every pair. Cloud shape and tight diagonal alignment barely move — geometry persists, behavioural meaning evaporates.

How do the regions sort trial conditions? Cross-region RSA

CCA asks whether the two regions share axes. But two regions can encode the same task structure through completely different axes and still agree on which conditions are similar to which others. To probe that basis-free notion of shared geometry, we use representational similarity analysis (Kriegeskorte et al. 2008).

Every trial gets a condition label combining stimulus side (L/R), absolute-contrast bin (0, lo, mi, hi — 0 %, 0–6.25 %, 6.25–25 %, 25–100 %), and feedback outcome (+ correct, − incorrect) — e.g. Lhi+ = "high-contrast left, correct". For each condition we average the SVCA scores over the 400 ms post-stimulus window, then build a per-region similarity matrix whose rows and columns are both that label set, in the same order. Cell $(i, j)$ is the Pearson $r$ between the mean activity vectors for conditions $i$ and $j$ — deep red ($r \approx +1$) means "these two conditions evoked very similar population responses in this region", white ($r \approx 0$) means "uncorrelated", and deep blue ($r \approx -1$) means "anti-correlated". Each matrix is symmetric, with $r = 1$ on the diagonal (every condition is identical to itself), and is the region's geometric fingerprint of how it groups task conditions. The cross-region overlap is the Spearman correlation between the corresponding upper triangles of the two matrices (higher = more shared geometry).

Before reading the cross-region overlap, the within-matrix structure of each individual similarity matrix tells us how much that single region cares about the trial labels at all. A uniformly red matrix means the region treats every condition as evoking essentially the same population response — it has not committed any of its leading SVCA dimensions to discriminating side, contrast, or feedback in the post-stimulus window. Block structure means the region does discriminate, and the block boundaries reveal which variable it cares about: a 4×4 split along contrast bins says "this region encodes how bright the stimulus was"; checkerboard alternation along the +/− axis says "this region encodes whether the mouse got it right". The matrix is not just an input to the cross-region statistic — it is the per-region readout of representational specificity.

The three pair-rows do not show this in equal measure. The V1↔CB row has two near-uniformly red matrices: in the 400-ms post-stimulus window, neither V1's leading SVCA modes nor CB's strongly discriminate amongst left-side trial conditions at the population level — the matrices are featureless, the cross-region Spearman correlation ($r \approx +0.53$) is therefore measuring overlap between two relatively flat fingerprints. The V1↔M1 row is the most informative: both matrices show clear off-diagonal whitish cells, indicating that certain condition pairs evoke clearly distinct patterns (e.g. correct vs incorrect at the same contrast), and the two matrices' patterns visibly resemble each other — which is why their cross-region Spearman is the strongest in the dataset ($r \approx +0.72$ raw, $+0.65$ partial) despite V1↔M1 having the smallest leading canonical correlation of the three pairs. The CB↔M1 row sits in between on M1's side and again featureless on CB's: the cerebellum, on this fast post-stimulus window, simply does not sort these task labels along its reliable SVCA axes, so the cross-region Spearman ($r \approx +0.36$ raw, $+0.28$ partial) is dragged down by one region having no condition-specific structure to share.

The take-away is that axis-aligned overlap (CCA) and basis-free condition-structure overlap (RSA) are not the same thing, and per-pair they can be ranked in opposite orders. V1↔M1's shared geometry lives in the pattern of condition similarities more than in co-rotating axes; CB↔M1's shared geometry lives in fewer real axes (the directed +40-ms transmission seen in §5) but those axes do not organise the task-condition structure visibly within our window. The small drop under partialling in every pair mirrors the CCA finding: shared condition geometry persists after stripping the behavioural axes. One number cannot summarise that — two pictures and two metrics can.

Per-pair representational dissimilarity matrices over trial conditions for region A and region B, with cross-region Spearman correlation under raw and Z-partialled regimes

Figure 8. Cross-region RSA per pair. Left two columns: representational dissimilarity matrices (RDMs) for region A and region B over trial-condition labels (side L/R × contrast bin 0/lo/mi/hi × feedback +/−). Cell colour = $1 - r$, the correlation distance between mean SVCA-score vectors of the two conditions on the row and column. Right column: Spearman correlation between the upper triangles of the two RDMs, raw vs rich-Z partialled, per pair.

The two pictures together encode more than the survival ratio can: the shared axes persist under rich behavioural partialling but lose their wheel-velocity meaning, and the shared condition geometry is also robust but distributed across pairs in a way that the axis-aligned analysis would not have predicted. The headline of the partialling analysis is one number; the geometry of what survives needs two pictures and a different vocabulary for each.

Conclusion

Three pairs, three mechanisms, one constraint

In 15 dual-region IBL sessions, cross-region shared subspaces between V1, CB, and M1 partly survive even rich behavioural partialling — but the surviving structure does different things in different pairs. V1↔CB looks like an unmeasured global state (zero-lag, task-flat). V1↔M1 looks like shared task-related cognitive state (zero-lag, strongly task-locked). CB↔M1 looks like real cerebellothalamocortical transmission (+40 ms cerebellar lead, motor-event-locked). Survival ratios alone collapse three plausibly different biologies into one number; lag and task-locking pull them apart.

The methodological lesson: scalar survival ratios under partialling are necessary but not sufficient. The shape of the residual — its temporal flow and its locking to task events — is where the mechanism actually lives. A "shared subspace survives behavioural partialling" headline can hide three different structural facts, and the dataset has to be probed in the time domain to tell them apart.

Limitations

Sample sizes for V1↔M1 (n = 4) and especially CB↔M1 (n = 1) bound the strength of any distributional claim about those pairs, including the dynamics signatures. The "VIS" definition in this analysis is widened to all dorsal visual cortex; the V1-specific reading of the earlier 3-session MVP analysis is preserved only on the gold-ringed anchor session in the figures. The asymmetric SVCA reliability across pair-sessions (see §3) means the absolute size of the shared subspace is not directly comparable across pairs — only the ratio of preserved-to-raw shared variance is.

Key references

International Brain Laboratory et al. (2025). A brain-wide map of neural activity during complex behaviour. Nature, 645, 177–185.
Stringer C, Pachitariu M, Steinmetz N, Reddy CB, Carandini M, Harris KD (2019). Spontaneous behaviors drive multidimensional, brainwide activity. Science, 364(6437), eaav7893.
Wang Y, Kurgyis L, Druckmann S (2026). Brain-wide movement encoding is structured across and within brain areas. Nature Neuroscience (in press).
Musall S, Kaufman MT, Juavinett AL, Gluf S, Churchland AK (2019). Single-trial neural dynamics are dominated by richly varied movements. Nature Neuroscience, 22, 1677–1686.
Semedo JD, Zandvakili A, Machens CK, Yu BM, Kohn A (2019). Cortical areas interact through a communication subspace. Neuron, 102(1), 249–259.

Same information, different geometry: cross-region shared subspaces between mouse V1, M1, and cerebellum