[ICML 2025]MindAligner: Explicit Brain Functional Alignment for Cross-Subject Visual Decoding from

①Limitations of alignment in cross-subject model: scarce data in shared space construction, and subject respond differently in watching the same image

②Conception of MindAligner:

2.3. Related Work

2.3.1. fMRI-Based Brain Decoding

①Lists traditional methods and point out their work might be inefficient in cross subject reconstruction

2.3.2. Cross-Subject Functional Alignment

①Align limits inter subject specificity

2.4. Preliminary

①Only use one hour of data to pretrain, then test model in shared test set

2.5. MindAligner

2.5.1. Overview

①The overall framework of MindAligner:

2.5.2. Brain Transfer Matrix

①Given the the fMRI signal $\mathcal{F}_{N}$ for a subject $S_N$ , the brain transfer matrix (BTM) maps it to:

$\hat{\mathcal{F}_K}=\mathcal{M}\times\mathcal{F}_N$

②The $\mathcal{M}$ can be decomposed to two low-rank matrices:

$\mathcal{M}=\mathcal{A}\times\mathcal{B}$

where $\mathcal{A}\in\mathbb{R}^{n\times h}$ and $\mathcal{B}\in\mathbb{R}^{h\times k}$ , $n$ and $k$ denotes the fMRI voxel dimensions of unkown and kown subjects, $h$ is the hidden dimension

2.5.3. Brain Functional Alignment Module

①Generate the stimuli embedding $z_K$ of unkown subject by stimuli differential condition to align kown embedding $\mathcal{F}_N$ :

$z_{N}=\mathcal{A}\times\mathcal{F}_{N}$

$\begin{aligned} & E_{\mathrm{diff}}=\mathcal{E}_{image}(\mathcal{I}_{N})-\mathcal{E}_{image}(\mathcal{I}_{K}), \\ & z_{\mathrm{diff}}=E_\mathrm{diff}\times\mathcal{M}_\mathrm{diff}, \\ & \boldsymbol{z}_{K}=\mathcal{M}_{C}(z_{N},z_{\mathrm{diff}}), \end{aligned}$

where $\mathcal{E}_{image}$ is pretrained CLIP, as the image encoder. $\mathcal{M}_{C}$ is the cross-stimulus neural mapper, 它使用 $\mathcal{M}_{\mathrm{diff}}\in\mathbb{R}^{a\times2h}$ 将条件 $z_{\mathrm{diff}}$ 分解为缩放和移位参数??

②Further align by:

$\hat{\mathcal{F}}_{K}=z_{K}\times\mathcal{B}.$

③Reconstruction loss:

$\mathcal{L}_{rec}=||\mathcal{F}_{K}-\mathcal{F}_{K}||_{2}^{2}$

and distribution loss:

$\mathcal{L}_{KL}=\mathcal{KL}(\mathcal{F}_{K},\mathcal{F}_{K})$

④Loss between fMRI embedding pairs and stimuli pairs:

$\mathcal{L}_{latent}=\|(\mathcal{R}(\mathcal{E}_{f}(\boldsymbol{z}_{N}),\mathcal{E}_{f}(\boldsymbol{z}_{K}))-\mathcal{R}(\boldsymbol{E}_{N},\boldsymbol{E}_{K})\|_{2}^{2}$

where $E_N$ and $E_K$ are the image embeddings from CLIP, $\mathcal{R}(\cdot)$ denotes dissimilarity function

⑤The final loss:

$\mathcal{L}_{\mathrm{Align}}=\mathcal{L}_{Dec}+\alpha_{rec}\mathcal{L}_{rec}+\alpha_{KL}\mathcal{L}_{KL}+\alpha_{la}\mathcal{L}_{latent},$

where $\mathcal{L}_{Dec}$ denotes the decoding loss in the baseline method

2.5.4. Inference

①BTM only

2.6. Experiments

2.6.1. Implementation Details

①BTM: consist of 2 linear layers with hidden dim of 4096

②Dimension of $\mathcal{M}_\mathrm{diff}$ : 768

③Input and output dimension of functional embedder: 4096

④Loss coefficients: $\alpha_{rec}=1,\alpha_{la}=\alpha_{KL}=0.001,\alpha_{1}=0.033,\alpha_{2}=0.016$

⑤Learning rate: 1e-5

⑥Batch size: 16

2.6.2. Dataset

①Dataset: NSD

2.6.3. Metrics

①Lists metrics for performance and functional alignment measurement

2.6.4. fMRI-based Visual Decoding

①Visualization of reconstructed image:

②Quantitative performance:

③Loss ablation:

④Ablation of alignment:

⑤Parameter comparison:

2.6.5. Brain Functional Alignment Analysis

①Visualization transfer quantity in brain:

they define that early visualization region presents lower inter-subject variability, and higher visual regions (including OPA, FFA, PPA, and EBA) show larger variability

②Performance of different alignment:

③Transfer quantity in one hour:

2.7. Conclusion

脑启社区

脑启社区是一个专注类脑智能领域的开发者社区。欢迎加入社区，共建类脑智能生态。社区为开发者提供了丰富的开源类脑工具软件、类脑算法模型及数据集、类脑知识库、类脑技术培训课程以及类脑应用案例等资源。

更多推荐

RCX多架构支持揭秘：ARM、x86、64位设备的兼容性解决方案

RCX作为Android平台上的Rclone客户端，通过创新的多架构支持技术，实现了对ARM、x86和64位设备的全面兼容。本文将深入剖析RCX如何突破硬件限制，让不同架构的Android设备都能享受高效的云存储管理体验。## 多架构支持的核心价值在Android设备碎片化严重的今天，处理器架构的多样性给开发者带来了巨大挑战。RCX通过精细化的架构适配，确保从低端手机到高端平板的各类设备都

脑启社区

YOLOv11【第四章：巅峰前沿与融合篇·第13节】生物计算与神经形态硬件：Spike 脉冲神经网络替换 YOLOv11！

脑启社区

色彩与排版的魔法：Awesome Sass高级特性完全指南

Awesome Sass是一个精心策划的资源列表，汇集了最优秀的Sass和SCSS框架、库、样式指南、文章和工具。无论你是前端开发新手还是有经验的开发者，这个项目都能帮助你掌握Sass的核心功能，提升CSS开发效率与代码质量。Awesome Sass项目logo## 为什么选择Sass？Sass（Syntactically Awesome Style Sheets）是CSS的扩展语言，