Beyond Single-Source Cognitive Taskonomy:Multi-Source Task Relations through fMRI Transfer Learning

Jie Guo; Junfeng Xia; Mengjiao Zhang; Wendu Li

arxiv: 2606.26279 · v1 · pith:X2CGD5O6new · submitted 2026-06-24 · 💻 cs.CV · q-bio.NC

Beyond Single-Source Cognitive Taskonomy:Multi-Source Task Relations through fMRI Transfer Learning

Junfeng Xia , Wendu Li , Mengjiao Zhang , Jie Guo This is my paper

Pith reviewed 2026-06-26 01:33 UTC · model grok-4.3

classification 💻 cs.CV q-bio.NC

keywords fMRI taskonomymulti-source transfercognitive taskstransfer learningBoolean Integer Programmingmasked reconstructionworking memoryHuman Connectome Project

0 comments

The pith

Multi-source fMRI transfer shows task relations depend on source set composition rather than pairwise links alone.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper extends reconstruction-based fMRI taskonomy from single-source to multi-source transfer across 23 Human Connectome Project task states. Single-source transfer follows paradigm boundaries, with motor states supporting only other motor targets. Multi-source transfer performance changes with the particular mix of sources chosen, and Boolean Integer Programming for budget-constrained allocation repeatedly assigns direct supervision to 0-back and 2-back working-memory states. Readers would care because the results indicate that many-to-one neural relations require joint consideration of source combinations and point to working-memory tasks as high-priority under global allocation.

Core claim

We extend an fMRI cognitive taskonomy from single-source to multi-source transfer across 23 Human Connectome Project task states and use Boolean Integer Programming (BIP) to analyze budget-constrained task allocation. Single-source transfer is directional and paradigm structured: motor states transfer well within the motor paradigm but provide limited support to most non-motor targets. Multi-source transfer depends on the composition of the source set, suggesting that many-to-one task relations are not fully captured by pairwise taskonomy alone. Across supervision budgets, BIP repeatedly allocates direct supervision to several 0-back and 2-back working-memory states. Together, these findings

What carries the argument

Masked fMRI reconstruction used as a shared self-supervised objective to measure transfer, extended through Boolean Integer Programming to select source sets under supervision budgets.

If this is right

Single-source transfer remains directional and limited by paradigm boundaries, with motor states confined to motor targets.
Many-to-one task relations cannot be recovered from pairwise taskonomy alone.
BIP allocation favors 0-back and 2-back working-memory states across multiple supervision budgets even when they are not the strongest single sources.
The pattern may reflect integration of perceptual, attentional, and executive processes inside working-memory tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Task taxonomies built on brain data may need to model source combinations explicitly to capture integrative hubs.
The same allocation method could be tested on other self-supervised objectives to check whether working-memory priority generalizes beyond reconstruction.
Prioritizing working-memory states in limited-data regimes might improve downstream prediction of activity in non-memory tasks.
The motor cluster finding suggests that cross-paradigm motor representations are more effector-specific than previously assumed from single-source maps.

Load-bearing premise

Masked fMRI reconstruction supplies a valid common objective that quantifies the degree of shared neural processes across different cognitive task states.

What would settle it

A result in which multi-source model accuracy stays constant across different source-set compositions, or in which BIP never selects the 0-back and 2-back working-memory states, would falsify the dependence on source composition and the allocation pattern.

Figures

Figures reproduced from arXiv: 2606.26279 by Jie Guo, Junfeng Xia, Mengjiao Zhang, Wendu Li.

**Figure 2.** Figure 2: Reconstruction and representation analyses. (A) Examples of brain-region and temporal masking, model [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Directed single-source taskonomy and multi-source transfer profiles. (A) Single-source affinity across 23 task [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Budget-constrained task allocation. (A) BIP solutions for supervision budgets 4, 8, 12, and 16 at the displayed [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

read the original abstract

Cognitive tasks are organized by shared and specialized neural processes. Masked fMRI reconstruction provides a common self-supervised objective for quantifying transfer relations among task states, but existing reconstruction-based taskonomies mainly study one-to-one transfer from a single source task to a target. Here, we extend an fMRI cognitive taskonomy from single-source to multi-source transfer across 23 Human Connectome Project task states and use Boolean Integer Programming (BIP) to analyze budget-constrained task allocation. We train 1,127 task-specific and transfer models. Single-source transfer is directional and paradigm structured: motor states transfer well within the motor paradigm but provide limited support to most non-motor targets, consistent with a shared sensorimotor execution system and effector-specific representations. Multi-source transfer depends on the composition of the source set, suggesting that many-to-one task relations are not fully captured by pairwise taskonomy alone. Across supervision budgets, BIP repeatedly allocates direct supervision to several 0-back and 2-back working-memory states, although these states are not consistently the strongest individual sources. This pattern may reflect the integration of perceptual, attentional, and executive processes in working-memory tasks. Together, these findings reveal a cross-paradigm-limited motor cluster and working-memory states with high priority under the specified global allocation objective. Our study extends reconstruction-based fMRI taskonomy from one-to-one transfer to many-to-one task relations and budget-constrained task dependencies.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The extension to multi-source transfer and BIP allocation is the actual addition, but the masked reconstruction proxy is still untested.

read the letter

The main thing here is the shift from single-source to multi-source fMRI taskonomy across 23 HCP states, plus the use of Boolean Integer Programming to pick which tasks get direct supervision under budget limits. They train 1,127 models and report that source-set composition affects transfer in ways pairwise relations miss, with BIP repeatedly favoring certain 0-back and 2-back working-memory states.

The concrete observations on motor-task clustering and the budget allocations are useful and go beyond the cited single-source work. The directional motor transfer and the note that working-memory tasks integrate multiple processes are clear enough from the abstract.

The soft spot is the unexamined premise that masked fMRI reconstruction error measures shared cognitive processes rather than low-level signal properties. No masking details, motion controls, or SNR checks appear in the abstract, so the multi-source discrepancy and BIP pattern could be artifacts. Lack of cross-validation, statistical tests, or sensitivity analysis also makes it hard to judge stability.

This is for cognitive neuroscientists already using taskonomy or planning multi-task scans who want a budget-allocation angle. A reader focused on resource optimization might pick up the BIP framing.

It is worth sending to peer review so the methods and controls can be checked; the core idea is straightforward enough to evaluate once the details are there.

Referee Report

2 major / 1 minor

Summary. The paper extends reconstruction-based fMRI taskonomy from single-source (one-to-one) to multi-source (many-to-one) transfer across 23 Human Connectome Project task states. It trains 1,127 task-specific and transfer models under a masked fMRI reconstruction objective, reports directional and paradigm-structured single-source transfers (e.g., motor within-paradigm), shows that multi-source transfer performance depends on source-set composition in ways not recoverable from pairwise relations, and uses Boolean Integer Programming (BIP) to identify budget-constrained optimal source allocations that repeatedly prioritize several 0-back and 2-back working-memory states.

Significance. If the masked-reconstruction objective is shown to validly index shared cognitive processes, the work would demonstrate that many-to-one task relations require explicit multi-source modeling and would identify working-memory states as high-priority under a global allocation objective. The scale (1,127 models) and use of BIP for budget-constrained optimization are concrete strengths that could inform experimental design in cognitive neuroscience if the proxy assumption holds.

major comments (2)

[Abstract / Methods] Abstract and Methods: The central premise that masked fMRI reconstruction error quantifies transfer relations among the 23 task states is stated without any description of the masking strategy, conditioning of the model on task labels, loss weighting, or controls for confounds (motion, SNR differences, paradigm-specific hemodynamic properties). This assumption is load-bearing for every reported transfer relation and BIP allocation.
[Abstract / Results] Abstract / Results: Claims that multi-source transfer depends on source-set composition (and cannot be recovered from pairwise taskonomy) and that BIP consistently allocates to 0-back/2-back states are presented without cross-validation details, statistical testing, effect sizes, or sensitivity analyses to hyperparameter choices. The abstract reports training 1,127 models but supplies no quantitative support for robustness of the multi-source vs. pairwise discrepancy.

minor comments (1)

[Abstract] Notation for the 23 task states (0-back vs. 2-back working-memory) is used without an explicit table or section defining the full set of HCP tasks and their paradigm groupings.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments emphasizing methodological transparency and quantitative robustness. We address each major comment below and commit to revisions that strengthen the manuscript without altering its core claims.

read point-by-point responses

Referee: [Abstract / Methods] Abstract and Methods: The central premise that masked fMRI reconstruction error quantifies transfer relations among the 23 task states is stated without any description of the masking strategy, conditioning of the model on task labels, loss weighting, or controls for confounds (motion, SNR differences, paradigm-specific hemodynamic properties). This assumption is load-bearing for every reported transfer relation and BIP allocation.

Authors: We agree that these details are essential for evaluating the central premise and were insufficiently described in the submitted version. In the revised manuscript we will expand the Methods section with a dedicated subsection specifying the masking strategy, task-label conditioning mechanism, loss formulation and weighting, and explicit controls for confounds including motion regression, SNR normalization, and checks for paradigm-specific hemodynamic effects. The Abstract will be updated to reference these controls. This revision directly addresses the load-bearing assumption. revision: yes
Referee: [Abstract / Results] Abstract / Results: Claims that multi-source transfer depends on source-set composition (and cannot be recovered from pairwise taskonomy) and that BIP consistently allocates to 0-back/2-back states are presented without cross-validation details, statistical testing, effect sizes, or sensitivity analyses to hyperparameter choices. The abstract reports training 1,127 models but supplies no quantitative support for robustness of the multi-source vs. pairwise discrepancy.

Authors: We agree that the abstract and results would benefit from explicit reporting of cross-validation, statistical tests, effect sizes, and sensitivity analyses. These elements were part of the experimental pipeline but not highlighted sufficiently. In the revision we will add a paragraph in Results detailing the cross-validation procedure across the 1,127 models, paired statistical comparisons of multi-source versus pairwise performance, effect sizes, and sensitivity checks on BIP budgets and masking hyperparameters. The abstract will be revised to note the robustness of the reported multi-source dependency. This provides the requested quantitative support. revision: yes

Circularity Check

0 steps flagged

No significant circularity; transfer metrics and BIP allocations derived from independent model trainings

full rationale

The paper trains 1,127 separate task-specific and transfer models under a masked fMRI reconstruction objective, then measures empirical transfer performance to populate the relations used by BIP. No equation or step defines a reported transfer ratio, source-set dependence, or BIP allocation in terms of fitted parameters from the same model; the reconstruction loss is applied as an external self-supervised proxy, and the multi-source vs. pairwise discrepancy follows directly from those measured performances rather than from any self-definitional reduction, fitted-input renaming, or self-citation chain. The derivation chain remains self-contained against the external training outcomes.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that reconstruction error under masked fMRI is a faithful proxy for shared neural processes; no free parameters or new entities are introduced in the abstract.

axioms (1)

domain assumption Masked fMRI reconstruction provides a common self-supervised objective for quantifying transfer relations among task states
Invoked in the first sentence of the abstract as the basis for both single-source and multi-source analyses.

pith-pipeline@v0.9.1-grok · 5795 in / 1281 out tokens · 23884 ms · 2026-06-26T01:33:21.492040+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

28 extracted references · 12 canonical work pages

[1]

Poldrack, Aniket Kittur, Donald Kalar, Eric Miller, Christian Seppa, Yolanda Gil, D

Russell A. Poldrack, Aniket Kittur, Donald Kalar, Eric Miller, Christian Seppa, Yolanda Gil, D. Stott Parker, Fred W. Sabb, and Robert M. Bilder. The cognitive atlas: Toward a knowledge foundation for cognitive neuroscience.Frontiers in Neuroinformatics, 5:17, 2011. doi:10.3389/fninf.2011.00017

work page doi:10.3389/fninf.2011.00017 2011
[2]

Turner and Angela R

Jessica A. Turner and Angela R. Laird. The cognitive paradigm ontology: Design and application. Neuroinformatics, 10(1):57–66, 2012. doi:10.1007/s12021-011-9126-x

work page doi:10.1007/s12021-011-9126-x 2012
[3]

Joglekar, H

Guangyu Robert Yang, Madhura R. Joglekar, H. Francis Song, William T. Newsome, and Xiao-Jing Wang. Task representations in neural networks trained to perform many cognitive tasks.Nature Neuroscience, 22(2):297–306,
[4]

10 Multi-Source Cognitive TaskonomyA PREPRINT

doi:10.1038/s41593-018-0310-2. 10 Multi-Source Cognitive TaskonomyA PREPRINT

work page doi:10.1038/s41593-018-0310-2
[5]

Learning neural representations of human cognition across many fMRI studies

Arthur Mensch, Julien Mairal, Danilo Bzdok, Bertrand Thirion, and Ga"el Varoquaux. Learning neural representations of human cognition across many fMRI studies. InAdvances in Neural Information Processing Systems, volume 30, 2017

2017
[6]

Representations and decodability of diverse cognitive functions are preserved across the human cortex, cerebellum, and subcortex.Communications Biology, 5:1245, 2022

Tomoya Nakai and Shinji Nishimoto. Representations and decodability of diverse cognitive functions are preserved across the human cortex, cerebellum, and subcortex.Communications Biology, 5:1245, 2022. doi:10.1038/s42003-022-04221-y

work page doi:10.1038/s42003-022-04221-y 2022
[7]

Zamir, Alexander Sax, William Shen, Leonidas J

Amir R. Zamir, Alexander Sax, William Shen, Leonidas J. Guibas, Jitendra Malik, and Silvio Savarese. Taskonomy: Disentangling task transfer learning. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3712–3722, 2018

2018
[8]

Representation similarity analysis for efficient task taxonomy and transfer learning

Kshitij Dwivedi and Gemma Roig. Representation similarity analysis for efficient task taxonomy and transfer learning. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12387–12396, 2019

2019
[9]

Trevor Standley, Amir Zamir, Dawn Chen, Leonidas Guibas, Jitendra Malik, and Silvio Savarese. Which tasks should be learned together in multi-task learning? InProceedings of the 37th International Conference on Machine Learning, volume 119 ofProceedings of Machine Learning Research, pages 9120–9132, 2020

2020
[10]

Efficiently identifying task groupings for multi-task learning

Christopher Fifty, Ehsan Amid, Zhe Zhao, Tianhe Yu, Rohan Anil, and Chelsea Finn. Efficiently identifying task groupings for multi-task learning. InAdvances in Neural Information Processing Systems, volume 34, 2021

2021
[11]

Uncovering cognitive taskonomy through transfer learning in masked autoencoder-based fmri reconstruction

Youzhi Qu, Junfeng Xia, Xinyao Jian, Wendu Li, Kaining Peng, Zhichao Liang, Haiyan Wu, and Quanying Liu. Uncovering cognitive taskonomy through transfer learning in masked autoencoder-based fmri reconstruction. In International Workshop on Human Brain and Artificial Intelligence, pages 35–50. Springer, 2024

2024
[12]

A genetic algorithms for optimizing structural brain network across cognitive tasks

Youzhi Qu, Wendu Lit, Junfeng Xia, Jiahao Tang, Kaining Peng, Zhichao Liang, Haiyan Wu, and Quanying Liu. A genetic algorithms for optimizing structural brain network across cognitive tasks. In2024 China Automation Congress (CAC), pages 5210–5215. IEEE, 2024

2024
[13]

Masked autoencoders are scalable vision learners

Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross Girshick. Masked autoencoders are scalable vision learners. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16000–16009, 2022

2022
[14]

SimMIM: A simple framework for masked image modeling

Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, and Han Hu. SimMIM: A simple framework for masked image modeling. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9653–9663, 2022

2022
[15]

Rizvi, Matteo Rosati, Christopher Averill, James L

Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca, Syed A. Rizvi, Matteo Rosati, Christopher Averill, James L. Cross, Prateek Mittal, Emanuele Zappala, Rahul Madhav Dhodapkar, Chadi G. Abdallah, and David van Dijk. BrainLM: A foundation model for brain activity recordings. InThe Twelfth International Conference on Learning Representations, 2024. URL...

2024
[16]

Brain-dit: A universal multi-state fmri foundation model with metadata-conditioned pretraining.arXiv preprint arXiv:2604.12683, 2026

Junfeng Xia, Wenhao Ye, Xuanye Pan, Xinke Shen, Mo Wang, and Quanying Liu. Brain-dit: A universal multi-state fmri foundation model with metadata-conditioned pretraining.arXiv preprint arXiv:2604.12683, 2026

Pith/arXiv arXiv 2026
[17]

Brainworld: A structural-prior-conditioned generative model for whole-brain 4d fmri dynamics.arXiv preprint arXiv:2606.17742, 2026

Junfeng Xia, Wenhao Ye, Junxiang Zhang, Xuanye Pan, Mo Wang, and Quanying Liu. Brainworld: A structural-prior-conditioned generative model for whole-brain 4d fmri dynamics.arXiv preprint arXiv:2606.17742, 2026

Pith/arXiv arXiv 2026
[18]

Slim-brain: A data-and training-efficient foundation model for fmri data analysis.arXiv preprint arXiv:2512.21881, 2025

Mo Wang, Junfeng Xia, Wenhao Ye, Enyu Liu, Kaining Peng, Jianfeng Feng, Quanying Liu, and Hongkai Wen. Slim-brain: A data-and training-efficient foundation model for fmri data analysis.arXiv preprint arXiv:2512.21881, 2025

arXiv 2025
[19]

Omni-fmri: A universal atlas-free fmri foundation model.arXiv preprint arXiv:2601.23090, 2026

Mo Wang, Wenhao Ye, Junfeng Xia, Junxiang Zhang, Xuanye Pan, Minghao Xu, Haotian Deng, Hongkai Wen, and Quanying Liu. Omni-fmri: A universal atlas-free fmri foundation model.arXiv preprint arXiv:2601.23090, 2026

arXiv 2026
[20]

Flexibrain: Resolution-agnostic voxel-level encoding for native fmri.arXiv preprint arXiv:2606.11500, 2026

Mo Wang, Wenhao Ye, Junfeng Xia, Minghao Xu, Hongkai Wen, and Quanying Liu. Flexibrain: Resolution-agnostic voxel-level encoding for native fmri.arXiv preprint arXiv:2606.11500, 2026

Pith/arXiv arXiv 2026
[21]

Brain-JEPA: Brain dynamics foundation model with gradient positioning and spatiotemporal masking.arXiv preprint arXiv:2409.19407, 2024

Zijian Dong, Ruilin Li, Yilei Wu, Thuan Tinh Nguyen, Joanna Su Xian Chong, Fang Ji, Nathanael Ren Jie Tong, Christopher Li Hsian Chen, and Juan Helen Zhou. Brain-JEPA: Brain dynamics foundation model with gradient positioning and spatiotemporal masking.arXiv preprint arXiv:2409.19407, 2024. doi:10.48550/arXiv.2409.19407

work page doi:10.48550/arxiv.2409.19407 2024
[22]

NeuroImage , author =

David C. Van Essen, Stephen M. Smith, Deanna M. Barch, Timothy E. J. Behrens, Essa Yacoub, Kamil Ugurbil, et al. The WU-Minn human connectome project: An overview.NeuroImage, 80:62–79, 2013. doi:10.1016/j.neuroimage.2013.05.041. 11 Multi-Source Cognitive TaskonomyA PREPRINT

work page doi:10.1016/j.neuroimage.2013.05.041 2013
[23]

Barch, Gregory C

Deanna M. Barch, Gregory C. Burgess, Michael P. Harms, Steven E. Petersen, Bradley L. Schlaggar, Maurizio Corbetta, Matthew F. Glasser, Sandra Curtiss, Sachin Dixit, Cindy Feldt, et al. Function in the human connectome: Task-fMRI and individual differences in behavior.NeuroImage, 80:169–189, 2013. doi:10.1016/j.neuroimage.2013.05.033

work page doi:10.1016/j.neuroimage.2013.05.033 2013
[24]

Glasser, Stamatios N

Matthew F. Glasser, Stamatios N. Sotiropoulos, J. Anthony Wilson, Timothy S. Coalson, Bruce Fischl, Jesper L. Andersson, Junqian Xu, Saad Jbabdi, Matthew Webster, Jonathan R. Polimeni, et al. The minimal preprocessing pipelines for the human connectome project.NeuroImage, 80:105–124, 2013. doi:10.1016/j.neuroimage.2013.04.127

work page doi:10.1016/j.neuroimage.2013.04.127 2013
[25]

Glasser, Timothy S

Matthew F. Glasser, Timothy S. Coalson, Emma C. Robinson, Carl D. Hacker, John Harwell, Essa Yacoub, Kamil Ugurbil, Jesper Andersson, Christian F. Beckmann, Mark Jenkinson, et al. A multi-modal parcellation of human cerebral cortex.Nature, 536(7615):171–178, 2016. doi:10.1038/nature18933

work page doi:10.1038/nature18933 2016
[26]

Mark D’Esposito and Bradley R. Postle. The cognitive neuroscience of working memory.Annual Review of Psychology, 66:115–142, 2015. doi:10.1146/annurev-psych-010814-015031

work page doi:10.1146/annurev-psych-010814-015031 2015
[27]

The multiple-demand (MD) system of the primate brain: Mental programs for intelligent behaviour

John Duncan. The multiple-demand (MD) system of the primate brain: Mental programs for intelligent behaviour. Trends in Cognitive Sciences, 14(4):172–179, 2010. doi:10.1016/j.tics.2010.01.004

work page doi:10.1016/j.tics.2010.01.004 2010
[28]

A multi-demand operating system underlying diverse cognitive tasks.Nature Communications, 15:2185, 2024

Weidong Cai, Jalil Taghia, and Vinod Menon. A multi-demand operating system underlying diverse cognitive tasks.Nature Communications, 15:2185, 2024. doi:10.1038/s41467-024-46511-5. 12

work page doi:10.1038/s41467-024-46511-5 2024

[1] [1]

Poldrack, Aniket Kittur, Donald Kalar, Eric Miller, Christian Seppa, Yolanda Gil, D

Russell A. Poldrack, Aniket Kittur, Donald Kalar, Eric Miller, Christian Seppa, Yolanda Gil, D. Stott Parker, Fred W. Sabb, and Robert M. Bilder. The cognitive atlas: Toward a knowledge foundation for cognitive neuroscience.Frontiers in Neuroinformatics, 5:17, 2011. doi:10.3389/fninf.2011.00017

work page doi:10.3389/fninf.2011.00017 2011

[2] [2]

Turner and Angela R

Jessica A. Turner and Angela R. Laird. The cognitive paradigm ontology: Design and application. Neuroinformatics, 10(1):57–66, 2012. doi:10.1007/s12021-011-9126-x

work page doi:10.1007/s12021-011-9126-x 2012

[3] [3]

Joglekar, H

Guangyu Robert Yang, Madhura R. Joglekar, H. Francis Song, William T. Newsome, and Xiao-Jing Wang. Task representations in neural networks trained to perform many cognitive tasks.Nature Neuroscience, 22(2):297–306,

[4] [4]

10 Multi-Source Cognitive TaskonomyA PREPRINT

doi:10.1038/s41593-018-0310-2. 10 Multi-Source Cognitive TaskonomyA PREPRINT

work page doi:10.1038/s41593-018-0310-2

[5] [5]

Learning neural representations of human cognition across many fMRI studies

Arthur Mensch, Julien Mairal, Danilo Bzdok, Bertrand Thirion, and Ga"el Varoquaux. Learning neural representations of human cognition across many fMRI studies. InAdvances in Neural Information Processing Systems, volume 30, 2017

2017

[6] [6]

Representations and decodability of diverse cognitive functions are preserved across the human cortex, cerebellum, and subcortex.Communications Biology, 5:1245, 2022

Tomoya Nakai and Shinji Nishimoto. Representations and decodability of diverse cognitive functions are preserved across the human cortex, cerebellum, and subcortex.Communications Biology, 5:1245, 2022. doi:10.1038/s42003-022-04221-y

work page doi:10.1038/s42003-022-04221-y 2022

[7] [7]

Zamir, Alexander Sax, William Shen, Leonidas J

Amir R. Zamir, Alexander Sax, William Shen, Leonidas J. Guibas, Jitendra Malik, and Silvio Savarese. Taskonomy: Disentangling task transfer learning. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3712–3722, 2018

2018

[8] [8]

Representation similarity analysis for efficient task taxonomy and transfer learning

Kshitij Dwivedi and Gemma Roig. Representation similarity analysis for efficient task taxonomy and transfer learning. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12387–12396, 2019

2019

[9] [9]

Trevor Standley, Amir Zamir, Dawn Chen, Leonidas Guibas, Jitendra Malik, and Silvio Savarese. Which tasks should be learned together in multi-task learning? InProceedings of the 37th International Conference on Machine Learning, volume 119 ofProceedings of Machine Learning Research, pages 9120–9132, 2020

2020

[10] [10]

Efficiently identifying task groupings for multi-task learning

Christopher Fifty, Ehsan Amid, Zhe Zhao, Tianhe Yu, Rohan Anil, and Chelsea Finn. Efficiently identifying task groupings for multi-task learning. InAdvances in Neural Information Processing Systems, volume 34, 2021

2021

[11] [11]

Uncovering cognitive taskonomy through transfer learning in masked autoencoder-based fmri reconstruction

Youzhi Qu, Junfeng Xia, Xinyao Jian, Wendu Li, Kaining Peng, Zhichao Liang, Haiyan Wu, and Quanying Liu. Uncovering cognitive taskonomy through transfer learning in masked autoencoder-based fmri reconstruction. In International Workshop on Human Brain and Artificial Intelligence, pages 35–50. Springer, 2024

2024

[12] [12]

A genetic algorithms for optimizing structural brain network across cognitive tasks

Youzhi Qu, Wendu Lit, Junfeng Xia, Jiahao Tang, Kaining Peng, Zhichao Liang, Haiyan Wu, and Quanying Liu. A genetic algorithms for optimizing structural brain network across cognitive tasks. In2024 China Automation Congress (CAC), pages 5210–5215. IEEE, 2024

2024

[13] [13]

Masked autoencoders are scalable vision learners

Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross Girshick. Masked autoencoders are scalable vision learners. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16000–16009, 2022

2022

[14] [14]

SimMIM: A simple framework for masked image modeling

Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, and Han Hu. SimMIM: A simple framework for masked image modeling. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9653–9663, 2022

2022

[15] [15]

Rizvi, Matteo Rosati, Christopher Averill, James L

Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca, Syed A. Rizvi, Matteo Rosati, Christopher Averill, James L. Cross, Prateek Mittal, Emanuele Zappala, Rahul Madhav Dhodapkar, Chadi G. Abdallah, and David van Dijk. BrainLM: A foundation model for brain activity recordings. InThe Twelfth International Conference on Learning Representations, 2024. URL...

2024

[16] [16]

Brain-dit: A universal multi-state fmri foundation model with metadata-conditioned pretraining.arXiv preprint arXiv:2604.12683, 2026

Junfeng Xia, Wenhao Ye, Xuanye Pan, Xinke Shen, Mo Wang, and Quanying Liu. Brain-dit: A universal multi-state fmri foundation model with metadata-conditioned pretraining.arXiv preprint arXiv:2604.12683, 2026

Pith/arXiv arXiv 2026

[17] [17]

Brainworld: A structural-prior-conditioned generative model for whole-brain 4d fmri dynamics.arXiv preprint arXiv:2606.17742, 2026

Junfeng Xia, Wenhao Ye, Junxiang Zhang, Xuanye Pan, Mo Wang, and Quanying Liu. Brainworld: A structural-prior-conditioned generative model for whole-brain 4d fmri dynamics.arXiv preprint arXiv:2606.17742, 2026

Pith/arXiv arXiv 2026

[18] [18]

Slim-brain: A data-and training-efficient foundation model for fmri data analysis.arXiv preprint arXiv:2512.21881, 2025

Mo Wang, Junfeng Xia, Wenhao Ye, Enyu Liu, Kaining Peng, Jianfeng Feng, Quanying Liu, and Hongkai Wen. Slim-brain: A data-and training-efficient foundation model for fmri data analysis.arXiv preprint arXiv:2512.21881, 2025

arXiv 2025

[19] [19]

Omni-fmri: A universal atlas-free fmri foundation model.arXiv preprint arXiv:2601.23090, 2026

Mo Wang, Wenhao Ye, Junfeng Xia, Junxiang Zhang, Xuanye Pan, Minghao Xu, Haotian Deng, Hongkai Wen, and Quanying Liu. Omni-fmri: A universal atlas-free fmri foundation model.arXiv preprint arXiv:2601.23090, 2026

arXiv 2026

[20] [20]

Flexibrain: Resolution-agnostic voxel-level encoding for native fmri.arXiv preprint arXiv:2606.11500, 2026

Mo Wang, Wenhao Ye, Junfeng Xia, Minghao Xu, Hongkai Wen, and Quanying Liu. Flexibrain: Resolution-agnostic voxel-level encoding for native fmri.arXiv preprint arXiv:2606.11500, 2026

Pith/arXiv arXiv 2026

[21] [21]

Brain-JEPA: Brain dynamics foundation model with gradient positioning and spatiotemporal masking.arXiv preprint arXiv:2409.19407, 2024

Zijian Dong, Ruilin Li, Yilei Wu, Thuan Tinh Nguyen, Joanna Su Xian Chong, Fang Ji, Nathanael Ren Jie Tong, Christopher Li Hsian Chen, and Juan Helen Zhou. Brain-JEPA: Brain dynamics foundation model with gradient positioning and spatiotemporal masking.arXiv preprint arXiv:2409.19407, 2024. doi:10.48550/arXiv.2409.19407

work page doi:10.48550/arxiv.2409.19407 2024

[22] [22]

NeuroImage , author =

David C. Van Essen, Stephen M. Smith, Deanna M. Barch, Timothy E. J. Behrens, Essa Yacoub, Kamil Ugurbil, et al. The WU-Minn human connectome project: An overview.NeuroImage, 80:62–79, 2013. doi:10.1016/j.neuroimage.2013.05.041. 11 Multi-Source Cognitive TaskonomyA PREPRINT

work page doi:10.1016/j.neuroimage.2013.05.041 2013

[23] [23]

Barch, Gregory C

Deanna M. Barch, Gregory C. Burgess, Michael P. Harms, Steven E. Petersen, Bradley L. Schlaggar, Maurizio Corbetta, Matthew F. Glasser, Sandra Curtiss, Sachin Dixit, Cindy Feldt, et al. Function in the human connectome: Task-fMRI and individual differences in behavior.NeuroImage, 80:169–189, 2013. doi:10.1016/j.neuroimage.2013.05.033

work page doi:10.1016/j.neuroimage.2013.05.033 2013

[24] [24]

Glasser, Stamatios N

Matthew F. Glasser, Stamatios N. Sotiropoulos, J. Anthony Wilson, Timothy S. Coalson, Bruce Fischl, Jesper L. Andersson, Junqian Xu, Saad Jbabdi, Matthew Webster, Jonathan R. Polimeni, et al. The minimal preprocessing pipelines for the human connectome project.NeuroImage, 80:105–124, 2013. doi:10.1016/j.neuroimage.2013.04.127

work page doi:10.1016/j.neuroimage.2013.04.127 2013

[25] [25]

Glasser, Timothy S

Matthew F. Glasser, Timothy S. Coalson, Emma C. Robinson, Carl D. Hacker, John Harwell, Essa Yacoub, Kamil Ugurbil, Jesper Andersson, Christian F. Beckmann, Mark Jenkinson, et al. A multi-modal parcellation of human cerebral cortex.Nature, 536(7615):171–178, 2016. doi:10.1038/nature18933

work page doi:10.1038/nature18933 2016

[26] [26]

Mark D’Esposito and Bradley R. Postle. The cognitive neuroscience of working memory.Annual Review of Psychology, 66:115–142, 2015. doi:10.1146/annurev-psych-010814-015031

work page doi:10.1146/annurev-psych-010814-015031 2015

[27] [27]

The multiple-demand (MD) system of the primate brain: Mental programs for intelligent behaviour

John Duncan. The multiple-demand (MD) system of the primate brain: Mental programs for intelligent behaviour. Trends in Cognitive Sciences, 14(4):172–179, 2010. doi:10.1016/j.tics.2010.01.004

work page doi:10.1016/j.tics.2010.01.004 2010

[28] [28]

A multi-demand operating system underlying diverse cognitive tasks.Nature Communications, 15:2185, 2024

Weidong Cai, Jalil Taghia, and Vinod Menon. A multi-demand operating system underlying diverse cognitive tasks.Nature Communications, 15:2185, 2024. doi:10.1038/s41467-024-46511-5. 12

work page doi:10.1038/s41467-024-46511-5 2024