Large language models surpass human experts in predicting neuroscience results

BrainBench's code base is currently divided into 3 parts, each of which is a tracked repo.

Repos:

experiments: repo hosting all analyses scripts that can be used to obtain raw results (pre-plotting) to replicate the findings from scratch.
finetuning: repo hosting the finetuning part of the paper.
plotting: repo hosting plotting scripts to remake all figures from the paper using results produced in experiments.

For step-by-step guidance, please refer to README in each dedicated repo.

To work with this repo locally:

git clone [email protected]:braingpt-lovelab/BrainBench.git --recursive

Hardware requirements:

Nvidia A100 (80GB) * 4
2TB storage

Attribution

@article{luo_large_2024,
	title = {Large language models surpass human experts in predicting neuroscience results},
	issn = {2397-3374},
	url = {https://www.nature.com/articles/s41562-024-02046-9},
	doi = {10.1038/s41562-024-02046-9},
	abstract = {Abstract
            Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. Here, to evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs indicated high confidence in their predictions, their responses were more likely to be correct, which presages a future where LLMs assist humans in making discoveries. Our approach is not neuroscience specific and is transferable to other knowledge-intensive endeavours.},
	language = {en},
	urldate = {2024-11-29},
	journal = {Nature Human Behaviour},
	author = {Luo, Xiaoliang and Rechardt, Akilles and Sun, Guangzhi and Nejad, Kevin K. and Yáñez, Felipe and Yilmaz, Bati and Lee, Kangjoo and Cohen, Alexandra O. and Borghesani, Valentina and Pashkov, Anton and Marinazzo, Daniele and Nicholas, Jonathan and Salatiello, Alessandro and Sucholutsky, Ilia and Minervini, Pasquale and Razavi, Sepehr and Rocca, Roberta and Yusifov, Elkhan and Okalova, Tereza and Gu, Nianlong and Ferianc, Martin and Khona, Mikail and Patil, Kaustubh R. and Lee, Pui-Shee and Mata, Rui and Myers, Nicholas E. and Bizley, Jennifer K. and Musslick, Sebastian and Bilgin, Isil Poyraz and Niso, Guiomar and Ales, Justin M. and Gaebler, Michael and Ratan Murty, N. Apurva and Loued-Khenissi, Leyla and Behler, Anna and Hall, Chloe M. and Dafflon, Jessica and Bao, Sherry Dongqi and Love, Bradley C.},
	month = nov,
	year = {2024},
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
experiments @ 9808e5d		experiments @ 9808e5d
finetuning @ 05f2d08		finetuning @ 05f2d08
plotting @ d988059		plotting @ d988059
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large language models surpass human experts in predicting neuroscience results

Repos:

To work with this repo locally:

Hardware requirements:

Attribution

About

Releases

Packages

License

braingpt-lovelab/BrainBench

Folders and files

Latest commit

History

Repository files navigation

Large language models surpass human experts in predicting neuroscience results

Repos:

To work with this repo locally:

Hardware requirements:

Attribution

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages