Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问后续还会有哪些相关的陆续更新呢? #3

Closed
Henry-Avery opened this issue Jan 15, 2022 · 2 comments
Closed

请问后续还会有哪些相关的陆续更新呢? #3

Henry-Avery opened this issue Jan 15, 2022 · 2 comments

Comments

@Henry-Avery
Copy link

请问关于论文中LM和PLM模型的代码还有更多的说明吗?我正在复现源1.0论文中的模型和技术,希望能对论文中的Model Architecture有进一步的了解

@AmFe-GH
Copy link

AmFe-GH commented Jan 15, 2022

+1
论文没有详细说明,想清楚地知道哪些是打算开源的,那些是需要自己填补的

@Shawn-IEITSystems
Copy link
Owner

目前的代码已经可以用来预训练及微调百亿参数的模型,不需要自己填补内容。考虑到Transformer结构是较为经典的结构,所以在论文中并未详细说明。对于Transformer的学习建议参考:https://arxiv.org/abs/1706.03762
对于源1.0百亿参数训练脚本,可参考:https://github.com/Shawn-Inspur/Yuan-1.0/blob/main/src/pretrain_yuan_13B.sh

Shawn-IEITSystems pushed a commit that referenced this issue Aug 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants