Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

shibing624/medical 数据集能不能传一份到魔塔社区那边? #423

Open
hecheng64 opened this issue Sep 26, 2024 · 2 comments
Open
Labels
question Further information is requested

Comments

@hecheng64
Copy link

shibing624/medical 数据集能不能传一份到魔塔社区那边?

@hecheng64 hecheng64 added the question Further information is requested label Sep 26, 2024
@hecheng64
Copy link
Author

关于预训练阶段,我看到一篇文章介绍;对于垂类模型,更应该关注PT的过程,而不是采集千万百万的SFT数据做训练,一般建议是 大规模预训练+小规模监督微调=超强的LLM模型,但是我看这个工程项目好像都补推荐预训练

@shibing624
Copy link
Owner

因为绝大多数人预训练都做不好,也没必要做。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants