-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我们发布了更完整、不是机器翻译的中文数据 #3
Comments
期待你们的工作!希望能早日开源权重,体验到模型的惊艳效果! |
不错👍,看是不是能合作一下,做个合并,我也在生成这样的数据 |
机器翻译的数据,对小模型可能是一种数据污染。 |
@Guanaco-Model 我在使用https://huggingface.co/nyanko7/alpaca-multilang/tree/main时,生成的句子会重复, 我使用的config如下: |
================== |
@Guanaco-Model @wac81 请问你们有对中文数据上的效果进行评测吗?方便展示一下结果吗? |
您可以参考协作者的一份实现:https://colab.research.google.com/drive/1nn6TCAKyFrgDEgA6X3o3YbxfbMm8Skp4?usp=sharing |
您似乎没有设定repetition_penalty |
Hi @Guanaco-Model ,
这个对话任务是什么样子的,数据是怎么构造的 |
https://guanaco-model.github.io/
https://huggingface.co/datasets/JosephusCheung/GuanacoDataset
The text was updated successfully, but these errors were encountered: