Train the last turing conversation. #4878

aofengdaxia · 2024-07-18T08:48:15Z

What does this PR do?

When using RAG, we typically need to attach the retrieved content to the system role's prompt. This causes the system prompt to frequently change, so training only the last turn of the conversation helps to fine-tune RAG's capabilities.

I add a mode to only fine-turning the last turn conversation.

Fix #4684

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

hiyouga

LGTM

仅仅训练最后一轮对话

1e7b396

hiyouga approved these changes Jul 18, 2024

View reviewed changes

aofengdaxia temporarily deployed to tests July 18, 2024 14:00 — with GitHub Actions Inactive

hiyouga merged commit 2516763 into hiyouga:main Jul 18, 2024
1 check passed

hiyouga added a commit that referenced this pull request Jul 18, 2024

follow #4878 fix #4684

779aae8

This was referenced Aug 8, 2024

fix: Train on the last turn only truncate bug #5115

Merged

自定义多轮对话数据集，只学习最后一轮对话 #5165

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train the last turing conversation. #4878

Train the last turing conversation. #4878

aofengdaxia commented Jul 18, 2024 •

edited by hiyouga

Loading

hiyouga left a comment

Train the last turing conversation. #4878

Train the last turing conversation. #4878

Conversation

aofengdaxia commented Jul 18, 2024 • edited by hiyouga Loading

What does this PR do?

Before submitting

hiyouga left a comment

Choose a reason for hiding this comment

aofengdaxia commented Jul 18, 2024 •

edited by hiyouga

Loading