Fork me on GitHub

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

Coding Poineer

10 2024 档案

摘要:iterative dpo : https://github.com/RLHFlow/Online-RLHF https://github.com/YuxiXie/MCTS-DPO (蒙特卡洛树dpo) longWriter:https://github.com/THUDM/LongWriter/t 阅读全文
posted @ 2024-10-26 16:59 365/24/60 阅读(5) 评论(0) 推荐(0) 编辑

点击右上角即可分享
微信分享提示