https://www.bilibili.com/video/BV1XY411B7nM/?spm_id_from=333.788&vd_source=4aed82e35f26bb600bc5b46e65e25c22https://www.bilibili.com/video/BV1XY411B7nM/?spm_id_from=333.788&vd_source=4aed82e35f26bb600bc5b46e65e25c22anthropic.ai:openai出来的一批人搞的。 abstract:rlhf在线学习。 1.introduction