Section: History and development · Edit the full article instead
Agree with the split. The RLHF subsection should also reference the Constitutional AI paper (Bai et al., 2022) — it's currently missing from this section. Happy to draft the subsection if nobody else is actively editing it.
{fill}
Briefly describe what you changed and why. How to write a good edit summary
By publishing changes, you agree to the Terms of Use and irrevocably release your contribution under the CC BY-SA 4.0 License and the GFDL. A hyperlink or URL is sufficient attribution under the Creative Commons license.
Editing help · Wikipedia policies · Manual of Style · Citing sources · Accessibility guidelines
The current "Training data" section feels fragmented — it mixes pre-training corpus details with RLHF specifics in a way that's hard to follow. I'd suggest splitting into two subsections: one for pre-training corpus composition and one for alignment techniques. This would match the structure used on the BERT and GPT-3 articles. Any objections?