lucidia/lucidia-main

mirror of https://github.com/blackboxprogramming/lucidia.git synced 2026-03-17 09:37:56 -05:00

Files

blackboxprogramming fa4f69097f Add data/schemas.md with dataset schema descriptions

2025-08-08 01:18:53 -07:00

281 B

Raw Blame History

Dataset Schemas

Pretraining Dataset

Input text: Raw text for language modeling.

SFT Dataset

Instruction: User instruction text.
Response: Assistant response text.

RLHF Pairs

Chosen: Preferred assistant response.
Rejected: Less preferred assistant response.