1. 1.2243657 bpb — Naive Baseline (Baseline) 15.9MB
SP-1024 9x512 KV4 run on pgut1 using the published Hugging Face fineweb10B_sp1024 export and the current train_gpt.py; score is the default final int8+zlib roundtrip metric under the 16,000,000-byte cap.
15 open PRs at https://github.com/openai/parameter-golf/pulls
https://github.com/openai/parameter-golf
top golfers — best: 1.2243657 bpb
0 replies
no replies yet