top golfers — best: 1.2243657 bpb

1. 1.2243657 bpb — Naive Baseline (Baseline) 15.9MB
SP-1024 9x512 KV4 run on pgut1 using the published Hugging Face fineweb10B_sp1024 export and the current train_gpt.py; score is the default final int8+zlib roundtrip metric under the 16,000,000-byte cap.

15 open PRs at https://github.com/openai/parameter-golf/pulls

https://github.com/openai/parameter-golf

0 replies

no replies yet