Files
NowChessSystems/modules
Janis Eccarius 9d656624d8
Build & Test (NowChessSystems) TeamCity build finished
fix(official-bots): stream NNUE features as sparse indices to stop host OOM
Densifying the 98304-dim HalfKP vector per item filled host RAM and crashed the
Colab runtime even at small batch sizes. The dataset now yields only the ~64
active feature indices; a custom collate carries (row, col) pairs and the
training loop scatters them into a dense [B, INPUT_SIZE] tensor on the GPU. Host
RAM stays tiny; GPU holds one dense batch transiently.

- NNUEDataset.__getitem__ returns indices via new fen_to_indices.
- fen_to_features now derives from fen_to_indices (kept for external callers).
- _collate_sparse builds row/col index batches; loaders use it.
- train/val loops scatter to a GPU dense batch; loss weighting uses batch size.
- Notebook: BATCH_SIZE 4096 -> 8192 (host no longer the limit; GPU is).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-24 22:28:53 +02:00
..
2026-06-22 17:41:58 +00:00
2026-06-15 20:52:53 +00:00
2026-06-23 22:30:53 +00:00
2026-06-21 09:39:54 +00:00
2026-06-21 11:34:38 +00:00
2026-06-02 09:49:42 +00:00
2026-05-02 15:53:05 +00:00
2026-05-12 22:21:53 +00:00
2026-05-06 07:06:52 +00:00
2026-06-03 06:04:13 +00:00
2026-06-19 10:28:49 +00:00