Tech report
PDF from Overleaf. arXiv ID will be linked here after endorsement + submit.
arXiv: pending endorsement (cs.RO)
Metrics
Nebius H200 — ~159 vs ~108 env-steps/s; task reward ~0.70.
Learning curve (MLflow)
Homeostasis vs training steps — monolithic run.