2 lines
70 B
Plaintext
2 lines
70 B
Plaintext
step lambda train_loss train_ppl eval_ppl eval_bpb lr time_s best_ppl
|
step lambda train_loss train_ppl eval_ppl eval_bpb lr time_s best_ppl
|