sweep.py has duplicate STAGE2 and STAGE3 definitions — second silently overwrites first #11
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
sweep.pydefinesSTAGE2andSTAGE3twice (lines 56-80). The second definition silently overwrites the first:First STAGE2:
mhc_streams=1,micro_batch_size=6,gradient_accumulation=4Second STAGE2:
mhc_streams=4,micro_batch_size=16,gradient_accumulation=2First STAGE3:
n_layers=16,max_seq_len=1024Second STAGE3:
n_layers=20,max_seq_len=2048The first set of values are completely lost.
Impact
Action needed
Files
scripts/sweep.py:56-80