Training Dynamics
Training dynamics: loss plateaus as phase transitions, saddle-to-saddle learning, and the random, organized, and structured states of a representation.
Training dynamics: loss plateaus as phase transitions, saddle-to-saddle learning, and the random, organized, and structured states of a representation.