(WIP) Rethinking Conventional Wisdom from the LLM perspective

28 Sep 2024

< 목차 >

tmp
(23th Sep 2024) Rethinking Conventional Wisdom in Machine Learning: From Generalization to Scaling
- Weight Decay (and muP)
References

tmp

(23th Sep 2024) Rethinking Conventional Wisdom in Machine Learning: From Generalization to Scaling

from_generalization_to_scaling_paper_fig1 Fig.

Weight Decay (and muP)

from_generalization_to_scaling_paper_weight_decay Fig.

from_generalization_to_scaling_paper_fig13 Fig.

References