(WIP) Rethinking Conventional Wisdom from the LLM perspective


< 목차 >


tmp

(23th Sep 2024) Rethinking Conventional Wisdom in Machine Learning: From Generalization to Scaling

from_generalization_to_scaling_paper_fig1 Fig.

Weight Decay (and muP)

from_generalization_to_scaling_paper_weight_decay Fig.

from_generalization_to_scaling_paper_fig13 Fig.

References