< 목차 > tmp Shapeness and Hessian References tmp Shapeness and Hessian References Papers Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning Others How to compute Hessian-vector products?