< 목차 > Overview tmp tmp Block Sparse References Overview Fig. Fig. tmp tmp Block Sparse Fig. Fig. References Papers Accelerating Sparse Deep Neural Networks Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Blogs Overview of Sparse NN Is the future of Neural Networks Sparse? An Introduction (1/N) Sparse Neural Networks (2/N): Understanding GPU Performance. NVIDIA Blogs Accelerating Matrix Multiplication with Block Sparse Format and NVIDIA Tensor Cores Structured Sparsity in the NVIDIA Ampere Architecture and Applications in Search Engines from NVIDIA Blog NVIDIA Ampere Architecture In-Depth Pytorch Blogs Speeding up ViTs using Block Sparsity from PyTorch Blog Accelerating Neural Network Training with Semi-Structured (2:4) Sparsity from Pytorch Blog Block-Sparse Block-sparse GPU kernels Codes https://huggingface.co/microsoft/Phi-3-small-128k-instruct/tree/main