August 17, 2024

[Machine Learning] Note of RMSNorm

Clay
2024-08-172024-08-17
Machine Learning, PyTorch

Last Updated on 2024-08-17 by Clay

Introduction to RMSNorm

RMSNorm is an improvement over LayerNorm, often used in the Transformer self-attention mechanism. It aims to mitigate the issues of vanishing and exploding gradients, helping the model converge faster and improve performance.

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31