June 2024

[Machine Learning] Note Of SiLU Activation Function

Clay
2024-06-062024-06-06
Machine Learning, PyTorch

Last Updated on 2024-06-06 by Clay

Introduction

SiLU (Sigmoid Linear Unit) activation function is similar to Swish function, Swish just have additional trainable beta parameter. Many large language model (LLM) also adopt this approach, primarily in some exploratory models that use activation functions other than ReLU, such as the classic Llama architecture.

Note Of Unsloth Accelerate Fine-tuning Open Source Project

Clay
2024-06-042024-06-05
Machine Learning, Python

Last Updated on 2024-06-05 by Clay

Introduction

For several months, I have benefited greatly from the Unsloth project, primarily because a significant part of my job involves fine-tuning large language models (LLMs). Fine-tuning LLMs is extremely time-consuming; aside from data collection, the biggest time sink is the endless GPU-powered fine-tuning process.

[Paper Reading] Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Clay
2024-06-032024-07-25
Machine Learning, Python

Last Updated on 2024-07-25 by Clay

Introduction

The accelerated framework is proposed by Huawei Noah’s Ark Lab, it replaces the small model used in the original speculative decoding with the shallow sub-network of the large model. Additionally, it employs an extra-trained adapter and the model’s own decoding head to generate speculative tokens, which are then verified by the large model. The subsequent operations are quite similar to the original speculative decoding process.

Note Of RESTful (With Python FastAPI + CURL Example)

Clay
2024-06-022024-06-02
Linux, Python

Last Updated on 2024-06-02 by Clay

Introduction

RESTful design (Representational State Transfer, REST) is an architectural style for designing network applications. It follows principles that make network applications simpler, more scalable, and easier to maintain.

Note Of Universally Unique Identifier (UUID)

Clay
2024-06-012024-06-01
Computer

Last Updated on 2024-06-01 by Clay

Introduction

When assigning identifiers to our data, if we want each data to have a unique identifier rather than a simple sequential number, UUID is the most common method we used.

So, what is UUID?

M	T	W	T	F	S	S
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30