Python

Note Of Unsloth Accelerate Fine-tuning Open Source Project

Clay
2024-06-042024-06-05
Machine Learning, Python

Introduction

For several months, I have benefited greatly from the Unsloth project, primarily because a significant part of my job involves fine-tuning large language models (LLMs). Fine-tuning LLMs is extremely time-consuming; aside from data collection, the biggest time sink is the endless GPU-powered fine-tuning process.

[Paper Reading] Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Clay
2024-06-032024-07-25
Machine Learning, Python

Introduction

The accelerated framework is proposed by Huawei Noah’s Ark Lab, it replaces the small model used in the original speculative decoding with the shallow sub-network of the large model. Additionally, it employs an extra-trained adapter and the model’s own decoding head to generate speculative tokens, which are then verified by the large model. The subsequent operations are quite similar to the original speculative decoding process.

Note Of RESTful (With Python FastAPI + CURL Example)

Clay
2024-06-022024-06-02
Linux, Python

Introduction

RESTful design (Representational State Transfer, REST) is an architectural style for designing network applications. It follows principles that make network applications simpler, more scalable, and easier to maintain.

[PyTorch] Release GPU / CPU Memory After Delete Model

Clay
2024-01-092024-01-15
Python, PyTorch

Problem

Yesterday, I developed a model merging program. This time I have no enough gpu memory to merge the models in only one time, so I tried to merge layer by layer. I found the memory of GPU is easily to release but CPU didn’t.

[Python] Creating and Auto-Removing Temporary Directories with Python’s `tempfile`

Clay
2024-01-072024-01-07
Python

Introduction

Today I was reading the training source code of DreamBooth, and I found the tempfile built-in module; I happened to reconstruct a script for model’s layer merging, suddenly thought that the module would make the code more elegant, so I made this note.

Direct Preference Optimization (DPO) Training Note

Clay
2024-01-022024-01-02
Machine Learning, Python, PyTorch

Introduction

DPO (Direct Preference Optimization) is a fine-tuning method that want to replace RLHF (Reinforcement Learning from Human Feedback).

[Solved] fatal error: portaudio.h: No such file or directory 9 | #include “portaudio.h” | ^~~~~~~~~~~~~ compilation terminated

Clay
2023-12-282023-12-28
Linux, Python

Problem

Today I installed pyaudio on my new Linux laptop (a python package for sound recording), I encountered an error I never met:

[Python] A Line Breaking In The IPython Interface

Clay
2023-12-182023-12-18
Python

Introduction

IPython is a system that provided a interactive interface, it could be integrated with shells or graphical interface; for example we can use ipython (if you had been installed it), or you can use some GUI editor/IDE such as VS Code and PyCharm.

LeetCode: 1913-Maximum Product Difference Between Two Pairs Solution

Clay
2023-12-182023-12-18
C++, LeetCode, Python

LeetCode: 242-Valid Anagram Solution

Clay
2023-12-162023-12-16
C++, LeetCode, Python

Problem

Given two strings s and t, return true if t is an anagram of s, and false otherwise.

« Previous
1
2
3
4
5
…
33
Next »