August 2024

Supervised Fine-tuning Trainer (SFTTrainer) Note

Clay
2024-08-022024-08-02
Machine Learning, PyTorch

Last Updated on 2024-08-02 by Clay

Introduction

Supervised Fine-Tuning (SFT) is one of the most well-known methods for training Large Language Models (LLM). Essentially, it is similar to traditional language modeling, where the model learns certain knowledge through training data.

LLM Fine-tuning Note – Differences Between SFT and DPO

Clay
2024-08-022024-08-02
2 Comments
Machine Learning

Last Updated on 2024-08-02 by Clay

Introduction

In the fine-tuning tasks of Large Language Models (LLM), several methods such as Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and Direct Preference Optimization (DPO) are all viable approaches. However, there are some differences among them.

« Previous
1
2
3
4

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31