Blog

[PyTorch] BERT Architecture Implementation Note

Clay
2024-09-072024-09-07
Machine Learning, PyTorch

Introduction

My advisor used to tell me, “Don’t just use other people’s libraries; you have to write your own to truly understand.” Back then, I didn’t have much time to implement various technologies I was interested in since I was fully occupied with my dissertation. However, I often recall his earnest advice even now, and it prompted me to finally attempt the implementation of BERT, a classic encoder-only transformer model.

Using the Integrated Outlines Tool for Decoding Constraints in the vLLM Inference Acceleration Framework

Clay
2024-09-062024-09-07
Machine Learning, Python

Recently, I integrated several applications of Outlines into my current workflow. Among them, the one I use most frequently is with vLLM. However, for some reason, its documentation has not been merged into the vLLM GitHub repository, so while designing the process, I had to constantly refer to the source code of a rejected PR for guidance XD

Implementation of Using Finite-State Machine to Constrain Large Language Model Decoding

Clay
2024-09-052024-09-05
Machine Learning, Python

This is a simple Python implementation, used to test Finite-State Machine (FSM) constraints for a Large Language Model (LLM) to decode responses in a specific format. It also serves as an introduction to the concept behind the Outlines tool. Of course, my implementation is far simpler compared to the actual Outlines tool.

Structuring Model Outputs Using the Outlines Tool

Clay
2024-09-032024-09-03
AI, Machine Learning

When applying Large Language Models (LLMs) in real-world scenarios, it’s often not just about letting the model generate text freely. We might want the model to return specific structures, such as multiple-choice questions or providing a rating. In such cases, transformers-based models can directly use the outlines tool.

Implementing Streamed Output Token Generation Using TextStreamer and TextIteratorStreamer in HuggingFace Transformers

Clay
2024-09-012024-09-01
Machine Learning

Introduction

Generative models are becoming increasingly powerful, and independent researchers are deploying one open-source large language model (LLMs) after another. However, when using LLMs for inference or generating responses, waiting for a longer output can be quite time-consuming.

[Machine Learning] Note Of Variational AutoEncoder (VAE)

Clay
2024-08-312024-08-31
Machine Learning, Python, PyTorch

Introduction

Variational AutoEncoder (VAE) is an advanced variant of the AutoEncoder (AE). The architecture is similar to the original AutoEncoder, consisting of an encoder and a decoder.

Evaluating LLM Defense Capabilities Using the Microsoft BIPIA Framework

Clay
2024-08-302024-08-30
AI, Machine Learning

Currently, LLM services cover a wide range of fields, and Prompt Injection and Jailbreak threats to LLMs are growing by the day. A few months ago, a customer service LLM even provided incorrect information, leading to a loss of customer rights (although that wasn’t caused by a prompt attack).

Microsoft’s open-source BIPIA (Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models) evaluation method, although tested six months ago without significant updates since, remains a simple and convenient testing method for the tasks I have at hand.

[Python] Using the difflib Module to Compare Sequence Differences

Clay
2024-08-282024-08-28
Python

difflib is a module in the Python standard library used to compare differences between sequences (often text). Back when I was doing my thesis, I implemented this by hand. It’s funny and a bit frustrating to realize now in my work that there’s such a neat module for this.

[Python] Using @property Decorator To Convert Class Method Into Read-Only Attribute

Clay
2024-08-272024-08-27
Python

In Python class construction, the @property decorator is commonly used and has significant benefits. Its main purpose is to transform a class method into a read-only attribute, allowing users to retrieve computed results via attribute access.

Note Of Newton Polynomial

Clay
2024-08-262024-08-26
Math

Newton’s interpolation is a polynomial interpolation method that constructs a set of polynomial functions using multiple data points. A major advantage is that with the addition of new data, Newton’s interpolation method does not require recalculations from scratch but can instead expand on the existing function.

« Previous
1
…
4
5
6
7
8
…
81
Next »

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30