October 2024

Using Finite State Machine (FSM) and Rollback Mechanism to Restrict LLM from Generating Banned Words

Clay
2024-10-292024-10-29
AI, Machine Learning

Last Updated on 2024-10-29 by Clay

When implementing various services through LLMs, do you worry about uncontrolled language generation? Recently, at a critical juncture in wrapping up a project, I used tools like Outlines to constrain LLM decoding, which effectively controlled the model's output to follow the desired patterns. However, a colleague posed a deep question: What if I want it not to generate specific words?

Clay
2024-10-272024-10-27
Python

Last Updated on 2024-10-27 by Clay

bisect is a built-in Python module, primarily designed to maintain the order of a sorted list, allowing items to be inserted without the need to re-sort the entire list.

Clay
2024-10-262024-10-26
Linux, Python

Last Updated on 2024-10-26 by Clay

Introduction

Hydra is an open-source Python framework designed to simplify the research and deployment process, especially for complex applications. Hydra dynamically creates hierarchical configuration files during deployment and allows command line-based overwriting of these configurations.

Clay
2024-10-252024-10-25
Git, Github

Last Updated on 2024-10-25 by Clay

Problem Description

Today, while developing a web application with React.js for the frontend and Python Flask for the backend, I pushed the project to my GitHub repository after reaching a satisfactory milestone. However, upon checking the repository, I was surprised to find that I couldn’t access the folder my-app created by npx create-react-app my-app.

Clay
2024-10-242024-10-24
AI, Machine Learning

Last Updated on 2024-10-24 by Clay

I've always used rough formulas to estimate the relationship between the scale of my models and the GPU VRAM consumption; after all, there are too many variables involved—model architecture, number of layers, attention mechanism implementation, sequence length, batch size, data precision used in training or inference... all of these affect our final calculation results.

Clay
2024-10-222024-10-22
AI, Essay

Last Updated on 2024-10-22 by Clay

Today, while I was eating, I came across a video (the video is attached at the end of this article). Unlike many tech channels that jump straight into discussing AI, economics, and replacing humans, this video took a more careful approach. It explained in detail how hardware specifications have influenced algorithms (or AI model architectures) over time.

Clay
2024-10-202024-10-20
Linux

Last Updated on 2024-10-20 by Clay

Problem Description

We usually start various services on Linux servers, the most common being hosting a website or opening a port to allow us or users to test developing functionalities.

Clay
2024-10-192024-10-19
AI, Machine Learning

Last Updated on 2024-10-19 by Clay

I've been intermittently reading about a fine-tuning method called Kahneman-Tversky Optimization (KTO) from various sources like HuggingFace's official documents and other online materials. It's similar to DPO as a way to align models with human values, but KTO's data preparation format is much more convenient, so I'm quickly applying it to my current tasks before making time to study the detailed content in the related papers.

Clay
2024-10-172024-10-17
Linux

Last Updated on 2024-10-17 by Clay

Introduction

Podman is an open-source tool designed to manage containers and images. Its full name is Pod Manager tool (podman). While Podman is similar to Docker, there are a few key differences in its design.

Clay
2024-10-162024-10-16
AI, Machine Learning, Papers

Last Updated on 2024-10-16 by Clay

The following are some points in this paper:

October 2024

Using Finite State Machine (FSM) and Rollback Mechanism to Restrict LLM from Generating Banned Words

[Python] Array Bisection Algorithm bisect Note

Note Of Hydra: Environment Configure Manager Package

Introduction

[Solved] Unable to View Folder with Arrow Icon in GitHub Project

Problem Description

Note on Calculating VRAM Consumption for Training and Inference of AI Models

Here’s a thought: Will Transformers be replaced in the future?

[Linux] How To Check The Port is Used?

Problem Description

Note Of KTOTrainer (Kahneman-Tversky Optimization Trainer)

[Linux] [Linux] Podman Basic Command Note

Introduction

[Paper Reading] ENTP: ENCODER-ONLY NEXT TOKEN PREDICTION