Blog

[Linux] viddy: An Enhanced Version of the Watch Command

Clay
2024-09-262024-09-26
Linux

Introduction

viddy is a tool similar to watch for running a command at regular intervals in a Linux terminal and displaying the output.

Differences in Precision Representations in Deep Learning: Float32, Float16, Float8, and BFloat16

Clay
2024-09-252024-09-25
AI, Machine Learning

In the process of training and fine-tuning deep neural networks, the most important and scarce resource is undoubtedly the GPU’s VRAM. Therefore, making every bit perform at its best is a critical task.

[Linux] Using “The Fuck” Tool to Correct Mistyped Commands with fuck

Clay
2024-09-192024-09-19
Linux

Linux has so many useful tools, and I truly want to document every single one of them. To celebrate Linux reaching a usage rate of 4.55% on StatCounter (2024-09-18), I’ve decided to document another tool recommended by a colleague—the fuck command.

[Linux] Efficient Directory Navigation On Linux: Mastering The `z` Command

Clay
2024-09-172024-09-17
Linux

Introduction

z command is something I’ve wanted to write about for a long time! However, I’ve been busy with AI training (company work) and model acceleration (personal interest), so I haven’t had the time. Let’s put it this way, if someone asks me to recommend essential tools for a Linux system, I would undoubtedly place z in my top ten list.

[Python] How to Use FastAPI’s Auto-Generated Documentation (Including Exporting Static Files)

Clay
2024-09-152024-09-15
Python

Introduction

In the FastAPI official documentation, there’s a section about ‘Automatic Interactive API Documentation‘:

Robust: Get production-ready code. With automatic interactive documentation.

Troubleshooting Accelerated Inference of Gemma-2 on V100 GPUs Using vLLM

Clay
2024-09-142024-09-14
AI, Machine Learning

Problem Description

Recently, I’ve achieved some good application results by fine-tuning Gemma-2. However, I encountered various errors when deploying it on the client’s equipment, which was quite frustrating. Currently, there isn’t a systematic troubleshooting guide online, so I’m documenting it here.

[Python] How To Use @contextmanager Decorator

Clay
2024-09-122024-09-12
Python

In Python, the context manager decorator @contextmanager from the contextlib module allows developers to conveniently create our own context manager.

[PyTorch] Traversing Every Layer of a Neural Network in a Model

Clay
2024-09-102024-09-10
Machine Learning, PyTorch

Introduction

Recently, due to some serendipitous events, I had a chance to modify the architecture of a model slightly. I took this opportunity to explore how to iterate and print the layers of neural networks in PyTorch.

OpenAI Triton Note (2): Fused Softmax

Clay
2024-09-092024-09-09
Machine Learning, PyTorch

Introduction

Softmax is a commonly used activation function, and it is often employed as the last layer in multi-class classification.

OpenAI Triton Note (1): Vector Addition

Clay
2024-09-082024-09-08
Machine Learning, PyTorch

Introduction

Triton is an open-source GPU programming language compiler released by OpenAI in 2021. Over recent years, it has become increasingly popular among developers for writing and optimizing parallel programs on GPUs. Compared to traditional libraries such as CUDA or OpenCL, Triton offers a Python-like syntax, making it more readable and easier to learn.

« Previous
1
…
3
4
5
6
7
…
81
Next »