Blog

[Linux] `ncdu` Disk Usage Analyzer Tool Note

Clay
2024-10-122024-10-12
Linux

ncdu is a disk usage analyzer tool based on NCurses, mainly used to view and manage disk space. It provides a terminal interface to browse the directory structure, allowing users to quickly explore directories and see the size of each file or folder. This helps users identify directories or files that are taking up a large amount of space.

[Python] Using Locust Open Source Load Testing Framework for Stress Testing

Clay
2024-10-102024-10-23
Python

Locust is an open-source load testing tool that helps simulate heavy user traffic on web applications and APIs. Compared to traditional load testing tools, Locust offers more customization and scalability—it supports Python as the scripting language, allowing us to write tests specific to our API or web service use cases.

Notes on Fine-Tuning a Multi-Modal Large Language Model Using SFTTrainer (Taking LLaVa-1.5 as an Example)

Clay
2024-10-082024-10-08
AI, Machine Learning, PyTorch

A multi-modal large language model (Multi-Modal Large Language Model) isn’t limited to text only. I know this might sound contradictory, but this is a term that has become widely accepted. What I want to document today is how to fine-tune a multi-modal model using a script.

an artist s illustration of artificial intelligence ai this image represents how machine learning is inspired by neuroscience and the human brain it was created by novoto studio as par

“Common sense, as people call it, is merely the biases learned during youth”—the training data for AI models is no different

Clay
2024-10-062024-10-06
AI

This year, due to work, I tried annotating the data myself; it was only after diving into it personally that I truly understood just how profoundly training data affects an AI model.

[Python] Extracting Text from PPT Using the python-pptx Library

Clay
2024-10-042024-10-04
Python

Introduction

Recently, while handling some work-related matters, I noticed that the client might potentially need a way to extract text from PPT files. I discussed this with the PM and my supervisor, and they mentioned that the client could simply copy the text from the PPT slides manually. Unless the client explicitly requests us to extract it programmatically.

[Machine Learning] Vector Quantization (VQ) Notes

Clay
2024-10-032024-10-03
Machine Learning, Python, Scikit Learn

The first time I heard about Vector Quantization (VQ) was from a friend who was working on audio processing, which gave me a vague understanding that VQ is a technique used for data feature compression and representation. At that time, I still wasn’t clear on how it differed from dimensionality reduction techniques like PCA.

[Linux] Use the batcat Command as a Replacement for cat, Highlighting Code or Configurations

Clay
2024-10-022024-10-02
Linux

batcat or simply bat, is a replacement tool for the cat command. It retains the functionality of cat for displaying files, while also highlighting keywords in code or configuration files, making it more convenient for developers to browse daily tasks or code files (thus, it’s definitely a productivity tool!).

[Linux] Ripgrep (rg): A Super Fast File Search Tool

Clay
2024-09-292024-09-29
Linux

Ripgrep (rg) is a command-line tool used for quickly searching file contents, designed as a replacement for grep, addressing grep‘s efficiency issues with large-scale file searches.

[Linux] TL;DR: Replace man with tldr to Read Command Line Manuals

Clay
2024-09-282024-09-28
Linux

man is the traditional documentation tool for UNIX/Linux systems, but the detailed nature of its output can be overwhelming for users who just want a quick reference on how to use a command. Therefore, a simplified version called tldr was created (short for “too long, didn’t read”). It focuses on providing concise, easy-to-understand command documentation.

[Linux] bpytop: A More Modern and Visually Appealing Resource Monitoring Tool Compared to htop

Clay
2024-09-272024-09-27
Linux

I’ve been looking for a more visually appealing alternative to htop for a long time. A few years ago, during a gathering with friends, I happened to pull out my laptop to fix a docker segmentation fault issue in the lab. One of my friends saw my htop and remarked, “So primitive~ Engineers are so boring~” I still hold a grudge for that (just kidding, of course).

« Previous
1
2
3
4
5
6
…
81
Next »

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30