Blog

Note Of Hydra: Environment Configure Manager Package

Clay
2024-10-262024-10-26
Linux, Python

Introduction

Hydra is an open-source Python framework designed to simplify the research and deployment process, especially for complex applications. Hydra dynamically creates hierarchical configuration files during deployment and allows command line-based overwriting of these configurations.

[Solved] Unable to View Folder with Arrow Icon in GitHub Project

Clay
2024-10-252024-10-25
Git, Github

Problem Description

Today, while developing a web application with React.js for the frontend and Python Flask for the backend, I pushed the project to my GitHub repository after reaching a satisfactory milestone. However, upon checking the repository, I was surprised to find that I couldn’t access the folder my-app created by npx create-react-app my-app.

Note on Calculating VRAM Consumption for Training and Inference of AI Models

Clay
2024-10-242024-10-24
AI, Machine Learning

I’ve always used rough formulas to estimate the relationship between the scale of my models and the GPU VRAM consumption; after all, there are too many variables involved—model architecture, number of layers, attention mechanism implementation, sequence length, batch size, data precision used in training or inference… all of these affect our final calculation results.

Here’s a thought: Will Transformers be replaced in the future?

Clay
2024-10-222024-10-22
AI, Essay

Today, while I was eating, I came across a video (the video is attached at the end of this article). Unlike many tech channels that jump straight into discussing AI, economics, and replacing humans, this video took a more careful approach. It explained in detail how hardware specifications have influenced algorithms (or AI model architectures) over time.

[Linux] How To Check The Port is Used?

Clay
2024-10-202024-10-20
Linux

Problem Description

We usually start various services on Linux servers, the most common being hosting a website or opening a port to allow us or users to test developing functionalities.

Note Of KTOTrainer (Kahneman-Tversky Optimization Trainer)

Clay
2024-10-192024-10-19
AI, Machine Learning

I’ve been intermittently reading about a fine-tuning method called Kahneman-Tversky Optimization (KTO) from various sources like HuggingFace’s official documents and other online materials. It’s similar to DPO as a way to align models with human values, but KTO’s data preparation format is much more convenient, so I’m quickly applying it to my current tasks before making time to study the detailed content in the related papers.

[Linux] [Linux] Podman Basic Command Note

Clay
2024-10-172024-10-17
Linux

Introduction

Podman is an open-source tool designed to manage containers and images. Its full name is Pod Manager tool (podman). While Podman is similar to Docker, there are a few key differences in its design.

[Paper Reading] ENTP: ENCODER-ONLY NEXT TOKEN PREDICTION

Clay
2024-10-162024-10-16
AI, Machine Learning, Papers

The following are some points in this paper:

[Solved] Uvicorn Closed In Container – HTTP connection lost. Shutting down, exit 0

Clay
2024-10-142024-10-14
Linux

Problem Description

Today, while I was using podman to create a container (from a FastAPI image) to run my FastAPI service, I encountered an issue where the container would automatically stop if the user logged out or if there were no HTTP POST requests sent to the API for a while. After some time, the service would stop.

[Machine Learning] Note Of Kullback-Leibler Divergence

Clay
2024-10-132024-10-13
Machine Learning, Python

What is KL Divergence?

In machine learning, we often encounter the term KL Divergence (also known as Kullback-Leibler Divergence). KL Divergence is a metric used to evaluate the difference between two probability distributions P and Q.

« Previous
1
2
3
4
5
…
81
Next »