Clay

[C++] The meaning of ios_base::sync_with_stdio(false) and cin.tie(NULL) in programming competition

Clay
2024-08-252024-08-25
C++

Explanation

On platforms like LeetCode, if one looks at the solutions provided by top coders after solving problems, one often encounters a peculiar piece of code (C++ only):

Clay
2024-08-242024-08-24
Machine Learning

Introduction

ColBERT is an embedding model designed specifically for retrieval tasks, transforming the tokens of Queries and Documents into embeddings and computing the maximum similarity.

Clay
2024-08-232024-08-23
Linux

What is Hadolint?

Hadolint is a Dockerfile linter that helps you follow best practices and style guidelines when writing Dockerfiles.

Clay
2024-08-222024-08-22
Machine Learning

To this day, many AI applications and open-source projects are developed based on the HuggingFace transformers package. A large number of models and packages are written to be compatible with the transformers format, and even share the same functions and methods, which makes them more widely accepted.

Under this premise, I came across an open-source training framework that conveniently wraps the automatic reading of Transformer architectures. However, one unavoidable problem is I want to use my custom model for experiments. I tried several solutions, hoping that when using AutoModel.from_pretrained(), by simply providing the local path to my model, I could successfully use my custom model architecture. This article records the method that worked.

Clay
2024-08-212024-08-21
Machine Learning, PyTorch

Problem Description

When building deep learning models in PyTorch, adjusting the shapes of layers and input/output dimensions is something every AI engineer has to deal with. However, there is a small but interesting pitfall in the view() method of PyTorch:

RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(...) instead.

Clay
2024-08-202024-08-20
Flutter, Linux

Recently, I started experimenting with Flutter to develop desktop applications. Since I’m using the Linux Gnome desktop, its default system window bar (the top strip) is completely black, which doesn’t match the light and playful tone of the app I’m developing. So, I found a great tool: bitsdojo_window.

Clay
2024-08-192024-08-19
Machine Learning, PyTorch

The working principle of LayerNorm is as follows:

Calculate mean and variance

Clay
2024-08-182024-08-18
Machine Learning, PyTorch

Introduction to Cross Entropy

Cross entropy is a very common loss function in Machine Learning, as it is able to quantify the difference between a model’s classification predictions and the actual class labels, particularly in ‘classification tasks’.

Clay
2024-08-182024-08-18
Machine Learning, PyTorch

Gaussian Error Linear Unit (GELU) is an activation function used in machine learning. While it resembles the classic ReLU (Rectified Linear Unit), there are some key differences.

Clay
2024-08-172024-08-17
Machine Learning, PyTorch

Introduction to RMSNorm

RMSNorm is an improvement over LayerNorm, often used in the Transformer self-attention mechanism. It aims to mitigate the issues of vanishing and exploding gradients, helping the model converge faster and improve performance.

« Previous
1
…
5
6
7
8
9
…
81
Next »

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Clay

[C++] The meaning of ios_base::sync_with_stdio(false) and cin.tie(NULL) in programming competition

Explanation

[Paper Reading] ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT

Introduction

[Linux] How To Use Hadolint To Avoid Terrible Dockerfile

What is Hadolint?

Using AutoModel.from_pretrained() In Transformers To Load Customized Model Architecture

[Solved] RuntimeError: view size is not compatible with input tensor’s size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(…) instead.

Problem Description

[Flutter] Using bitsdojo_window to Customize Window Title Bar

[Machine Learning] Note of LayerNorm

[Machine Learning] Note of Cross Entropy Loss

Introduction to Cross Entropy

[Machine Learning] Note of Activation Function GELU

[Machine Learning] Note of RMSNorm

Introduction to RMSNorm