Machine Learning

[PyTorch] 遍歷模型每一層神經網路筆記

Clay
2023-09-122023-09-12
Machine Learning, PyTorch

介紹

最近在因緣際會下，有了簡單改一些模型架構的機會，於是便趁這個機會好好地摸索了下 PyTorch 中模型神經網路層的遍歷打印方式。

an artist s illustration of artificial intelligence ai this image represents how machine learning is inspired by neuroscience and the human brain it was created by novoto studio as par

[論文閱讀] The Forward-Forward Algorithm: Some Preliminary Investigation

Clay
2023-09-042023-09-04
Machine Learning

介紹

論文連結： https://arxiv.org/abs/2212.13345

本篇研究工作的作者是鼎鼎大名的深度學習之父 Geoffrey Hinton，最初撰寫本篇論文時仍是 Google Brain 的研究員（於 2023 年離開）。

[Machine Learning] KL 散度（Kullback-Leibler Divergence）筆記

Clay
2023-07-192023-07-19
Machine Learning, Python

什麼是 KL 散度（Divergence）？

我們經常會在機器學習中聽到『KL 散度』（Kullback Libler Divergence）這個詞，而 KL 散度其實就是評估兩個機率分佈 P 和 Q 之間『差異性』的一個評估值。

Consensus: 使用 AI 驅動的論文搜尋引擎

Clay
2023-06-282023-06-28
Tools

在研究一些 AI 模型的時候，或許是研究所時保留下來的習慣，我總是會去翻翻模型、系統、套件相關的論文，就算只是粗略掃過 Abstract、Introduction、Conclusion 以及圖表也好，至少我能對我正在使用什麼東西、這東西背後是怎麼驅動的有個底，不是只會直接把開發團隊附上的程式碼跑起來就好。

[已解決] Some weights of the model checkpoint at distilbert-base-multilingual-cased were not used when initializing DistilBertForSequenceClassification: [‘vocab_projector.bias’, ‘vocab_layer_norm.bias’, ‘vocab_layer_norm.weight’, ‘vocab_transform.weight’, ‘vocab_transform.bias’]

Clay
2023-06-192023-06-19
Machine Learning

問題描述

在使用 transformers 套件時，如果當我們使用如 AutoModelForSequenceClassification、AutoModelForSeq2SeqLM… 這樣不同 head 的任務模型時，經常我們會得到以下錯誤訊息：

使用 OpenAI Moderation Endpoint 偵測不適當內容

Clay
2023-06-152023-06-15
Machine Learning

介紹

Moderation 模型是一個 OpenAI 所提供的免費工具，用來審查所謂的『不適當內容』。詳細的禁止條例可以參考 https://openai.com/policies/usage-policies。

[已解決] UserWarning: CUDA initialization: CUDA unknown error – this may be due to an incorrectly set up environment, e.g. changing env variable CUDA_VISIBLE_DEVICES after program start. Setting the available devices to be zero. (Triggered internally at …/c10/cuda/CUDAFunctions.cpp:109.) return torch._C._cuda_getDeviceCount() > 0

Clay
2023-06-072023-06-07
Linux, Machine Learning

問題描述

今天我在我的伺服器訓練模型時，我寫好了一份多片 GPU 平行化訓練的腳本，接著把最新的資料輸入給模型開始訓練；但是在訓練過程中，我得到了沒有 GPU 的錯誤訊息。當我使用 torch.cuda.is_available() 確認時，得到了以下錯誤訊息。

ImageBind：能轉換多模態向量的模型使用心得筆記

Clay
2023-05-292023-05-29
Machine Learning

前言

最近的 Meta AI 真的是強到不行，似乎一瞬間站穩了 AI 研發巨擘的地位，而且還是樣樣開源的頂級標竿。從影像領域的能切割物件的 Segment Anything、到公開大型語言模型（Large Language Model）同時也是基石模型（fundational model）的 LLaMA（引起眾羊駝之亂的就是它！）、到最近能轉換 6 種模態（modals）的 ImageBind 跟語音模型計畫（MMS）…… 只能說像我這種平凡人要努力跟上怎麼去使用都很費勁，更遑論試圖追逐他們的技術了。

BigCode & StarCoder: 專精於程式的大型語言模型計畫

Clay
2023-05-102023-05-10
Machine Learning, NLP

介紹

BigCode

首先我們先來介紹 BigCode 吧！BigCode 是一個由 Hugging Face 和 ServiceNow 共同領導的開放性科學合作項目，目的是共同開發能夠應用於『程式設計』的大型語言模型（Large Language Model, LLM）。

[Machine Learning] 可以和 Accuracy、F1 一同用於模型評估的 MCC 指標筆記

Clay
2023-02-102023-04-13
Machine Learning

Introduction

每當我們訓練完一個深度學習模型後，就又到了我們評估模型好壞的時間。比方說分類任務所使用的分類模型，我們總是會拿各式各樣的指標來測試，比方說 precision、recall、F1-score、Accuracy… 指標間總是有好有壞，甚至有可能會互相衝突，所以大部分時候還是要依據任務的需求來選擇自己評估模型的指標的。

« 上一頁
1
...
6
7
8
9
10
...
17
下一頁 »

一	二	三	四	五	六	日
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31