Clay

[Linux] 顯示終端機電子時鐘時間的小工具: tty-clock 介紹

Clay
2024-02-252024-02-25
Linux

介紹

我經常會使用我的電腦看全螢幕的影片，並且需要在等下工作時間到時準時回到工作狀態 —— 但尷尬的是我並不喜歡隨時拿起手機查看當前的時間，因為那樣做很麻煩。

[已解決] RuntimeError: view size is not compatible with input tensor’s size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(…) instead.

Clay
2024-02-222024-02-22
Machine Learning, PyTorch

問題描述

在使用 PyTorch 進行深度學習模型的建設時，我們免不了一次又一次地調整神經層與輸入輸出的形狀，這顯然是每位 AI 工程師必經的道路 —— 而在 PyTorch 的形狀變換 view() 方法中，顯然存在一個有趣的小陷阱：

RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(...) instead.

[Linux] Hadolint 使用筆記

Clay
2024-02-202024-02-20
Linux

什麼是 Hadolint？

Hadolint 是一個 Dockerfile linter，它可以幫助你在撰寫 Dockerfile 時遵循最佳做法和風格指南。

[已解決] 使用 SFTTrainer 時，如果訓練資料中存在多個 response_template，會從何處開始計算 loss

Clay
2024-02-192024-04-01
Machine Learning

問題描述

SFTTrainer 是 HuggingFace 所提供的一個進行 LLM 微調任務的訓練工具，可以快速調整多項超參數與細項配置在大型語言模型的微調任務中。其中，response_template 是訓練資料中我們必須傳遞的特殊字串模板，在這個模板字串後的所有內容，都會在訓練時參與 loss 的計算。