Stable Diffusion ComfyUI Note 02 – Build The Basic Workflow

Clay
2024-07-272024-08-12
AI

Last Updated on 2024-08-12 by Clay

Introduction

Previously, we finished the configuration of ComfyUI, now we can try to build a basic and simplest workflow. The workflow is the most different point with stable-diffusion-webui. ComfyUI uses a card-based process that makes it easier to understand how the Stable Diffusion model actually performs inference and also makes it easier to customize and achieve more advanced effects.

Clay
2024-07-252024-07-25
Linux

Last Updated on 2024-07-25 by Clay

Introduction

In Linux System, If we want to use USB device for accessing our data, we always need to connect our USB device to the computer, and then open the folder manager, click the USB device that computer detected.

Clay
2024-07-252024-08-12
AI

Last Updated on 2024-08-12 by Clay

What is ComfyUI？

Those who play with Stable Diffusion AI-generated images have likely heard of stable-diffusion-webui. It is a visual interface that supports the Stable Diffusion model framework, allowing users to perform inference with AI models without having to write code or deal with complicated command-line operations. ComfyUI, on the other hand, is a slightly more niche front-end interface, but it has quickly garnered a loyal fan base due to its flexibility and customizability. Essentially, it can be seen as a more advanced version of stable-diffusion-webui, though it is less user-friendly.

Clay
2024-07-222024-07-22
Linux, Machine Learning, Python

Last Updated on 2024-07-22 by Clay

Introduction

HuggingFace Model Hub is now a widely recognized and essential open-source platform for every one. Every day, countless individuals and organizations upload their latest trained models (including those for text, images, speech, and other domains) to this platform. It can be said that anyone working in AI-related fields frequently browses the HuggingFace platform website.

Clay
2024-07-212024-07-25
Machine Learning

Last Updated on 2024-07-25 by Clay

Introduction

Mistral 7B is a large language model (LLM) proposed on September 27, 2023, trained by the Mistral AI team, which also released its weights as open source. Interestingly, it uses the highly permissive Apache 2.0 license, unlike Llama 2, which has its own Llama license terms. Therefore, Mistral 7B is truly “open source” (Llama’s license requires discussion with Meta AI when the service volume reaches 700 million).

Clay
2024-07-202024-07-20
Machine Learning, Python

Last Updated on 2024-07-20 by Clay

Introduction

Recently, I have been exploring models used for Optical Character Recognition (OCR). In the past, OCR was a very popular research field as it was one of the earliest practical applications of computer vision. Today, OCR has become a very mature task, and you can easily find high-performance open-source models online.

Clay
2024-07-102024-07-20
Machine Learning

Last Updated on 2024-07-20 by Clay

Introduction

In today’s era of flourishing large language models, researchers and companies are racking their brains to apply these models to their work. However, speaking personally, the performance of current language models is still not strong enough, and their application scenarios are limited, often far less than that of humans.

But there is one type of task for which large language models are naturally quite suitable: information extraction in any scenario, which is what I want to introduce today, the NuExtract model.

Clay
2024-06-062024-06-06
Machine Learning, PyTorch

Last Updated on 2024-06-06 by Clay

Introduction

SiLU (Sigmoid Linear Unit) activation function is similar to Swish function, Swish just have additional trainable beta parameter. Many large language model (LLM) also adopt this approach, primarily in some exploratory models that use activation functions other than ReLU, such as the classic Llama architecture.

Clay
2024-06-042024-06-05
Machine Learning, Python

Last Updated on 2024-06-05 by Clay

Introduction

For several months, I have benefited greatly from the Unsloth project, primarily because a significant part of my job involves fine-tuning large language models (LLMs). Fine-tuning LLMs is extremely time-consuming; aside from data collection, the biggest time sink is the endless GPU-powered fine-tuning process.

Clay
2024-06-032024-07-25
Machine Learning, Python

Last Updated on 2024-07-25 by Clay

Introduction

The accelerated framework is proposed by Huawei Noah’s Ark Lab, it replaces the small model used in the original speculative decoding with the shallow sub-network of the large model. Additionally, it employs an extra-trained adapter and the model’s own decoding head to generate speculative tokens, which are then verified by the large model. The subsequent operations are quite similar to the original speculative decoding process.

« Previous
1
…
9
10
11
12
13
…
82
Next »

Stable Diffusion ComfyUI Note 02 – Build The Basic Workflow