Scikit Learn

[Solved] ConvergenceWarning: lbfgs failed to converge (status=1): STOP: TOTAL NO. of ITERATIONS REACHED L

The full error message as follows:

[NLP] The TF-IDF In Text Mining

Clay
2021-08-102021-08-10
Machine Learning, NLP, Python, Scikit Learn

TF-IDF (Term Frequency – Inverse Document Frequency) is a famous word weighting technology, it can show the importance of words to texts.

[Python] Use ShuffleSplit() To Process Cross-Validation Step

Clay
2021-08-102021-08-10
Machine Learning, Python, Scikit Learn

Cross-validation is an important concept in data splitting of machine learning. Simply to put, when we want to train a model, we need to split data to training data and testing data.

[Solved] graphviz.backend.ExecutableNotFound: failed to execute [‘dot’, ‘-Tpdf’, ‘-O’, ‘Digraph.gv’], make sure the Graphviz executables are on your systems’ PATH

Clay
2021-07-052021-07-05
Linux, Machine Learning, MacOS, Python, PyTorch, Scikit Learn

Today, I used PyTorch to build a model, I suddenly needed to submit my technical report, so I simply found a tool to visualize the model: torchviz.

[Machine Learning] Introduction the indicators of the three evaluation models of Precision、Recall、F1-score

Clay
2021-06-182021-06-18
Machine Learning, Python, Scikit Learn

Precision, Recall, and F1-score are three fairly well-known model evaluation indicators, which are mostly used for binary classification (if it is a multi-classification, it is suitable for macro and micro). The following is a brief description of these different indicators:

[Scikit-Learn] Using “train_test_split()” to split your data

Clay
2020-05-052021-06-28
Machine Learning, Python, Scikit Learn

Today, if we need to split our data for training our model —— we need to split training data and test data. We use “training data” to train our model and check it never peep our “test data”, it is very important, because our “test data” can assess the quality of our model.