Skip to content

[Tool] Use ConvertZ to convert cp950 to Unicode

For a long time, when I was using python for program development, if I was dealing with texts, I would often got some files edited by others from Windows and sent to my Linux server waiting to be processed.

if it is an English file, that’s fine. But if it is a Chinese file, that’s a trouble.

After all, the default Chinese encoding for Windows computers is Big5, which is cp950, and almost all Unicode encodings on Linux.

So when you open the file, you may see garbled characters.

Sometimes we can use python to convert the encoding, but this method still has garbled characters that cannot be converted.

As you can see, it still has some garbled characters in the head.

However, we will find that when you use Notepad++ on a Windows computer, there is no problem here.

Of course we can convert our files by Notepad++, but we cannot use it to convert a large number of files.

Therefore, I found a very good tool, the conversion function is not much better than Notepad++, that is the tool I would recommend today: ConvertZ.


Introduction of ConvertZ

ConvertZ is free tool, its working principle is to call the built-in conversion program interface of Windows.

Unfortunately, it is said that the original author’s website is dead, and it is difficult to find the place where it was originally published.

But fortunately, there are still many backups circulating on the Internet, I will not provide links here.

After startup of ConvertZ, you can see a long graphical interface, very chronological.

Click file can select the files

After opening, there will be a graphical interface like the following.

The graphical interface is actually quite detailed, after all, there is a Chinese commentary in the lower left corner.

The basic process used is to select the directory to be converted at the upper left button, ConvertZ will read all the txt files under the directory and prepare for conversion.

In fact, the conversion part, such as the conversion of cp950 to Unicode that is not preset in the right half, but have to click “Advanced Options” to see it.

After you have selected all of them, select “Start Conversion”, and the program will automatically convert all the text files under the directory you selected.

Now I use python to read the converted files and come in, except for some noise, they are all in standard Chinese.


References

Leave a Reply