July 2024

[Paper Reading] Mistral 7B

Clay
2024-07-212024-07-25
Machine Learning

Last Updated on 2024-07-25 by Clay

Introduction

Mistral 7B is a large language model (LLM) proposed on September 27, 2023, trained by the Mistral AI team, which also released its weights as open source. Interestingly, it uses the highly permissive Apache 2.0 license, unlike Llama 2, which has its own Llama license terms. Therefore, Mistral 7B is truly “open source” (Llama’s license requires discussion with Meta AI when the service volume reaches 700 million).

PaddleOCR: A Framework and Model Specialized in Chinese Optical Character Recognition (OCR)

Clay
2024-07-202024-07-20
Machine Learning, Python

Last Updated on 2024-07-20 by Clay

Introduction

Recently, I have been exploring models used for Optical Character Recognition (OCR). In the past, OCR was a very popular research field as it was one of the earliest practical applications of computer vision. Today, OCR has become a very mature task, and you can easily find high-performance open-source models online.

NuExtract: A Large Language Model For Information Extraction

Clay
2024-07-102024-07-20
Machine Learning

Last Updated on 2024-07-20 by Clay

Introduction

In today’s era of flourishing large language models, researchers and companies are racking their brains to apply these models to their work. However, speaking personally, the performance of current language models is still not strong enough, and their application scenarios are limited, often far less than that of humans.

But there is one type of task for which large language models are naturally quite suitable: information extraction in any scenario, which is what I want to introduce today, the NuExtract model.

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31