The Concept and Practice of LLM

Introduction

Welcome to the Big Models Course! This course will take you deep into the realm of big models and help you master the skills to apply these powerful models. Whether you're interested in the field of deep learning or looking to apply big models in real-world projects, this course will provide you with valuable knowledge and hands-on experience.

Big models refer to deep learning models with enormous parameters and complex structures. These models perform exceptionally well when dealing with large-scale datasets and complex tasks like image recognition, natural language processing, speech recognition, and more. The emergence of big models has sparked significant changes in the field of deep learning, leading to breakthroughs in various domains.

In this course, you'll learn the fundamental concepts and principles of big models. We'll delve into the foundational theory, developmental history, commonly used big models, and the evolving techniques like Prompts and In-context learning within LLMs (Large Language Models). As the course progresses, we'll dive into the practical applications of big models. You'll learn how to deploy highly regarded big models such as Stable Diffusion and ChatGLM2-6B onto SOPHON's latest generation deep learning processor, the SOPHON BM1684X. The SOPHON BM1684X is the fourth-generation tensor processor specifically introduced by SOPHON for the field of deep learning, capable of 32TOPS computing power, supporting 32 channels of HD hardware decoding, and 12 channels of HD hardware encoding, applicable in environments such as deep learning, computer vision, high-performance computing, and more.

Whether you're inclined toward in-depth academic research on big models or their industrial applications, this course will provide you with a robust foundation and practical skills. Are you ready to take on the challenge of big models? Let's delve into this fascinating field together!

Chapters （ 10Lesson）

1_ Basic Concepts

Start Learning

1.1 LLM: Lossless compression of the world's knowledge

To do

Start Learning

1.2 The development of LLM

To do

Start Learning

1.3 Common large model introduction

To do

Start Learning

1.4 Evolving Prompt and In-context learning technologies

To do

Start Learning

1.5 RLHF technology

To do

Start Learning

1.6 Compression of LLM

To do

Start Learning

1.7 Training and inference acceleration technology for LLM

To do

Start Learning

2_ Practice and Deployment

Start Learning

2.1 BM1684X box Deployment Course：Stable Diffusion

To do

Start Learning

2.2 BM1684X Deployment Course：ChatGLM2-6B

To do

Start Learning

2.3 ChatGLM2_6B

To do

Start Learning

Objective

After completing this course, learners will acquire the following abilities, providing effective tools and approaches to tackle complex problems in natural language processing and computer vision:

Understanding the basic concepts and principles of big models, being familiar with the latest advancements and cutting-edge knowledge in this domain: Learners will grasp the underlying technological principles behind large-scale language models, including model structures, training methodologies, and inference algorithms. This knowledge will enable them to stay updated with the evolving dynamics of the field.
Mastering the application of big models: Learners will gain insight into the extensive applications of big models across various domains, such as machine translation, text generation, intelligent question-answering systems, and more.
Understanding the capabilities and limitations of big models: Learners will comprehend the abilities of big models in language understanding, generation, reasoning, as well as their limitations in handling long texts, multimodal data, and other aspects.
Mastering the computational requirements and technical challenges of big models: Learners will understand the computational demands posed by big models, along with relevant training and inference techniques, including distributed training, model compression, acceleration, and more.

Course Participants

This course is suitable for a wide range of learners, including but not limited to the following:

Students studying deep learning: For students in computer science, deep learning, or related disciplines, this course provides an in-depth understanding and application of big models.
Professionals: For professionals working in fields such as natural language processing, computer vision, data science, etc., this course can help them grasp the latest advancements and cutting-edge knowledge in the domain of big models.
Individuals interested in deep learning: For individuals intrigued by deep learning technology and seeking comprehensive insights into the principles and applications of big models, this course offers a thorough introduction.

Course Recommendation

Compiler development

As a bridge between the framework and hardware, the Deep learning compiler can realize the goal of one-time code development and reuse of various computing power processors. Recently, Computational Energy has also opened source its self-developed TPU compiler tool - TPU-MLIR (Multi-Level Intermediate Representation). Tpu-mlir is an open source TPU compiler for Deep learning processors. The project provides a complete tool chain, which converts the pre-trained neural network under various frameworks into a binary file bmodel that can operate efficiently in TPU to achieve more efficient reasoning. This course is driven by actual practice, leading you to intuitively understand, practice, and master the TPU compiler framework of intelligent Deep learning processors.

At present, the TPU-MLIR project has been applied to the latest generation of deep learning processor BM1684X, which is developed by Computational Energy. Combined with the high-performance ARM core of the processor itself and the corresponding SDK, it can realize the rapid deployment of deep learning algorithms. The course will cover the basic syntax of MLIR and the implementation details of various optimization operations in the compiler, such as figure optimization, int8 quantization, operator segmentation, and address allocation.

TPU-MLIR has several advantages over other compilation tools

1. Simple and convenient

By reading the development manual and the samples included in the project, users can understand the model conversion process and principles, and quickly get started. Moreover, TPU-MLIR is designed based on the current mainstream compiler tool library MLIR, and users can also learn the application of MLIR through it. The project has provided a complete set of tool chain, users can directly through the existing interface to quickly complete the model transformation work, do not have to adapt to different networks

2. General

At present, TPU-MLIR already supports the TFLite and onnx formats, and the models of these two formats can be directly converted into the bmodel available for TPU. What if it's not either of these formats? In fact, onnx provides a set of conversion tools that can convert models written by major deep learning frameworks on the market today to onnx format, and then proceed to bmodel

3, precision and efficiency coexist

During the process of model conversion, accuracy is sometimes lost. TPU-MLIR supports INT8 symmetric and asymmetric quantization, which can greatly improve the performance and ensure the high accuracy of the model combined with Calibration and Tune technology of the original development company. In addition, TPU-MLIR also uses a lot of graph optimization and operator segmentation optimization techniques to ensure the efficient operation of the model.

4. Achieve the ultimate cost performance and build the next generation of Deep learning compiler

In order to support graphic computation, operators in neural network model need to develop a graphic version; To adapt the TPU, a version of the TPU should be developed for each operator. In addition, some scenarios need to be adapted to different models of the same computing power processor, which must be manually compiled each time, which will be very time-consuming. The Deep learning compiler is designed to solve these problems. Tpu-mlir's range of automatic optimization tools can save a lot of manual optimization time, so that models developed on RISC-V can be smoothly and freely ported to the TPU for the best performance and price ratio.

5. Complete information

Courses include Chinese and English video teaching, documentation guidance, code scripts, etc., detailed and rich video materials detailed application guidance clear code script TPU-MLIR standing on the shoulders of MLIR giants to build, now all the code of the entire project has been open source, open to all users free of charge.

Code Download Link: https://github.com/sophgo/tpu-mlir

TPU-MLIR Development Reference Manual: https://tpumlir.org/docs/developer_manual/01_introduction.html

The Overall Design Ideas Paper: https://arxiv.org/abs/2210.15016

Video Tutorials: https://space.bilibili.com/1829795304/channel/collectiondetail?sid=734875"

Course catalog

序号	课程名	课程分类	课程资料
			视频	文档	代码
1.1	Deep learning编译器基础	TPU_MLIR基础	√	√	√
1.2	MLIR基础	TPU_MLIR基础	√	√	√
1.3	MLIR基本结构	TPU_MLIR基础	√	√	√
1.4	MLIR之op定义	TPU_MLIR基础	√	√	√
1.5	TPU_MLIR介绍（一）	TPU_MLIR基础	√	√	√
1.6	TPU_MLIR介绍（二）	TPU_MLIR基础	√	√	√
1.7	TPU_MLIR介绍（三）	TPU_MLIR基础	√	√	√
1.8	量化概述	TPU_MLIR基础	√	√	√
1.9	量化推导	TPU_MLIR基础	√	√	√
1.10	量化校准	TPU_MLIR基础	√	√	√
1.11	量化感知训练（一）	TPU_MLIR基础	√	√	√
1.12	量化感知训练（二）	TPU_MLIR基础	√	√	√
2.1	Pattern Rewriting	TPU_MLIR实战	√	√	√
2.2	Dialect Conversion	TPU_MLIR实战	√	√	√
2.3	前端转换	TPU_MLIR实战	√	√	√
2.4	Lowering in TPU_MLIR	TPU_MLIR实战	√	√	√
2.5	添加新算子	TPU_MLIR实战	√	√	√
2.6	TPU_MLIR图优化	TPU_MLIR实战	√	√	√
2.7	TPU_MLIR常用操作	TPU_MLIR实战	√	√	√
2.8	TPU原理（一）	TPU_MLIR实战	√	√	√
2.9	TPU原理（二）	TPU_MLIR实战	√	√	√
2.10	后端算子实现	TPU_MLIR实战	√	√	√
2.11	TPU层优化	TPU_MLIR实战	√	√	√
2.12	bmodel生成	TPU_MLIR实战	√	√	√
2.13	To ONNX format	TPU_MLIR实战	√	√	√
2.14	Add a New Operator	TPU_MLIR实战	√	√	√
2.15	TPU_MLIR模型适配	TPU_MLIR实战	√	√	√
2.16	Fuse Preprocess	TPU_MLIR实战	√	√	√
2.17	精度验证	TPU_MLIR实战	√	√	√

Milk-V Duo Development Board Pratical Course

This course introduces the hardware circuit design and basic environment set up, as well as provides some simple development examples and some basic Deep learning examples.

Milk-V Duo is an ultra-compact embedded development platform based on CV1800B. It has small size and comprehensive functionality, it is equipped with dual cores and can run linux and rtos systems separately, and has various connectable peripherals.

Scalability: The Milk-V Duo core board has various interfaces such as GPIO, I2C, UART, SDIO1, SPI, ADC, PWM, etc.
Diverse connectable peripherals: The Milk-V Duo core board can be expanded with various devices such as LED, portable screens, cameras, WIFI and so on.

Course features:

The content materials are rich and complete, including development board hardware design, peripheral interface instructions, basic environment set up method, and sample code scripts.
The learning path is scientifically reasonable, starting from the introduction and basic usage of the development board, and then leading to pratical projects to fully utilize the development board and provide reference for users' own development.
The pratical projects are rich, and the course provides many examples of practical code usage and function demonstrations. Different functions can be implemented by simply modifying and combining the code.

SE5 Development Series Course

The deep neural network model can be trained and tested quickly and then deployed by the industry to effectively perform tasks in the real world. Deploying such systems on small-sized, low-power Deep learning edge computing platforms is highly favored by the industry. This course takes a practice-driven approach to lead you to intuitively learn, practice, and master the knowledge and technology of deep neural networks.

The SOPHON Deep learning microserver SE5 is a high-performance, low-power edge computing product equipped with the third-generation TPU processor BM1684 developed independently by SOPHGO. With an INT8 computing power of up to 17.6 TOPS, it supports 32 channels of Full HD video hardware decoding and 2 channels of encoding. This course will quickly guide you through the powerful features of the SE5 server. Through this course, you can understand the basics of Deep learning and master its basic applications.

Course Features

1. One-stop service

All common problems encountered in SE5 applications can be found here.

• Provide a full-stack solution for Deep learning micro servers

• Break down the development process step by step, in detail and clearly

• Support all mainstream frameworks, easy to use products

2. Systematic teaching

It includes everything from setting up the environment, developing applications, converting models, and deploying products, as well as having a mirrored practical environment.

• How is the environment built?

• How is the model compiled?

• How is the application developed?

• How are scenarios deployed?

3. Complete materials

The course includes video tutorials, document guides, code scripts, and other comprehensive materials.

• Rich video materials

• Detailed application guidance

• Clear code scripts

Code download link: https://github.com/sophon-ai-algo/examples

4. Free cloud development resources

Online free application for using SE5-16 microserver cloud testing space

• SE5-16 microserver cloud testing space can be used for online development and testing, supporting user data retention and export

• SE5-16 microserver cloud testing space has the same resource performance as the physical machine environment

Cloud platform application link: https://account.sophgo.com/sign_in?service=https://cloud.sophgo.com&locale=zh-CN

Cloud platform usage instructions: https://cloud.sophgo.com/tpu.pdf