Posts by Tags

Mamba：Linear Sequence Modeling Model

less than 1 minute read

Published: April 29, 2024

This blog mainly focuses on the improvement of the Mamba model, involving underlying visual tasks, multimodal tasks, etc.

Mamba：Linear Sequence Modeling Model

less than 1 minute read

Published: April 29, 2024

This blog mainly focuses on the improvement of the Mamba model, involving underlying visual tasks, multimodal tasks, etc.

Mamba：Linear Sequence Modeling Model

less than 1 minute read

Published: April 29, 2024

This blog mainly focuses on the improvement of the Mamba model, involving underlying visual tasks, multimodal tasks, etc.

Classic Neural Network

less than 1 minute read

Published: November 07, 2022

This blog series mainly introduces the traditional neural network structure design, including convolutional neural network, visual Transformer, Mamba, etc.

MMSA from beginner to proficient

less than 1 minute read

Published: January 02, 2025

MMSA is a unified framework for multimodal emotion recognition developed by Tsinghua University. It supports three datasets, MOSI, MOSEI, and CH-SIMS, and 15 multimodal emotion analysis models, as well as tasks such as emotion recognition and emotion intensity regression. This column will use MMSA v1.0 as an example to implement framework analysis and customized modules.

Detectron2 Getting Started Tutorial

less than 1 minute read

Published: January 01, 2025

Due to the high degree of encapsulation of Detectron2 and the obscure syntax, it often takes time to find the corresponding modules when carrying out the project. In addition, there is a lack of easy-to-understand introductory tutorials, and the official website tutorials lack flexibility. This series will start with topics such as Detectron2 installation, custom data sets, custom networks, and validation set loss printing to get started with Detectron2 step by step.

Openstl from zero to master

less than 1 minute read

Published: April 28, 2024

Openstl is a third-party library developed by Westlake University for future frame prediction, which integrates multiple SOTA methods such as ConvLstm and simvip. This column will start from scratch and analyze the internal structure and customized modules of the entire Openstl project.

MMSA from beginner to proficient

less than 1 minute read

Published: January 02, 2025

MMSA is a unified framework for multimodal emotion recognition developed by Tsinghua University. It supports three datasets, MOSI, MOSEI, and CH-SIMS, and 15 multimodal emotion analysis models, as well as tasks such as emotion recognition and emotion intensity regression. This column will use MMSA v1.0 as an example to implement framework analysis and customized modules.

Detectron2 Getting Started Tutorial

less than 1 minute read

Published: January 01, 2025

Due to the high degree of encapsulation of Detectron2 and the obscure syntax, it often takes time to find the corresponding modules when carrying out the project. In addition, there is a lack of easy-to-understand introductory tutorials, and the official website tutorials lack flexibility. This series will start with topics such as Detectron2 installation, custom data sets, custom networks, and validation set loss printing to get started with Detectron2 step by step.

Classic Neural Network

less than 1 minute read

Published: November 07, 2022

This blog series mainly introduces the traditional neural network structure design, including convolutional neural network, visual Transformer, Mamba, etc.

Openstl from zero to master

less than 1 minute read

Published: April 28, 2024

Openstl is a third-party library developed by Westlake University for future frame prediction, which integrates multiple SOTA methods such as ConvLstm and simvip. This column will start from scratch and analyze the internal structure and customized modules of the entire Openstl project.

Openstl from zero to master

less than 1 minute read

Published: April 28, 2024

Openstl is a third-party library developed by Westlake University for future frame prediction, which integrates multiple SOTA methods such as ConvLstm and simvip. This column will start from scratch and analyze the internal structure and customized modules of the entire Openstl project.

Xin Li

Posts by Tags

Mamba

Mamba：Linear Sequence Modeling Model

Multimodal

Mamba：Linear Sequence Modeling Model

SSM

Mamba：Linear Sequence Modeling Model

deep learning

Classic Neural Network

detectron2

MMSA from beginner to proficient

Detectron2 Getting Started Tutorial

hands on

Openstl from zero to master

object detection

MMSA from beginner to proficient

Detectron2 Getting Started Tutorial

paper reading

Classic Neural Network

tutorial

Openstl from zero to master

video prediction

Openstl from zero to master