How to Use YOLOV8 Pose on C3V

This document will provide a detailed description of:

How to convert the YOLOV8 ONNX model into a model for use on the C3V platform

Write sample code for object pose based on YOLOV8

Execute object pose program and obtain recognition results in the C3V Linux environment

The tool versions involved in the current document are as follows:

NPU Kernel Driver	v6.4.15.9	v6.4.18.5
Acuity Toolkit	6.21.1	6.30.7
ViviantelIDE	5.8.2	5.10.1

1. Model Conversation

Before the conversion, it is necessary to first set up the environment for model conversion. Please refer to the following document to prepare the environment：NN Model Conversion

1.1. Project Preparation

Create Model folder

Create a folder yolov8s-pose in path ~/c3v/Models. Please ensure the folder name is the same as the ONNX file name.

~/c3v/Models$ mkdir yolov8s-pose && cd yolov8s-pose

Copy the ONNX file and input.jpg which resolution is 640x640 to the folder yolov8s. These two files will be used as input files during model conversion.

~/c3v/Models$ cp yolov8s-pose.onnx yolov8s-pose/
~/c3v/Models$ cp input.jpg yolov8s-pose/

Create a dataset.txt file, the content of dataset.txt is the input.jpg file name.

./input.jpg

Create inputs_outputs.txt file and get the information from yolov8s-pose .onnx via netron tool/webpage. Here is the onnx file: .

Select the three operators within the red box as the output. write --input-size-list and --outputs informations to inputs_outputs.txt:

--outputs  '/model.22/Sigmoid_output_0 /model.22/Mul_2_output_0 /model.22/Sigmoid_1_output_0 /model.22/Mul_4_output_0'

After completing the above steps, there will be the following files under the yolov8s-pose path：

1.2. Implementing

Using shell script tools to convert the model from ONNX to the NB file. There are 4 steps: import quantize inference and export. Tools are in ~/c3v/Models:

pegasus_import.sh
pegasus_quantize.sh
pegasus_inference.sh
pegasus_export_ovx.sh

Import

Execute the command in the console or terminal, and wait for it to complete. It will import and translate an NN model to NN formats.

./pegasus_import.sh yolov8s-pose

Wait until the tool execution is complete and check there are no errors like this:

Then we will see the following four files added under the folder ~/c3v/Models/yolov8s.

Quantize

Modify the scale value(1/255=0.003921569) of the yolov8s-pose_inputmeta.yml file, which is in ~/c3v/Models/yolov8s.

Select one quantized type for your need, such as uint8 / int16 / bf16 / pcq. In this sample we use int16.

./pegasus_quantize.sh yolov8s-pose int16

Wait until the tool execution is complete and check there are no errors like this:

Then we will see the following four files added under the folder ~/c3v/Models/yolov8s.

Inference

Inference the NN model with the quantization data type.

./pegasus_inference.sh yolov8s-pose int16

Wait until the tool execution is complete and check there are no errors like this:

Export

Export the quantized application for device deployment. Please modify the pegasus_export_ovx.sh for the nb file generating, and add both 3 lines marked in the red box.

./pegasus_export_ovx.sh yolov8s-pose int16

Wait until the tool execution is complete and check there are no errors like this:

In the path ~/c3v/Models/yolov8s-pose/wksp, you will find a folder named yolov8s-pose_uint16_nbg_unify.

We can get the nb file and a c file for NN graph setup information.

2. YoloV8 Pose Program

2.1. Post Processing

The post-processing of the example code automatically transferred out by the tool will print the top 5. We need to increase the parsing of the results to obtain complete results of target recognition. The relevant post-processing functions are located in the file vnn_post_process.c.

We provide an example function for post-processing, which can complete the parsing of NN processing results：

post_proc_init
post_proc_process
post_proc_deinit

The function needs to be modified：vnn_PostProcessYolov8sPoseInt16

vsi_status vnn_PostProcessYolov8sPoseInt16(vsi_nn_graph_t *graph)
{
    vsi_status status = VSI_FAILURE;

#if DETECT_RESULT_IMPL
    /*detect result sample implement*/
    post_proc_init(graph);
    post_proc_process(graph);
    post_proc_deinit();

#else
    /* Show the top5 result */
    status = show_top5(graph, vsi_nn_GetTensor(graph, graph->output.tensors[0]));
    TEST_CHECK_STATUS(status, final);

    /* Save all output tensor data to txt file */
    save_output_data(graph);

final:
#endif

    return VSI_SUCCESS;
}

For detailed function implementation, please refer to the following file:

we needs to be unzipped and placed in ~/c3v/Models/yolov8s-pose/wksp/yolov8s_int16_nbg_unify Folder.

2.2. Program Compile

When compiling NN-related applications, SDK's headers and libraries must be included.

Example of SDK Includes Path：

INCLUDES+=-I$(NN_SDK_DIR)/include/ \
-I$(NN_SDK_DIR)/include/CL \
-I$(NN_SDK_DIR)/include/VX \
-I$(NN_SDK_DIR)/include/ovxlib \
-I$(NN_SDK_DIR)/include/jpeg

Example of SDK Link Libraries：

LIBS+=-lOpenVX -lOpenVXU -lCLC -lVSC -lGAL -ljpeg -lovxlib

The following file is an example Makefile that needs to be unzipped and placed in ~/c3v/Models/yolov8s-pose/wksp/yolov8s_int16_nbg_unify Folder.

2.2.3 build in c3v

If you want to build the project in c3v directly, please modify these contents of Makefile:

BIN=yolov8s-detection-int16

# 2.build in c3v
NN_SDK_DIR=/usr

CC=gcc
CXX=g++

2.2.4 cross compile in Linux

If you want to build the project in host Linux, please modify these contents of Makefile:

BIN=yolov8s-detection-int16

# 1.cross compile
NN_SDK_DIR=Path to NN SDK directory
TOOLCHAIN=Path to toolchain directory

CROSS_COMPILE=$(TOOLCHAIN)/aarch64-none-linux-gnu-
CC=$(CROSS_COMPILE)gcc
CXX=$(CROSS_COMPILE)g++

you need to set the right path of NN_SDK_DIR and TOOLCHAIN

NN_SDK_DIR: The path to NPU SDK

TOOLCHAIN: The cross-compile toolchain path. which format may be like this:

TOOLCHAIN=/pub/toolchain/crossgcc/gcc-arm-9.2-2019.12-x86_64-aarch64-none-linux-gnu/bin

the brief folder of the project is like this:

then using make to compile the project.

make

3. Running on the C3V Linux

Insmod to the kernel if the driver is not probed or skip this step.

insmod ./galcore.ko
[14358.019373] galcore f8140000.galcore: NPU get power success
[14358.019458] galcore f8140000.galcore: galcore irq number is 44
[14358.020542] galcore f8140000.galcore: NPU clock: 900000000
[14358.026015] Galcore version 6.4.18.5

Copy the application and related libraries into C3V Linux and run:

param1 is the nb file that converts from the acuity toolkit.

param2 is the image that is for detection.

./yolov8s-pose-int16 ./network_binary.nb ./input.jpg

The result is like this:

/mnt/yolov8s-pose_int16_nbg_unify # ./yolov8s-pose-int16 ./network_binary.nb
../input.jpg
Create Neural Network: 59ms or 59044us
Verify...
Verify Graph: 24ms or 24933us
Start run graph [1] times...
Run the 1 time: 122.44ms or 122443.93us
vxProcessGraph execution time:
Total   122.66ms or 122658.48us
Average 122.66ms or 122658.48us
obj: L: 0 P:0.93, [(0, 42) - (200, 599)]
obj: L: 0 P:0.91, [(309, 279) - (180, 361)]
obj: L: 0 P:0.58, [(344, 171) - (170, 301)]