Using AI Models for Object Classification in MaixCAM MaixPy

Object Classification Concept

For example, if there are two images in front of you, one with an apple and the other with an airplane, the task of object classification is to input these two images into an AI model one by one. The model will then output two results, one for apple and one for airplane.

Using Object Classification in MaixPy

MaixPy provides a pre-trained 1000 classification model based on the imagenet dataset, which can be used directly:

from maix import camera, display, image, nn

classifier = nn.Classifier(model="/root/models/mobilenetv2.mud", dual_buff = True)
cam = camera.Camera(classifier.input_width(), classifier.input_height(), classifier.input_format())
disp = display.Display()

while 1:
    img = cam.read()
    res = classifier.classify(img)
    max_idx, max_prob = res[0]
    msg = f"{max_prob:5.2f}: {classifier.labels[max_idx]}"
    img.draw_string(10, 10, msg, image.COLOR_RED)
    disp.show(img)

Result video:

Here, the camera captures an image, which is then passed to the classifier for recognition. The result is displayed on the screen.

For more API usage, refer to the documentation for the maix.nn module.

dual_buff Dual Buffer Acceleration

You may have noticed that the model initialization uses dual_buff (which defaults to True). Enabling the dual_buff parameter can improve running efficiency and increase the frame rate. For detailed principles and usage notes, see dual_buff Introduction.

Training Your Own Classification Model on MaixHub

If you want to train a classification model for specific images, visit MaixHub to learn and train the model. When creating a project, select "Classification Model", then simply upload your images to train. There's no need to set up a training environment or spend money on expensive GPUs—training can be done quickly with one click.

Offline Training for Your Own Classification Model

For offline training, you need to set up your environment. Search for keywords such as PyTorch classification model training or Mobilenet for guidance.
After training the model, export it in ONNX format, then refer to the MaixCAM Model Conversion Documentation to convert it into a model format supported by MaixCAM. Finally, use the nn.Classifier class mentioned above to load the model.

The classification model can be Mobilenet or another model like ResNet. During model conversion, it's best to extract the layer just before softmax as the final output layer because the classifier.classify(img, softmax=True) function has softmax enabled by default—this means the function will perform a softmax calculation on the results. Therefore, the model itself doesn't need a softmax layer. However, if the model does include a softmax layer, you can specify not to execute it again by using: classifier.classify(img, softmax=False).

AI vision knowledge

YOLO11/v8/v5 object detection