Skip to content

Model Farm Preview User Guide

Introduction

Model Farm actively adapts to current cutting-edge or popular AI models. To help developers understand Model Farm's model adaptation and release plans as well as related model performance parameters in advance, we have launched Model Farm Preview.

Model Farm Preview uses the same methods as Model Farm with unified interface interaction. Developers can quickly view performance parameters of models that will be released in Model Farm, or contact APLUX for support to obtain related model files and inference code in advance.

💡Note

Model Farm Preview is only for viewing model adaptation status and related performance parameters. It cannot be downloaded or used. You need to wait for the model to be officially released before downloading and using it in Model Farm.

Quick Start

Model Farm Preview is only for previewing model performance parameters and cannot be downloaded, so developers can access all information on this page without login.

Accessing Model Farm Preview

  • Developers can directly access the Model Farm Preview interface by clicking the link: Model Farm Preview

  • Developers can also enter the Model Farm Preview interface through Model Farm interface interaction:

Viewing Models

Developers can search for models on Model Farm Preview according to their needs and understand detailed model information to make quick evaluations.

Model Farm Preview provides multiple ways to filter and search for models:

  • Filter by model type
  • Filter by model data precision
  • Filter by chip platform
  • Keyword search

Model Performance Reference

The model details page of Model Farm Preview provides actual performance measurements of AI models on corresponding hardware:

  • Device: Development board model and corresponding chip model used for model testing
  • AI Framework: Framework and version number used for model conversion and inference
  • Model Data Precision: Data precision used by the converted model
  • Inference Time: Actual measured runtime of the model, excluding pre/post-processing
  • Precision Loss: Cosine similarity between the output matrices of the source model (FP32) and the converted model
  • Model Size: File size of the converted model

💡Note

For the same SoC chip, model performance on devices with different hardware specifications should only be used as reference data

Taking SigLIP-so400m on MeiG SNM972 (QCS8550) as an example:

Performance Reference Display

Access to Model Files and Code

Models in the Preview Section of the Model Farm cannot be downloaded directly via the web interface. Developers can download these assets on APLUX boards using the mms command-line tool.

The following example demonstrates how to retrieve the MobileClip2-S3 model.

  1. Login via mms

Run the following command and enter your credentials:

bash
mms login

# Enter your username: 
# Enter your password:

# A successful login will return:
# Login successfully.
  1. Search for Models

Use the mms list command to search for models (supports keyword filtering):

bash
mms list mobileclip

# Expected output:
Model           Precision  Chipset           Backend
-----           ---------  -------           -------
MobileClip-S2   FP16       Qualcomm QCS8550  QNN2.31
MobileClip2-S3  FP16       Qualcomm QCS8550  QNN2.36
  1. Download MobileClip2-S3

Specify the model details using the flags below:

bash
# -m: Model Name
# -p: Precision
# -c: Chipset
# -b: QNN Version
mms get -m MobileClip2-S3 -p fp16 -c qcs8550 -b qnn2.36

# Model assets are downloaded by default to: /var/opt/modelfarm_models