app-mlperf-inference-qualcomm
Automatically generated README for this automation recipe: app-mlperf-inference-qualcomm
Category: Modular MLPerf benchmarks
License: Apache 2.0
- CM meta description for this script: _cm.yaml
- Output cached? False
Reuse this script in your project
Install MLCommons CM automation meta-framework
Pull CM repository with this automation recipe (CM script)
cm pull repo mlcommons@cm4mlops
Print CM help from the command line
cmr "reproduce mlcommons mlperf inference harness qualcomm-harness qualcomm kilt-harness kilt" --help
Run this script
Run this script via CLI
cm run script --tags=reproduce,mlcommons,mlperf,inference,harness,qualcomm-harness,qualcomm,kilt-harness,kilt[,variations] [--input_flags]
Run this script via CLI (alternative)
cmr "reproduce mlcommons mlperf inference harness qualcomm-harness qualcomm kilt-harness kilt [variations]" [--input_flags]
Run this script from Python
import cmind
r = cmind.access({'action':'run'
'automation':'script',
'tags':'reproduce,mlcommons,mlperf,inference,harness,qualcomm-harness,qualcomm,kilt-harness,kilt'
'out':'con',
...
(other input keys for this script)
...
})
if r['return']>0:
print (r['error'])
Run this script via Docker (beta)
cm docker script "reproduce mlcommons mlperf inference harness qualcomm-harness qualcomm kilt-harness kilt[variations]" [--input_flags]
Variations
-
No group (any combination of variations can be selected)
Click here to expand this section.
_activation-count.#
- ENV variables:
- CM_MLPERF_QAIC_ACTIVATION_COUNT:
#
- CM_MLPERF_QAIC_ACTIVATION_COUNT:
- ENV variables:
_num-devices.4
- ENV variables:
- CM_QAIC_DEVICES:
0,1,2,3
- CM_QAIC_DEVICES:
- ENV variables:
_pro
- ENV variables:
- qaic_queue_length:
10
- qaic_queue_length:
- ENV variables:
-
Group "batch-size"
Click here to expand this section.
_bs.#
- ENV variables:
- kilt_model_batch_size:
#
- kilt_model_batch_size:
- ENV variables:
_bs.0
- ENV variables:
- kilt_model_batch_size:
1
- kilt_model_batch_size:
- ENV variables:
-
Group "device"
Click here to expand this section.
_cpu
(default)- ENV variables:
- CM_MLPERF_DEVICE:
cpu
- kilt_backend_type:
cpu
- CM_MLPERF_DEVICE:
- ENV variables:
_cuda
- ENV variables:
- CM_MLPERF_DEVICE:
gpu
- CM_MLPERF_DEVICE_LIB_NAMESPEC:
cudart
- kilt_backend_type:
gpu
- CM_MLPERF_DEVICE:
- ENV variables:
_qaic
- ENV variables:
- CM_MLPERF_DEVICE:
qaic
- CM_MLPERF_DEVICE_LIB_NAMESPEC:
QAic
- kilt_backend_type:
qaic
- CM_MLPERF_DEVICE:
- ENV variables:
-
Group "framework"
Click here to expand this section.
_glow
- ENV variables:
- device:
qaic
- CM_MLPERF_BACKEND:
glow
- CM_MLPERF_BACKEND_LIB_NAMESPEC:
QAic
- device:
- ENV variables:
_onnxruntime
(default)- ENV variables:
- device:
onnxrt
- CM_MLPERF_BACKEND:
onnxruntime
- CM_MLPERF_BACKEND_LIB_NAMESPEC:
onnxruntime
- device:
- ENV variables:
_tensorrt
- ENV variables:
- CM_MLPERF_BACKEND:
tensorrt
- device:
tensorrt
- CM_MLPERF_BACKEND_NAME:
TensorRT
- CM_MLPERF_BACKEND:
- ENV variables:
-
Group "loadgen-batch-size"
Click here to expand this section.
_loadgen-batch-size.#
- ENV variables:
- CM_MLPERF_LOADGEN_BATCH_SIZE:
#
- CM_MLPERF_LOADGEN_BATCH_SIZE:
- ENV variables:
-
Group "loadgen-scenario"
Click here to expand this section.
_multistream
- ENV variables:
- CM_MLPERF_LOADGEN_SCENARIO:
MultiStream
- CM_MLPERF_LOADGEN_SCENARIO:
- ENV variables:
_offline
- ENV variables:
- CM_MLPERF_LOADGEN_SCENARIO:
Offline
- CM_MLPERF_LOADGEN_SCENARIO:
- ENV variables:
_server
- ENV variables:
- CM_MLPERF_LOADGEN_SCENARIO:
Server
- CM_MLPERF_LOADGEN_SCENARIO:
- ENV variables:
_singlestream
- ENV variables:
- CM_MLPERF_LOADGEN_SCENARIO:
SingleStream
- CM_MLPERF_LOADGEN_SCENARIO:
- ENV variables:
-
Group "model"
Click here to expand this section.
_bert-99
- ENV variables:
- CM_MODEL:
bert-99
- CM_SQUAD_ACCURACY_DTYPE:
float32
- CM_ML_MODEL_STARTING_WEIGHTS_FILENAME:
https://zenodo.org/record/3750364/files/bert_large_v1_1_fake_quant.onnx
- CM_MODEL:
- ENV variables:
_bert-99.9
- ENV variables:
- CM_MODEL:
bert-99.9
- CM_ML_MODEL_STARTING_WEIGHTS_FILENAME:
https://zenodo.org/record/3733910/files/model.onnx
- CM_MODEL:
- ENV variables:
_resnet50
(default)- ENV variables:
- CM_MODEL:
resnet50
- kilt_model_name:
resnet50
- kilt_input_count:
1
- kilt_output_count:
1
- kilt_input_format:
FLOAT32,-1,224,224,3
- kilt_output_format:
INT64,-1
- dataset_imagenet_preprocessed_input_square_side:
224
- ml_model_has_background_class:
YES
- ml_model_image_height:
224
- loadgen_buffer_size:
1024
- loadgen_dataset_size:
50000
- CM_BENCHMARK:
STANDALONE_CLASSIFICATION
- CM_MODEL:
- ENV variables:
_retinanet
- ENV variables:
- CM_MODEL:
retinanet
- CM_ML_MODEL_STARTING_WEIGHTS_FILENAME:
https://zenodo.org/record/6617981/files/resnext50_32x4d_fpn.pth
- kilt_model_name:
retinanet
- kilt_input_count:
1
- kilt_model_max_detections:
600
- kilt_output_count:
1
- kilt_input_format:
FLOAT32,-1,3,800,800
- kilt_output_format:
INT64,-1
- dataset_imagenet_preprocessed_input_square_side:
224
- ml_model_image_height:
800
- ml_model_image_width:
800
- loadgen_buffer_size:
64
- loadgen_dataset_size:
24576
- CM_BENCHMARK:
STANDALONE_OBJECT_DETECTION
- CM_MODEL:
- ENV variables:
-
Group "nsp"
Click here to expand this section.
_nsp.#
_nsp.14
_nsp.16
-
Group "power-mode"
Click here to expand this section.
_maxn
- ENV variables:
- CM_MLPERF_NVIDIA_HARNESS_MAXN:
True
- CM_MLPERF_NVIDIA_HARNESS_MAXN:
- ENV variables:
_maxq
- ENV variables:
- CM_MLPERF_NVIDIA_HARNESS_MAXQ:
True
- CM_MLPERF_NVIDIA_HARNESS_MAXQ:
- ENV variables:
-
Group "precision"
Click here to expand this section.
_fp16
_fp32
- ENV variables:
- CM_IMAGENET_ACCURACY_DTYPE:
float32
- CM_IMAGENET_ACCURACY_DTYPE:
- ENV variables:
_uint8
-
Group "run-mode"
Click here to expand this section.
_network-client
- ENV variables:
- CM_RUN_MODE:
network-client
- CM_RUN_MODE:
- ENV variables:
_network-server
- ENV variables:
- CM_RUN_MODE:
network-server
- CM_RUN_MODE:
- ENV variables:
_standalone
(default)- ENV variables:
- CM_RUN_MODE:
standalone
- CM_RUN_MODE:
- ENV variables:
-
Group "sut"
Click here to expand this section.
_dl2q.24xlarge
- ENV variables:
- CM_QAIC_DEVICES:
0,1,2,3,4,5,6,7
- qaic_queue_length:
4
- CM_QAIC_DEVICES:
- ENV variables:
_rb6
- ENV variables:
- CM_QAIC_DEVICES:
0
- qaic_queue_length:
6
- CM_QAIC_DEVICES:
- ENV variables:
Default variations
_cpu,_onnxruntime,_resnet50,_standalone
Script flags mapped to environment
--count=value
→CM_MLPERF_LOADGEN_QUERY_COUNT=value
--devices=value
→CM_QAIC_DEVICES=value
--max_batchsize=value
→CM_MLPERF_LOADGEN_MAX_BATCHSIZE=value
--mlperf_conf=value
→CM_MLPERF_CONF=value
--mode=value
→CM_MLPERF_LOADGEN_MODE=value
--multistream_target_latency=value
→CM_MLPERF_LOADGEN_MULTISTREAM_TARGET_LATENCY=value
--offline_target_qps=value
→CM_MLPERF_LOADGEN_OFFLINE_TARGET_QPS=value
--output_dir=value
→CM_MLPERF_OUTPUT_DIR=value
--performance_sample_count=value
→CM_MLPERF_LOADGEN_PERFORMANCE_SAMPLE_COUNT=value
--rerun=value
→CM_RERUN=value
--scenario=value
→CM_MLPERF_LOADGEN_SCENARIO=value
--server_target_qps=value
→CM_MLPERF_LOADGEN_SERVER_TARGET_QPS=value
--singlestream_target_latency=value
→CM_MLPERF_LOADGEN_SINGLESTREAM_TARGET_LATENCY=value
--skip_preprocess=value
→CM_SKIP_PREPROCESS_DATASET=value
--skip_preprocessing=value
→CM_SKIP_PREPROCESS_DATASET=value
--target_latency=value
→CM_MLPERF_LOADGEN_TARGET_LATENCY=value
--target_qps=value
→CM_MLPERF_LOADGEN_TARGET_QPS=value
--user_conf=value
→CM_MLPERF_USER_CONF=value
Default environment
These keys can be updated via --env.KEY=VALUE
or env
dictionary in @input.json
or using script flags.
- CM_BATCH_COUNT:
1
- CM_BATCH_SIZE:
1
- CM_FAST_COMPILATION:
yes
- CM_MLPERF_LOADGEN_SCENARIO:
Offline
- CM_MLPERF_LOADGEN_MODE:
performance
- CM_SKIP_PREPROCESS_DATASET:
no
- CM_SKIP_MODEL_DOWNLOAD:
no
- CM_MLPERF_SUT_NAME_IMPLEMENTATION_PREFIX:
kilt
- CM_MLPERF_SKIP_RUN:
no
- CM_KILT_REPO_URL:
https://github.com/GATEOverflow/kilt-mlperf
- CM_QAIC_DEVICES:
0
- kilt_max_wait_abs:
10000
- verbosity:
0
- loadgen_trigger_cold_run:
0
Native script being run
No run file exists for Windows
Script output
cmr "reproduce mlcommons mlperf inference harness qualcomm-harness qualcomm kilt-harness kilt [variations]" [--input_flags] -j