get-preprocessed-dataset-openimages
Automatically generated README for this automation recipe: get-preprocessed-dataset-openimages
Category: AI/ML datasets
License: Apache 2.0
-
Notes from the authors, contributors and users: README-extra
-
CM meta description for this script: _cm.json
- Output cached? True
Reuse this script in your project
Install MLCommons CM automation meta-framework
Pull CM repository with this automation recipe (CM script)
cm pull repo mlcommons@cm4mlops
Print CM help from the command line
cmr "get dataset openimages open-images object-detection preprocessed" --help
Run this script
Run this script via CLI
cm run script --tags=get,dataset,openimages,open-images,object-detection,preprocessed[,variations] [--input_flags]
Run this script via CLI (alternative)
cmr "get dataset openimages open-images object-detection preprocessed [variations]" [--input_flags]
Run this script from Python
import cmind
r = cmind.access({'action':'run'
'automation':'script',
'tags':'get,dataset,openimages,open-images,object-detection,preprocessed'
'out':'con',
...
(other input keys for this script)
...
})
if r['return']>0:
print (r['error'])
Run this script via Docker (beta)
cm docker script "get dataset openimages open-images object-detection preprocessed[variations]" [--input_flags]
Variations
-
No group (any combination of variations can be selected)
Click here to expand this section.
_filter
_for.retinanet.onnx
- ENV variables:
- CM_ML_MODEL_NAME:
retinanet
- CM_DATASET_SUBTRACT_MEANS:
1
- CM_DATASET_GIVEN_CHANNEL_MEANS:
0.485 0.456 0.406
- CM_DATASET_GIVEN_CHANNEL_STDS:
0.229 0.224 0.225
- CM_DATASET_NORMALIZE_DATA:
0
- CM_DATASET_NORMALIZE_LOWER:
0.0
- CM_DATASET_NORMALIZE_UPPER:
1.0
- CM_DATASET_CONVERT_TO_BGR:
0
- CM_DATASET_CROP_FACTOR:
100.0
- CM_ML_MODEL_NAME:
- ENV variables:
_nvidia
- ENV variables:
- CM_PREPROCESSING_BY_NVIDIA:
yes
- CM_PREPROCESSING_BY_NVIDIA:
- ENV variables:
_quant-offset.#
_quant-scale.#
-
Group "annotations"
Click here to expand this section.
_custom-annotations
_default-annotations
(default)
-
Group "dataset-count"
Click here to expand this section.
_50
(default)- ENV variables:
- CM_DATASET_SIZE:
50
- CM_DATASET_SIZE:
- ENV variables:
_500
- ENV variables:
- CM_DATASET_SIZE:
500
- CM_DATASET_SIZE:
- ENV variables:
_full
_size.#
- ENV variables:
- CM_DATASET_SIZE:
#
- CM_DATASET_SIZE:
- ENV variables:
-
Group "dataset-layout"
Click here to expand this section.
_NCHW
(default)- ENV variables:
- CM_DATASET_DATA_LAYOUT:
NCHW
- CM_DATASET_DATA_LAYOUT:
- ENV variables:
_NHWC
- ENV variables:
- CM_DATASET_DATA_LAYOUT:
NHWC
- CM_DATASET_DATA_LAYOUT:
- ENV variables:
-
Group "dataset-precision"
Click here to expand this section.
_fp32
(default)- ENV variables:
- CM_DATASET_DTYPE:
fp32
- CM_DATASET_INPUT_DTYPE:
fp32
- CM_DATASET_QUANTIZE:
0
- CM_DATASET_CONVERT_TO_UNSIGNED:
0
- CM_DATASET_DTYPE:
- ENV variables:
_int8
- ENV variables:
- CM_DATASET_DTYPE:
int8
- CM_DATASET_INPUT_DTYPE:
fp32
- CM_DATASET_QUANTIZE:
1
- CM_DATASET_CONVERT_TO_UNSIGNED:
0
- CM_DATASET_DTYPE:
- ENV variables:
_uint8
- ENV variables:
- CM_DATASET_DTYPE:
uint8
- CM_DATASET_INPUT_DTYPE:
fp32
- CM_DATASET_QUANTIZE:
1
- CM_DATASET_CONVERT_TO_UNSIGNED:
1
- CM_DATASET_DTYPE:
- ENV variables:
-
Group "dataset-type"
Click here to expand this section.
_calibration
- ENV variables:
- CM_DATASET_PATH:
<<<CM_CALIBRATION_DATASET_PATH>>>
- CM_DATASET_ANNOTATIONS_FILE_PATH:
<<<CM_DATASET_CALIBRATION_ANNOTATIONS_FILE_PATH>>>
- CM_DATASET_TYPE:
calibration
- CM_DATASET_PATH:
- ENV variables:
_validation
(default)- ENV variables:
- CM_DATASET_TYPE:
validation
- CM_DATASET_TYPE:
- ENV variables:
-
Group "extension"
Click here to expand this section.
_npy
- ENV variables:
- CM_DATASET_PREPROCESSED_EXTENSION:
npy
- CM_DATASET_PREPROCESSED_EXTENSION:
- ENV variables:
_raw
- ENV variables:
- CM_DATASET_PREPROCESSED_EXTENSION:
raw
- CM_DATASET_PREPROCESSED_EXTENSION:
- ENV variables:
_rgb32
- ENV variables:
- CM_DATASET_PREPROCESSED_EXTENSION:
rgb32
- CM_DATASET_PREPROCESSED_EXTENSION:
- ENV variables:
_rgb8
- ENV variables:
- CM_DATASET_PREPROCESSED_EXTENSION:
rgb8
- CM_DATASET_PREPROCESSED_EXTENSION:
- ENV variables:
-
Group "filter-size"
Click here to expand this section.
_filter-size.#
-
Group "interpolation-method"
Click here to expand this section.
_inter.area
- ENV variables:
- CM_DATASET_INTERPOLATION_METHOD:
INTER_AREA
- CM_DATASET_INTERPOLATION_METHOD:
- ENV variables:
_inter.linear
- ENV variables:
- CM_DATASET_INTERPOLATION_METHOD:
INTER_LINEAR
- CM_DATASET_INTERPOLATION_METHOD:
- ENV variables:
-
Group "preprocessing-source"
Click here to expand this section.
_generic-preprocessor
- ENV variables:
- CM_DATASET_REFERENCE_PREPROCESSOR:
0
- CM_DATASET_REFERENCE_PREPROCESSOR:
- ENV variables:
_mlcommons-reference-preprocessor
(default)- ENV variables:
- CM_DATASET_REFERENCE_PREPROCESSOR:
1
- CM_DATASET_REFERENCE_PREPROCESSOR:
- ENV variables:
Default variations
_50,_NCHW,_default-annotations,_fp32,_mlcommons-reference-preprocessor,_validation
Script flags mapped to environment
--dir=value
→CM_DATASET_PREPROCESSED_PATH=value
--threads=value
→CM_NUM_PREPROCESS_THREADS=value
Default environment
These keys can be updated via --env.KEY=VALUE
or env
dictionary in @input.json
or using script flags.
- CM_DATASET:
OPENIMAGES
- CM_DATASET_DTYPE:
fp32
- CM_DATASET_INPUT_SQUARE_SIDE:
800
- CM_DATASET_CROP_FACTOR:
100.0
- CM_DATASET_QUANT_SCALE:
1
- CM_DATASET_QUANTIZE:
0
- CM_DATASET_QUANT_OFFSET:
0
Native script being run
Script output
cmr "get dataset openimages open-images object-detection preprocessed [variations]" [--input_flags] -j