Text Summarization using LLAMA3.1-405b
Dataset
The benchmark implementation run command will automatically download the validation and calibration datasets and do the necessary preprocessing. In case you want to download only the datasets, you can use the below commands.
Get Validation Dataset
cm run script --tags=get,dataset,mlperf,inference,llama3,_validation --outdirname=<path to download> -j
Get Calibration Dataset
cm run script --tags=get,dataset,mlperf,inference,llama3,_calibration --outdirname=<path to download> -j
Model
The benchmark implementation run command will automatically download the required model and do the necessary conversions. In case you want to only download the official model, you can use the below commands.
Get the Official MLPerf LLAMA3.1-405b Model
Pytorch
cm run script --tags=get,ml-model,llama3 --outdirname=<path to download> --hf_token=<huggingface access token> -j
Tip
Downloading llama3.1-405B model from Hugging Face will require an access token which could be generated for your account. Additionally, ensure that your account has access to the llama3.1-405B model.