https://platoaistream.net/plato-data/reduce-inference-time-for-bert-models-using-neural-architecture-search-and-sagemaker-automated-model-tuning-amazon-web-services/
Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning | Amazon Web Services