p3.2xlarge Specifications, pricing and developer feedback

Encryption Algorithm	Speed (1024 Block Size, 3 threads)
AES-128 CBC	462.2MB
AES-256 CBC	336.8MB
MD5	2.0GB
SHA256	1.3GB
SHA512	1.7GB

	Read	Write
Max	3162	3160
Average	3160	3160
Deviation	0.56	0.3
Min	3159	3159

Pretraining a small model like nanoGPT on a p3.2xlarge instance, the AWS Deep Learning AMI is a suitable choice. It comes pre-configured with popular deep learning frameworks, NVIDIA drivers, and libraries, making it easy to get started with

19-03-2025

Read original

memory_usage

One thing to note - although the p3 isn‚Äôt much better than a 1080ti you can buy yourself, it‚Äôs _much_ better than the p2. So if you need to use AWS, and need to run big models quickly, a p3 is a good option.

2025-10-17 00:00:00

Read original

benchmarking

Don‚Äôt get suckered by AWS and Volta. The Amazon P3 instances (available Oregon) feature the latest DL GPU tech at $3.06 an hour (2xlarge) but PyTorch, TF, et al can‚Äôt utilize it fully yet.

2025-10-17 00:00:00

Read original

benchmarking

Testing new Tesla V100 on AWS. Fine-tuning VGG on DeepSent dataset for 10 epochs.

26-10-2017

Read original

benchmarking

If you really want to get into the black magic of speed-ups, these cards also feature full FP16 support, which means you can double your TFLOPS by dropping to FP16 from FP32.

26-10-2017

Read original

benchmarking

For anyone using the standard set of frameworks (Tensorflow, Keras, PyTorch, Chainer, MXNet, DyNet, DeepLearning4j, ...) this type of speed-up will likely require you to do nothing - except throw more money at the P3 instance :)

26-10-2017

Read original

memory_usage, benchmarking

Oh, and the V100 comes with 16GB of (faster) RAM compared to the K80's 12GB of RAM, so you win there too.

26-10-2017

Read original

memory_usage, benchmarking

P3 (V100) with single GPU: ~20 seconds per epoch

26-10-2017

Read original

benchmarking

Gartner Peer Insights content consists of the opinions of individual end users based on their own experiences, and should not be construed as statements of fact, nor do they represent the views of Gartner or its affiliates. Gartner does not endorse any vendor, product or service depicted in this content nor makes any warranties, expressed or implied, with respect to this content, about its accuracy or completeness, including any warranties of merchantability or fitness for a particular purpose. This site is protected by hCaptcha and its [Privacy Policy](https://hcaptcha.com/privacy) and [Terms of Service](https://hcaptcha.com/terms) apply.

19-03-2025

Read original

I asked AWS to let me use a p3 instance. Their answer : no.

2018-01-04 00:00:00

Read original

Gartner Peer Insights content consists of the opinions of individual end users based on their own experiences, and should not be construed as statements of fact, nor do they represent the views of Gartner or its affiliates. Gartner does not endorse any vendor, product or service depicted in this content nor makes any warranties, expressed or implied, with respect to this content, about its accuracy or completeness, including any warranties of merchantability or fitness for a particular purpose. This site is protected by hCaptcha and its [Privacy Policy](https://hcaptcha.com/privacy) and [Terms of Service](https://hcaptcha.com/terms) apply.

Read original

After I build PyTorch from source, there‚Äôs no initialization delay in conv_learner. Works smoothly.

29-11-2017

Read original

benchmarking

p3 instances were not showing because of the AWS region that I was assigned. I changed that to Oregon and that fixed the problem.

19-03-2018

Read original

The P3 is 800% faster than P2 for training with fastai!

Leader

29-11-2017

Read original

benchmarking

Testing new Tesla V100 on AWS. Fine-tuning VGG on DeepSent dataset for 10 epochs.

26-10-2017

Read original

benchmarking

If you really want to get into the black magic of speed-ups, these cards also feature full FP16 support, which means you can double your TFLOPS by dropping to FP16 from FP32.

26-10-2017

Read original

benchmarking

For anyone using the standard set of frameworks (Tensorflow, Keras, PyTorch, Chainer, MXNet, DyNet, DeepLearning4j, ...) this type of speed-up will likely require you to do nothing - except throw more money at the P3 instance :)

26-10-2017

Read original

memory_usage, benchmarking

P3 (V100) with single GPU: ~20 seconds per epoch

26-10-2017

Read original

benchmarking

Oh, and the V100 comes with 16GB of (faster) RAM compared to the K80's 12GB of RAM, so you win there too.

26-10-2017

Read original

memory_usage, benchmarking

Gartner Peer Insights content consists of the opinions of individual end users based on their own experiences, and should not be construed as statements of fact, nor do they represent the views of Gartner or its affiliates. Gartner does not endorse any vendor, product or service depicted in this content nor makes any warranties, expressed or implied, with respect to this content, about its accuracy or completeness, including any warranties of merchantability or fitness for a particular purpose. This site is protected by hCaptcha and its [Privacy Policy](https://hcaptcha.com/privacy) and [Terms of Service](https://hcaptcha.com/terms) apply.

19-03-2025

Read original

In my app, I repurposed a pre-trained vgg19 model. Inference time of one 256x256 color jpeg on p3.2xlarge with the Volta AMI was like 100 milliseconds or less.

30-10-2017

Read original

benchmarking

How about "NVIDIA Volta Deep Learning AMI" with p3.2xlarge (Tesla V100 GPU) instance?

28-10-2017

Read original

benchmarking

I'm having the same issue on p3.2xlarge instances.

19-03-2019

Read original

One thing to note - although the p3 isn‚Äôt much better than a 1080ti you can buy yourself, it‚Äôs _much_ better than the p2. So if you need to use AWS, and need to run big models quickly, a p3 is a good option.

2025-10-17 00:00:00

Read original

benchmarking

Don‚Äôt get suckered by AWS and Volta. The Amazon P3 instances (available Oregon) feature the latest DL GPU tech at $3.06 an hour (2xlarge) but PyTorch, TF, et al can‚Äôt utilize it fully yet.

2025-10-17 00:00:00

Read original

benchmarking

In my app, I repurposed a pre-trained vgg19 model. Inference time of one 256x256 color jpeg on p3.2xlarge with the Volta AMI was like 100 milliseconds or less.

30-10-2017

Read original

benchmarking

How about "NVIDIA Volta Deep Learning AMI" with p3.2xlarge (Tesla V100 GPU) instance?

28-10-2017

Read original

benchmarking

I'm having the same issue on p3.2xlarge instances.

19-03-2019

Read original

I'm having the same issue on p3.2xlarge instances.

19-03-2019

Read original