I am experiencing sharp drops and spikes in instance count every 30 minutes although the request rate has been stable for 2h at 2.4k RPS. Periodically there are a lot of warmup requests after many instances get shutdown at the same time. This also increases our operational costs because of the large number of idle instances.
- App Engine Release: 1.8.1
- Total number of instances: 235 total (15 Resident)
- Average QPS: 9.143
- Average Latency: 135.5 ms
- Average Memory: 157.9 MBytes
The performance settings of the app are still at the defaults (F1 instances, min/max pending latency and min/max idle instances are still at automatic).
I will re-run the same test on F2 instances shortly. In the meantime:
- Is this a known problem on GAE?
- Is this caused by the memory consumption which is too high for F1s?
- What can I do to fix this problem except going to F2s?
- How can the average memory be above 128 MB using F1 instances?
instance count [F1]
RPS [F1]
total memory usage in MB [F1]
Update after running the test on F2 instances
During the first 2h of the test instance churn was significantly reduced. Instance count was significantly more stable. In the last 2h of the test, instance count went up from 250 to 600 although the request rate was stable at 2.4k RPS.
instance count [F1 vs F2]
RPS [F1 vs F2]
total memory usage in MB [F1 vs F2]
milliseconds per request [F1 vs F2]