My framework uses Python API for GCE to create instances on the fly as needed. All the instances are based on single machine image. There is a limit on the rate of the instance-creation operation. However, I noticed that sometimes GCE allows the creation of several instances in a row without much delay. Hence, I do not want to fix the delay between instance creations in my code.
For now, I use exponentially increasing delays between attempts at instance creation. Although I could not find anything related in the documentation, I decided to try asking: is there a way to use the API to find out directly when the next instance creation will be allowed?