Gunicorn graceful restarts

How to configure the Gunicorn application server for graceful restarts, and what is the difference between regular, hot, and phased restarts?

Application restarts are necessary when things go wrong or if we need to push a new application version. A regular restart isn’t usually anything more than stopping and starting the server again. To keep clients connected or even keep serving requests, we need a better strategy.

Graceful restarts

Application servers like Gunicorn can be usually stopped and restarted in various ways:

regular restarts: connections are lost, and new ones are established after booting the new application process
hot restarts: connections are not lost but wait for the new application server workers to boot
phased restarts: current connections finish with old workers and new workers handle new connections

Hot and phased restarts is what we call graceful restarts, because they try to gracefully finish the already started requests first.

Gunicorn supports graceful restarts with the HUP signal, which reloads the configuration, starts the new worker processes, and gracefully shut- downs older workers. If the application is not preloaded, Gunicorn will also load the new version of it.

To send a signal, we use the famous kill program:

$ kill -SIGHUP 25197

If we need to replace the Gunicorn executable, we can issue the SIGUSR2 signal to the current master process after we replaced its executable with the new one. A new master process with new workers and a new PID will be started (.2 will be appended to the PID file).

You can try it out with:

$ kill -SIGUSR2 25197

Usually, we achieve zero downtime configuration change by leveraging systemd socket activation, swapping containers, or load balancing processes and servers. But as you can see, we can depend on SIGUSR2 as well.

To shut down the old master process, send the WINCH signal:

$ gunicorn -c config/gunicorn.conf app:app
$ #... some changes to Gunicorn like an update
$ sudo kill -SIGUSR2 $PID
$ sudo kill -WINCH $PID

Finally, Gunicorn configuration has a directive to set the maximum timeout for graceful shutdowns (in seconds):

$ config/gunicorn.conf
...
graceful_timeout = 30

Check out my book

Deployment from Scratch is unique Linux book about web application deployment. Learn how deployment works from the first principles rather than YAML files of a specific tool.

Published on 23 Sep 2021

Psst, I am preparing Kamal DevOps course