Configure systemd to always restart the inference engine and inference
replica services on failure.
`RestartSec=5` ensures a 5-second delay before restarting the service to
avoid tight restart loops.
`StartLimitIntervalSec=0` and `StartLimitBurst=0` disable start rate
limiting.