Configure systemd to always restart the inference engine and inference replica services on failure. `RestartSec=5` ensures a 5-second delay before restarting the service to avoid tight restart loops. `StartLimitIntervalSec=0` and `StartLimitBurst=0` disable start rate limiting.