Explanation

Running Simple Chat in Azure Production

Use the deployment model first, then choose the right Azure production startup pattern for web workers and scheduler work.

Container App Service by default Native Python is a separate path Separate scheduler work when scaling

Deployment reference Running locally

Production startup guidance only makes sense after you identify the deployment model. In this repo, most Azure deployment paths are container-based, and that changes the startup-command conversation immediately.

Repo deployers are container-first

AZD, Bicep, Terraform, and Azure CLI paths all assume the container image is what App Service runs in production.

Container entrypoint already starts Gunicorn

For the supported container paths, App Service does not need a second native Python startup command layered on top.

Native Python is intentional, not default

If you choose a native Python App Service deployment, you need to manage startup behavior explicitly and treat it as a different operating model.

Scheduler work deserves its own plan

Once you scale workers or instances, background task isolation becomes part of production correctness, not a tuning detail.

The key split

If the site runs the repo container image, let the image entrypoint launch Gunicorn. If the site runs native Python directly, set the startup command yourself. Most confusion in Azure production comes from mixing those two models.

This guide explains the supported production startup patterns for Simple Chat in Azure.

Current documentation version: 0.241.009

Default Azure Production Model in This Repo

The repo-provided Azure deployment paths are container-based App Service deployments.

That includes the deployers documented in this repository for:

azd
Bicep
Terraform
Azure CLI

In those deployment models:

Azure App Service runs the published container image
the container entrypoint already starts Gunicorn
you do not need to set an App Service Stack Settings Startup command

The web container entrypoint is:

python3 -m gunicorn -c /app/gunicorn.conf.py app:app

Native Python App Service Option

If you intentionally deploy Simple Chat as a native Python App Service instead of using the repo container image, deploy the application/single_app folder and set the web startup command explicitly.

Use this Startup command:

python -m gunicorn -c gunicorn.conf.py app:app

Background Scheduler Guidance

For production, keep scheduler-style work separate from multi-worker web processes when possible.

Recommended web-process setting when scheduler work runs elsewhere:

SIMPLECHAT_RUN_BACKGROUND_TASKS=0

Recommended scheduler command:

python simplechat_scheduler.py

Operationally, that scheduler can run as:

a separate App Service or worker process
a scheduled container or job
another automation path that launches the same codebase with the scheduler command

Gunicorn Guidance for Azure

Gunicorn is the production web server for Simple Chat in Azure-oriented deployments.

The shared runtime config supports these tuning variables:

GUNICORN_BIND
GUNICORN_WORKERS
GUNICORN_THREADS
GUNICORN_TIMEOUT
GUNICORN_GRACEFUL_TIMEOUT
GUNICORN_KEEPALIVE
GUNICORN_MAX_REQUESTS
GUNICORN_MAX_REQUESTS_JITTER

Use multiple workers only after you have decided how scheduler work is isolated.

Recommended Azure Production Pattern

For most production environments in this repository:

Deploy the container image through the repo-supported deployer.
Let the container entrypoint launch Gunicorn.
Do not configure an extra App Service Startup command.
Move scheduler work into a separate runtime if you want clean multi-worker web behavior.

What Not to Do

Do not configure a second Gunicorn startup layer on top of the container deployer.
Do not treat Windows local development startup as proof of Gunicorn production behavior.
Do not leave scheduler decisions implicit if you plan to scale out workers or instances.

Summary

Repo deployers: container-based, Gunicorn already handled.
Native Python App Service: set the Gunicorn startup command explicitly.
Multi-worker production: separate scheduler work deliberately.
Local developer startup and Azure production startup should be treated as different runtime concerns.