Intermediate 8 min · 2026-06-21

Ansible Docker Management: Production Patterns with community.docker 3.x

Q: What is the community.docker collection?

It's an Ansible collection that provides modules to manage Docker resources like containers, images, networks, volumes, and Compose stacks. It is the recommended way to manage Docker with Ansible.

Q: How do I install the community.docker collection?

Run `ansible-galaxy collection install community.docker`. To pin a version, use `ansible-galaxy collection install community.docker:==3.4.0`.

Q: What Python package is required on target hosts?

The `docker` Python package (e.g., pip install docker==6.1.0). This is the Docker SDK for Python.

Q: How do I make docker_container idempotent?

Set the `comparisons` parameter to a list of attributes to compare, such as `['*cmd*', '*env*', '*labels*']`. This tells the module to detect changes and trigger updates.

Q: Should I use docker_compose_v2 or raw docker-compose?

Use docker_compose_v2 for simple Compose stacks that don't require profiles or complex variable substitution. Use raw docker-compose for advanced use cases, but handle idempotency manually.

Q: How do I pass environment variables to a Compose stack via Ansible?

For docker_compose_v2, use the `environment` parameter of the task. For raw docker-compose, set environment variables in the task's `environment` field or use Ansible's `template` module to render the Compose file.

Q: How do I handle private registry authentication?

Use the `docker_login` module with `no_log: yes` to log into the registry before pulling images. Store credentials in Ansible Vault.

Q: What is the best way to update a running container with a new image?

Use a blue/green deployment: create a new container with the new image, verify health, switch traffic, then remove the old container. Alternatively, use `state: absent` then `state: started` for simple cases, but this causes downtime.

Master Ansible Docker management with community.docker collection.

Naren Founder & Principal Engineer

20+ years shipping production infrastructure and CI/CD at scale. Drawn from code that ran under real load.

✓ Production

production tested

June 21, 2026

last updated

1,596

articles · all by Naren

● Production Incident 🔎 Debug Guide ⚙ Triage Commands

⚡Quick Answer

Use community.docker 3.4.0+; install via ansible-galaxy collection install community.docker. docker_container module: use state: started with restart_policy: unless-stopped for production. docker_image: build with source: build and force_source: yes to rebuild on code changes. docker_network: create with driver: overlay for Swarm; use attachable: yes for standalone. docker_volume: prefer driver_opts: { type: nfs, o: addr=..., device=...} for shared storage. docker_compose module vs raw docker-compose: use community.docker.docker_compose_v2 for Ansible-native; fallback to command: docker compose up -d for complex stacks. Container lifecycle: use restart: yes only when config changes; prefer comparisons: ['cmd'] for idempotency. Bootstrap Docker hosts: run get_url for Docker GPG key, apt_repository for repo, then package for docker-ce.

✦ Definition~90s read

What is Ansible Docker Management?

The community.docker collection is Ansible's official way to manage Docker resources. It provides modules like docker_container, docker_image, docker_network, docker_volume, and docker_compose_v2 that map directly to Docker API objects. Unlike running shell commands, these modules are idempotent—they check the current state and only make changes if needed.

★

Imagine you're the manager of a busy restaurant kitchen.

This solves the problem of ad-hoc scripts that break on partial failures or drift. The collection communicates with the Docker daemon via the Docker SDK for Python, so you need the docker Python package on the control node or target host.

Plain-English First

Imagine you're the manager of a busy restaurant kitchen. You have a bunch of prep cooks (Docker containers) each making different dishes. Ansible is like your head chef who can instantly tell each cook to start, stop, or change their recipe. The community.docker collection is the set of standardized order slips that let Ansible communicate with the kitchen's computer system (Docker daemon). Instead of running around yelling, you give precise written instructions that work every time. When a cook needs a new cutting board (a Docker volume) or a new station layout (a Docker network), you just write it on the slip and it happens automatically. No more burnt dishes or missing ingredients.

It was 2 AM on a Tuesday when I got paged. Our microservices deployment had failed halfway through—half the containers were running old images, half the new ones, and the database container was missing entirely. The previous engineer had written a bash script that ran docker commands sequentially and it had a race condition. That's when I realized we needed a proper configuration management tool for Docker, not ad-hoc scripts. Ansible with the community.docker collection became our salvation. Over the next weeks, I rewrote our deployment pipeline to use Ansible's idempotent modules, and we never had a partial deploy again.

Installing the community.docker Collection

The community.docker collection is not included in ansible-base. You must install it explicitly. Use ansible-galaxy collection install community.docker to get the latest version (3.4.0 as of writing). For production, pin the version: ansible-galaxy collection install community.docker:==3.4.0. On the target hosts, you need the docker Python SDK: pip install docker==6.1.0. Alternatively, you can use the community.docker.docker_connection plugin if you want to run modules on the control node and connect to remote Docker daemons via TCP. However, I recommend running tasks on the target host to avoid latency and socket issues.

Example playbook snippet: ``yaml - hosts: docker_hosts tasks: - name: Install docker SDK pip: name: docker==6.1.0 state: present ``

If you're using Ansible Automation Platform or AWX, add the collection to requirements.yml: ``yaml collections: - name: community.docker version: 3.4.0 ``

Version Compatibility

community.docker 3.x requires ansible-core >=2.12 and Docker SDK for Python >=5.0.0. Check compatibility matrix at https://docs.ansible.com/ansible/latest/collections/community/docker/index.html.

Production Insight

In production, I once had a mismatch: control node had collection 2.x but target hosts had SDK 7.x. The docker_container module failed with cryptic JSON decode errors. Always keep collection and SDK versions in sync across all nodes.

Key Takeaway

Always pin the collection version and ensure Docker SDK version compatibility.

Docker volumes are persistent storage. Use docker_volume

state: present to create
driver: local (default) or rbd, nfs, etc.
driver_opts: { type: nfs, o: addr=192.168.1.100,rw,device=:/exported/path } for NFS

Example: ``yaml - name: Create shared data volume community.docker.docker_volume: name: shared_data driver: local driver_opts: type: nfs o: addr=192.168.1.100,rw device: :/mnt/nfs_share state: present ``

For local volumes, you can specify labels and labels_from_object. Volume removal is tricky: if a container is using it, the module will fail. Use force: yes to remove even if in use (but be careful).

Production tip: Use recreate: yes only when you need to purge data. Otherwise, volumes persist across container recreations, which is usually desired.

NFS Performance

For NFS volumes, ensure the NFS server is reliable and the network is low-latency. Consider using nfsvers=4 for better performance. Test with docker volume create --driver local --opt type=nfs --opt o=addr=... --opt device=:... testvol.

Production Insight

We had a volume that was created with driver_opts but later the NFS server IP changed. The module didn't update the volume—it just said 'ok'. We had to remove and recreate the volume. Now I use a unique volume name per deployment to avoid this.

Key Takeaway

Volumes are immutable after creation. If you need to change driver options, destroy and recreate the volume.

docker_compose Module vs Raw docker-compose

The community.docker.docker_compose_v2 module (introduced in collection 2.0) manages Docker Compose stacks natively. It is idempotent and integrates with Ansible's state model. However, it has limitations: it cannot handle all Compose features (e.g., extends, profiles). For complex stacks, I fall back to the command module running docker compose up -d.

Example using docker_compose_v2: ``yaml - name: Deploy stack with compose community.docker.docker_compose_v2: project_src: /opt/myapp files: - docker-compose.yml - docker-compose.prod.yml state: present recreate: always ``

When to use raw docker-compose

Need to use --profile or --with-registry-auth
Need to run docker compose exec for one-off tasks
Complex variable substitution

Example raw: ``yaml - name: Deploy with raw docker-compose command: docker compose -f docker-compose.yml -f docker-compose.prod.yml up -d --remove-orphans args: chdir: /opt/myapp ``

Production insight: Use docker_compose_v2 for simple stacks and raw for complex ones. The raw approach is less idempotent but more flexible.

Module vs Raw: Idempotency

The docker_compose_v2 module compares the current state with desired state and skips if unchanged. Raw docker-compose always runs, which can cause unnecessary service restarts. Use changed_when: false on raw tasks if you handle restarts separately.

Production Insight

We had a stack that needed --profile monitoring to start extra containers. The docker_compose_v2 module didn't support profiles, so we switched to raw. We added changed_when: false and used a separate task to check if containers were running.

Key Takeaway

Use docker_compose_v2 for basic stacks; fall back to raw docker-compose for advanced features like profiles or registry auth.

Container Lifecycle Management

Managing container lifecycle means handling creation, updates, restarts, and removal. Key patterns: - Use state: started for normal operation. - Use state: stopped to pause (but containers restart on daemon restart if restart policy is set). - Use state: absent to remove. - Use restart: yes only when you need to force restart (e.g., after config change).

Example with handlers: ```yaml - name: Update web config template: src: nginx.conf.j2 dest: /opt/nginx/nginx.conf notify: restart web

handlers: - name: restart web community.docker.docker_container: name: web state: started restart: yes ```

For zero-downtime deployments, use rolling updates with multiple containers and a load balancer. Ansible can orchestrate this by updating containers one by one.

Production insight: I once used state: started with restart: yes on every playbook run. This caused unnecessary container restarts and downtime. Now I only restart when configuration changes, using comparisons or handlers.

Restart vs Recreate

Restart stops and starts the same container (preserving its filesystem). Recreate (set state: absent then started) destroys and creates a new container. Use restart for config changes, recreate for image updates.

Production Insight

We had a container that needed to be recreated on every deploy to pick up new image tags. We used state: absent then state: started in two tasks. But this caused a brief downtime. We switched to using a blue-green pattern with two containers and a reverse proxy.

Key Takeaway

Use handlers to restart only on config changes. For image updates, prefer recreate (absent + started) or blue-green deployment.

Bootstraping Docker Hosts with Ansible

To manage Docker with Ansible, you first need Docker installed on the target hosts. Use Ansible to bootstrap:

``yaml - name: Install Docker on Ubuntu hosts: all tasks: - name: Install prerequisites apt: name: - apt-transport-https - ca-certificates - curl - gnupg - lsb-release state: present - name: Add Docker GPG key get_url: url: https://download.docker.com/linux/ubuntu/gpg dest: /usr/share/keyrings/docker-archive-keyring.gpg - name: Add Docker repository apt_repository: repo: "deb [arch=amd64 signed-by=/usr/share/keyrings/docker-archive-keyring.gpg] https://download.docker.com/linux/ubuntu {{ ansible_distribution_release }} stable" state: present - name: Install Docker packages apt: name: - docker-ce - docker-ce-cli - containerd.io - docker-compose-plugin state: present - name: Start and enable Docker service: name: docker state: started enabled: yes - name: Install Docker SDK for Python pip: name: docker==6.1.0 state: present ``

For CentOS/RHEL, use yum_repository and yum. For production, consider using the official Docker install script or a custom role from Ansible Galaxy.

Docker Compose Plugin

Install docker-compose-plugin (v2) instead of standalone docker-compose (v1). The plugin is maintained by Docker and integrates with docker compose command. The community.docker.docker_compose_v2 module requires the plugin.

Production Insight

We had a host where Docker was installed but the Python SDK was missing. The docker_container module failed with 'No module named docker'. I added a pip task to install the SDK, but it ran on every playbook run. I added when: docker_sdk_installed is not defined with a stat check to make it idempotent.

Key Takeaway

Always install the Docker Python SDK after Docker itself. Use idempotent checks to avoid reinstalling.

Deploying Compose Stacks with Ansible

Once Docker is installed, deploy Compose stacks. Example using docker_compose_v2:

``yaml - name: Deploy application stack hosts: docker_hosts vars: app_version: "1.2.3" tasks: - name: Ensure network exists community.docker.docker_network: name: app_net state: present - name: Ensure volumes exist community.docker.docker_volume: name: app_data state: present - name: Deploy stack community.docker.docker_compose_v2: project_src: /opt/myapp files: - docker-compose.yml state: present recreate: always environment: APP_VERSION: "{{ app_version }}" ``

If using raw docker-compose: ``yaml - name: Deploy with raw compose command: docker compose -f docker-compose.yml up -d --remove-orphans args: chdir: /opt/myapp environment: APP_VERSION: "{{ app_version }}" changed_when: false ``

Production tip: Use --remove-orphans to clean up containers not defined in the Compose file. This prevents drift.

Environment Variables

When using docker_compose_v2, environment variables from Ansible are not automatically passed to the Compose file. You must set them explicitly using the environment parameter of the task or define them in the Compose file's environment section.

Production Insight

We had a stack that used ${APP_VERSION} in the Compose file. With docker_compose_v2, the variable was not substituted because the module runs in its own environment. We switched to raw docker-compose with environment set, which worked. Later we moved to using Ansible templates to render the Compose file before deploying.

Key Takeaway

For variable substitution in Compose files, either use raw docker-compose with environment or template the Compose file with Ansible.

Advanced: Rolling Updates with Ansible

For zero-downtime deployments, implement rolling updates. Example with two containers and a load balancer:

``yaml - name: Rolling update web service hosts: docker_hosts serial: 1 tasks: - name: Pull new image community.docker.docker_image: name: myapp tag: "{{ new_version }}" source: pull state: present - name: Stop old container community.docker.docker_container: name: web state: stopped - name: Start new container with new image community.docker.docker_container: name: web image: "myapp:{{ new_version }}" state: started restart_policy: unless-stopped ``

But this pattern has downtime. Better: use two containers (blue/green): - Deploy new container with different name - Update load balancer to point to new container - Remove old container

Example: ``yaml - name: Deploy blue/green hosts: docker_hosts tasks: - name: Deploy green container community.docker.docker_container: name: web_green image: "myapp:{{ new_version }}" state: started ports: - "8081:80" - name: Switch load balancer to green template: src: nginx.conf.j2 dest: /etc/nginx/conf.d/app.conf notify: reload nginx - name: Remove blue container community.docker.docker_container: name: web_blue state: absent ``

Production insight: We used this pattern with HAProxy. The playbook would update the HAProxy config to point to the new container and then remove the old one. This gave us zero downtime.

Health Checks

Before switching traffic, ensure the new container is healthy. Use uri module to check a health endpoint: uri: url: http://{{ inventory_hostname }}:8081/health status_code: 200.

Production Insight

We forgot to add a health check once, and the load balancer started sending traffic to a container that was still starting up. We added a 5-second wait and a health check. Now we also use docker_container's healthcheck parameter in the container definition.

Key Takeaway

For zero-downtime, use blue/green or rolling updates with health checks. Avoid stopping the only container.

Security: Managing Secrets and Registries

When pulling images from private registries, you need authentication. Use docker_login module (or docker_config in newer versions) to log in:

``yaml - name: Log into private registry community.docker.docker_login: registry_url: https://registry.example.com username: "{{ registry_user }}" password: "{{ registry_pass }}" reauthorize: yes no_log: yes ``

Then pull images normally. For Kubernetes-style secrets, you can create Docker config secrets with docker_secret (community.docker 3.x).

For container secrets (e.g., database passwords), use Ansible Vault to encrypt variables and pass them as environment variables or mounted files. Avoid hardcoding secrets in Dockerfiles.

Example: ``yaml - name: Run container with secrets community.docker.docker_container: name: app image: myapp env: DB_PASSWORD: "{{ vault_db_password }}" secrets: - source: db_password target: /run/secrets/db_password ``

Production insight: We used no_log: yes on the login task, but the password still appeared in the module's return output. We had to set no_log: yes on the entire playbook. Now we use Ansible Vault for all secrets.

No Log on Secrets

Always set no_log: yes on tasks that handle passwords, API keys, or any sensitive data. Even with no_log, some modules may leak secrets in error messages. Test by running with -v and checking output.

Production Insight

We had a security audit that found our Docker registry password in Ansible logs. We had forgotten no_log: yes. We immediately rotated the password and added a pre-commit hook to flag missing no_log on tasks with password in the name.

Key Takeaway

Always use no_log: yes on tasks with secrets. Use Ansible Vault for variable encryption.

Monitoring and Idempotency Checks

After deploying, verify that containers are running. Use the docker_container_info module:

```yaml - name: Get container info community.docker.docker_container_info: name: web register: container_info

name: Fail if container not running
fail:
msg: "Container web is not running"
when: container_info.container.State.Status != 'running'
```

For idempotency, use docker_container_info to decide whether to recreate: ```yaml - name: Check if container exists community.docker.docker_container_info: name: web register: web_info

name: Remove old container if image changed
community.docker.docker_container:
name: web
state: absent
when: web_info.container.Image != 'myapp:{{ new_version }}'
```

Production insight: We had a playbook that always recreated containers because we didn't check the current image. Adding the docker_container_info check reduced deployment time by 50%.

Idempotency with Info

Use docker_container_info to avoid unnecessary container recreations. Compare container.Image or container.State.StartedAt to decide if an update is needed.

Production Insight

We used docker_container_info to check if a container was running the correct image tag. If not, we recreated. This made our deployments idempotent and fast.

Key Takeaway

Always verify container state after deployment. Use info modules to make playbooks idempotent.

● Production incidentPOST-MORTEMseverity: high

The Case of the Missing Container Restart

Symptom

After running playbook, containers were still running old config. No error, just no restart.

Assumption

Assumed restart: yes would always restart the container.

Root cause

The restart: yes parameter only triggers a restart if the container already exists and is running. But the module's idempotency check didn't detect the config change because comparisons was not set.

Fix

Add comparisons: ['cmd', 'env', 'labels'] to detect changes, and use restart: yes only when comparisons detect a change. Alternatively, use state: started with restart: yes and a handler.

Key lesson

Always use comparisons to define what constitutes a change for docker_container.
Never rely on restart: yes alone.

Production debug guideSymptom → Root cause → Fix4 entries

Symptom · 01

docker_container: 'Error: Connection to Docker daemon refused'

→

Fix

Root cause: Docker socket not accessible. Fix: set DOCKER_HOST env var or use docker_host: unix:///var/run/docker.sock in task. Ensure user has permissions. Run ansible -m shell -a 'docker ps' to test.

Symptom · 02

docker_image: 'Build failed: manifest for image:tag not found'

→

Fix

Root cause: Dockerfile path wrong or context missing. Fix: use build.path: /path/to/dir with Dockerfile in it. Add build.dockerfile: Dockerfile.prod if non-default. Run docker build -t test . locally first.

Symptom · 03

docker_network: 'Error response from daemon: network already exists'

→

Fix

Root cause: Module tried to create network that exists. Fix: use state: present which is idempotent. If using state: absent, ensure you have force: yes to remove in use networks.

Symptom · 04

docker_compose_v2: 'Failed to parse docker-compose.yml'

→

Fix

Root cause: Compose file syntax error or version incompatible. Fix: validate with docker compose config. Use project_src: /path and ensure files: [docker-compose.yml].

★ Ansible Docker Management Quick Referenceprint this for your desk

Container not created−

Immediate action

Check if image exists

Commands

ansible target -m community.docker.docker_image -a 'name=myimage state=present'

ansible target -m community.docker.docker_container -a 'name=mycontainer state=absent'

Fix now

ansible target -m community.docker.docker_container -a 'name=mycontainer image=myimage state=started'

Container not restarting on config change+

Network creation fails+

Volume mount not working+

Compose stack not updating+

docker_compose_v2 vs Raw docker-compose

Feature	docker_compose_v2 Module	Raw docker-compose (command module)	Recommendation
Idempotent	Yes (checks current state)	No (always runs unless you check manually)	Use module for idempotency
Profiles support	No	Yes (--profile)	Use raw for profiles
Variable substitution	Limited (must use environment parameter)	Full (shell environment)	Use raw for complex vars
Error handling	Ansible-native (fail on error)	Requires manual error handling	Module is easier
Performance	Faster (no subprocess)	Slower (spawns process)	Module is faster
Compose version	V2 only (plugin)	V1 or V2	Use module for V2, raw for V1

Key takeaways

Install community.docker collection and pin version in requirements.yml.

Always set comparisons on docker_container to detect changes.

Use docker_compose_v2 for simple stacks; raw docker-compose for advanced features.

Bootstrap Docker hosts with apt/yum and install Docker Python SDK.

Use blue/green or rolling updates for zero-downtime deployments.

Handle secrets with Ansible Vault and no_log

yes.

Symptom

Secrets leaked in logs

Fix

Add no_log: yes to tasks with passwords

INTERVIEW PREP · PRACTICE MODE

Interview Questions on This Topic

Q01SENIOR

How do you ensure idempotency when using docker_container module?

Q02SENIOR

What is the difference between docker_compose_v2 module and raw docker-c...

Q03SENIOR

How do you handle secrets when deploying containers with Ansible?

Q04SENIOR

How would you implement zero-downtime deployment for a Docker container ...

Q05SENIOR

What are common pitfalls when using Ansible to manage Docker networks?

Q06JUNIOR

How do you bootstrap a Docker host with Ansible?

Q07JUNIOR

What is the role of the community.docker collection in Ansible?

Q08SENIOR

How do you handle container lifecycle (start, stop, restart, remove) wit...

Q01 of 08SENIOR

How do you ensure idempotency when using docker_container module?

ANSWER

Use comparisons parameter to specify which attributes to compare (e.g., ['cmd', 'env']). Also use docker_container_info to check current state before making changes. Set restart: yes only when a change is detected via handlers.

FAQ · 8 QUESTIONS

Frequently Asked Questions

What is the community.docker collection?

How do I install the community.docker collection?

What Python package is required on target hosts?

How do I make docker_container idempotent?

Should I use docker_compose_v2 or raw docker-compose?

How do I pass environment variables to a Compose stack via Ansible?

How do I handle private registry authentication?

What is the best way to update a running container with a new image?

Naren Founder & Principal Engineer

20+ years shipping production infrastructure and CI/CD at scale. Drawn from code that ran under real load.

✓ Verified

production tested

June 21, 2026

last updated

1,596

articles · all by Naren

🔥

That's Ansible. Mark it forged?

8 min read · try the examples if you haven't