2026-05-28 · 9 min read

How to Monitor Kubernetes CronJobs Without Modifying Your Images

Kubernetes CronJob resources are the cluster equivalent of a crontab — and they fail the same way: silently. A CronJob that hasn't scheduled a successful Pod in two weeks looks identical, in kubectl get cj, to one that ran ten minutes ago. The only difference is the LAST SCHEDULE column, which nobody is paid to read.

You can solve this with external heartbeat monitoring — a pinger checks in on every successful run, and you get paged when the pings stop. The tricky question is wherethe ping comes from when the workload lives in a container you don't control. This post covers the three patterns we see most often.

Pattern 1: bake the ping into the image

The simplest approach: append a curl call to the end of your container's entrypoint.

# Dockerfile

FROM python:3.12-slim

RUN apt-get update && apt-get install -y curl

COPY backup.py /app/

CMD python /app/backup.py && \

curl -fsS https://api.crond.io/ping/$PING_KEY

Works fine for greenfield images. Fails the moment you need to monitor a third-party container, a Helm chart you don't own, or a CronJob that shells out to multiple tools. The ping is also fragile: if the script returns 0 but didn't actually do its job, the ping still fires.

Pattern 2: an emptyDir sidecar with a wrapper script

When you can't (or don't want to) rebuild the image, you can mount a wrapper script via an init container, then exec the original entrypoint through it.

apiVersion: batch/v1

kind: CronJob

spec:

schedule: "0 2 * * *"

jobTemplate:

spec:

template:

spec:

initContainers:

- name: install-wrapper

image: ghcr.io/platops-security/crond-agent:latest

command: ["cp", "/usr/local/bin/crond-agent", "/shared/"]

volumeMounts: [{ name: shared, mountPath: /shared }]

containers:

- name: backup

image: original-image:v1.2

command: ["/shared/crond-agent", "wrap", "--name", "backup", "--", "/app/backup.sh"]

The wrapper captures exit code, duration, and stdout/stderr — so a backup script that fails partway through gets reported as a failure, not a success. Tradeoff: every CronJob manifest gets ~10 lines of boilerplate, and you have to remember to add the init container when you create new jobs. Our crond-agent Helm chart packages exactly this pattern behind a one-line crond-agent.wrap macro, so the boilerplate collapses to a single include.

Pattern 3: a mutating webhook (the Helm chart approach)

The boilerplate problem in pattern 2 goes away if a controller injects the wrapper automatically. A small mutating admission webhook can watch for CronJob resources with a specific annotation and inject the init container + command rewrite at admission time.

apiVersion: batch/v1

kind: CronJob

metadata:

annotations:

crond.io/inject: "true"

crond.io/ping-key-env: PING_KEY_BACKUP

spec:

# original spec, unchanged

Two annotations, no per-job boilerplate, no image rebuild — at the cost of running a webhook controller in every cluster. This is the crond-agent Helm chart's V2 path: an opt-in mutating admission webhook (injector.enabled, off by default) that auto-wraps any CronJob labeled crond.io/inject: "true". It runs with failurePolicy: Ignore, so a webhook outage never blocks your CronJobs — they just run unmonitored until it recovers. The crond-agent.wrap macro (pattern 2) still ships for clusters that prefer no in-cluster controller.

A note on schedule drift

Kubernetes' CronJob controller does not guarantee on-time execution. If the API server is under load, or the controller is starved of resources, scheduled runs can be delayed by minutes. Set your grace period to account for this — we recommend at least 30s on top of expected runtime for sub-hour CronJobs, more for clusters under sustained pressure.

When each pattern fits

Bake into image: 1–3 CronJobs, all under your control, no plans to scale the number of monitored workloads.
EmptyDir sidecar (crond-agent Helm chart):Third-party images you can't modify, but you still own the manifests. The chart's crond-agent.wrap macro is the shipped path here — a good fit from a handful of CronJobs up to many across namespaces.
Mutating webhook: Eliminates per-job boilerplate for dozens of CronJobs via annotation-based opt-in — a heavier pattern that runs a controller in the cluster. Shipped in the crond-agent Helm chart (V2, off by default via injector.enabled).

Read the Kubernetes setup guide →

Monitor your Kubernetes CronJobs — no image rebuilds.

Free tier: 10 monitors, no credit card. Wrap a CronJob with the crond-agent Helm macro and get alerted on missed or failed runs.

$get-started --free

How to Monitor Kubernetes CronJobs Without Modifying Your Images

Pattern 1: bake the ping into the image

Pattern 2: an emptyDir sidecar with a wrapper script

Pattern 3: a mutating webhook (the Helm chart approach)

A note on schedule drift

When each pattern fits

Related