Find the job
Open the job in the Tilebox Console, or use the Tilebox command-line tool if you already have the job ID.Check task state first
Start with the task graph. A failed task often points to task code or runtime dependencies. A queued task often points to cluster, runner, or task registration mismatch. Common checks:- The job was submitted to the intended cluster.
- A runner is connected to the same cluster.
- The runner advertises the submitted task identifier and compatible version.
- Any task dependencies are complete.
- Retry limits have not been exhausted.
Inspect logs
Logs show messages emitted by task code and runner context attached by Tilebox.Inspect traces
Traces show task timing, parent-child relationships, custom spans, and failures.Fix and rerun
For direct runners, fix the code and restart the runner process. For release runners, publish a fixed release and deploy it. If the fix is compatible with the failed task input schema and task major version, retry the job. If the change is breaking, submit a new job with a new task version.Retry with a compatible release
Publish a compatible fix, deploy it to the same cluster, and retry failed work.
Inspect workflow runs
Learn how logs, traces, task status, and runner context fit together.