Troubleshoot¶

Worker failure¶

See also

Workers make state changes as late as possible, and acknowledge messages once processing is complete. There are exceptions:

To implement the Aggregator pattern, the check.dataset worker sets the dataset’s state to in-progress before doing work. The worker will then acknowledge messages for the same dataset without further processing. However, this means that, if the worker fails, then the redelivery is ignored. To reset a dataset’s state and re-publish the corresponding message, run, for example:
```
./manage.py dev restart-dataset-check 123
```
In the Extract workers, the message is acknowledged before publishing a message for each batch of extracted data. This avoids cascading redelivery, as there is logic that can fail between each publish. However, this means that, if the worker fails, then the missing batches are never extracted. After fixing the issue, add a new dataset and remove the old dataset.

Debugging workers¶

The RabbitMQ management interface makes it easy to add a message to a queue. For reference, see the sample messages in the RabbitMQ section. For the extract workers, it might be easier to run the add command with --sample INTEGER.