I think this is a good test to see what is the problem. The
deadlocks
and OOM's seem to happen at 0400 when other virtual systems are
Hrm... so all of these are xen instances and they're doing backups at
the same time. If the rsync processes are going into a D state I'd
think it's an I/O exhaustion problem. Would it be possible to alter
the backup schedule and stagger them if the scheduler change doesn't
work?
-Adam