Dynamic Failover Strategies for AI Workloads