Checklist for Optimizing AI Latency with Async Processing