Cache Hierarchy: Role in AI Model Inference