As per the release notes, “this option allows users to compile a repeated nn.Module (e.g., a transformer layer in LLM) without recompilations,” resulting in faster performance with minimal degradation ...