Since you didn't post the string that describes your target (e.g. `llvm -mcpu=xxx other flags`) I can only offer a wild guess. Under some circumstances I have found that forcing a particular layout (e.g. NHCW vs NCHW) and data types (int 8 vs int16) you will get a worse schedule which results in longer latencies. So choosing a particular optimization level may force the compiler to generate a worse schedule?
Choosing `metal` is more restrictive and thus guides the scheduling at compile time which gives better latencies. Again, it's a wild guess and I could be completely wrong. --- [Visit Topic](https://discuss.tvm.apache.org/t/why-ios-arm64-cpu-opt-level-0-is-faster-than-opt-level-3/10783/3) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/9cf7bf02c3cae4b6392f74bbe1822860235013aa9918c9a081295f6491b02df2).