Skip to content

NNAPI usage via onnxruntime #10692

Discussion options

You must be logged in to vote

The ORT NNAPI EP will be preferred over the ORT CPU EP when assigning nodes in the model.

If the NNAPI EP can handle a specific operator ('handle' meaning convert to the equivalent NNAPI operator), nodes involving that operator will be assigned to the NNAPI EP. Any remaining nodes will be handled by the ORT CPU EP.

The NNAPI EP will create NNAPI model/s from the nodes assigned to it at runtime. We don't have much control over how NNAPI itself will execute the NNAPI model we create. It will internally pick whatever it thinks is best.

Setting the NNAPI_FLAG_CPU_DISABLED flag will prevent NNAPI from running operators in the NNAPI model that only have a CPU implementation. If there are operat…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by skottmckay
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
platform:mobile issues related to ONNX Runtime mobile; typically submitted using template
3 participants