Skip to content

Latest commit

 

History

History
43 lines (26 loc) · 3.06 KB

ABI.md

File metadata and controls

43 lines (26 loc) · 3.06 KB

ONNX Runtime ABI

We release ONNX Runtime as both static library and shared library on Windows, Linux and Mac OS X. ABI (Application Binary Interface) is only for the shared library. It allows you upgrade ONNX Runtime to a newer version without recompiling.

The ABI contains:

  1. A set of C functions for creating an inference session and load/run a model. No global variables would be directly exported. All these functions can be directly used in .net and .net core through PInvoke. All these public symbols are C symbols, no C++.
  2. (TODO) A C++ API for authoring custom ops and put them into a separated dll.

Functionality

C API

Integration

Q: Should I statically link to ONNX Runtime or dynamically? A: On Windows, Any custom op DLL must dynamically link to ONNX Runtime. Dynamical linking also helps on solving diamond dependency problem. For example, if part of your program depends on ONNX 1.2 but ONNX Runtime depends on ONNX 1.3, then dynamically linking to them would be better.

Q: Any requirement on CUDA version? My program depends on CUDA 9.0, but the ONNX Runtime binary was built with CUDA 9.1. Is it ok to put them together? A: Yes. Because ONNX Runtime statically linked to CUDA.

Dev Notes

Global Variables

Global variables may get constructed or destructed inside "DllMain". There are significant limits on what you can safely do in a DLL entry point. See 'DLL General Best Practices'. For example, you can't put a ONNX Runtime InferenceSession into a global variable.

Undefined symbols in a shared library

On Windows, you can't build a DLL with undefined symbols. Every symbol must be get resolved at link time. On Linux, you can.

In this project, we setup a rule: when building a shared library, every symbol must get resolved at link time, unless it's a custom op.

For custom op, on Linux, don't pass any libraries(except libc, pthreads) to linker. So that, even the application is statically linked to ONNX Runtime, they can still use the same custom op binary.

Default visibility

On POSIX systems, please always specify "-fvisibility=hidden" and "-fPIC" when compiling any code in ONNX Runtime shared library.

See pybind11 FAQ

RTLD_LOCAL vs RTLD_GLOBAL

RTLD_LOCAL and RTLD_GLOBAL are two flags of dlopen(3) function on Linux. By default, it's RTLD_LOCAL. And basically you can say, there no corresponding things like RTLD_GLOBAL on Windows.

If your application is a shared library, which statically linked to ONNX Runtime, and your application needs to dynamically load a custom op, then your application must be loaded with RTLD_GLOBAL. In all other cases, you should use RTLD_LOCAL. ONNX Runtime python binding is a good example of why sometimes RTLD_GLOBAL is needed.