- Support for allocating input/output tensors together
- Support for allocating input/output tensors in external memory
- Support for paging tensors to external memory
- Support for asynchronous loading of weights from flash
- Switched PyPI release to Jenkins from Github actions