@tqchen yes absolutely - from talking to you yesterday I hadn't thought of the 
uTVM application, but it certainly could be interesting. One possible 
improvement in that direction could be to create a mmap'able representation of 
the parsed graph_json, i.e. these fields of `MinimalGraphRuntime`:

```
  DynArray<Node> nodes_;
  DynArray<uint32_t> input_nodes_;
  DynArray<uint32_t> node_row_ptr_;
  DynArray<NodeEntry> outputs_;
```

which would allow us to 'allocation-free' construct the GraphRuntime (and 
eliminate the code-size cost of the json parser), and then the remaining 
allocations are the NDArray tensor allocations themselves which could be 
handled via a static storage plan or similar?


-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/dmlc/tvm/pull/3567#issuecomment-512623009

Reply via email to