Re: [apache/incubator-tvm] TVM v0.7 Release Note Candidate (#6486)

ziheng Thu, 24 Sep 2020 17:53:08 -0700

## Operator Coverage

- Allow empty tensor for reshape, tile and strided_slice 
[#4618](https://github.com/apache/incubator-tvm/issues/4618)
- Fix meaning of conv2d_transpose output_padding parameter"; 
[#4708](https://github.com/apache/incubator-tvm/issues/4708)
- Remove cpp upsampling and resize op 
[#4769](https://github.com/apache/incubator-tvm/issues/4769)
- upsample operator 'NCHWinic' format support. 
[#4791](https://github.com/apache/incubator-tvm/pull/4791)
- Injective schedule improvement 
[#4786](https://github.com/apache/incubator-tvm/pull/4786)
- Enable vectorization on fp16 type 
[#4867](https://github.com/apache/incubator-tvm/pull/4867)
- Support for Int8 schedules - CUDA/x86 
[#5031](https://github.com/apache/incubator-tvm/pull/5031)
- New PR to re-add tan to TVM 
[#5025](https://github.com/apache/incubator-tvm/pull/5025)
- Register topi schedule for Relay fast_exp and fast_tanh 
[#5131](https://github.com/apache/incubator-tvm/pull/5131)
- Move Dilation2d from nn to image namespace 
[#5110](https://github.com/apache/incubator-tvm/pull/5110)
- Use Thrust sort for argsort and topk 
[#5097](https://github.com/apache/incubator-tvm/pull/5097)
- Conv2d and Dense ops support on Tensor Core 
[#5099](https://github.com/apache/incubator-tvm/pull/5099)
- Setting workload correctly for Depthwise Spatial conv ARM. 
[#5182](https://github.com/apache/incubator-tvm/pull/5182)
- Adding a few missing math intrin 
[#5011](https://github.com/apache/incubator-tvm/pull/5011)
- Missing vectorize for depthwise conv2d. 
[#5196](https://github.com/apache/incubator-tvm/pull/5196)
- [TOPI] Using x86 schedules for ARM conv2d 
([#5334](https://github.com/apache/incubator-tvm/pull/5334))
- [TOPI-ARM] Do not alter layout if layout is NHWC 
([#5350](https://github.com/apache/incubator-tvm/pull/5350))
- [TOPI] Setting workload correctly for Depthwise Spatial conv ARM. 
([#5182](https://github.com/apache/incubator-tvm/pull/5182))
- [Relay][OP] Add fast_erf implementation 
([#5241](https://github.com/apache/incubator-tvm/pull/5241))
- [Topi] Tensorcore support for Conv3D 
([#5284](https://github.com/apache/incubator-tvm/pull/5284))
- [intrin] a few more math functions 
([#5468](https://github.com/apache/incubator-tvm/pull/5468))
- [Intrinsic] Add log1p, ldexp, atan2, hypot, nextafter, copysign 
([#5312](https://github.com/apache/incubator-tvm/pull/5312))
- [relay][topi] Add operation relay.nn.dilate() which calls topi.nn.dilate() 
([#5331](https://github.com/apache/incubator-tvm/pull/5331))
- [Topi x86] Missing vectorize for depthwise conv2d. 
([#5196](https://github.com/apache/incubator-tvm/pull/5196))
- [TOPI x86] Adding unroll_kw config option for depthwise conv2d. (#5197)
- [Topi] Breakdown [topi.cc](http://topi.cc/) into smaller files 
([#5253](https://github.com/apache/incubator-tvm/pull/5253))
- ReduceLogSumExp Operator support 
[#5453](https://github.com/apache/incubator-tvm/pull/5453)
- Math ops added [#5502](https://github.com/apache/incubator-tvm/pull/5502)
- Enable blocking format in x86 conv2d and fold scale axis 
[#5357](https://github.com/apache/incubator-tvm/pull/5357)
- Add operation gather to relay. 
[#5716](https://github.com/apache/incubator-tvm/pull/5716)
- Add storage_order ignore in pooling layer. 
[#5781](https://github.com/apache/incubator-tvm/pull/5781)
- Fix bifrost spatial packing conv2d auto tune 
[#5684](https://github.com/apache/incubator-tvm/pull/5684)
- Fix reshape usage in ARM schedule 
[#5732](https://github.com/apache/incubator-tvm/pull/5732)
- Block sparse dense on cuda 
[#5746](https://github.com/apache/incubator-tvm/pull/5746)
- Improve CUDA softmax scheduling 
[#5600](https://github.com/apache/incubator-tvm/pull/5600)
- block sparse dense on cuda 
[#5746](https://github.com/apache/incubator-tvm/pull/5746)
- pass-by-value -> pass-by-const-reference 
[#5783](https://github.com/apache/incubator-tvm/pull/5783)
- Using MKL blas for quantized dense 
[#6115](https://github.com/apache/incubator-tvm/pull/6115)
- topi -> tvm/topi [#6186](https://github.com/apache/incubator-tvm/pull/6186)
- Use auto-tuner to improve conv2d_gemm performance 
[#6117](https://github.com/apache/incubator-tvm/pull/6117)
- Improve CUDA conv2d_transpose_nchw 
[#4762](https://github.com/apache/incubator-tvm/issues/4762)
- Add CUDA conv2d for NHWC layout 
[#4737](https://github.com/apache/incubator-tvm/issues/4737)
- conv3d_ndhwc schedule 
[#4775](https://github.com/apache/incubator-tvm/issues/4775)
- Fast exponent [#4790](https://github.com/apache/incubator-tvm/pull/4790)
- Add Scatter to Topi/Relay/ONNX via hybrid script 
[#5619](https://github.com/apache/incubator-tvm/pull/5619)
- Split MKL from BLAS. 
[#6182](https://github.com/apache/incubator-tvm/pull/6182)
- Change the meaning of conv3d_transpose output_padding to match 
conv{1,2}d_transpose [#6065](https://github.com/apache/incubator-tvm/pull/6065)
- Gather op support added 
[#6013](https://github.com/apache/incubator-tvm/pull/6013)


## Runtime and Backend

- Cythonize NDArray.copyto 
([#4549](https://github.com/apache/incubator-tvm/pull/4549))
- Unified Object System runtime refactor 
([#4578](https://github.com/apache/incubator-tvm/pull/4578), 
[#4581](https://github.com/apache/incubator-tvm/pull/4581), 
[#4603](https://github.com/apache/incubator-tvm/pull/4603))
- VM profiler: sort VM stats by time 
([#4601](https://github.com/apache/incubator-tvm/pull/4601))
- Update RPC runtime to allow remote module as arg 
([#4462](https://github.com/apache/incubator-tvm/pull/4462))
- Refactorying system lib and dso lib into library module 
([#4481](https://github.com/apache/incubator-tvm/pull/4481))
- Improve TSIM virtual memory mapping 
([#4545](https://github.com/apache/incubator-tvm/pull/4545))
- make adt tag signed 
[#4605](https://github.com/apache/incubator-tvm/issues/4605)
- Improve TVMBackendPackedCFunc to allow return val 
[#4637](https://github.com/apache/incubator-tvm/issues/4637)
- EdgeTPU runtime for Coral Boards 
[#4698](https://github.com/apache/incubator-tvm/issues/4698)
- Fix memory leak when using openMP 
[#4811](https://github.com/apache/incubator-tvm/issues/4811)
- Fix memory leakage of TVMByteArray 
[#4856](https://github.com/apache/incubator-tvm/pull/4856)
- Fix TVM_DLL_EXPORT_TYPED_FUNC to work on Windows 
[#4955](https://github.com/apache/incubator-tvm/pull/4955)
- Fix memory leak when using openMP 
[#4811](https://github.com/apache/incubator-tvm/pull/4811)
- Export GraphRuntime in tvm_runtime.dll 
[#5002](https://github.com/apache/incubator-tvm/pull/5002)
- MISRA-C compliant TVM runtime 
[#3934](https://github.com/apache/incubator-tvm/pull/3934)
- Update the type_keys to reflect the code-org 
[#5074](https://github.com/apache/incubator-tvm/pull/5074)
- Fix AttrEqual for Array and StrMap, double 
[#5054](https://github.com/apache/incubator-tvm/pull/5054)
- Export GraphRuntime in tvm_runtime.dll 
[#5002](https://github.com/apache/incubator-tvm/pull/5002)
- Fix unused-value warning 
[#5140](https://github.com/apache/incubator-tvm/pull/5140)
- crt error handling [#5147](https://github.com/apache/incubator-tvm/pull/5147)
- Bundle deployment with static linking 
[#5158](https://github.com/apache/incubator-tvm/pull/5158)
- Implemented kDLCPUPinned (cudaMallocHost) 
[#4985](https://github.com/apache/incubator-tvm/pull/4985)
- Explicitly cast min/max operands 
[#5090](https://github.com/apache/incubator-tvm/pull/5090)
- ref_counter -> ref_counter_ 
[#5184](https://github.com/apache/incubator-tvm/pull/5184)
- Expose runtime::String to Python 
([#5212](https://github.com/apache/incubator-tvm/pull/5212))
- [PY][FFI] Refactor runtime.String to subclass str 
([#5426](https://github.com/apache/incubator-tvm/pull/5426))
- [RUNTIME] Auto conversion from str to runtime::String in PackedFUnc 
([#5251](https://github.com/apache/incubator-tvm/pull/5251))
- [RUNTIME] Improved Packed FFI for optional. 
([#5478](https://github.com/apache/incubator-tvm/pull/5478))
- [Hexagon] Add hexagon_posix.cc to TVM/RT sources in the right place (#5346)
- [PY][FFI] Refactor runtime.String to subclass str (#5426)
- Fix workspace [#5503](https://github.com/apache/incubator-tvm/pull/5503)
- Store nullptr PackedFunc as nullptr for better error propagation 
[#5540](https://github.com/apache/incubator-tvm/pull/5540)
- Improve PackedFunc robustness 
[#5517](https://github.com/apache/incubator-tvm/pull/5517)
- Seg fault in WorkspacePool's destructor (#5632) 
[#5636](https://github.com/apache/incubator-tvm/pull/5636)
- Resolve constexpr issue in debug mode. 
[#5651](https://github.com/apache/incubator-tvm/pull/5651)
- Add compile_shared option to linux compile utility fn 
[#5751](https://github.com/apache/incubator-tvm/pull/5751)
- Call sync in CopyFromRemote and CopyToRemote 
[#5512](https://github.com/apache/incubator-tvm/pull/5512)
- Fix the multihop cpu case 
[#5522](https://github.com/apache/incubator-tvm/pull/5522)
- Improve RPCServer AsyncIO support. 
[#5544](https://github.com/apache/incubator-tvm/pull/5544)
- Modularize the RPC infra 
[#5484](https://github.com/apache/incubator-tvm/pull/5484)
- Add compile_shared option to linux compile utility fn 
[#5751](https://github.com/apache/incubator-tvm/pull/5751)
- Overload string operators 
[#5806](https://github.com/apache/incubator-tvm/pull/5806)
- Only initialize required module 
[#5926](https://github.com/apache/incubator-tvm/pull/5926)
- if a param not in input, we should still consume it’s data 
[#5990](https://github.com/apache/incubator-tvm/pull/5990)
- init TVMPackedFunc’s name 
[#6044](https://github.com/apache/incubator-tvm/pull/6044)
- Enable auto conversion String->DLDataType 
[#6214](https://github.com/apache/incubator-tvm/pull/6214)
- Support random fill [#5913](https://github.com/apache/incubator-tvm/pull/5913)
- Use new to avoid exit-time de-allocation order 
[#6292](https://github.com/apache/incubator-tvm/pull/6292)
- Add parallel_for support to run a loop in parallel 
[#6275](https://github.com/apache/incubator-tvm/pull/6275)
- Solve ARM BIG.LITTLE heterogeneous multicores 
[#4747](https://github.com/apache/incubator-tvm/issues/4747)
- [RUNTIME] Quick fix PackedFunc String passing 
([#5266](https://github.com/apache/incubator-tvm/pull/5266))
- Introduce runtime::String::CanConvertFrom 
[#5718](https://github.com/apache/incubator-tvm/pull/5718)
- Restore the StrMap behavior in JSON/SHash/SEqual 
[#5719](https://github.com/apache/incubator-tvm/pull/5719)
- Support overriding RPCWatchdog termination behavior on Android and other 
platforms [#6216](https://github.com/apache/incubator-tvm/pull/6216)
- Set NDArray::Container.shape_ in NDArray::FromDLPack 
([#5301](https://github.com/apache/incubator-tvm/pull/5301))
- Enable x86 cpu cache flush 
[#5914](https://github.com/apache/incubator-tvm/pull/5914)

## Quantization

- Conv2D type checking for kernel per-channel scales. 
[#4732](https://github.com/apache/incubator-tvm/issues/4732)
- Add missing nullptr check 
[#4773](https://github.com/apache/incubator-tvm/issues/4773)
- Doc fix on convolution and dequantize 
[#4799](https://github.com/apache/incubator-tvm/issues/4799)
- Conv2D with dilation support. 
[#4796](https://github.com/apache/incubator-tvm/issues/4796)
- Making scale/zero_points as expr instead of attrs. 
[#4611](https://github.com/apache/incubator-tvm/issues/4611)
- Make calibration faster and more memory usage friendly 
[#4589](https://github.com/apache/incubator-tvm/issues/4589)
- Doc fix on convolution and dequantize 
[#4799](https://github.com/apache/incubator-tvm/pull/4799)
- Conv2D with dilation support. 
[#4796](https://github.com/apache/incubator-tvm/pull/4796)
- Optimize lowering for requantize and FixedPointMultiply. 
[#4798](https://github.com/apache/incubator-tvm/pull/4798)
- More doc fix on quantize and convolution 
[#4874](https://github.com/apache/incubator-tvm/pull/4874)
- Add support for per channel weight scale in dense op 
[#4880](https://github.com/apache/incubator-tvm/pull/4880)
- Add support for quantized models via QNN 
[#4977](https://github.com/apache/incubator-tvm/pull/4977) 
[#5013](https://github.com/apache/incubator-tvm/pull/5013)
- Support 4D padding. [#5036](https://github.com/apache/incubator-tvm/pull/5036)
- [Requantize] Cleanup and Optimize Lowering 
([#5286](https://github.com/apache/incubator-tvm/pull/5286))
- [Topi, ARM] Disbale Winograd for quantized tensors. 
([#5363](https://github.com/apache/incubator-tvm/pull/5363))
- Adding support for TFLite QnnSubtract operator. 
([#5230](https://github.com/apache/incubator-tvm/pull/5230))
- Remove developer facing api from frontend exports. 
([#5375](https://github.com/apache/incubator-tvm/pull/5375))
- Add Quantize/Dequantize Partitioning 
[#5940](https://github.com/apache/incubator-tvm/pull/5940)
- Add support for quantized models via QNN 
[#5016](https://github.com/apache/incubator-tvm/pull/5016)
- Quanitze operation expanded to take const argument 
[#6127](https://github.com/apache/incubator-tvm/pull/6127)
- FP32 and Quantized Object Detection Model 
[#5479](https://github.com/apache/incubator-tvm/pull/5479)
- Support CallNode inputs in qnn.concatenate 
[#5360](https://github.com/apache/incubator-tvm/pull/5360)
- QNN support for TFLite 2.1.0 quantized models 
[#5848](https://github.com/apache/incubator-tvm/pull/5848)

## TE

- Tighten split's extent 
[#4931](https://github.com/apache/incubator-tvm/pull/4931)
- Set split node's range to minimum of ext and split factor or split np… 
[#5044](https://github.com/apache/incubator-tvm/pull/5044)
- Support mixing normal and cross-thread reduction 
([#5193](https://github.com/apache/incubator-tvm/pull/5193))
- Inline -> te/schedule/operation_inline.h 
([#5386](https://github.com/apache/incubator-tvm/pull/5386))
- Create loops according to storage scope and thread hierarchies 
([#5190](https://github.com/apache/incubator-tvm/pull/5190))
- Fix import in dump pass ir (#5327)
- Scalar support for `te.extern` 
[#6079](https://github.com/apache/incubator-tvm/pull/6079)

## TIR

- IR readability enhancement 
([#4501](https://github.com/apache/incubator-tvm/pull/4501))
- Introduce tir::PrimFunc 
[#5070](https://github.com/apache/incubator-tvm/pull/5070)
- Introduce PrimFuncPass. 
[#5139](https://github.com/apache/incubator-tvm/pull/5139)
- [TIR] Enhance Substitute, python bindings for Substitute/PostOrderVisit 
([#5400](https://github.com/apache/incubator-tvm/pull/5400))
- [TIR] Remove ProducerConsumer and AllocateNode::new_expr 
([#5333](https://github.com/apache/incubator-tvm/pull/5333))
- [IR][TRANSFORM] Enable CopyOnWrite for TIR passes. 
([#5309](https://github.com/apache/incubator-tvm/pull/5309))
- [REFACTOR][TIR] Migrate LowerTVMBuiltin, InferFragment, LowerThreadAllreduce, 
ThreadSync to Pass Manager 
([#5213](https://github.com/apache/incubator-tvm/pull/5213))
- [TIR][REFACTOR] Remove te::Tensor dependencies from TIR passes. 
([#5372](https://github.com/apache/incubator-tvm/pull/5372))
- [TIR] Refactor MakePackedAPI to target dependent stage. 
([#5326](https://github.com/apache/incubator-tvm/pull/5326))
- [REFACTOR] tvm.hybrid -> te.hybrid 
([#5223](https://github.com/apache/incubator-tvm/pull/5223))
- [REFACTOR][TIR] Migrate most of low-level build to use the Pass Manager. 
([#5225](https://github.com/apache/incubator-tvm/pull/5225))
- [TIR][REFACTOR] Migrate low-level passes in tvm.lower to the Pass Manager 
([#5364](https://github.com/apache/incubator-tvm/pull/5364))
- [TIR] Migrate VTA TIR passes to the new pass manager. 
([#5397](https://github.com/apache/incubator-tvm/pull/5397))
- [REFACTOR][TIR] Migrate all low-level passes to the Pass Manager. 
([#5233](https://github.com/apache/incubator-tvm/pull/5233))
- [REFACTOR][TIR] Introduce ExprDeepEqual, Remove IRDeepCompare (#5206)
- [TIR][REFACTOR] RewriteForTensorCore -> te/schedule (#5379)
- [TIR][REFACTOR] Remove ir_pass in favor of analysis/transform. (#5415)
- text format printer considering future parsing use 
[#5483](https://github.com/apache/incubator-tvm/pull/5483)
- Remove buffer params from pass config. 
[#5652](https://github.com/apache/incubator-tvm/pull/5652)
- std::string -> String Migration in TIR nodes 
[#5596](https://github.com/apache/incubator-tvm/pull/5596)
- Remove CallNode.call_type in favor of attribute. 
[#5937](https://github.com/apache/incubator-tvm/pull/5937)
- Remove legacy HoistIfThenElse 
[#5944](https://github.com/apache/incubator-tvm/pull/5944)
- Improve Let/LetStmt support. 
[#5949](https://github.com/apache/incubator-tvm/pull/5949)
- Refine side effect analysis. 
[#5954](https://github.com/apache/incubator-tvm/pull/5954)
- Provide->ProducerStore, Realize->ProducerRealize. 
[#5750](https://github.com/apache/incubator-tvm/pull/5750)
- Migrate the tvm/tir/expr.h to constructor 
[#5773](https://github.com/apache/incubator-tvm/pull/5773)
- Migrate tir/stmt.h to use constructor. 
[#5778](https://github.com/apache/incubator-tvm/pull/5778)
- Cleanup unused classes 
[#5789](https://github.com/apache/incubator-tvm/pull/5789)
- Add tir prefix to type keys 
[#5802](https://github.com/apache/incubator-tvm/pull/5802)
- Enhance VerifyGPUCode 
[#6194](https://github.com/apache/incubator-tvm/pull/6194)
- Enforce buffer pointer var type to be consistent with dtype. 
[#6317](https://github.com/apache/incubator-tvm/pull/6317)
- Create a StringImm reference type 
[#4806](https://github.com/apache/incubator-tvm/issues/4806)
- Add `init` member to ReduceNode 
[#6138](https://github.com/apache/incubator-tvm/pull/6138)
- Add dump and print for debugging (NFC) 
[#5207](https://github.com/apache/incubator-tvm/pull/5207)
- Streamline Function Attr interface. 
[#5045](https://github.com/apache/incubator-tvm/pull/5045)
- alpha_equal to structural_equal 
[#5161](https://github.com/apache/incubator-tvm/pull/5161)
- Remove AttrsEqual and AttrsHash related code 
[#5169](https://github.com/apache/incubator-tvm/pull/5169)
- [NODE] General serialzation of leaf objects into bytes. 
([#5299](https://github.com/apache/incubator-tvm/pull/5299))
- [POC][IR] Initial stab at std::string->String upgrade 
([#5438](https://github.com/apache/incubator-tvm/pull/5438))
- [TIR] Make lower_warp_memory support extent(threadIdx.x) < warp_size 
([#5307](https://github.com/apache/incubator-tvm/pull/5307))
- [TIR][PASS] dtype rewrite for indexing variables 
([#5092](https://github.com/apache/incubator-tvm/pull/5092))
- [PYTHON] Enhance with_attr API, cleanup MakeAPILegacy in testcases 
([#5335](https://github.com/apache/incubator-tvm/pull/5335))
- [PYTHON] Make IntImm more like an integer 
([#5232](https://github.com/apache/incubator-tvm/pull/5232))
- [REFACTOR][IR] Move to runtime::String 
([#5276](https://github.com/apache/incubator-tvm/pull/5276))
- [REFACTOR][IR] kExternalSymbol -> kGlobalSymbol 
([#5211](https://github.com/apache/incubator-tvm/pull/5211))
- [REFACTOR][IR] Remove PrimExpr from String 
([#5311](https://github.com/apache/incubator-tvm/pull/5311))
- IRModule is updated with String 
[#5523](https://github.com/apache/incubator-tvm/pull/5523)
- IR is updated with String 
[#5547](https://github.com/apache/incubator-tvm/pull/5547)
- Streamline ir/op Registry 
[#5609](https://github.com/apache/incubator-tvm/pull/5609)
- Migrate IRModule ObjectRef to not-null 
[#5654](https://github.com/apache/incubator-tvm/pull/5654)
- Migrate BuildConfig to PassContext. 
[#5668](https://github.com/apache/incubator-tvm/pull/5668)
- relay.op.Op -> tvm.ir.Op 
[#5705](https://github.com/apache/incubator-tvm/pull/5705)
- Separate ArgTypeCode from DLDataTypeCode 
[#5730](https://github.com/apache/incubator-tvm/pull/5730)
- Remove legacy compute_expr.h 
[#5738](https://github.com/apache/incubator-tvm/pull/5738)
- Call::Halide => ProducerLoad, DSL/TIR decouple. 
[#5743](https://github.com/apache/incubator-tvm/pull/5743)
- Provide->ProducerStore, Realize->ProducerRealize. 
[#5750](https://github.com/apache/incubator-tvm/pull/5750)
- Migrate the tvm/tir/expr.h to constructor 
[#5773](https://github.com/apache/incubator-tvm/pull/5773)
- Migrate tir/stmt.h to use constructor. 
[#5778](https://github.com/apache/incubator-tvm/pull/5778)
- Migrate all Object construction to constructor. 
[#5784](https://github.com/apache/incubator-tvm/pull/5784)
- Cleanup unused classes 
[#5789](https://github.com/apache/incubator-tvm/pull/5789)
- Finish std::string->String updates 
[#5793](https://github.com/apache/incubator-tvm/pull/5793)
- Add tir prefix to type keys 
[#5802](https://github.com/apache/incubator-tvm/pull/5802)
- Change [Call.name](http://call.name/) to Call.op(RelayExpr) 
[#5863](https://github.com/apache/incubator-tvm/pull/5863)
- Range/IntSet API style consistency. 
[#5953](https://github.com/apache/incubator-tvm/pull/5953)
- Separate ArgTypeCode from DLDataTypeCode 
[#5730](https://github.com/apache/incubator-tvm/pull/5730)
- Migrate all Object construction to constructor. 
[#5784](https://github.com/apache/incubator-tvm/pull/5784)
- Finish std::string->String updates 
[#5793](https://github.com/apache/incubator-tvm/pull/5793)
- Unify StrMapNode and MapNode 
[#5687](https://github.com/apache/incubator-tvm/pull/5687)

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/apache/incubator-tvm/issues/6486#issuecomment-698661161

Re: [apache/incubator-tvm] TVM v0.7 Release Note Candidate (#6486)

Reply via email to