# Performance Improvements - Int8 GEMM performance enhancement using Cublas ([#4550](https://github.com/apache/incubator-tvm/pull/4550)) - Speedup TSIM with multi-threading ([#4491](https://github.com/apache/incubator-tvm/pull/4491)) - [Runtime][Contrib] Support cudnn softmax (#5214) - [cuDNN] Add cuDNN grouped convolution support (#5319) - [Relay][Topi][AutoTVM] Winograd support for Conv3D (#5186) - [TOPI] Improve get_valid_count and nms performance for CUDA (#5339) - [Topi][Cuda]Optimizations of global_ave_pool for NHWC layout (#5450) - Optimization of Conv2d Winograd algorithm on Tensor … [#5485](https://github.com/apache/incubator-tvm/pull/5485) - Some performance improvement to VM [#5901](https://github.com/apache/incubator-tvm/pull/5901) - Optimize x86 conv3d_ndhwc using data packing approach. [#4866](https://github.com/apache/incubator-tvm/pull/4866) - Improve NHWC depthwise convolution for AArch64 [#6095](https://github.com/apache/incubator-tvm/pull/6095) - Improve quantized convolution performance for armv8 architectures [#5754](https://github.com/apache/incubator-tvm/pull/5754)
# Documentation - Adding benchmark log format doc ([#4366](https://github.com/apache/incubator-tvm/pull/4366)) - Add Ninja build system to installation docs ([#4554](https://github.com/apache/incubator-tvm/pull/4554)) - Doc/comment fixes ([#4452](https://github.com/apache/incubator-tvm/pull/4452), [#4463](https://github.com/apache/incubator-tvm/pull/4463), [#4469](https://github.com/apache/incubator-tvm/pull/4469), [#4493](https://github.com/apache/incubator-tvm/pull/4493), [#4397](https://github.com/apache/incubator-tvm/pull/4397), [#4580](https://github.com/apache/incubator-tvm/pull/4580), [#4585](https://github.com/apache/incubator-tvm/pull/4585), [#4591](https://github.com/apache/incubator-tvm/pull/4591)) - Fix doc after moving to unified IR [#4835](https://github.com/apache/incubator-tvm/pull/4835) - Introduction to module serialization [#4564](https://github.com/apache/incubator-tvm/pull/4564) - ConvertLayout - Call RemoveUnunsedFunctions. [#4834](https://github.com/apache/incubator-tvm/pull/4834) - Fix bugs that override n_trials [#4842](https://github.com/apache/incubator-tvm/pull/4842) - Update the vm doc [#4868](https://github.com/apache/incubator-tvm/pull/4868) - Refine the example description of max/min/sum/tag_scope [#4974](https://github.com/apache/incubator-tvm/pull/4974) - Fix vta tutorial [#4809](https://github.com/apache/incubator-tvm/pull/4809) - Introduce how to add hardware backend to FAQ [#4898](https://github.com/apache/incubator-tvm/pull/4898) - Update API docs to reflect the status after the refactor. [#4907](https://github.com/apache/incubator-tvm/pull/4907) - Fix sphinx warnings [#4917](https://github.com/apache/incubator-tvm/pull/4917) - Fix Sphinx Warnings (RST indent, cross-ref, and image scale) [#4920](https://github.com/apache/incubator-tvm/pull/4920) - Fix Sphinx Warning: the target found for cross-reference [#4925](https://github.com/apache/incubator-tvm/pull/4925) - Sphinx -- Introduce alias detection. [#4954](https://github.com/apache/incubator-tvm/pull/4954) - Fix Warnings from #4942 [#4959](https://github.com/apache/incubator-tvm/pull/4959) - Fix sphinx precheck [#4967](https://github.com/apache/incubator-tvm/pull/4967) - Move git_howto to rst, add Stage documents to te [#5055](https://github.com/apache/incubator-tvm/pull/5055) - Add doc for Relay op strategy [#5078](https://github.com/apache/incubator-tvm/pull/5078) - Update relay docs [#5112](https://github.com/apache/incubator-tvm/pull/5112) - Include a tarball of docs, add a security faq [#5119](https://github.com/apache/incubator-tvm/pull/5119) - Cleanup docs before rebuild [#5127](https://github.com/apache/incubator-tvm/pull/5127) - Minimize necessary doc change [#5129](https://github.com/apache/incubator-tvm/pull/5129) - Various sphinx related fix. [#5168](https://github.com/apache/incubator-tvm/pull/5168) - Point docs to the ASF site. [#5178](https://github.com/apache/incubator-tvm/pull/5178) - Use https link [#5183](https://github.com/apache/incubator-tvm/pull/5183) - Reduce artifcats generated by sphinx gallery [#5208](https://github.com/apache/incubator-tvm/pull/5208) - Refine the example description of max/min/sum/tag_scope [#4974](https://github.com/apache/incubator-tvm/pull/4974) - Description updated for pooling attributes [#5091](https://github.com/apache/incubator-tvm/pull/5091) - [DOCS] Migrate some markdowns to rst, fix sphinx3 warnings ([#5416](https://github.com/apache/incubator-tvm/pull/5416)) - [DOCS] Misc docs improvements ([#5222](https://github.com/apache/incubator-tvm/pull/5222)) - [DOCS] Bring relay docs to the top-level flat view ([#5343](https://github.com/apache/incubator-tvm/pull/5343)) - [DOCS] Reduce artifcats generated by sphinx gallery ([#5208](https://github.com/apache/incubator-tvm/pull/5208)) - [DOCS] Use https link ([#5183](https://github.com/apache/incubator-tvm/pull/5183)) - [DOCSTRING]missing function parameters updated ([#5228](https://github.com/apache/incubator-tvm/pull/5228)) - [DOCS] Migrate HLS documents from md to rst ([#5419](https://github.com/apache/incubator-tvm/pull/5419)) - [Tutorial, QNN] Add tutorial for loading quantized PyTorch model ([#5321](https://github.com/apache/incubator-tvm/pull/5321)) - [Docs] VTA install doc migration from md to rst ([#5442](https://github.com/apache/incubator-tvm/pull/5442)) - [TVM][docs] compiler version in docs ([#5281](https://github.com/apache/incubator-tvm/pull/5281)) - Remove legacy compute_expr.h [#5738](https://github.com/apache/incubator-tvm/pull/5738) - TVM_REGISTER_API -> TVM_REGISTER_GLOBAL [#4768](https://github.com/apache/incubator-tvm/issues/4768) # Bug Fixes - Add bfloat16 typeflag support ([#4525](https://github.com/apache/incubator-tvm/pull/4525)) - MSVC / Windows fixes ([#4455](https://github.com/apache/incubator-tvm/pull/4455), [#4569](https://github.com/apache/incubator-tvm/pull/4569)) - Fix Makefile for howto_deploy ([#4457](https://github.com/apache/incubator-tvm/pull/4457/files)) - Fix GCC 4.8 compact ([#4461](https://github.com/apache/incubator-tvm/pull/4461)) - Fix search path to build libtvm_topi.so ([#4467](https://github.com/apache/incubator-tvm/pull/4467)) - Fix for conv2d_transpose CUDA compilation ([#4472](https://github.com/apache/incubator-tvm/pull/4472)) - Fix for LLVM 10.0 codegen ([#4480](https://github.com/apache/incubator-tvm/pull/4480), [#4515](https://github.com/apache/incubator-tvm/pull/4515)) - Fix alter op layout when calling global var ([#4454](https://github.com/apache/incubator-tvm/pull/4454)) - Fix float2half_rn support for cuda compute capabilities < 53 ([#4489](https://github.com/apache/incubator-tvm/pull/4489)) - Fix compile errors for OpenCL backends ([#4492](https://github.com/apache/incubator-tvm/pull/4492)) - Fix serialization precision loss ([#4503](https://github.com/apache/incubator-tvm/pull/4503)) - Fix hybrid script to support array of tensors ([#4494](https://github.com/apache/incubator-tvm/pull/4494)) - Fix annotation for multiply op ([#4458](https://github.com/apache/incubator-tvm/pull/4458)) - Fix Dockerfile for linter CI ([#4506](https://github.com/apache/incubator-tvm/pull/4506)) - Fix TF resize for dynamic size models ([#4510](https://github.com/apache/incubator-tvm/pull/4510)) - Fix bias_add gradient ([#4516](https://github.com/apache/incubator-tvm/pull/4516)) - Fix tanH unit test function call ([#4517](https://github.com/apache/incubator-tvm/pull/4517)) - Fix extra reshape parameter for ONNX ([#4524](https://github.com/apache/incubator-tvm/pull/4524)) - Fix crash caused by empty TOPI config ([#4520](https://github.com/apache/incubator-tvm/pull/4520)) - Fix ONNX shape op type to use int64 ([#4528](https://github.com/apache/incubator-tvm/pull/4528)) - Fix crash in TSIM virtual memory driver ([#4527](https://github.com/apache/incubator-tvm/pull/4527)) - Replace deprecated python library in setup script ([#4533](https://github.com/apache/incubator-tvm/pull/4533)) - Fix NMS max_output_size loop ([#4541](https://github.com/apache/incubator-tvm/pull/4541)) - Fix style in IR mutator and IR visitor ([#4561](https://github.com/apache/incubator-tvm/pull/4561)) - Fix compiler warning ([#4559](https://github.com/apache/incubator-tvm/pull/4559)) - Fix to get end to end inference on Chisel VTA ([#4574](https://github.com/apache/incubator-tvm/pull/4574)) - Fix LLVM build by adding missing intrinsics headers ([#4575](https://github.com/apache/incubator-tvm/pull/4575)) - Fix context creation in quantization ([#4582](https://github.com/apache/incubator-tvm/pull/4582)) - Fix NDArray SaveDLTensor signature ([#4586](https://github.com/apache/incubator-tvm/pull/4586)) - Fix dense pack schedule for x86 ([#4539](https://github.com/apache/incubator-tvm/pull/4539)) - Fix for broadcast tensor of scalar type ([#4577](https://github.com/apache/incubator-tvm/pull/4577)) - Datatype refactor ([#4513](https://github.com/apache/incubator-tvm/pull/4513/files), [#4560](https://github.com/apache/incubator-tvm/pull/4560)) - Add const qualifiers for NDArray container ([#4590](https://github.com/apache/incubator-tvm/pull/4590)) - Fix TF <= 1.12 compatibility ([#4593](https://github.com/apache/incubator-tvm/pull/4593)) - Fix for graph debug runtime ([#4598](https://github.com/apache/incubator-tvm/pull/4598/files)) - Disable copy constructor for external codegen ([#4597](https://github.com/apache/incubator-tvm/pull/4597)) - Make ADT tag signed ([#4605](https://github.com/apache/incubator-tvm/pull/4605)) - Added declare of aluBits for TensorAlu [#4624](https://github.com/apache/incubator-tvm/issues/4624) - Get around limitation of g++-4.8 [#4626](https://github.com/apache/incubator-tvm/issues/4626) - Bugfix StmtMutator IfThenElse [#4609](https://github.com/apache/incubator-tvm/issues/4609) - Remove unecessary rdynamic [#4613](https://github.com/apache/incubator-tvm/issues/4613) - Resolve constexpr related link error in debug mode [#4641](https://github.com/apache/incubator-tvm/issues/4641) - Asymmetric padding [#4511](https://github.com/apache/incubator-tvm/issues/4511) - Reduce data size of asymmetric padding testcase [#4658](https://github.com/apache/incubator-tvm/issues/4658) - Fix Base64OutStream portability issue [#4668](https://github.com/apache/incubator-tvm/issues/4668) - Fix topi.nn.global_pool layout="NHWC" [#4656](https://github.com/apache/incubator-tvm/issues/4656) - Also package core.rly [#4679](https://github.com/apache/incubator-tvm/issues/4679) - `fskip` of EliminateCommonSubexpr cannot always return false [#4620](https://github.com/apache/incubator-tvm/issues/4620) - Fix Python syntax error in start_rpc_server_to_tracker.py [#4682](https://github.com/apache/incubator-tvm/issues/4682) - os.path --> osp to match the import [#4681](https://github.com/apache/incubator-tvm/issues/4681) - GitHub actions/checkout@v1 --> v2 [#4680](https://github.com/apache/incubator-tvm/issues/4680) - Fix Python syntax error AGAIN in start_rpc_server_to_tracker.py [#4685](https://github.com/apache/incubator-tvm/issues/4685) - Use ==/!= to compare str, bytes, and int literals [#4686](https://github.com/apache/incubator-tvm/issues/4686) - Rename start_rpc_server_to_tracker.py to start_rpc_server_to_tracker.sh [#4689](https://github.com/apache/incubator-tvm/issues/4689) - GitHub Action lint Python code for syntax errors [#4688](https://github.com/apache/incubator-tvm/issues/4688) - Generate blob use LLVM directly [#4657](https://github.com/apache/incubator-tvm/issues/4657) - Reduce input size to fix oom [#4653](https://github.com/apache/incubator-tvm/issues/4653) - Fix RemoveUnusedFunctions pass [#4700](https://github.com/apache/incubator-tvm/issues/4700) - Link the math library by default [#4713](https://github.com/apache/incubator-tvm/issues/4713) - Update mainline version to 0.7.dev0 [#4720](https://github.com/apache/incubator-tvm/issues/4720) - Add SizeVar representing non-neg valued variable in a tensor shape [#4684](https://github.com/apache/incubator-tvm/issues/4684) - Fix the compile problem of cpp_rpc [#4725](https://github.com/apache/incubator-tvm/issues/4725) - JSON upgrader to upgrade serialized json. [#4730](https://github.com/apache/incubator-tvm/issues/4730) - Fallback schedule for Int8 depthwise. [#4733](https://github.com/apache/incubator-tvm/issues/4733) - Fix dense x86 schedule [#4728](https://github.com/apache/incubator-tvm/issues/4728) - Fix demo dockerfile build failed [#4744](https://github.com/apache/incubator-tvm/issues/4744) - Improve CUDA vectorizer [#4736](https://github.com/apache/incubator-tvm/issues/4736) - Add .asf.yaml for github info [#4761](https://github.com/apache/incubator-tvm/issues/4761) - Fix padding in pooling op [#4738](https://github.com/apache/incubator-tvm/issues/4738) - Remove run_infer_type duplicates [#4766](https://github.com/apache/incubator-tvm/issues/4766) - [pooling.cc](http://pooling.cc/) improvements [#4767](https://github.com/apache/incubator-tvm/issues/4767) - Export builtin_fp16 on Windows [#4731](https://github.com/apache/incubator-tvm/issues/4731) - Fix Tensorflow conv3d pad bug, add non-cubic data and kernel tests [#4772](https://github.com/apache/incubator-tvm/issues/4772) - Bump prebuilt-image version in demo dockerfile [#4770](https://github.com/apache/incubator-tvm/issues/4770) - Update tune_simple_template.py [#4778](https://github.com/apache/incubator-tvm/issues/4778) - Explicitly link to cublasLt if it exists [#4776](https://github.com/apache/incubator-tvm/issues/4776) - Fix hasattr by extracting Python error type from Windows error message [#4780](https://github.com/apache/incubator-tvm/issues/4780) - Replace os.path.exists with try...except...else [#4784](https://github.com/apache/incubator-tvm/issues/4784) - Make sure to visit the arguments of inlined functions [#4783](https://github.com/apache/incubator-tvm/issues/4783) - Parse additional exception strings [#4785](https://github.com/apache/incubator-tvm/issues/4785) - Fix #4670: add bias for fc layer [#4801](https://github.com/apache/incubator-tvm/issues/4801) - Change color channel from BGR to RGB for darknet preprocessing [#4794](https://github.com/apache/incubator-tvm/issues/4794) - Fix -Wextra [#4804](https://github.com/apache/incubator-tvm/issues/4804) - Fix vta tutorial [#4809](https://github.com/apache/incubator-tvm/issues/4809) - Minor bug fixes in AutoTVM for QNN graphs [#4797](https://github.com/apache/incubator-tvm/pull/4797) - Fixed subprocess creation under windows [#4820](https://github.com/apache/incubator-tvm/pull/4820) - Improve tol to resolve flaky case [#4836](https://github.com/apache/incubator-tvm/pull/4836) - Fixed process termination routine in windows [#4844](https://github.com/apache/incubator-tvm/pull/4844) - test_cuddn flaky [#4846](https://github.com/apache/incubator-tvm/pull/4846) - Mxnet parser for Qnn dialect [#4714](https://github.com/apache/incubator-tvm/pull/4714) - Enhance cc.cross_compiler [#4817](https://github.com/apache/incubator-tvm/pull/4817) - Fixed crash caused by reversing bitwise operations [#4852](https://github.com/apache/incubator-tvm/pull/4852) - Reverse some changes made for intel_graphics/conv2d.py in PR #4849 [#4853](https://github.com/apache/incubator-tvm/pull/4853) - const auto p -> const auto& p [#4861](https://github.com/apache/incubator-tvm/pull/4861) - Fix onnx import bugs [#4750](https://github.com/apache/incubator-tvm/pull/4750) - Explicit llvm::StringRef to std::string conversion [#4859](https://github.com/apache/incubator-tvm/pull/4859) - Update the runtime PackedFunc for module [#4871](https://github.com/apache/incubator-tvm/pull/4871) - Improve antlr import error message [#4888](https://github.com/apache/incubator-tvm/pull/4888) - Fix alpha_equal bug for attribute check [#4897](https://github.com/apache/incubator-tvm/pull/4897) - Fix issues in cuda codegen [#4876](https://github.com/apache/incubator-tvm/pull/4876) - Fixed: Bitwise ops on floats causing wrong code generation and crashes. [#4892](https://github.com/apache/incubator-tvm/pull/4892) - Fix tvm.target.generic_func runtime detection [#4910](https://github.com/apache/incubator-tvm/pull/4910) - topi/tests/python/test_topi_sort.py::test_argsort [#4891](https://github.com/apache/incubator-tvm/pull/4891) - Use opencv reisze method for preprocessing of image in darknet [#4883](https://github.com/apache/incubator-tvm/pull/4883) - Fix build breaks with StringRef changes [#4923](https://github.com/apache/incubator-tvm/pull/4923) - Remove unnecessary spliting in the cached chunk [#4935](https://github.com/apache/incubator-tvm/pull/4935) - Fixing an Infinite Loop case in UnmatchedChecker. [#4881](https://github.com/apache/incubator-tvm/pull/4881) - Remove SGX toolchain installation from CI Dockerfile [#4948](https://github.com/apache/incubator-tvm/pull/4948) - Fix tedd tutorial after strategy change [#4947](https://github.com/apache/incubator-tvm/pull/4947) - Allow customize MKLDNN library location [#4814](https://github.com/apache/incubator-tvm/pull/4814) - Added CopyFromBytes and CopyToBytes convenience methods to NDArray. Fixed typos. [#4970](https://github.com/apache/incubator-tvm/pull/4970) - Fix gcn tutorial failure [#4994](https://github.com/apache/incubator-tvm/pull/4994) - Fix stride default value None in torch.nn.functional.avg_pool [#4984](https://github.com/apache/incubator-tvm/pull/4984) - Fix ROCm strategy for winograd conv selection [#5001](https://github.com/apache/incubator-tvm/pull/5001) - Fix get_valid_count flaky test for cuda [#4901](https://github.com/apache/incubator-tvm/pull/4901) - Change Scala Linter scalafmt => scalastyle [#4998](https://github.com/apache/incubator-tvm/pull/4998) - Kill from tvm import te [#5007](https://github.com/apache/incubator-tvm/pull/5007) - Chisel fixes and de10nano support [#4986](https://github.com/apache/incubator-tvm/pull/4986) - Fix gpu not found when running TVM docker [#4975](https://github.com/apache/incubator-tvm/pull/4975) - Fixes for pylint==2.4.4 [#4849](https://github.com/apache/incubator-tvm/pull/4849) - Fix unordered dictionary problem for python version under 3.6 [#4982](https://github.com/apache/incubator-tvm/pull/4982) - Fix gcn tutorial failure [#4994](https://github.com/apache/incubator-tvm/pull/4994) - Fix stride default value None in torch.nn.functional.avg_pool [#4984](https://github.com/apache/incubator-tvm/pull/4984) - Fix ROCm strategy for winograd conv selection [#5001](https://github.com/apache/incubator-tvm/pull/5001) - Early checking added and new test cases added for schedule fuse [#5010](https://github.com/apache/incubator-tvm/pull/5010) - Fixed div by zero core dump. Fixed rounding intrinsics on int crash [#5026](https://github.com/apache/incubator-tvm/pull/5026) - Test case modified for int type [#5012](https://github.com/apache/incubator-tvm/pull/5012) - Bug Fix for ARM CPUs. Lower strict assumption. [#5063](https://github.com/apache/incubator-tvm/pull/5063) - Triage the testcases to fit the the new namespaces [#5071](https://github.com/apache/incubator-tvm/pull/5071) - Add colors to compute_at edges and thread/block indices. [#5111](https://github.com/apache/incubator-tvm/pull/5111) - Temporary fix to the stack overflow issue in autotvm task extraction [#5019](https://github.com/apache/incubator-tvm/pull/5019) - Fix compilation of If-Elses [#5040](https://github.com/apache/incubator-tvm/pull/5040) - Fix CompilerAttrs [#5109](https://github.com/apache/incubator-tvm/pull/5109) - Fix the existing test cases before refactoring. [#5122](https://github.com/apache/incubator-tvm/pull/5122) - Fixed bug where shifting by out-of-bounds value results in no compute code being emitted. [#5115](https://github.com/apache/incubator-tvm/pull/5115) - Fix for issue #4831. The data_min_idx and data_max_idx were flipped. [#5136](https://github.com/apache/incubator-tvm/pull/5136) - Duplicate likely nodes added when loop axis split unevenly [#5084](https://github.com/apache/incubator-tvm/pull/5084) - Fix incorrect name of calibration mode [#5150](https://github.com/apache/incubator-tvm/pull/5150) - Remove contrib spatial pack schedule of depthwise convolution [#5148](https://github.com/apache/incubator-tvm/pull/5148) - Fix annotate pass static variable [#5023](https://github.com/apache/incubator-tvm/pull/5023) - Fixed ConvTranspose2D parsing [#5157](https://github.com/apache/incubator-tvm/pull/5157) - Nullptr check [#5176](https://github.com/apache/incubator-tvm/pull/5176) - rocm: fix miopen convolutions [#5179](https://github.com/apache/incubator-tvm/pull/5179) - rocm: fix dense_rocblas in strategy, topi [#5191](https://github.com/apache/incubator-tvm/pull/5191) - Bugfix in tensorflow space_to_batch_nd [#5175](https://github.com/apache/incubator-tvm/pull/5175) - Fix CRT static test bug ([#5293](https://github.com/apache/incubator-tvm/pull/5293)) - Fix perf regression of tir refactor ([#5258](https://github.com/apache/incubator-tvm/pull/5258)) - Bugfix in tensorflow space_to_batch_nd ([#5175](https://github.com/apache/incubator-tvm/pull/5175)) - Compilation warnings fixed for 32bit and 64bit compilation ([#5349](https://github.com/apache/incubator-tvm/pull/5349)) - Fix hang in MergeCompilerRegions ([#5227](https://github.com/apache/incubator-tvm/pull/5227)) - Fixes to MergeCompilerRegions ([#5195](https://github.com/apache/incubator-tvm/pull/5195)) - Fix generation of LLVM intrinsics ([#5282](https://github.com/apache/incubator-tvm/pull/5282)) - Fix setting up hints for getaddrinfo ([#2872](https://github.com/apache/incubator-tvm/pull/2872)) - Add ConstantNode to IsAtomic ([#5457](https://github.com/apache/incubator-tvm/pull/5457)) - Fix String SEqual ([#5275](https://github.com/apache/incubator-tvm/pull/5275)) - Fix fuse over functions that are handled by external codegen ([#5365](https://github.com/apache/incubator-tvm/pull/5365)) - Fix memory leak when accessing NDArray ([#5413](https://github.com/apache/incubator-tvm/pull/5413)) - Remove the duplicate PrintIR pass in Relay ([#5403](https://github.com/apache/incubator-tvm/pull/5403)) - Fix lower_warp_memory ([#5247](https://github.com/apache/incubator-tvm/pull/5247)) - Fix lower_warp_memory when there are >1 warp buffers ([#5368](https://github.com/apache/incubator-tvm/pull/5368)) - Fix intel conv2d auto tune ([#5200](https://github.com/apache/incubator-tvm/pull/5200)) - Fix FuseBatchNorm output cast error if need_cast is True [#4894](https://github.com/apache/incubator-tvm/pull/4894) - Fix an assertion exposed by loop vectorizer [#4916](https://github.com/apache/incubator-tvm/pull/4916) - Fix error message [#4945](https://github.com/apache/incubator-tvm/pull/4945) - Fix for recursive let [#5757](https://github.com/apache/incubator-tvm/pull/5757) - Fix Calibration Pass to Support Modules with Multiple Functions [#5768](https://github.com/apache/incubator-tvm/pull/5768) - Fix what looks like bizzare copy-paste issue [#6010](https://github.com/apache/incubator-tvm/pull/6010) - Fix bug in transpose_shape_func [#6180](https://github.com/apache/incubator-tvm/pull/6180) - Fix bugs in CUDA codegen ([#5209](https://github.com/apache/incubator-tvm/pull/5209)) - Don’t remove() TemporaryFile in **del**. ([#5414](https://github.com/apache/incubator-tvm/pull/5414)) - Fix test_ir_type. ([#5390](https://github.com/apache/incubator-tvm/pull/5390)) - Fix multiple identical inputs bug ([#5389](https://github.com/apache/incubator-tvm/pull/5389)) - Add cuda target check to dense tensorcore schedule. ([#5376](https://github.com/apache/incubator-tvm/pull/5376)) - T2 test fixups ([#5391](https://github.com/apache/incubator-tvm/pull/5391)) - Fix miopen padding ([#5433](https://github.com/apache/incubator-tvm/pull/5433)) - Misc fixes for ROCm ([#5431](https://github.com/apache/incubator-tvm/pull/5431)) - Fix copy constructor (#5237) - Corrected TVM autotuning on GPU ([#5432](https://github.com/apache/incubator-tvm/pull/5432)) - Fix vector load ([#5226](https://github.com/apache/incubator-tvm/pull/5226)) - Minor bugfix in message_passing.cc ([#5254](https://github.com/apache/incubator-tvm/pull/5254)) - Fix a bug when vectorized load&store was involved for… ([#5428](https://github.com/apache/incubator-tvm/pull/5428)) - Fix to skip node not in graph. ([#5238](https://github.com/apache/incubator-tvm/pull/5238)) - Fix [#5388](https://github.com/apache/incubator-tvm/pull/5388) [RUNTIME][VULKAN] vkBuffer released before memory copy command se… ([#5418](https://github.com/apache/incubator-tvm/pull/5418)) - Fix a minor error in device_annotation ([#5291](https://github.com/apache/incubator-tvm/pull/5291)) - Fix scalar’s ndim is 0 ([#5344](https://github.com/apache/incubator-tvm/pull/5344)) - Fix the runtime raise error [#5586](https://github.com/apache/incubator-tvm/pull/5586) - Fixed bug in attribute parsing for pool layers. #[5582](https://github.com/apache/incubator-tvm/pull/5582) - AutoTVM incorrect measurement [#5511](https://github.com/apache/incubator-tvm/pull/5511) - fix a min/max simplify bug [#5761](https://github.com/apache/incubator-tvm/pull/5761) - Rename tvm_dso_op to libtvm_dso_op [#5714](https://github.com/apache/incubator-tvm/pull/5714) - Fix generating types like float44 and float88 [#5722](https://github.com/apache/incubator-tvm/pull/5722) - Avoid downloading when TOPHUB_LOCATION is NONE [#5720](https://github.com/apache/incubator-tvm/pull/5720) - codegen llvm: move nvptx-specific intrinsic handling into codegen_nvptx [#5726](https://github.com/apache/incubator-tvm/pull/5726) - ROCm warp shuffles and reductions [#5727](https://github.com/apache/incubator-tvm/pull/5727) - fix small bug about dense_grad [#5695](https://github.com/apache/incubator-tvm/pull/5695) - Clarify downstream consistency of TVMArgTypeCode [#5742](https://github.com/apache/incubator-tvm/pull/5742) - Fix gelu in PyTorch frontend, tighten numerical checks [#5763](https://github.com/apache/incubator-tvm/pull/5763) - Make batch matrix multiplication on GPU tunable [#5752](https://github.com/apache/incubator-tvm/pull/5752) - update vulkan build rule [#5777](https://github.com/apache/incubator-tvm/pull/5777) - aten::norm support added [#5776](https://github.com/apache/incubator-tvm/pull/5776) - Edit onnx parser to infer values in post order [#5755](https://github.com/apache/incubator-tvm/pull/5755) - Support symbolic inputs of Fill [#5762](https://github.com/apache/incubator-tvm/pull/5762) - support aten::type_as in the pytorch frontend [#5787](https://github.com/apache/incubator-tvm/pull/5787) - Temporary disable fp16 type_as test for PyTorch Frontend [#5799](https://github.com/apache/incubator-tvm/pull/5799) - Add config switch for nn.dense layer type. [#5801](https://github.com/apache/incubator-tvm/pull/5801) - Move cpu-only frontend tests to a CPU stage [#5807](https://github.com/apache/incubator-tvm/pull/5807) - Pin hand landmark network to version 0.7.4. [#5813](https://github.com/apache/incubator-tvm/pull/5813) - Limit number of threads in all jobs [#5815](https://github.com/apache/incubator-tvm/pull/5815) - Error msg update [#5818](https://github.com/apache/incubator-tvm/pull/5818) - fix relay.build to not change the module argument in place [#5822](https://github.com/apache/incubator-tvm/pull/5822) - Fix InferType when module contains Prelude [#5797](https://github.com/apache/incubator-tvm/pull/5797) - Add a combine batch_matmul pass [#5791](https://github.com/apache/incubator-tvm/pull/5791) - RepeatVector, Conv3DTranspose op support added [#5833](https://github.com/apache/incubator-tvm/pull/5833) - Fix converting serialized quantized models [#5839](https://github.com/apache/incubator-tvm/pull/5839) - ffi (Object): make class dict visible in instances [#5843](https://github.com/apache/incubator-tvm/pull/5843) - Additional canonicalization added for AddNode [#5846](https://github.com/apache/incubator-tvm/pull/5846) - Suppress the warning messages when compile engine selects impls [#5821](https://github.com/apache/incubator-tvm/pull/5821) - fix #5849 [#5851](https://github.com/apache/incubator-tvm/pull/5851) - Introduce POD-C Compliant tvm::Map [#5740](https://github.com/apache/incubator-tvm/pull/5740) - Add bfloat16 [#5601](https://github.com/apache/incubator-tvm/pull/5601) - Add Python Classes for all Attrs [#5853](https://github.com/apache/incubator-tvm/pull/5853) - Fix map assign issue in CI test [#5854](https://github.com/apache/incubator-tvm/pull/5854) - Introduce Target Id Registry [#5838](https://github.com/apache/incubator-tvm/pull/5838) - Update has_dtype/has_shape to pattern lang doc [#5847](https://github.com/apache/incubator-tvm/pull/5847) - Add nn.batch_flatten as quantizable. [#5805](https://github.com/apache/incubator-tvm/pull/5805) - Fail early before running invalid dynamic graphs [#5856](https://github.com/apache/incubator-tvm/pull/5856) - Improve type handling in PyTorch frontend [#5834](https://github.com/apache/incubator-tvm/pull/5834) - HotFix the python intrin rule [#5895](https://github.com/apache/incubator-tvm/pull/5895) - add a few gradients [#5899](https://github.com/apache/incubator-tvm/pull/5899) - Add Binary Intrinsic ops to TIR Ops in C++ [#5900](https://github.com/apache/incubator-tvm/pull/5900) - Allow implicit conversion in TVM FFI to tvm::Bool [#5907](https://github.com/apache/incubator-tvm/pull/5907) - PyTorch frontend: fix handling of duplicate use of a model weight [#5897](https://github.com/apache/incubator-tvm/pull/5897) - Don’t multiply by constant 1 uselessly in dense [#5911](https://github.com/apache/incubator-tvm/pull/5911) - Support any index matching for TupleGetItem [#5909](https://github.com/apache/incubator-tvm/pull/5909) - Add MicroTVM tutorial using the STM32F746 discovery board [#5655](https://github.com/apache/incubator-tvm/pull/5655) - Fix serialization of inf float value [#5912](https://github.com/apache/incubator-tvm/pull/5912) - Fix CPU Thread Binding for Multiple Sockets [#5918](https://github.com/apache/incubator-tvm/pull/5918) - CUDA device API & VerifyGPUCode pass update [#5898](https://github.com/apache/incubator-tvm/pull/5898) - Update install.rst [#5858](https://github.com/apache/incubator-tvm/pull/5858) - Two small fixes to AMDCPU codegen for LLVM 10+ and ROCm 3.5+ [#5920](https://github.com/apache/incubator-tvm/pull/5920) - Add LegalizeInvalidAttach to legalize the compute_at location after split or fuse [#591](https://github.com/apache/incubator-tvm/pull/5917) - Don’t rewrite expressions used outside of the pattern [#5930](https://github.com/apache/incubator-tvm/pull/5930) - Add TupleGetItem to CSE [#5931](https://github.com/apache/incubator-tvm/pull/5931) - Various update for CoreML codegen [#5934](https://github.com/apache/incubator-tvm/pull/5934) - Update date in the NOTICE [#5943](https://github.com/apache/incubator-tvm/pull/5943) - Raise right error in tensorflow split op [#5951](https://github.com/apache/incubator-tvm/pull/5951) - Add rm xla attributes in tf docs [#5950](https://github.com/apache/incubator-tvm/pull/5950) - Fix OpenCL get_valid_counts errors due to intrinsic atomic_add [#5857](https://github.com/apache/incubator-tvm/pull/5857) - Amendments for gradients [#5941](https://github.com/apache/incubator-tvm/pull/5941) - Fix the meaning of conv{1,2}d_transpose output_padding parameter. [#5758](https://github.com/apache/incubator-tvm/pull/5758) - Make first order gradient graphs more efficient [#5959](https://github.com/apache/incubator-tvm/pull/5959) - Raise an exception when extern function does not return Stmt [#5964](https://github.com/apache/incubator-tvm/pull/5964) - Improve docker/bash.sh to handle git worktrees [#5970](https://github.com/apache/incubator-tvm/pull/5970) - Install DNNL (OneDNN) to CI Environment [#5936](https://github.com/apache/incubator-tvm/pull/5936) - Add Dynamic reshape to a dynamic namespace and add DynamicToStatic Pass [#5826](https://github.com/apache/incubator-tvm/pull/5826) - Add meshgrid op in Relay, TOPI, Pytorch frontend [#5961](https://github.com/apache/incubator-tvm/pull/5961) - Print right number of parentheses for LoadNode [#5965](https://github.com/apache/incubator-tvm/pull/5965) - Migrate data structure of TargetNode [#5960](https://github.com/apache/incubator-tvm/pull/5960) - Remove redundant function CreateBufferVecPtr [#5982](https://github.com/apache/incubator-tvm/pull/5982) - Fix string argument mismatch in GraphRuntimeCodegen [#5933](https://github.com/apache/incubator-tvm/pull/5933) - VectorType::get with two parameters is deprecated in LLVM 11+ [#5984](https://github.com/apache/incubator-tvm/pull/5984) - Fix Compilation Error in CRT [#5713](https://github.com/apache/incubator-tvm/pull/5713) - Fix runtime::String backward compatibility in JSON [#5725](https://github.com/apache/incubator-tvm/pull/5725) - Allow RPCWrappedFunc to rewrite runtime::String as std::string [#5796](https://github.com/apache/incubator-tvm/pull/5796) - Fix reshape [#5739](https://github.com/apache/incubator-tvm/pull/5739) - Fix building with LLVM-10 on macOS [#5859](https://github.com/apache/incubator-tvm/pull/5859) - Add cuda 11 to contrib.nvcc.find_libdevice_path() [#5902](https://github.com/apache/incubator-tvm/pull/5902) - Fix sequential cpp test [#5745](https://github.com/apache/incubator-tvm/pull/5745) - Infer types in MergeComposite [#5766](https://github.com/apache/incubator-tvm/pull/5766) - Fix recursive let for well formed check [#5780](https://github.com/apache/incubator-tvm/pull/5780) - Recover global state after test_util.py [#5824](https://github.com/apache/incubator-tvm/pull/5824) - Fix bug in rpc ring buffer shrink [#5516](https://github.com/apache/incubator-tvm/pull/5516) - Fix remote device sync [#5538](https://github.com/apache/incubator-tvm/pull/5538) - Fix bug in rpc ring buffer shrink (#5516) [#5537](https://github.com/apache/incubator-tvm/pull/5537) - RPC Server error fix on Pynq FPGA [#5607](https://github.com/apache/incubator-tvm/pull/5607) - Fix FloorMod Simplifier [#5509](https://github.com/apache/incubator-tvm/pull/5509) - Fix Python debugger segfaults with TVM built with LLVM [#5685](https://github.com/apache/incubator-tvm/pull/5685) - Fix Compilation Error in CRT [#5713](https://github.com/apache/incubator-tvm/pull/5713) - Fix runtime::String backward compatibility in JSON [#5725](https://github.com/apache/incubator-tvm/pull/5725) - Allow RPCWrappedFunc to rewrite runtime::String as std::string [#5796](https://github.com/apache/incubator-tvm/pull/5796) - Fix reshape [#5739](https://github.com/apache/incubator-tvm/pull/5739) - Make "none" DataType explicit [#5491](https://github.com/apache/incubator-tvm/pull/5491) - Change "scalar" and "stack" in IDL from "inrout" to "in" [#5487](https://github.com/apache/incubator-tvm/pull/5487) - Link necessary libraries when building runtime for Android [#5496](https://github.com/apache/incubator-tvm/pull/5496) - Fixes for wasm32 target [#5489](https://github.com/apache/incubator-tvm/pull/5489) - Reset target and wait for runtime initialization on connect. [#5499](https://github.com/apache/incubator-tvm/pull/5499) - Bump tophub rocm version [#5504](https://github.com/apache/incubator-tvm/pull/5504) - Improve commentary for RingBuffer [#5518](https://github.com/apache/incubator-tvm/pull/5518) - Add unit tests for ONNX PRelu and fix importer to pass them. [#5521](https://github.com/apache/incubator-tvm/pull/5521) - LRN only supports 4D tensors, remove it from alter_op_layout [#5520](https://github.com/apache/incubator-tvm/pull/5520) - Fix an issue with ONNX Upsample [#5530](https://github.com/apache/incubator-tvm/pull/5530) - Cache PrimExpr instead of raw pointers in bound analyzer [#5533](https://github.com/apache/incubator-tvm/pull/5533) - fix a few bugs with shape inference and types in the ONNX importer [#5534](https://github.com/apache/incubator-tvm/pull/5534) - Add Onnx Pad v11 [#5539](https://github.com/apache/incubator-tvm/pull/5539) - Changes to cpp_rpc to make it work on Android (+ Hexagon offloading) [#5535](https://github.com/apache/incubator-tvm/pull/5535) - Fix to reduce RAM size during loading model [#5507](https://github.com/apache/incubator-tvm/pull/5507) - Fix MakeLoopNest for warp memory [#5382](https://github.com/apache/incubator-tvm/pull/5382) - Load platform specific lib for tvmdsoop instead of the hard-coded tvm_dso_op.so [#5542](https://github.com/apache/incubator-tvm/pull/5542) - Add tests for running micro on native arm hardware [#5546](https://github.com/apache/incubator-tvm/pull/5546) - Apparently, ONNX Conv with no 'pads' defaults to zero padding [#5548](https://github.com/apache/incubator-tvm/pull/5548) - clang-format the h,cc,m files. [#5557](https://github.com/apache/incubator-tvm/pull/5557) - Fix conv2d alter op for arm cpu [#5532](https://github.com/apache/incubator-tvm/pull/5532) - Fix topi test (/topi/tests/python/test_topi_conv2d_nhwc_winograd.py) for non tensorcore CI. [#5563](https://github.com/apache/incubator-tvm/pull/5563) - Add clang-format and nodejs to ci-lint [#5567](https://github.com/apache/incubator-tvm/pull/5567) - Enable clang-format. [#5572](https://github.com/apache/incubator-tvm/pull/5572) - Allow ubuntu_install_darknet.sh to work in both 18.04 and 16.04 [#5574](https://github.com/apache/incubator-tvm/pull/5574) - Add a quantized conv2 unit test for the tflite front-end [#5558](https://github.com/apache/incubator-tvm/pull/5558) - Fix JSON graph dumping. [#5591](https://github.com/apache/incubator-tvm/pull/5591) - Warp level reduction support for CUDA [#5498](https://github.com/apache/incubator-tvm/pull/5498) - One more fix for concurrency count [#5589](https://github.com/apache/incubator-tvm/pull/5589) - Improve robustness of the docs build [#5583](https://github.com/apache/incubator-tvm/pull/5583) - Phase out WebGL [#5570](https://github.com/apache/incubator-tvm/pull/5570) - Fix vulkansdk in the ci-gpu and upgrade to 1.2.135 [#5566](https://github.com/apache/incubator-tvm/pull/5566) - Update ci-cpu to bionic [#5554](https://github.com/apache/incubator-tvm/pull/5554) - Overestimate binary size for microTVM compiled binaries. [#5590](https://github.com/apache/incubator-tvm/pull/5590) - Fix bug and re-enable RPC execution test [#5436](https://github.com/apache/incubator-tvm/pull/5436) - Add ostream formatters for TargetPtr/TargetVal. [#5592](https://github.com/apache/incubator-tvm/pull/5592) - Fix cross thread reduction [#5551](https://github.com/apache/incubator-tvm/pull/5551) - Fix TVMArray layout on device [#5599](https://github.com/apache/incubator-tvm/pull/5599) - Add debug mode to tempdir() [#5581](https://github.com/apache/incubator-tvm/pull/5581) - Represent alignment information in LLVM IR [#5598](https://github.com/apache/incubator-tvm/pull/5598) - Fix codegen for warp shuffle intrinsics [#5606](https://github.com/apache/incubator-tvm/pull/5606) - Fix Topological Order calculation for DFPattern Language [#5612](https://github.com/apache/incubator-tvm/pull/5612) - Global MaxPool3d and AvgPool3d support [#5098](https://github.com/apache/incubator-tvm/pull/5098) - Fix build error of iOS RPC [#5621](https://github.com/apache/incubator-tvm/pull/5621) - isn't a CallNode sometimes [#5623](https://github.com/apache/incubator-tvm/pull/5623) - Introduce config to PassContext. [#5631](https://github.com/apache/incubator-tvm/pull/5631) - CMAKE fix [#5630](https://github.com/apache/incubator-tvm/pull/5630) - Label Pattern Partitions [#5627](https://github.com/apache/incubator-tvm/pull/5627) - Extend AttrPattern to support CallNode and FunctionNode attributes [#5637](https://github.com/apache/incubator-tvm/pull/5637) - Increase bss section size. [#5660](https://github.com/apache/incubator-tvm/pull/5660) - Add buffer name when creating tensor bindings [#5670](https://github.com/apache/incubator-tvm/pull/5670) - µtvm debug improvements [#5648](https://github.com/apache/incubator-tvm/pull/5648) - enable amd_apu device on vulkan target [#5659](https://github.com/apache/incubator-tvm/pull/5659) - Support TupleWrapper as direct ancestor of control flow ops [#5639](https://github.com/apache/incubator-tvm/pull/5639) - add tvm.micro pydoc to sphinx [#5661](https://github.com/apache/incubator-tvm/pull/5661) - Add a regression testcase for #5674 [#5677](https://github.com/apache/incubator-tvm/pull/5677) - Fix C++ RPC build problem on Linux [#5671](https://github.com/apache/incubator-tvm/pull/5671) - Add a check Callback to the Pattern Paritioner [#5646](https://github.com/apache/incubator-tvm/pull/5646) - Call previous excepthook in tvm_excepthook. [#5675](https://github.com/apache/incubator-tvm/pull/5675) - Fix the shift column for scale_shift_nchw and scale_shift_nhwc in C topi [#5679](https://github.com/apache/incubator-tvm/pull/5679) - Support more dtypes for TVMDSOOp [#5694](https://github.com/apache/incubator-tvm/pull/5694) - In memory_plan, check if value is not None, instead of just checking value as boolean. [#5700](https://github.com/apache/incubator-tvm/pull/5700) - Fix flaky test_topi_pooling.py:test_adaptive_pool [#5736](https://github.com/apache/incubator-tvm/pull/5736) - Fix the values for test_fmod since it fails way too often otherwise [#5723](https://github.com/apache/incubator-tvm/pull/5723) - fix small bug about dense_grad [#5695](https://github.com/apache/incubator-tvm/pull/5695) - Fix sequential cpp test [#5745](https://github.com/apache/incubator-tvm/pull/5745) - Add Scatter to Topi/Relay/ONNX via hybrid script [#5619](https://github.com/apache/incubator-tvm/pull/5619) - Clean WASM environment before build [#5759](https://github.com/apache/incubator-tvm/pull/5759) - Fix gelu in PyTorch frontend, tighten numerical checks [#5763](https://github.com/apache/incubator-tvm/pull/5763) - fix #5686: remove a overstrict assert in MakeAllreduce (#5686) [#5785](https://github.com/apache/incubator-tvm/pull/5785) - Improve Pattern Language Docs [#5676](https://github.com/apache/incubator-tvm/pull/5676) - Add missing expr visitor for any [#6082](https://github.com/apache/incubator-tvm/pull/6082) - Remove the tvm web from version update [#6122](https://github.com/apache/incubator-tvm/pull/6122) - Clear relay cache after every build & Clear warning message cache after autotvm task extraction [#6131](https://github.com/apache/incubator-tvm/pull/6131) - avoid unexpected throw in AttrInitEntry [#6128](https://github.com/apache/incubator-tvm/pull/6128) - Verify that tensor reshape is valid. [#6215](https://github.com/apache/incubator-tvm/pull/6215) - Use LocalRunner by default in the tutorial tune_relay_cuda.py [#6001](https://github.com/apache/incubator-tvm/pull/6001) - Undefined names: import os for line 324 & import re for line 308 [#6003](https://github.com/apache/incubator-tvm/pull/6003) - GitHub Actions upgrade to actions/setup-python@v2 [#6002](https://github.com/apache/incubator-tvm/pull/6002) - Only pass pythonpath for ci images [#6005](https://github.com/apache/incubator-tvm/pull/6005) - Auto-convert shuffle with single index to “extract element” [#6006](https://github.com/apache/incubator-tvm/pull/6006) - Cache object refs in loop partitioner instead of object pointers [#6004](https://github.com/apache/incubator-tvm/pull/6004) - Fix test_arith_solve_linear_inequality.py::test_multi_equal [#6014](https://github.com/apache/incubator-tvm/pull/6014) - MXNet frontend support for AMP cast op [#5976](https://github.com/apache/incubator-tvm/pull/5976) - Demo showing how to run a pruned model. [#5975](https://github.com/apache/incubator-tvm/pull/5975) - Move compiler related registry items to vta/build_module.py [#6012](https://github.com/apache/incubator-tvm/pull/6012) - Pin keras version [#6032](https://github.com/apache/incubator-tvm/pull/6032) - Fix in arm_cpu/conv2d_alter_op for NHWC quantized [#6027](https://github.com/apache/incubator-tvm/pull/6027) - Add creation of Hexagon device in RPC client [#6035](https://github.com/apache/incubator-tvm/pull/6035) - Terminate basic block after “ret” instruction [#6036](https://github.com/apache/incubator-tvm/pull/6036) - µTVM CRT modifications for on-device RPC server [#5921](https://github.com/apache/incubator-tvm/pull/5921) - Create TBAA information based on the unrelying buffer type [#6046](https://github.com/apache/incubator-tvm/pull/6046) - Add support for tflite arg_min and arg_max [#5992](https://github.com/apache/incubator-tvm/pull/5992) - Fix fully_connected converter when batch size is not 1 [#6038](https://github.com/apache/incubator-tvm/pull/6038) - Fix a primitive check error [#5991](https://github.com/apache/incubator-tvm/pull/5991) - Refactor to expose MakeOp functions to C++ [#6047](https://github.com/apache/incubator-tvm/pull/6047) - Fix conv2_gemm after target structure update [#6037](https://github.com/apache/incubator-tvm/pull/6037) - Remove use of designated initializers from hexagon_module.cc [#6055](https://github.com/apache/incubator-tvm/pull/6055) - Build crttest and cpptest separately. [#6057](https://github.com/apache/incubator-tvm/pull/6057) - Fix pytorch frontend prim::Constant issue [#6051](https://github.com/apache/incubator-tvm/pull/6051) - update frontend tutorials to new model based runtime interface [#6063](https://github.com/apache/incubator-tvm/pull/6063) - Remove unnecessary std::cout [#6072](https://github.com/apache/incubator-tvm/pull/6072) - Fix error message in Buffer::vstore, NFC [#6056](https://github.com/apache/incubator-tvm/pull/6056) - Fix FSIM Compile Error. [#6070](https://github.com/apache/incubator-tvm/pull/6070) - Improve vector simplification for float operands [#6043](https://github.com/apache/incubator-tvm/pull/6043) - Fix LocalBuilder on macOS with python 3.8. [#6083](https://github.com/apache/incubator-tvm/pull/6083) - Add missing test for fast erf [#6058](https://github.com/apache/incubator-tvm/pull/6058) - Fixed point multiplication improvements for AArch64 [#5980](https://github.com/apache/incubator-tvm/pull/5980) - Fix code generation bugs for C/CUDA & Improve VerifyGPUCode pass [#6041](https://github.com/apache/incubator-tvm/pull/6041) - Delete declaration of unused op_node [#6102](https://github.com/apache/incubator-tvm/pull/6102) - Load configs even it has no entity [#6100](https://github.com/apache/incubator-tvm/pull/6100) - Update SGX example Cargo.toml [#6067](https://github.com/apache/incubator-tvm/pull/6067) - Add default value for option USE_DNNL_CODEGEN in the cmake [#6099](https://github.com/apache/incubator-tvm/pull/6099) - Update installation doc with minor improvements [#6104](https://github.com/apache/incubator-tvm/pull/6104) - lint: add opencl .cl file type [#6092](https://github.com/apache/incubator-tvm/pull/6092) - Clean up conversions between TVM and Rust functions [#6114](https://github.com/apache/incubator-tvm/pull/6114) - Improve reduction schedule on arm CPUs [#6110](https://github.com/apache/incubator-tvm/pull/6110) - Register Shape Func for Some Operators to Handle Dynamic Shapes [#5955](https://github.com/apache/incubator-tvm/pull/5955) - Fix variable name conflict with OpenCL keyword [#6048](https://github.com/apache/incubator-tvm/pull/6048) - Some rust cleanups [#6116](https://github.com/apache/incubator-tvm/pull/6116) - Option to specify alternate directory to output build to [#6016](https://github.com/apache/incubator-tvm/pull/6016) - Add ‘get_num_inputs’ to GraphRuntime [#6118](https://github.com/apache/incubator-tvm/pull/6118) - TFLite quantized conv test [#6084](https://github.com/apache/incubator-tvm/pull/6084) - Fix autotvm on the conv2d_nchw_winograd.mali operator [#6130](https://github.com/apache/incubator-tvm/pull/6130) - add attr option mfloat-abi for arm32 [#6123](https://github.com/apache/incubator-tvm/pull/6123) - Fix CUDA Library Tuning [#6132](https://github.com/apache/incubator-tvm/pull/6132) - Add missing RPC sources after refactor [#6113](https://github.com/apache/incubator-tvm/pull/6113) - Correct runtime.load_module [#6161](https://github.com/apache/incubator-tvm/pull/6161) - Improve error messages in graph tuner, graph runtime, and module loader. [#6148](https://github.com/apache/incubator-tvm/pull/6148) - Fix some shape mismatches between TF and Relay [#6166](https://github.com/apache/incubator-tvm/pull/6166) - Improve doc string [#6176](https://github.com/apache/incubator-tvm/pull/6176) - Fix incorrect function signature in header [#6172](https://github.com/apache/incubator-tvm/pull/6172) - Fix alignment of note [#6181](https://github.com/apache/incubator-tvm/pull/6181) - Implemented PADV2 Operator for TFLite and added support for constant values in PAD. [#6167](https://github.com/apache/incubator-tvm/pull/6167) - Unary ops support added in frontend [#6196](https://github.com/apache/incubator-tvm/pull/6196) - Change the meaning of conv3d_transpose output_padding to match conv{1,2}d_transpose [#6065](https://github.com/apache/incubator-tvm/pull/6065) - Fix compile warnings. [#6204](https://github.com/apache/incubator-tvm/pull/6204) - Fix -mfloat-abi=soft compilation for ARM with OpenCL target [#6150](https://github.com/apache/incubator-tvm/pull/6150) - Match pytorch 1.6 googlenet pretrained model (#6201) [#6212](https://github.com/apache/incubator-tvm/pull/6212) - Mod operator, bug fix [#6160](https://github.com/apache/incubator-tvm/pull/6160) - RESHAPE with dynamic shape arg in TFLite frontend [#6208](https://github.com/apache/incubator-tvm/pull/6208) - Fix compilation error with cuda 11 [#6213](https://github.com/apache/incubator-tvm/pull/6213) - Fix port_end wrong default value 9199 to 9099 for keeping same with source code [#6220](https://github.com/apache/incubator-tvm/pull/6220) - Std op without specified dimensions support [#6226](https://github.com/apache/incubator-tvm/pull/6226) - fix crt building and running error [#6231](https://github.com/apache/incubator-tvm/pull/6231) - Implemented ONE_HOT Operator for TFLite. [#6223](https://github.com/apache/incubator-tvm/pull/6223)) - Avoid unexpected throw in AttrInitEntry [#6128](https://github.com/apache/incubator-tvm/pull/6128) - Added casting to hybrid script doc and fixed pass infra doc [#6174](https://github.com/apache/incubator-tvm/pull/6174) - Fix compile warnings. [#6204](https://github.com/apache/incubator-tvm/pull/6204) - Fix -mfloat-abi=soft compilation for ARM with OpenCL target [#6150](https://github.com/apache/incubator-tvm/pull/6150) - Mod operator, bug fix [#6160](https://github.com/apache/incubator-tvm/pull/6160) - Fix compilation error with cuda 11 [#6213](https://github.com/apache/incubator-tvm/pull/6213) - Fix port_end wrong default value 9199 to 9099 for keeping same with source code [#6220](https://github.com/apache/incubator-tvm/pull/6220) - Std op without specified dimensions support [#6226](https://github.com/apache/incubator-tvm/pull/6226) - Verify that tensor reshape is valid. [#6215](https://github.com/apache/incubator-tvm/pull/6215) - Fix crt building and running error [#6231](https://github.com/apache/incubator-tvm/pull/6231) - Fix conv2d_transpose output padding [#6236](https://github.com/apache/incubator-tvm/pull/6236) - Fix cuda half math function is undefined: hpow, htanh [#6225](https://github.com/apache/incubator-tvm/pull/6225) - Fix division range estimation error in simplifier [#6244](https://github.com/apache/incubator-tvm/pull/6244) - Fix newer GCC compiler warnings. [#6257](https://github.com/apache/incubator-tvm/pull/6257) - Support _contrib_SyncBatchNorm [#6245](https://github.com/apache/incubator-tvm/pull/6245) - Fix reduction [#6250](https://github.com/apache/incubator-tvm/pull/6250) - Add apt repository for clang-11 and llvm-11 [#6256](https://github.com/apache/incubator-tvm/pull/6256) - Update tutorial to new TARGET as micro_dev is no more [#6262](https://github.com/apache/incubator-tvm/pull/6262) - Fix clang-format [#6264](https://github.com/apache/incubator-tvm/pull/6264) - Trivial fix, up the rodata section for the discovery board to 512 bytes. [#6259](https://github.com/apache/incubator-tvm/pull/6259) - Fix cuda half math function is undefined: hpow, htanh [#6253](https://github.com/apache/incubator-tvm/pull/6253) - Add dilation in x86 NCHWc depthwise conv support [#6267](https://github.com/apache/incubator-tvm/pull/6267) - Decrease test times by introducing testing model [#6235](https://github.com/apache/incubator-tvm/pull/6235) - Add support for parsing the any dimension. [#6277](https://github.com/apache/incubator-tvm/pull/6277) - Improve error messages for memory verifier and gpu memory verifier [#6281](https://github.com/apache/incubator-tvm/pull/6281) - Reflect Compile-Time CMake Options into [libtvm.so](http://libtvm.so/) [#6280](https://github.com/apache/incubator-tvm/pull/6280) - Add cmake options into libinfo [#6286](https://github.com/apache/incubator-tvm/pull/6286) - Update slice to infer attributes when not graph inputs [#6276](https://github.com/apache/incubator-tvm/pull/6276) - Use rpc.LocalSession for simple tests [#6294](https://github.com/apache/incubator-tvm/pull/6294) - Fix random fail [#6312](https://github.com/apache/incubator-tvm/pull/6312) - Fix resize test [#6298](https://github.com/apache/incubator-tvm/pull/6298) - Fix cython FFI compact with np.int64 [#6321](https://github.com/apache/incubator-tvm/pull/6321) - Fix relay vm optimize [#6322](https://github.com/apache/incubator-tvm/pull/6322) - Changed TVMCTVMContext to TVMContext [#6306](https://github.com/apache/incubator-tvm/pull/6306) - Make able to compile with MSVC [#6341](https://github.com/apache/incubator-tvm/pull/6341) - ROCm changed name of library and removed the old one in ROCm 3.7 release. [#6345](https://github.com/apache/incubator-tvm/pull/6345) - Compatible for ROCm before 3.7 [#6359](https://github.com/apache/incubator-tvm/pull/6359) - Use clear name that is separate from ASF brand for cache [#6360](https://github.com/apache/incubator-tvm/pull/6360) - Fix Dockerfile.demo_android [#6361](https://github.com/apache/incubator-tvm/pull/6361) - Fx sparse dense schedule on cuda [#5803](https://github.com/apache/incubator-tvm/pull/5803) - Fix strategy for sparse dense cuda [#5782](https://github.com/apache/incubator-tvm/pull/5782) - Fix x86 conv2d template when tuning with unpacked layout [#5938](https://github.com/apache/incubator-tvm/pull/5938) - Fix the filter width parameter in depthwise_conv2d [#6081](https://github.com/apache/incubator-tvm/pull/6081) - Fix reshape usage in ARM schedule [#5732](https://github.com/apache/incubator-tvm/pull/5732) - Missing header [#4865](https://github.com/apache/incubator-tvm/pull/4865) - Fix conv2d_transpose output padding [#6236](https://github.com/apache/incubator-tvm/pull/6236) # Known Issues # Deprecation - Deprecating NNVM ([#4535](https://github.com/apache/incubator-tvm/pull/4535), [#4562](https://github.com/apache/incubator-tvm/pull/4562), [#4565](https://github.com/apache/incubator-tvm/pull/4565), [#4571](https://github.com/apache/incubator-tvm/pull/4571)) - Deprecate FreeStmt [#5890](https://github.com/apache/incubator-tvm/pull/5890) - Remove legacy compute_expr.h [#5738](https://github.com/apache/incubator-tvm/pull/5738) - Deprecate OpenGL [#5711](https://github.com/apache/incubator-tvm/pull/5711), [#5712](https://github.com/apache/incubator-tvm/pull/5712) -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/apache/incubator-tvm/issues/6486#issuecomment-698661521