|
792 | 792 | Signed-off-by: Naren Dasan <[email protected]>
|
793 | 793 |
|
794 | 794 |
|
| 795 | +# 1.1.0 (2022-05-09) |
| 796 | + |
| 797 | + |
| 798 | +### Bug Fixes |
| 799 | + |
| 800 | +* add at::adaptive_avg_pool1d in interpolate plugin and fix [#791](https://github.com/NVIDIA/TRTorch/issues/791) ([deb9f74](https://github.com/NVIDIA/TRTorch/commit/deb9f74)) |
| 801 | +* Added ipywidget dependency to notebook ([0b2040a](https://github.com/NVIDIA/TRTorch/commit/0b2040a)) |
| 802 | +* Added test case names ([296e98a](https://github.com/NVIDIA/TRTorch/commit/296e98a)) |
| 803 | +* Added truncate_long_and_double ([417c096](https://github.com/NVIDIA/TRTorch/commit/417c096)) |
| 804 | +* Adding truncate_long_and_double to ptq tests ([3a0640a](https://github.com/NVIDIA/TRTorch/commit/3a0640a)) |
| 805 | +* Avoid resolving non-tensor inputs to torch segment_blocks unneccessarily ([3e090ee](https://github.com/NVIDIA/TRTorch/commit/3e090ee)) |
| 806 | +* Considering rtol and atol in threshold comparison for floating point numbers ([0b0ba8d](https://github.com/NVIDIA/TRTorch/commit/0b0ba8d)) |
| 807 | +* Disabled mobilenet_v2 test for DLFW CI ([40c611f](https://github.com/NVIDIA/TRTorch/commit/40c611f)) |
| 808 | +* fix bug that python api doesn't pass truncate_long_and_double value to internal.partition_info ([828336d](https://github.com/NVIDIA/TRTorch/commit/828336d)) |
| 809 | +* fix bugs in aten::to ([2ecd187](https://github.com/NVIDIA/TRTorch/commit/2ecd187)) |
| 810 | +* Fix BUILD file for tests/accuracy ([8b0170e](https://github.com/NVIDIA/TRTorch/commit/8b0170e)) |
| 811 | +* Fix existing uninstallation of Torch-TRT ([9ddd7a8](https://github.com/NVIDIA/TRTorch/commit/9ddd7a8)) |
| 812 | +* Fix for torch scripted module faiure with DLFW ([88c02d9](https://github.com/NVIDIA/TRTorch/commit/88c02d9)) |
| 813 | +* Fix fuse addmm pass ([58e9ea0](https://github.com/NVIDIA/TRTorch/commit/58e9ea0)) |
| 814 | +* Fix pre_built name change in bazelrc ([3ecee21](https://github.com/NVIDIA/TRTorch/commit/3ecee21)) |
| 815 | +* fix the bug that introduces kLong Tensor in prim::NumToTensor ([2c3e1d9](https://github.com/NVIDIA/TRTorch/commit/2c3e1d9)) |
| 816 | +* Fix when TRT prunes away an output ([9465e1d](https://github.com/NVIDIA/TRTorch/commit/9465e1d)) |
| 817 | +* Fixed bugs and addressed review comments ([588e1d1](https://github.com/NVIDIA/TRTorch/commit/588e1d1)) |
| 818 | +* Fixed failures for host deps sessions ([ec2232f](https://github.com/NVIDIA/TRTorch/commit/ec2232f)) |
| 819 | +* Fixed typo in the path ([43fab56](https://github.com/NVIDIA/TRTorch/commit/43fab56)) |
| 820 | +* Getting unsupported ops will now bypass non-schema ops avoiding redundant failures ([d7d1511](https://github.com/NVIDIA/TRTorch/commit/d7d1511)) |
| 821 | +* Guard test activation for CI testing ([6d1a1fd](https://github.com/NVIDIA/TRTorch/commit/6d1a1fd)) |
| 822 | +* Implement a patch for gelu schema change in older NGC containers ([9ee3a04](https://github.com/NVIDIA/TRTorch/commit/9ee3a04)) |
| 823 | +* Missing log severity ([6a4daef](https://github.com/NVIDIA/TRTorch/commit/6a4daef)) |
| 824 | +* Preempt torch package override via timm in nox session ([8964d1b](https://github.com/NVIDIA/TRTorch/commit/8964d1b)) |
| 825 | +* refactor the resegmentation for TensorRT segments in ResolveNonTensorInput ([3cc2dfb](https://github.com/NVIDIA/TRTorch/commit/3cc2dfb)) |
| 826 | +* remove outdated member variables ([0268da2](https://github.com/NVIDIA/TRTorch/commit/0268da2)) |
| 827 | +* Removed models directory dependencies ([c4413e1](https://github.com/NVIDIA/TRTorch/commit/c4413e1)) |
| 828 | +* Resolve issues in exception elmination pass ([99cea1b](https://github.com/NVIDIA/TRTorch/commit/99cea1b)) |
| 829 | +* Review comments incorporated ([962660d](https://github.com/NVIDIA/TRTorch/commit/962660d)) |
| 830 | +* Review comments incorporated ([e9865c2](https://github.com/NVIDIA/TRTorch/commit/e9865c2)) |
| 831 | +* support dict type for input in shape analysis ([630f9c4](https://github.com/NVIDIA/TRTorch/commit/630f9c4)) |
| 832 | +* truncate_long_and_double incur torchscript inference issues ([c83aa15](https://github.com/NVIDIA/TRTorch/commit/c83aa15)) |
| 833 | +* Typo fix for test case name ([2a516b2](https://github.com/NVIDIA/TRTorch/commit/2a516b2)) |
| 834 | +* Update "reduceAxes" variable in GlobalPoolingConverter function and add corresponding uTests ([f6f5e3e](https://github.com/NVIDIA/TRTorch/commit/f6f5e3e)) |
| 835 | +* **//core/conversion/evaluators:** Change how schemas are handled ([20e5d41](https://github.com/NVIDIA/TRTorch/commit/20e5d41)) |
| 836 | +* Update base container for dockerfile ([1b3245a](https://github.com/NVIDIA/TRTorch/commit/1b3245a)) |
| 837 | +* **//core:** Take user setting in the case we can't determine the ([01c89d1](https://github.com/NVIDIA/TRTorch/commit/01c89d1)), closes [#814](https://github.com/NVIDIA/TRTorch/issues/814) |
| 838 | +* Update test for new Exception syntax ([2357099](https://github.com/NVIDIA/TRTorch/commit/2357099)) |
| 839 | +* **//core/conversion:** Add special case for If and Loop ([eacde8d](https://github.com/NVIDIA/TRTorch/commit/eacde8d)) |
| 840 | +* **//core/runtime:** Support more delimiter variants ([819c911](https://github.com/NVIDIA/TRTorch/commit/819c911)) |
| 841 | +* **//cpp/bin/torchtrtc:** Fix mbs ([aca175f](https://github.com/NVIDIA/TRTorch/commit/aca175f)) |
| 842 | +* **//docsrc:** Fix dependencies for docgen ([806e663](https://github.com/NVIDIA/TRTorch/commit/806e663)) |
| 843 | +* **//notebooks:** Render citrinet ([12dbda1](https://github.com/NVIDIA/TRTorch/commit/12dbda1)) |
| 844 | +* **//py:** Constrain the CUDA version in container builds ([a21a045](https://github.com/NVIDIA/TRTorch/commit/a21a045)) |
| 845 | +* Use user provided dtype when we can't infer it from the graph ([14650d1](https://github.com/NVIDIA/TRTorch/commit/14650d1)) |
| 846 | + |
| 847 | + |
| 848 | +### Code Refactoring |
| 849 | + |
| 850 | +* removing the strict_types and max_batch_size apis ([b30cbd9](https://github.com/NVIDIA/TRTorch/commit/b30cbd9)) |
| 851 | +* Rename enabled precisions arugment to ([10957eb](https://github.com/NVIDIA/TRTorch/commit/10957eb)) |
| 852 | +* Removing the max-batch-size argument ([03bafc5](https://github.com/NVIDIA/TRTorch/commit/03bafc5)) |
| 853 | + |
| 854 | +### Features |
| 855 | + |
| 856 | +* **//core/conversion:** Better tooling for debugging ([c5c5c47](https://github.com/NVIDIA/TRTorch/commit/c5c5c47)) |
| 857 | +* **//core/conversion/evaluators:** aten::pow support ([c4fdfcb](https://github.com/NVIDIA/TRTorch/commit/c4fdfcb)) |
| 858 | +* **//py:** Context managers to quickly switch logging level ([12e470f](https://github.com/NVIDIA/TRTorch/commit/12e470f)) |
| 859 | +* **//tests:** Adding BERT to the test suite ([7996a10](https://github.com/NVIDIA/TRTorch/commit/7996a10)) |
| 860 | +* **aten::__range_length:** Adding range length evaluator ([11c4608](https://github.com/NVIDIA/TRTorch/commit/11c4608)) |
| 861 | +* **aten::add:** adding string concat evaluator ([65dbf90](https://github.com/NVIDIA/TRTorch/commit/65dbf90)) |
| 862 | +* Add converter files for reflection pad 1d and 2d ([406d860](https://github.com/NVIDIA/TRTorch/commit/406d860)) |
| 863 | +* Add converter files for torch::max ([569bcde](https://github.com/NVIDIA/TRTorch/commit/569bcde)) |
| 864 | +* Add converter files for torch::max ([f628aca](https://github.com/NVIDIA/TRTorch/commit/f628aca)) |
| 865 | +* Add converter files for torch::max ([dd7a44e](https://github.com/NVIDIA/TRTorch/commit/dd7a44e)) |
| 866 | +* Add converter for reflection pad 1d and 2d operation ([2484a43](https://github.com/NVIDIA/TRTorch/commit/2484a43)) |
| 867 | +* Added comprehensive perf benchmark script ([a8016ff](https://github.com/NVIDIA/TRTorch/commit/a8016ff)) |
| 868 | +* Added compute capability for Orin ([af3d0ff](https://github.com/NVIDIA/TRTorch/commit/af3d0ff)) |
| 869 | +* Added env var for TOP_DIR ([c26180e](https://github.com/NVIDIA/TRTorch/commit/c26180e)) |
| 870 | +* Added Python accuracy tests using Nox ([6ae8652](https://github.com/NVIDIA/TRTorch/commit/6ae8652)) |
| 871 | +* Enable prim::DictConstruct to fallback without conversion check error ([01d98c7](https://github.com/NVIDIA/TRTorch/commit/01d98c7)) |
| 872 | +* Handle empty schemas for unsupported ops ([bf6c929](https://github.com/NVIDIA/TRTorch/commit/bf6c929)) |
| 873 | +* Implement fast approximation of Gelu as lowering pass to improve performance ([8024ea2](https://github.com/NVIDIA/TRTorch/commit/8024ea2)) |
| 874 | +* Implement lowering for aten::to.dtype schema ([4b3ae3a](https://github.com/NVIDIA/TRTorch/commit/4b3ae3a)) |
| 875 | +* Implement test case for aten::to.dtype lowering ([bde8ee0](https://github.com/NVIDIA/TRTorch/commit/bde8ee0)) |
| 876 | +* Perf benchmark initial draft ([f2d1655](https://github.com/NVIDIA/TRTorch/commit/f2d1655)) |
| 877 | +* replace view with reshape during lowering ([d39b918](https://github.com/NVIDIA/TRTorch/commit/d39b918)) |
| 878 | +* Review comment incorporated ([161ef3d](https://github.com/NVIDIA/TRTorch/commit/161ef3d)) |
| 879 | +* support aten::adaptive_max_pool1d, aten::adaptive_avg_pool3d and aten::adaptive_max_pool3d operators ([e554dbd](https://github.com/NVIDIA/TRTorch/commit/e554dbd)) |
| 880 | +* support aten::div.Tensor_mode ([bb3046a](https://github.com/NVIDIA/TRTorch/commit/bb3046a)) |
| 881 | +* support aten::extend evaluator ([33c523d](https://github.com/NVIDIA/TRTorch/commit/33c523d)) |
| 882 | +* support aten::format evaluator ([3a33d33](https://github.com/NVIDIA/TRTorch/commit/3a33d33)) |
| 883 | +* **aten::Int:** Adding a new pass to remove single use ([46ac757](https://github.com/NVIDIA/TRTorch/commit/46ac757)) |
| 884 | +* **aten::Int:** Lowers out aten::Int ([908340f](https://github.com/NVIDIA/TRTorch/commit/908340f)) |
| 885 | +* **core//conversion:** Implement converter for torch unbind ([268a49b](https://github.com/NVIDIA/TRTorch/commit/268a49b)) |
| 886 | +* Update Pytorch version to 1.11 ([c009a1f](https://github.com/NVIDIA/TRTorch/commit/c009a1f)) |
| 887 | +* Upgrade TensorRT to 8.2.4.2 ([f1f151b](https://github.com/NVIDIA/TRTorch/commit/f1f151b)) |
| 888 | + |
| 889 | + |
| 890 | +### BREAKING CHANGES |
| 891 | + |
| 892 | +* This commit removes the strict types and max_batch_size apis. We are doing this because the functionality of these APIs in TRT is convoluted and likely to be ignored during building. A replacement for strict types with actual guarantees will be added at a later date. |
| 893 | + |
| 894 | +Signed-off-by: Dheeraj Peri < [email protected]> |
| 895 | +* This is a minor change but may cause scripts |
| 896 | +using torchtrtc to fail. We are renaming enabled-precisions to |
| 897 | +enable-precision since it makes more sense as the argument can |
| 898 | +be repeated |
| 899 | + |
| 900 | +Signed-off-by: Naren Dasan <[email protected]> |
| 901 | +Signed-off-by: Naren Dasan <[email protected]> |
| 902 | +* This PR removes `--max-batch-size` from the CLI |
| 903 | +as it has no real functional effect |
| 904 | + |
| 905 | +Signed-off-by: Naren Dasan <[email protected]> |
| 906 | +Signed-off-by: Naren Dasan <[email protected]> |
| 907 | + |
| 908 | + |
795 | 909 |
|
0 commit comments