Update dependency com.microsoft.onnxruntime:onnxruntime to v1.24.1#35
Open
renovate[bot] wants to merge 1 commit intomasterfrom
Open
Update dependency com.microsoft.onnxruntime:onnxruntime to v1.24.1#35renovate[bot] wants to merge 1 commit intomasterfrom
renovate[bot] wants to merge 1 commit intomasterfrom
Conversation
f8df994 to
78cd577
Compare
78cd577 to
7844bd6
Compare
7844bd6 to
7d887de
Compare
7d887de to
12fd980
Compare
12fd980 to
b49fce6
Compare
b49fce6 to
f4f9479
Compare
f4f9479 to
64c33ef
Compare
64c33ef to
7763049
Compare
7763049 to
b83b0bf
Compare
b83b0bf to
82e667f
Compare
82e667f to
3642954
Compare
3642954 to
dcc0d72
Compare
dcc0d72 to
c63b039
Compare
c63b039 to
4ed4331
Compare
4ed4331 to
c90fcd2
Compare
c90fcd2 to
e22a391
Compare
e22a391 to
5f1b66d
Compare
5f1b66d to
05a7348
Compare
05a7348 to
47d97dc
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
1.13.1→1.24.1Release Notes
microsoft/onnxruntime (com.microsoft.onnxruntime:onnxruntime)
v1.24.1: ONNX Runtime v1.24.1Compare Source
📢 Announcements & Breaking Changes
Platform Support Changes
API Version
✨ New Features
🤖 Execution Provider (EP) Plugin API
A major infrastructure enhancement enabling plugin-based EPs with dynamic loading:
OrtKernelInfoAPIs for kernel-based plugin EPs (#26803)🔧 Core APIs
OrtApi::CreateEnvWithOptions()andOrtEpApi::GetEnvConfigEntries()(#26971)KernelInfo(#26589)📊 Dependencies & Integration
🖥️ Execution Provider Updates
NVIDIA
Qualcomm QNN EP
Intel & AMD
ArmNN EP
Arm is formally deprecating the Arm NN Execution Provider (EP) in ONNX Runtime. The Arm NN EP is still experimental and depends on technology that is no longer actively maintained. Keeping it available now only adds complexity and potential confusion for users.
What to expect:
🌐 Web & JavaScript
🧠 CPU Improvements
--enable_arm_neon_nchwcto enable this feature (#25580 #26838 #26691 #26171). This feature may be turned ON by default in a future release based on community feedback.SiLUactivation perf improvement (#26753)🔌 Language Bindings
C#
Python
add_external_initializers_from_files(#26012)Java
OrtCompiledModelCompatibility(#26028)🐛 Bug Fixes
Critical Fixes
FuseReluClip(#26878)KernelContext_GetAllocator(#26883)EP-Specific Fixes
🙏 Contributors
Thanks to our 170 contributors for this release!
@fs-eire, @tianleiwu, @edgchen1, @qjia7, @yuslepukhin, @hariharans29, @Honry, @qti-yuduo, @adrianlizarraga, @snnn, @eserscor, @vraspar, @xiaofeihan1, @guschmue, @daijh, @quic-muchhsu, @qti-jkilpatrick, @tirupath-qti, @Jiawei-Shao, @qti-hungjuiw, @quic-ashwshan, @titaiwangms, @qti-mattsinc, @chilo-ms, @jchen10, @xhcao, @skottmckay, @quic-calvnguy, @JonathanC-ARM, @Rohanjames1997, @sushraja-msft, @jambayk, @adrastogi, @xenova, @quic-tirupath, @justinchuby, @HectorSVC, @kunal-vaishnavi, @wenqinI, @prathikr, @baijumeswani, @preetha-intel, @jatinwadhwa921, @umangb-09, @qti-ashwshan, @carzh, @bachelor-dou, @ranjitshs, @gedoensmax, @xadupre, @nenad1002, @TedThemistokleous, @keshavv27, @zpye, @jnagi-intel, @jiafatom, @mingyueliuh, @Colm-in-Arm, @borg323, @chunghow-qti, @Craigacp, @BODAPATIMAHESH, @AlekseiNikiforovIBM, @hans00, @thevishalagarwal, @MaanavD, @qti-kromero, @damdoo01-arm, @BoarQing, @naomiOvad, @yuhuchua-qti, @hadiFute, @vishalpandya1990, @rivkastroh, @minfhong-qti, @kuanyul-qti, @xieofxie, @ankitm3k, @RyanMetcalfeInt8, @MayureshV1, @bopeng1234, @vthaniel, @mdvoretc-intel, @ericcraw, @javier-intel, @saurabhkale17, @sfatimar, @Kotomi-Du, @intbf, @n1harika, @TejalKhade28, @gupta-pallavi, @cbourjau, @nieubank, @r-devulap, @wszqkzqk, @sanketkaleoss, @amancini-N, @fanchenkong1, @meakbiyik, @hisham-hchowdhu, @shaoboyan091, @Stonesjtu, @qwu16, @wangw-1991, @bonktree, @naetherm, @nikhilfujitsu, @Panxuefeng-loongson, @selenayang888, @moyo1997, @chwarr, @patryk-kaiser-ARM, @fdwr, @SavaLione, @shiyi9801, @mcost45, @aciddelgado, @prudhvi-qti, @Jonahcb, @lifang-zhang, @zhaoxul-qti, @gaugarg-nv, @cocotdf, @WangFengtu1996, @orlmon01, @weidu-tpvision, @theHamsta, @kevinch-nv, @XXXXRT666, @movedancer, @melkap01-Arm, @KingSora, @urpetkov-amd, @junchao-loongson, @jixiongdeng, @wcy123, @GrigoryEvko, @anujj, @peishenyan, @quic-ankus, @jchen351, @yihonglyu, @satyajandhyala, @co63oc, @mschofie, @quic-ashigarg, @asoldano, @nproshun, @jiangzhaoming, @seungtaek94, @liqunfu, @jaholme, @hanbitmyths, @quic-boyuc, @rM-planet, @qti-vaiskv, @AndreyOrb, @pkubaj, @xhan65, @Jaswanth51, @quic-hungjuiw, @jywu-msft, @mklimenk, @derdeljan-msft, @ianfhunter, @NingW101, @feich-ms, @Akupadhye, @wschin
Full Changelog: v1.23.2...rel-1.24.1
v1.23.2: ONNX Runtime v1.23.2Compare Source
v1.23.1: ONNX Runtime v1.23.1Compare Source
What's Changed
Full Changelog: microsoft/onnxruntime@v1.23.0...v1.23.1
v1.23.0: ONNX Runtime v1.23.0Compare Source
Announcements
This release introduces Execution Provider (EP) Plugin API, which is a new infrastructure for building plugin-based EPs. (#24887 , #25137, #25124, #25147, #25127, #25159, #25191, #2524)
This release introduces the ability to dynamically download and install execution providers. This feature is exclusively available in the WinML build and requires Windows 11 version 25H2 or later. To leverage this new capability, C/C++/C# users should use the builds distributed through the Windows App SDK, and Python users should install the onnxruntime-winml package(will be published soon). We encourage users who can upgrade to the latest Windows 11 to utilize the WinML build to take advantage of this enhancement.
Upcoming Changes
Execution & Core Optimizations
Shutdown logic on Windows is simplified
Now on Windows some global object will be not destroyed if we detect that the process is being shutting down(#24891) . It will not cause memory leak as when a process ends all the memory will be returned to the operating system. This change can reduce the chance of having crashes on process exit.
AutoEP/Device Management
Now ONNX Runtime has the ability to automatically discovery computing devices and select the best EPs to download and register. The EP downloading feature currently only works on Windows 11 version 25H2 or later.
Execution Provider (EP) Updates
ROCM EP was removed from the source tree. Users are recommended to use Migraphx or Vitis AI EPs from AMD.
A new EP, Nvidia TensorRT RTX, was added.
Web
EMDSK is upgraded from 4.0.4 to 4.0.8
WebGPU EP
Added WGSL template support.
QNN EP
SDK Update: Added support for QNN SDK 2.37.
KleidiAI
Enhanced performance for SGEMM, IGEMM, and Dynamic Quantized MatMul operations, especially for Conv2D operators on hardware that supports SME2 (Scalable Matrix Extension v2).
Known Problems
Contributions
Contributors to ONNX Runtime include members across teams at Microsoft, along with our community members:
@1duo, @Akupadhye, @amarin16, @AndreyOrb, @ankan-ban, @ankitm3k, @anujj, @aparmp-quic, @arnej27959, @bachelor-dou, @benjamin-hodgson, @Bonoy0328, @chenweng-quic, @chuteng-quic, @clementperon, @co63oc, @daijh, @damdoo01-arm, @danyue333, @fanchenkong1, @gedoensmax, @genarks, @gnedanur, @Honry, @huaychou, @ianfhunter, @ishwar-raut1, @jing-bao, @joeyearsley, @johnpaultaken, @jordanozang, @JulienMaille, @keshavv27, @kevinch-nv, @khoover, @krahenbuhl, @kuanyul-quic, @mauriciocm9, @mc-nv, @minfhong-quic, @mingyueliuh, @MQ-mengqing, @NingW101, @notken12, @omarhass47, @peishenyan, @pkubaj, @qc-tbhardwa, @qti-jkilpatrick, @qti-yuduo, @quic-ankus, @quic-ashigarg, @quic-ashwshan, @quic-calvnguy, @quic-hungjuiw, @quic-tirupath, @qwu16, @ranjitshs, @saurabhkale17, @schuermans-slx, @sfatimar, @stefantalpalaru, @sunnyshu-intel, @TedThemistokleous, @thevishalagarwal, @toothache, @umangb-09, @vatlark, @VishalX, @wcy123, @xhcao, @xuke537, @zhaoxul-qti
v1.22.0: ONNX Runtime v1.22Compare Source
Announcements
GenAI & Advanced Model Features
Execution & Core Optimizations
Core
Execution Provider (EP) Updates
CPU EP/MLAS
MatMulNBits, enabling matrix multiplication with weights quantized to 8 bits.OpenVINO EP
QNN EP
TensorRT EP
NV TensorRT RTX EP
CUDA EP
MatMulNBits.VitisAI EP
Infrastructure & Build Improvements
Build System & Packages
Dependencies / Version Updates
Web
Mobile
Contributions
Contributors to ONNX Runtime include members across teams at Microsoft, along with our community members:
Yulong Wang, Jian Chen, Changming Sun, Satya Kumar Jandhyala, Hector Li, Prathik Rao, Adrian Lizarraga, Jiajia Qin, Scott McKay, Jie Chen, Tianlei Wu, Edward Chen, Wanming Lin, xhcao, vraspar, Dmitri Smirnov, Jing Fang, Yifan Li, Caroline Zhu, Jianhui Dai, Chi Lo, Guenther Schmuelling, Ryan Hill, Sushanth Rajasankar, Yi-Hong Lyu, Ankit Maheshkar, Artur Wojcik, Baiju Meswani, David Fan, Enrico Galli, Hans, Jambay Kinley, John Paul, Peishen Yan, Yateng Hong, amarin16, chuteng-quic, kunal-vaishnavi, quic-hungjuiw, Alessio Soldano, Andreas Hussing, Ashish Garg, Ashwath Shankarnarayan, Chengdong Liang, Clément Péron, Erick Muñoz, Fanchen Kong, George Wu, Haik Silm, Jagadish Krishnamoorthy, Justin Chu, Karim Vadsariya, Kevin Chen, Mark Schofield, Masaya, Kato, Michael Tyler, Nenad Banfic, Ningxin Hu, Praveen G, Preetha Veeramalai, Ranjit Ranjan, Seungtaek Kim, Ti-Tai Wang, Xiaofei Han, Yueqing Zhang, co63oc, derdeljan-msft, genmingz@AMD, jiangzhaoming, jing-bao, kuanyul-quic, liqun Fu, minfhong-quic, mingyue, quic-tirupath, quic-zhaoxul, saurabh, selenayang888, sfatimar, sheetalarkadam, virajwad, zz002, Ștefan Talpalaru
v1.21.1: ONNX Runtime v1.21.1Compare Source
What's new?
v1.21.0: ONNX Runtime v1.21.0Compare Source
Announcements
GenAI & Advanced Model Features
Enhanced Decoding & Pipeline Support
API & Compatibility Updates
Bug Fixes for Model Output
top_kon CPU.Execution & Core Optimizations
Core Refinements
Execution Provider (EP) Updates
General
TensorRT EP Improvements
NMS,RoiAlign,NonZero) to TensorRT by default.trt_op_types_to_excludeto exclude specific ops from TensorRT assignment.CUDA EP Improvements
QNN EP Improvements
--use_qnn static_lib.DirectML EP Support & Upgrades
OpenVINO EP Improvements
SkipLayerNormalization,MatMulNBits,FusedGemm,FusedConv,EmbedLayerNormalization,BiasGelu,Attention,DynamicQuantizeMatMul,FusedMatMul,QuickGelu,SkipSimplifiedLayerNormalizationVitisAI EP Improvements
Mobile Platform Enhancements
CoreML Updates
Extensions & Tokenizer Improvements
Expanded Tokenizer Support
ChatGLM,Baichuan2,Phi-4, etc.Phi-4pre/post-processing support for text, vision, and audio.tokenizer.json.Image Codec Enhancements
ImageCodecnow links to native APIs if available; otherwise, falls back to built-in libraries.Unified Tokenizer API
Infrastructure & Build Improvements
Runtime Requirements
All the prebuilt Windows packages now require VC++ Runtime version >= 14.40(instead of 14.38). If your VC++ runtime version is lower than that, you may see a crash when ONNX Runtime was initializing. See https://github.com/microsoft/STL/wiki/Changelog#vs-2022-1710 for more details.
Updated minimum iOS and Android SDK requirements to align with React Native 0.76:
All macOS packages now require macOS version >= 13.3.
CMake File Changes
CMake Version: Increased the minimum required CMake version from 3.26 to 3.28. Added support for CMake 4.0.
Python Version: Increased the minimum required Python version from 3.8 to 3.10 for building ONNX Runtime from source.
Improved VCPKG support
Added the following cmake options for WebGPU EP
Added cmake option onnxruntime_BUILD_QNN_EP_STATIC_LIB for building with QNN EP as a static library.
Removed cmake option onnxruntime_USE_PREINSTALLED_EIGEN.
Fixed a build issue with Visual Studio 2022 17.3 (#23911)
Modernized Build Tools
onnxruntime_USE_CUDA_NHWC_OPSby default for CUDA builds.Dependency Cleanup
nsyncfrom dependencies.Others
Updated Node.js installation script to support network proxy usage (#23231)
Web
Contributors
Contributors to ONNX Runtime include members across teams at Microsoft, along with our community members:
Changming Sun, Yulong Wang, Tianlei Wu, Jian Chen, Wanming Lin, Adrian Lizarraga, Hector Li, Jiajia Qin, Yifan Li, Edward Chen, Prathik Rao, Jing Fang, shiyi, Vincent Wang, Yi Zhang, Dmitri Smirnov, Satya Kumar Jandhyala, Caroline Zhu, Chi Lo, Justin Chu, Scott McKay, Enrico Galli, Kyle, Ted Themistokleous, dtang317, wejoncy, Bin Miao, Jambay Kinley, Sushanth Rajasankar, Yueqing Zhang, amancini-N, ivberg, kunal-vaishnavi, liqun Fu, Corentin Maravat, Peishen Yan, Preetha Veeramalai, Ranjit Ranjan, Xavier Dupré, amarin16, jzm-intel, kailums, xhcao, A-Satti, Aleksei Nikiforov, Ankit Maheshkar, Javier Martinez, Jianhui Dai, Jie Chen, Jon Campbell, Karim Vadsariya, Michael Tyler, PARK DongHa, Patrice Vignola, Pranav Sharma, Sam Webster, Sophie Schoenmeyer, Ti-Tai Wang, Xu Xing, Yi-Hong Lyu, genmingz@AMD, junchao-zhao, sheetalarkadam, sushraja-msft, Akshay Sonawane, Alexis Tsogias, Ashrit Shetty, Bilyana Indzheva, Chen Feiyue, Christian Larson, David Fan, David Hotham, Dmitry Deshevoy, Frank Dong, Gavin Kinsey, George Wu, Grégoire, Guenther Schmuelling, Indy Zhu, Jean-Michaël Celerier, Jeff Daily, Joshua Lochner, Kee, Malik Shahzad Muzaffar, Matthieu Darbois, Michael Cho, Michael Sharp, Misha Chornyi, Po-Wei (Vincent), Sevag H, Takeshi Watanabe, Wu, Junze, Xiang Zhang, Xiaoyu, Xinpeng Dou, Xinya Zhang, Yang Gu, Yateng Hong, mindest, mingyue, raoanag, saurabh, shaoboyan091, sstamenk, tianf-fff, wonchung-microsoft, xieofxie, zz002
v1.20.0: ONNX Runtime v1.20.0Compare Source
Release Manager: @apsonawane
Announcements
Build System & Packages
Core
Performance
EPs
CPU
CUDA
TensorRT
QNN
OpenVINO
DirectML
Mobile
Web
GenAI
Full release notes for ONNX Runtime generate() API v0.5.0 can be found here.
Extensions
Full release notes for ONNX Runtime Extensions v0.13 can be found here.
Olive
Full release notes for Olive v0.7.0 can be found here.
Contributors
Big thank you to the release manager @apsonawane, as well as @snnn, @jchen351, @sheetalarkadam, and everyone else who made this release possible!
Tianlei Wu, Yi Zhang, Yulong Wang, Scott McKay, Edward Chen, Adrian Lizarraga, Wanming Lin, Changming Sun, Dmitri Smirnov, Jian Chen, Jiajia Qin, Jing Fang, George Wu, Caroline Zhu, Hector Li, Ted Themistokleous, mindest, Yang Gu, jingyanwangms, liqun Fu, Adam Pocock, Patrice Vignola, Yueqing Zhang, Prathik Rao, Satya Kumar Jandhyala, Sumit Agarwal, Xu Xing, aciddelgado, duanshengliu, Guenther Schmuelling, Kyle, Ranjit Ranjan, Sheil Kumar, Ye Wang, kunal-vaishnavi, mingyueliuh, xhcao, zz002, 0xdr3dd, Adam Reeve, Arne H Juul, Atanas Dimitrov, Chen Feiyue, Chester Liu, Chi Lo, Erick Muñoz, Frank Dong, Jake Mathern, Julius Tischbein, Justin Chu, Xavier Dupré, Yifan Li, amarin16, anujj, chenduan-amd, saurabh, sfatimar, sheetalarkadam, wejoncy, Akshay Sonawane, AlbertGuan9527, Bin Miao, Christian Bourjau, Claude, Clément Péron, Emmanuel, Enrico Galli, Fangjun Kuang, Hann Wang, Indy Zhu, Jagadish Krishnamoorthy, Javier Martinez, Jeff Daily, Justin Beavers, Kevin Chen, Krishna Bindumadhavan, Lennart Hannink, Luis E. P., Mauricio A Rovira Galvez, Michael Tyler, PARK DongHa, Peishen Yan, PeixuanZuo, Po-Wei (Vincent), Pranav Sharma, Preetha Veeramalai, Sophie Schoenmeyer, Vishnudas Thaniel S, Xiang Zhang, Yi-Hong Lyu, Yufeng Li, goldsteinn, mcollinswisc, mguynn-intc, mingmingtasd, raoanag, shiyi, stsokolo, vraspar, wangshuai09
Full changelog: v1.19.2...v1.20.0
v1.19.2: ONNX Runtime v1.19.2Compare Source
Announcements
Build System & Packages
Training
Mobile
Generative AI
Contributors
@prathikr, @mszhanyi, @edgchen1, @tianleiwu, @wangyems, @aciddelgado, @mindest, @snnn, @baijumeswani, @MaanavD
Thanks to everyone who helped ship this release smoothly!
Full Changelog: microsoft/onnxruntime@v1.19.0...v1.19.2
v1.19.0: ONNX Runtime v1.19.0Compare Source
Announcements
26250ae. This shouldn't effect much, but sorry for the inconvenience!Build System & Packages
Core
Performance
Execution Providers
TensorRT
CUDA
CPU
QNN
OpenVINO
DirectML
Mobile
Web
Training
GenAI
Extensions
Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.