Intel® oneAPI Base Toolkit Release Notes

ID 768214
Updated 3/15/2024
Version 2024.1.0
Public

author-image

By

System Requirements

Please see Intel oneAPI Base Toolkit System Requirements

Intel® oneAPI Base Toolkit Major Component Versions

Please visit Intel® oneAPI Toolkit and Component Versioning Schema for semantic versioning schema detail. 

The following table contains major versions of components in the latest oneAPI Base Toolkit 2024.1.0

Component Name Version
Intel® oneAPI DPC++ Compiler 2024.1.0
Intel® oneAPI DPC++ Library 2022.5.0
Intel® DPC++ Compatibility Tool 2024.1.0
Intel® oneAPI Math Kernel Library 2024.1.0
Intel® Distribution for GDB* 2024.1.0
Intel® VTune™ Profiler 2024.1.0
Intel® Advisor 2024.1.0
Intel® oneAPI Threading Building Blocks 2021.12.0
Intel® Integrated Performance Primitives 2021.11.0
Intel® Integrated Performance Primitives Cryptography 2021.10.0
Intel® oneAPI Collective Communications Library 2021.12.0
Intel® oneAPI Data Analytics Library 2024.2.0
Intel® oneAPI Deep Neural Networks Library 2024.1.0

 

New in Intel® oneAPI Base Toolkit 2024.1.0

Toolkit Level Updates

  • Intel® oneAPI Base Toolkit 2024.1.0 now supports Fedora 39 for CPU.
  • The Intel® oneAPI DPC++/C++ Compiler is the industry's first compiler conformant with SYCL 2020 allowing developers to write code once and run it on a variety of different processors using standard C++ making developers more productive by reducing development time and effort.
  • Enhanced SYCL Graph, allowing developers to use multi-threaded work generation and thread-safe functions seamlessly integrate with applications. SYCL ensures thread safety for all member functions, enhancing performance and reliability in parallel computing. SYCL Graph is now available on multiple SYCL backends, allowing developers to tune once and deploying anywhere. Additionally, our CUDA graph alternative offers and open, multi-platform solution, minimizing kernel dispatching overhead and ensuring adaptability across diverse software and hardware stacks.
  • Migrate to SYCL, build and deploy easier with Intel® DPC++ Compatibility Tool migrating more CUDA APIs, and now also migrating the project CMake file as a technology preview
  • Develop more future-proof code with the Data Parallel Control library (dpctl), providing 100% conformance to the Python Array API standard and offers new support for Nvidia* devices. New functions include types for reduction, statistics, sorting, set, elementwise, linear algebra, and in-place elementwise operations.   
  • Intel® oneAPI Math Kernel Library 2024.1.0 introduces several optimizations and new functionalities able to reduce the data transfer between Intel GPUs and the host CPU including batched Singular Value Decomposition, batched solver for linear systems and addition of Bessel functions of first and second kinds.
  • Unlock performance enhancements with the latest Intel® oneAPI Deep Neural Network Library (oneDNN) including improvements in graphics processing for Intel data center GPUs and Intel® Arc™ Graphics, perfect for complex models like Large Language Models and Stable Diffusion and increased performance for Intel Xeon Scalable processors.
  • Ensure accuracy & consistency of your computations with reproducibility of BLAS level 3 operations on Intel GPUs now, using Conditional Numeric Reproducibility (CNR).
  • Speed up gradient boosting inference across XGBoost, LightGBM, and CatBoost* without sacrificing accuracy with new fast tree inference1 in Intel® oneAPI Data Analytics Library (oneDAL).
  • Enhance your security with Intel IPP Cryptography’s compliance to FIPS 140-3, a U.S govt standard. Ideal for govt agencies and industries that handle sensitive data.

Intel® oneAPI DPC++ Compiler 2024.1.0

  • The Intel® oneAPI DPC++/C++ Compiler is the industry's first compiler conformant with SYCL 2020 allowing developers to write code once and run it on a variety of different processors using standard C++ making developers more productive by reducing development time and effort.
  • Enhanced SYCL Graph, allowing developers to use multi-threaded work generation and thread-safe functions seamlessly integrate with applications. SYCL ensures thread safety for all member functions, enhancing performance and reliability in parallel computing. SYCL Graph is now available on multiple SYCL backends, allowing developers to tune once and deploying anywhere. Additionally, our CUDA graph alternative offers and open, multi-platform solution, minimizing kernel dispatching overhead and ensuring adaptability across diverse software and hardware stacks. 
  • Intel® oneAPI DPC++/C++ Compiler enhances OpenMP 5.0, 5.1, 5.2, and TR12 standards compliance.

Intel® oneAPI DPC++ Library 2022.5.0

  • Intel® oneAPI DPC++ Library adds specialized sort algorithm to improve PVC app performance
  • Intel® oneAPI DPC++ Library adds transform_if variant with mask input for stencil computation needs
  • Intel® oneAPI DPC++ Library extends C++ STL style programming with histogram algorithms to accelerate scientific, AI & other apps

Intel® DPC++ Compatibility Tool 2024.1.0

  • Automatically captures CUDA workload signature to validate migrated SYCL code using Intel® DPC++ Compatibility Tool “CodePin” technology preview
  • Migrate to SYCL, build and deploy easier with Intel® DPC++ Compatibility Tool migrating more CUDA APIs, and now also migrating the project CMake file as a technology preview

Intel® oneAPI Math Kernel Library 2024.1.0

  • Intel® oneAPI Math Kernel Library 2024.1.0 introduces several optimizations and new functionalities able to reduce the data transfer between Intel GPUs and the host CPU including batched Singular Value Decomposition, batched solver for linear systems and addition of Bessel functions of first and second kinds.
  • This release provides users the ability to reproduce results of BLAS level 3 operations on Intel GPUs from run to run through Conditional Numerical Reproducibility (CNR) that was previously available only for x86 CPUs. Users can configure Intel® oneMKL  to ensure bitwise reproducible results.
  • Intel® oneMKL 2024.1.0 makes it easier to port CUDA applications to SYCL by adding multiple functions equivalent to those available in cuSolver*, cuBLAS* and CUDA Math Library*.
  • Improved performance of QR factorization, a key computation in LAPACK, by taking advantage of both the Intel Xeon Processor Family and the Intel® Data Center GPU Max Series.

Intel® Distribution for GDB* 2024.1.0

  • Intel Distribution for GDB* rebases to GDB* 14 staying current and aligned with the latest enhancements supporting effective application debug.
  • Intel Distribution for GDB* adds online page fault handling for GPUs allowing developers to monitor and troubleshoot memory access issues in real-time, while also providing insight into the behavior of the GPU driver, resulting in improved application performance and reliability. 
  • Intel Distribution for GDB* adds large General Purpose Register File (GRF) debug mode support for GPUs providing developers with more visibility into the GPU's internal state and allowing for more comprehensive debugging and optimization of GPU-accelerated applications. This mode is particularly useful for debugging complex or performance-critical code.

Intel® VTune™ Profiler 2024.1.0

  • Intel® VTune™ Profiler 2024.1.0 adds capability to identify and understand the reasons of implicit Unified Shared Memory data movements between Host and GPU causing performance inefficiencies in SYCL* applications. It also correlates the data transfers with compute tasks execution on GPU. 

  • Intel® VTune™ Profiler 2024.1.0 adds support for .NET 8, Ubuntu 23.10 and FreeBSD 14.0.

Intel® Advisor 2024.1.0

  • Intel® Advisor 2024.1.0 adds stability, quality improvements and better performance of CPU and GPU Roofline Analysis.

Intel® oneAPI Threading Building Blocks 2021.12.0

  • Intel® oneAPI Threading Building Blocks 2021.12.0 provides several improvements and bug fixes

Intel® Integrated Performance Primitives 2021.11.0

  • Added the verification part of post-quantum eXtended Merkle Signature Scheme (XMSS) algorithm as a tech preview feature.
  • Added FIPS-compliance mode for the library (open-source distribution). More information can be found in the Intel(R) IPP Cryptography FIPS Guide.  
  • The version of LZ4 (lossless data compression algorithm) in IPP has been updated to v1.9.4.
  • IPP NuGet packages have been improved to support .NET Standard 2.0 which allows them to be used in .NET projects (useful for .NET developers.)

Intel® oneAPI Collective Communications Library 2021.12.0

  • The 2024.1 update to oneCCL delivers even more performance for distributed Deep Learning and Machine Learning Training and Inference workloads. All key communication patterns have been further optimized to not only speed up message passing but also to do so in a memory efficient manner. This release in particular improves Inference performance.   

Intel® oneAPI Data Analytics Library 2024.2.0

  • Speed up gradient boosting inference across XGBoost, LightGBM, and CatBoost* without sacrificing accuracy with new fast tree inference in Intel® oneAPI Data Analytics Library (oneDAL).​
  • oneDAL improves clustering by adding spare K-Means support to automatically identify a subset of the features to use in clustering observations and improving K-Means performance.

Intel® oneAPI Deep Neural Networks Library 2024.1.0

  • Improved performance on Intel Architecture, Graphics, Aarch64-based processors. 
  • Introduced GPT-Q support to improve Large Language Models (LLMs) performance with compressed weights. Optimized implementation is available for Intel Graphics Products and support matmul with int8 wight compression.
  • Introduced fp8 data type support in primitives and Graph API. Optimized implementation is available for Intel Data Center GPU Max Series (formerly Ponte Vecchio).
  • Introduced support for fp16 and bf16 scale and shift arguments for layer normalization. Optimized implementation is available for Intel Graphics Products.
  • Added opt-in deterministic mode support. Deterministic mode guarantees that results are bitwise identical between runs in a fixed environment.
  • For Intel Graphics Products, introduced PReLU post-op support for inner product and matmul primitives.

Deprecation Notices

  • Intel® Fortran Compiler Classic (ifort) is now deprecated and will be discontinued in late 2024. Intel recommends that customers transition now to using the LLVM-based Intel® Fortran Compiler (ifx) for continued Windows* and Linux* support, new language support, new language features, and optimizations.
    For more information on ifx, see the Intel® Fortran Compiler Developer Guide and Reference and the Porting Guide for ifort Users to ifx.
  • Flow Graph Analyzer feature of Intel Advisor will be discontinued in 2025 or later. Customers who have purchased Intel® Priority Support will continue to receive support.
  • 32-bit support for Intel® Analyzers is deprecated and will be discontinued in the 2024.2 release. 
  • Fedora Linux version 38 support on CPU is deprecated with 2024.1 and will be discontinued in a future release.

 

Toolkit Level Updates

  • Intel oneAPI Base Toolkit 2024.0.1 now includes recent component patch releases. 
  • Patches are built on top of previous patch releases as needed.

Intel® oneAPI DPC++ Compiler 2024.0.2

  • Minor bug fixes

Intel® Integrated Performance Primitives Cryptography 2021.9.1 

  • This patch release fixes an algorithmic issue in the AES-XTS Intel® Advanced Vector Extensions 512 (Intel® AVX-512) code path.

Intel® oneAPI Collective Communications Library 2021.11.2

  • This update provides bug fixes to maintain driver compatibility for Intel® Data Center GPU Max Series.

Intel® oneAPI Data Analytics Library 2024.0.1

  • New features and bug fixes. See the Release Notes for more information.

Installation Instructions

Please visit Installation Guide for Intel oneAPI Toolkits

How to Start Using the Tools

Please reference:

Known Issues, Limitations and Workarounds

  1. Known Issue: The modulefiles included with Intel® VTune™ Profiler and Intel® Advisor incorrectly derive the component root path as "/". Workarounds for this issue include:
    • For Intel VTune Profiler:
      1. Click here to download the fixed tcl file and replace the existing 2024.0 file located in <install-dir>/vtune/2024.0/etc/modulefiles/vtune/
      2. Instead of using "module load" to set up the environment variables, run:

        $ source <install-dir>/vtune/latest/vtune-vars.sh

    • For Intel Advisor:

      1. Click here to download the fixed tcl file and replace the existing 2024.0 file located in <install-dir>/advisor/2024.0/etc/modulefiles/advisor/

      2. Instead of using "module load" to set up the environment variables, run:

        $ source <install-dir>/advisor/latest/advisor-vars.sh

  2. Known Issue: There is a known issue integrating Intel software developer tools (Intel® oneAPI Base Toolkit, Intel® HPC Toolkit, or their component products) into Microsoft Visual Studio* 2022 (17.7 or higher) on offline systems with the Windows Performance Toolkit (Win11SDK_WindowsPerformanceToolkit) installed. This results in an incomplete integration. To work around the issue, either enable an Internet connection during Intel developer tools installation or uninstall the Windows Performance Toolkit before installing Intel developer tools, after Intel developer tools are installed, reinstall the Windows Performance Toolkit.
  3. Known Issue: When using Intel® oneAPI DPC++/C++ Compiler on a Linux machine, users may run into an issue if the highest version of GNU gcc detected doesn't have the equivalent g++ package installed. More details on the error and workarounds can be found here.
  4. Please read the whitepaper on Challenges, tips, and known issues when debugging heterogeneous programs using DPC++ or OpenMP offload 
  5. Limitations
    1. Running any GPU code on a Virtual Machine is not supported at this time.
    2. If you have chosen to download the Get Started Guide to use offline, viewing it in Chrome may cause the text to disappear when the browser window is resized. To fix this problem, resize your browser window again, or use a different browser.
    3. Eclipse* 4.12: the code sample project created by IDE plugin from Makefile will not build. It is a known issue with Eclipse 4.12. Please use Eclipse 4.9, 4.10 or 4.11.


Release Notes for All Tools included in Intel® oneAPI Base Toolkit

Previous oneAPI Releases

Notices and Disclaimers

Intel technologies may require enabled hardware, software or service activation.

No product or component can be absolutely secure.

Your costs and results may vary.

© Intel Corporation. Intel, the Intel logo, and other Intel marks are trademarks of Intel Corporation or its subsidiaries. Other names and brands may be claimed as the property of others.

No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.

The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request.

Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade.