mesytec-mnode/external/taskflow-3.8.0/docs/release-3-0-0.html
2025-01-04 01:25:05 +01:00

132 lines
18 KiB
HTML

<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8" />
<title>Release Notes &raquo; Release 3.0.0 (2021/01/01) | Taskflow QuickStart</title>
<link rel="stylesheet" href="https://fonts.googleapis.com/css?family=Source+Sans+Pro:400,400i,600,600i%7CSource+Code+Pro:400,400i,600" />
<link rel="stylesheet" href="m-dark+documentation.compiled.css" />
<link rel="icon" href="favicon.ico" type="image/vnd.microsoft.icon" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<meta name="theme-color" content="#22272e" />
</head>
<body>
<header><nav id="navigation">
<div class="m-container">
<div class="m-row">
<span id="m-navbar-brand" class="m-col-t-8 m-col-m-none m-left-m">
<a href="https://taskflow.github.io"><img src="taskflow_logo.png" alt="" />Taskflow</a> <span class="m-breadcrumb">|</span> <a href="index.html" class="m-thin">QuickStart</a>
</span>
<div class="m-col-t-4 m-hide-m m-text-right m-nopadr">
<a href="#search" class="m-doc-search-icon" title="Search" onclick="return showSearch()"><svg style="height: 0.9rem;" viewBox="0 0 16 16">
<path id="m-doc-search-icon-path" d="m6 0c-3.31 0-6 2.69-6 6 0 3.31 2.69 6 6 6 1.49 0 2.85-0.541 3.89-1.44-0.0164 0.338 0.147 0.759 0.5 1.15l3.22 3.79c0.552 0.614 1.45 0.665 2 0.115 0.55-0.55 0.499-1.45-0.115-2l-3.79-3.22c-0.392-0.353-0.812-0.515-1.15-0.5 0.895-1.05 1.44-2.41 1.44-3.89 0-3.31-2.69-6-6-6zm0 1.56a4.44 4.44 0 0 1 4.44 4.44 4.44 4.44 0 0 1-4.44 4.44 4.44 4.44 0 0 1-4.44-4.44 4.44 4.44 0 0 1 4.44-4.44z"/>
</svg></a>
<a id="m-navbar-show" href="#navigation" title="Show navigation"></a>
<a id="m-navbar-hide" href="#" title="Hide navigation"></a>
</div>
<div id="m-navbar-collapse" class="m-col-t-12 m-show-m m-col-m-none m-right-m">
<div class="m-row">
<ol class="m-col-t-6 m-col-m-none">
<li><a href="pages.html">Handbook</a></li>
<li><a href="namespaces.html">Namespaces</a></li>
</ol>
<ol class="m-col-t-6 m-col-m-none" start="3">
<li><a href="annotated.html">Classes</a></li>
<li><a href="files.html">Files</a></li>
<li class="m-show-m"><a href="#search" class="m-doc-search-icon" title="Search" onclick="return showSearch()"><svg style="height: 0.9rem;" viewBox="0 0 16 16">
<use href="#m-doc-search-icon-path" />
</svg></a></li>
</ol>
</div>
</div>
</div>
</div>
</nav></header>
<main><article>
<div class="m-container m-container-inflatable">
<div class="m-row">
<div class="m-col-l-10 m-push-l-1">
<h1>
<span class="m-breadcrumb"><a href="Releases.html">Release Notes</a> &raquo;</span>
Release 3.0.0 (2021/01/01)
</h1>
<nav class="m-block m-default">
<h3>Contents</h3>
<ul>
<li><a href="#release-3-0-0_download">Download</a></li>
<li><a href="#release-3-0-0_system_requirements">System Requirements</a></li>
<li><a href="#release-3-0-0_working_items">Working Items</a></li>
<li>
<a href="#release-3-0-0_new_features">New Features</a>
<ul>
<li><a href="#release-3-0-0_taskflow_core">Taskflow Core</a></li>
<li><a href="#release-3-0-0_cudaflow">cudaFlow</a></li>
<li><a href="#release-3-0-0_utilities">Utilities</a></li>
<li><a href="#release-3-0-0_profiler">Taskflow Profiler (TFProf)</a></li>
</ul>
</li>
<li>
<a href="#release-3-0-0_new_algorithms">New Algorithms</a>
<ul>
<li><a href="#release-3-0-0_cpu_algorithms">CPU Algorithms</a></li>
<li><a href="#release-3-0-0_gpu_algorithms">GPU Algorithms</a></li>
</ul>
</li>
<li><a href="#release-3-0-0_bug_fixes">Bug Fixes</a></li>
<li><a href="#release-3-0-0_breaking_changes">Breaking Changes</a></li>
<li><a href="#release-3-0-0_deprecated_items">Deprecated and Removed Items</a></li>
<li><a href="#release-3-0-0_documentation">Documentation</a></li>
<li><a href="#release-3-0-0_miscellaneous_items">Miscellaneous Items</a></li>
</ul>
</nav>
<p>Taskflow 3.0.0 is the 1st release in the 3.x line! This release includes several new changes such as CPU-GPU tasking, algorithm collection, enhanced web-based profiler, documentation, and unit tests.</p><aside class="m-note m-info"><h4>Note</h4><p>Starting from v3, we have migrated the codebase to the <a href="https://en.wikipedia.org/wiki/C%2B%2B17">C++17</a> standard to largely improve the expressivity and efficiency of the codebase.</p></aside><section id="release-3-0-0_download"><h2><a href="#release-3-0-0_download">Download</a></h2><p>Taskflow 3.0.0 can be downloaded from <a href="https://github.com/taskflow/taskflow/releases/tag/v3.0.0">here</a>.</p></section><section id="release-3-0-0_system_requirements"><h2><a href="#release-3-0-0_system_requirements">System Requirements</a></h2><p>To use Taskflow v3.0.0, you need a compiler that supports C++17:</p><ul><li>GNU C++ Compiler at least v7.0 with -std=c++17</li><li>Clang C++ Compiler at least v6.0 with -std=c++17</li><li>Microsoft Visual Studio at least v19.27 with /std:c++17</li><li>AppleClang Xcode Version at least v12.0 with -std=c++17</li><li>Nvidia CUDA Toolkit and Compiler (nvcc) at least v11.1 with -std=c++17</li><li>Intel C++ Compiler at least v19.0.1 with -std=c++17</li></ul><p>Taskflow works on Linux, Windows, and Mac OS X.</p></section><section id="release-3-0-0_working_items"><h2><a href="#release-3-0-0_working_items">Working Items</a></h2><ul><li>enhancing the taskflow profiler (<a href="https://github.com/taskflow/tfprof">TFProf</a>)</li><li>adding methods for updating <a href="classtf_1_1cudaFlow.html" class="m-doc">tf::<wbr />cudaFlow</a> (with unit tests)</li><li>adding support for <a href="https://docs.nvidia.com/cuda/cublas/index.html">cuBLAS</a></li><li>adding support for <a href="https://developer.nvidia.com/cudnn">cuDNN</a></li><li>adding support for SYCL (ComputeCpp and DPC++)</li></ul></section><section id="release-3-0-0_new_features"><h2><a href="#release-3-0-0_new_features">New Features</a></h2><section id="release-3-0-0_taskflow_core"><h3><a href="#release-3-0-0_taskflow_core">Taskflow Core</a></h3><ul><li>replaced all non-standard libraries with C++17 STL (e.g., <a href="https://en.cppreference.com/w/cpp/utility/optional">std::<wbr />optional</a>, <a href="https://en.cppreference.com/w/cpp/utility/variant">std::<wbr />variant</a>)</li><li>added <a href="classtf_1_1WorkerView.html" class="m-doc">tf::<wbr />WorkerView</a> for users to observe the running works of tasks</li><li>added asynchronous tasking (see <a href="AsyncTasking.html" class="m-doc">Asynchronous Tasking</a>)</li><li>modified <a href="classtf_1_1ObserverInterface.html#a8225fcacb03089677a1efc4b16b734cc" class="m-doc">tf::<wbr />ObserverInterface::<wbr />on_entry</a> and <a href="classtf_1_1ObserverInterface.html#aa22f5378154653f08d9a58326bda4754" class="m-doc">tf::<wbr />ObserverInterface::<wbr />on_exit</a> to take <a href="classtf_1_1WorkerView.html" class="m-doc">tf::<wbr />WorkerView</a></li><li>added a custom graph interface to support dynamic polymorphism for tf::cudaGraph</li><li>supported separate compilations between Taskflow and CUDA (see <a href="CompileTaskflowWithCUDA.html" class="m-doc">Compile Taskflow with CUDA</a>)</li><li>added <a href="classtf_1_1Semaphore.html" class="m-doc">tf::<wbr />Semaphore</a> and tf::CriticalSection to limit the maximum concurrency</li><li>added <a href="classtf_1_1Future.html" class="m-doc">tf::<wbr />Future</a> to support cancellation of submitted tasks (see <a href="RequestCancellation.html" class="m-doc">Request Cancellation</a>)</li></ul></section><section id="release-3-0-0_cudaflow"><h3><a href="#release-3-0-0_cudaflow">cudaFlow</a></h3><ul><li>added <a href="classtf_1_1cudaFlowCapturer.html" class="m-doc">tf::<wbr />cudaFlowCapturer</a> for building a cudaFlow through stream capture (see <a href="GPUTaskingcudaFlowCapturer.html" class="m-doc">GPU Tasking (cudaFlowCapturer)</a>)</li><li>added tf::cudaFlowCapturerBase for creating custom capturers</li><li>added <a href="classtf_1_1cudaFlow.html#a89c389fff64a16e5dd8c60875d3b514d" class="m-doc">tf::<wbr />cudaFlow::<wbr />capture</a> for capturing a cudaFlow within a parent cudaFlow</li><li>added tf::Taskflow::emplace_on to place a cudaFlow on a GPU</li><li>added <a href="classtf_1_1cudaFlow.html#a7f97b68fa7c889db49b26aa71a46a7cf" class="m-doc">tf::<wbr />cudaFlow::<wbr />dump</a> and <a href="classtf_1_1cudaFlowCapturer.html#a90d1265bcc27647906bed6e6876c9aa7" class="m-doc">tf::<wbr />cudaFlowCapturer::<wbr />dump</a> to visualize cudaFlow</li><li>added tf::cudaFlow::offload and update methods to run and update a cudaFlow explicitly</li><li>supported standalone cudaFlow</li><li>supported standalone cudaFlowCapturer</li><li>added tf::cublasFlowCapturer to support <a href="https://docs.nvidia.com/cuda/cublas/index.html">cuBLAS</a> (see LinearAlgebracublasFlowCapturer)</li></ul></section><section id="release-3-0-0_utilities"><h3><a href="#release-3-0-0_utilities">Utilities</a></h3><ul><li>added utility functions to grab the cuda device properties (see <a href="cuda__device_8hpp.html" class="m-doc">cuda_<wbr />device.hpp</a>)</li><li>added utility functions to control cuda memory (see <a href="cuda__memory_8hpp.html" class="m-doc">cuda_<wbr />memory.hpp</a>)</li><li>added utility functions for common mathematics operations</li><li>added serializer and deserializer libraries to support tfprof</li><li>added per-thread pool for CUDA streams to improve performance</li></ul></section><section id="release-3-0-0_profiler"><h3><a href="#release-3-0-0_profiler">Taskflow Profiler (TFProf)</a></h3><ul><li>added visualization for asynchronous tasks</li><li>added server-based profiler to support large profiling data (see <a href="Profiler.html" class="m-doc">Profile Taskflow Programs</a>)</li></ul></section></section><section id="release-3-0-0_new_algorithms"><h2><a href="#release-3-0-0_new_algorithms">New Algorithms</a></h2><section id="release-3-0-0_cpu_algorithms"><h3><a href="#release-3-0-0_cpu_algorithms">CPU Algorithms</a></h3><ul><li>added parallel sort (see <a href="ParallelSort.html" class="m-doc">Parallel Sort</a>)</li></ul></section><section id="release-3-0-0_gpu_algorithms"><h3><a href="#release-3-0-0_gpu_algorithms">GPU Algorithms</a></h3><ul><li>added single task (see <a href="SingleTaskCUDA.html" class="m-doc">Single Task</a>)</li><li>added parallel iterations (see <a href="ForEachCUDA.html" class="m-doc">Parallel Iterations</a>)</li><li>added parallel transforms</li><li>added parallel reduction</li></ul></section></section><section id="release-3-0-0_bug_fixes"><h2><a href="#release-3-0-0_bug_fixes">Bug Fixes</a></h2><ul><li>fixed the bug in stream capturing (need to use <code>ThreadLocal</code> mode)</li><li>fixed the bug in reporting wrong worker ids when compiling a shared library due to the use of <code>thread_local</code> (now with C++17 <code>inline</code> variable)</li></ul></section><section id="release-3-0-0_breaking_changes"><h2><a href="#release-3-0-0_breaking_changes">Breaking Changes</a></h2><ul><li>changed the returned values of asynchronous tasks to be <a href="https://en.cppreference.com/w/cpp/utility/optional">std::<wbr />optional</a> in order to support cancellation (see <a href="AsyncTasking.html" class="m-doc">Asynchronous Tasking</a> and <a href="RequestCancellation.html" class="m-doc">Request Cancellation</a>)</li></ul></section><section id="release-3-0-0_deprecated_items"><h2><a href="#release-3-0-0_deprecated_items">Deprecated and Removed Items</a></h2><ul><li>removed tf::cudaFlow::device; users may call tf::Taskflow::emplace_on to associate a cudaflow with a GPU device</li><li>removed tf::cudaFlow::join, use tf::cudaFlow::offload instead</li><li>removed the legacy tf::Framework</li><li>removed external mutable use of <a href="classtf_1_1TaskView.html" class="m-doc">tf::<wbr />TaskView</a></li></ul></section><section id="release-3-0-0_documentation"><h2><a href="#release-3-0-0_documentation">Documentation</a></h2><ul><li>added <a href="CompileTaskflowWithCUDA.html" class="m-doc">Compile Taskflow with CUDA</a></li><li>added <a href="BenchmarkTaskflow.html" class="m-doc">Benchmark Taskflow</a></li><li>added <a href="LimitTheMaximumConcurrency.html" class="m-doc">Limit the Maximum Concurrency</a></li><li>added <a href="AsyncTasking.html" class="m-doc">Asynchronous Tasking</a></li><li>added <a href="GPUTaskingcudaFlowCapturer.html" class="m-doc">GPU Tasking (cudaFlowCapturer)</a></li><li>added <a href="RequestCancellation.html" class="m-doc">Request Cancellation</a></li><li>added <a href="Profiler.html" class="m-doc">Profile Taskflow Programs</a></li><li>added <a href="cudaFlowAlgorithms.html" class="m-doc">cudaFlow Algorithms</a><ul><li><a href="SingleTaskCUDA.html" class="m-doc">Single Task</a> to run a kernel function in just a single thread</li><li><a href="ForEachCUDA.html" class="m-doc">Parallel Iterations</a> to perform parallel iterations over a range of items</li><li><a href="ParallelTransformsCUDA.html" class="m-doc">Parallel Transforms</a> to perform parallel transforms over a range of items</li></ul></li><li>added <a href="Governance.html" class="m-doc">Governance</a><ul><li><a href="rules.html" class="m-doc">Rules</a></li><li><a href="team.html" class="m-doc">Team</a></li><li><a href="codeofconduct.html" class="m-doc">Code of Conduct</a></li></ul></li><li>added <a href="Contributing.html" class="m-doc">Contributing</a><ul><li><a href="guidelines.html" class="m-doc">Guidelines</a></li><li><a href="contributors.html" class="m-doc">Contributors</a></li></ul></li><li>revised <a href="ConditionalTasking.html" class="m-doc">Conditional Tasking</a></li><li>revised documentation pages for files</li></ul></section><section id="release-3-0-0_miscellaneous_items"><h2><a href="#release-3-0-0_miscellaneous_items">Miscellaneous Items</a></h2><p>We have presented Taskflow in the following C++ venues with recorded videos:</p><ul><li><a href="https://www.youtube.com/watch?v=MX15huP5DsM">2020 CppCon Taskflow Talk</a></li><li><a href="https://www.youtube.com/watch?v=u8Mc_WgGwVY">2020 MUC++ Taskflow Talk</a></li></ul><p>We have published Taskflow in the following conferences and journals:</p><ul><li>Tsung-Wei Huang, &quot;<a href="iccad20.pdf">A General-purpose Parallel and Heterogeneous Task Programming System for VLSI CAD</a>,&quot; <em>IEEE/ACM International Conference on Computer-aided Design (ICCAD)</em>, CA, 2020</li><li>Chun-Xun Lin, Tsung-Wei Huang, and Martin Wong, &quot;<a href="icpads20.pdf">An Efficient Work-Stealing Scheduler for Task Dependency Graph</a>,&quot; <em>IEEE International Conference on Parallel and Distributed Systems (ICPADS)</em>, Hong Kong, 2020</li><li>Tsung-Wei Huang, Dian-Lun Lin, Yibo Lin, and Chun-Xun Lin, &quot;Cpp-Taskflow: A General-purpose Parallel Task Programming System at Scale,&quot; <em>IEEE Transactions on Computer-aided Design of Integrated Circuits and Systems (TCAD)</em>, to appear, 2020</li></ul></section>
</div>
</div>
</div>
</article></main>
<div class="m-doc-search" id="search">
<a href="#!" onclick="return hideSearch()"></a>
<div class="m-container">
<div class="m-row">
<div class="m-col-m-8 m-push-m-2">
<div class="m-doc-search-header m-text m-small">
<div><span class="m-label m-default">Tab</span> / <span class="m-label m-default">T</span> to search, <span class="m-label m-default">Esc</span> to close</div>
<div id="search-symbolcount">&hellip;</div>
</div>
<div class="m-doc-search-content">
<form>
<input type="search" name="q" id="search-input" placeholder="Loading &hellip;" disabled="disabled" autofocus="autofocus" autocomplete="off" spellcheck="false" />
</form>
<noscript class="m-text m-danger m-text-center">Unlike everything else in the docs, the search functionality <em>requires</em> JavaScript.</noscript>
<div id="search-help" class="m-text m-dim m-text-center">
<p class="m-noindent">Search for symbols, directories, files, pages or
modules. You can omit any prefix from the symbol or file path; adding a
<code>:</code> or <code>/</code> suffix lists all members of given symbol or
directory.</p>
<p class="m-noindent">Use <span class="m-label m-dim">&darr;</span>
/ <span class="m-label m-dim">&uarr;</span> to navigate through the list,
<span class="m-label m-dim">Enter</span> to go.
<span class="m-label m-dim">Tab</span> autocompletes common prefix, you can
copy a link to the result using <span class="m-label m-dim"></span>
<span class="m-label m-dim">L</span> while <span class="m-label m-dim"></span>
<span class="m-label m-dim">M</span> produces a Markdown link.</p>
</div>
<div id="search-notfound" class="m-text m-warning m-text-center">Sorry, nothing was found.</div>
<ul id="search-results"></ul>
</div>
</div>
</div>
</div>
</div>
<script src="search-v2.js"></script>
<script src="searchdata-v2.js" async="async"></script>
<footer><nav>
<div class="m-container">
<div class="m-row">
<div class="m-col-l-10 m-push-l-1">
<p>Taskflow handbook is part of the <a href="https://taskflow.github.io">Taskflow project</a>, copyright © <a href="https://tsung-wei-huang.github.io/">Dr. Tsung-Wei Huang</a>, 2018&ndash;2024.<br />Generated by <a href="https://doxygen.org/">Doxygen</a> 1.9.1 and <a href="https://mcss.mosra.cz/">m.css</a>.</p>
</div>
</div>
</div>
</nav></footer>
</body>
</html>