Releases · facebookincubator/dispenso

03 Jan 00:35

graphicsMan

v1.4.0

f5d0dc5

Version 1.4 Release Latest

Latest

Efficiency improvements, bug and warning fixes

Added some benchmarks and comparison with TaskFlow (thanks @andre-nguyen!)
Fixed compilation when compiling with DISPENSO_DEBUG (thanks @EscapeZero!)
Improved efficiency on Linux for infrequent thread pool usage. Reduces polling overhead by 10x by switching to event-based wakeup instead of spin polling.
Fix C++20 compilation issues (thanks @aavbsouza!)
Fix several build warnings (thanks @SeaOtocinclus!)
Add conda package badge, disable gtest install (thanks JeongSeok Lee!)
Solved rare post-main shutdown issues with NewThreadInvoker
Fixed test issues for 32-bit builds
Fixed broken test logic for test thread IDs
Fixed various build warnings

Contributors

andre-nguyen, EscapeZero, and 2 other contributors

Assets 2

25 Apr 23:12

graphicsMan

v1.3.0

d56d92e

Version 1.3 Release

Bug fixes, portability enhancements, and small functionality enhancements

Fixed several generic warnings (thanks michel-slm!)
cpuRelax added for PowerPC and ARM (thanks barracuda156!)
Added missing header (thanks ryandesign!)
Try to detect and add libatomic when required (thanks for discussions barracuda156!)
Enable small buffers from small buffer allocators to go down to 4 bytes (thanks for discussion David Caruso!). This is handy for 32-bit builds where pointers are typically 4 bytes
Ensure that NOMINMAX is propagated for CMake Windows builds (thanks SeaOtocinclus!)
Fix some cases using std::make_shared for types requiring large alignment, which is a bug prior to C++17 (thanks for help finding these SeaOtocinclus!)
Set up CI on GitHub Actions, including builds for Mac and Windows in addition to Linux (thanks SeaOtocinclus!)
Add an envinronment variable DISPENSO_MAX_THREADS_PER_POOL to limit max number of threads available to any thread pool. In the spirit of OMP_NUM_THREADS. (thanks Yong-Chull Jang!)
Slight change of behavior w.r.t. use of maxThreads option in ForEachOptions and ParForOptions to limit concurrency the same way in both blocking and non-blocking for_each and parallel_for (thanks Arnie Yuan!)
Various fixes to enable CMake builds on various 32-bit platforms (thanks for discussions barracuda156!)
Updates to README

Known Issues:

Large subset of dispenso tests are known to fail on 32-bit PPC Mac. If you have access to such a machine and are willing to help debug, it would be appreciated!
NewThreadInvoker can have a program shutdown race on Windows platforms if the threads launched by it are not finished running by end of main()

Assets 2

28 Dec 16:23

graphicsMan

v1.2.0

fd85899

Version 1.2 Release

Several small bug fixes, especially around 32-bit builds and at-exit shutdown corner cases, and TSAN finding benign races and/or causing timeout due to pathological lock-free behaviors in newer versions of TSAN
Improve accuracy of dispenso::getTime
Add C++-20-like Latch functionality
Add mechanism for portable thread priorities
Add a timed task/periodically scheduled task feature. Average and standard deviation of the accuracy of dispenso::TimedTaskScheduler are both much better than folly::FunctionScheduler (from 2x to 10x+ depending on settings and platform)
Enhancements to parallel_for
- Add an option that allows to automatically reduce the number of threads working on a range if the work is too cheap to justify parallelization. This can result in 3000x+ speedups for very lightweight loops
- Resuse per-thread state containers across parallel for calls (these must block in-between, or be thread-safe types)
- parallel_for functors may now be called with an input range directly instead of requiring a ChunkedRange. This is as simple as providing a functor/lambda that takes the additional argument, just as was previously done with ChunkedRange. ChunkedRanges still work, and this is fully backward compatible
ThreadPools have a new option for full spin polling. This is generally best avoided, and I'd argue never to use this for the default Global thread pool, but can be useful for a subset of threads in systems that require real-time responsivity (especially, can be combined with the thread priority feature also found in this release)
Task graph execution (thanks @RomanFedotovFB!). Building and running dispenso task graphs is typically 25%+ faster than the (already excellent) TaskFlow library in our benchmarks. Additionally, we have a partial update feature that can enable much faster (e.g. 50x faster) execution in cases where only a small percentage of task inputs are updated (think of per-frame partial scene updates in a game)
Builds for EPEL and Fedora (thanks @michel-slm )

Contributors

michel-slm and RomanFedotovFB

Assets 2

01 Nov 15:43

graphicsMan

v1.1.0

9360e1e

Version 1.1 Release

Many small bug fixes. Additionally:

CMake changes to allow install of targets and CMake dispenso target exports (thanks jeffamstutz!)
Addition of typical container type definitions for ConcurrentVector (thanks Michael Jung!)
Large performance improvements for Futures and CompletionEvents on MacOs. Resulted in order-of-magnitude speedups for those use cases on MacOs.
Addition of new benchmark for performance with infrequent use of parallel_for, for_latency_benchmark
Fixes to ensure parallel_for works with thread pools with zero threads (thanks kevinbchen!). Further work has been done to ensure that thread pools with zero threads simply always run code inline.
By default, the global thread pool uses one fewer thread than the machine has hardware threads. This behavior was introduced because dispenso very often runs on the calling thread as well as pool threads, and so one fewer thread in the pool can lead to better performance.
Update googletest version to 1.12.1 (thanks porumbes!)
Add a utility in dispenso to get a thread ID, threadId. These 64-bit IDs are unique per thread, and will not be recyled. These values grow from zero, ensuring the caller can assume they are small if number of threads also is small (e.g. you won't have an ID of 0xdeadbeef if you only run hundreds or thousands of threads in the lifetime of the process).
Add a utility, getTime, to get time quickly. This provides the double-precision time in seconds since the first call to getTime after process start.
Use a new scheduling mechanism in the thread pool when in Windows. This resulted in up to a 13x improvement in latency between putting items in the pool and having those items run. This scheduling is optional, but turned off for Linux and MacOs since scheduling was already fast on those platforms.
Optimizations to enable faster scheduling in thread pools. This resulted in a range of 5% to 45% speedup across multiple benchmarks including future_benchmark and pipeline_benchmark.
Fixed a performance bug in work stealing logic; now dispenso outperforms TBB in the pipeline_benchmark
Added a task set cancellation feature, with a relatively simple mechanism for submitted work to check if it's owning task set has been cancelled. When creating a task set, you can optionally opt into parent cancellation propagation as well. While this propagation is fairly efficient, it did create a noticeable impact on performance in some cases, and thus it was decided to allow this behavior, but not penalize performance for those who don't need the behavior.

Assets 2

10 Jan 19:37

graphicsMan

v1.0.0

22ff47d

Version 1.0 Release, initial public release

It is encouraged to work off of main, which should typically be possible since we have a philosophy of avoiding API breaks, but v1.0 is the first stable release for those who wish to work off of stable releases.

v1.0, corresponding to initial public release. There is a variety of functionality present, see the README.md.

Future releases will bump minor version for significant new functionality. Major versions will only be bumped when API changes are not backwards compatible, which will be very rare, and avoided if at all possible. If severe bugs are fixed that affect already tagged releases, those releases will be bumped with a patch version.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficiency improvements, bug and warning fixes

Contributors

Bug fixes, portability enhancements, and small functionality enhancements

Contributors

Releases: facebookincubator/dispenso

Version 1.4 Release

Efficiency improvements, bug and warning fixes

Contributors

Version 1.3 Release

Bug fixes, portability enhancements, and small functionality enhancements

Version 1.2 Release

Contributors

Version 1.1 Release

Version 1.0 Release, initial public release