Releases: ValeevGroup/tiledarray
Releases · ValeevGroup/tiledarray
v1.1.0
Major Changes
- Platforms
- Improved support for utilization of NVIDIA GPUs via CUDA
- New support for utilization of AMD GPUs via HIP
- Use https://github.com/victor-anisimov/Librett for tensor permutations on GPUs
- API:
einsum
for general binary tensor products- Reference support for
DistArray
s with nested (Tensors-of-Tensors, aka ToT) tiles - Efficient re-ranging (retiling, subarrays, etc.)
- External dependencies
- Use https://github.com/icl-utk-edu/blaspp/[https://github.com/icl-utk-edu/lapackpp](
lapackpp
) (akalinalgpp
) for 1-node C++ linear algebra - Use [https://github.com/wavefunction91/blacspp](
blacspp
)/https://github.com/wavefunction91/scalapackpp for multi-node C++ linear algebra - Use https://github.com/wavefunction91/linalg-cmake-modules for linear algebra discovery
- Can build modularized Boost from source
- Use https://github.com/icl-utk-edu/blaspp/[https://github.com/icl-utk-edu/lapackpp](
Detailed List of Changes
- Andrey's revamp of generic algebra by @evaleev in #226
- Generic solver interface by @evaleev in #222
- Fix eigen MD5 hash 05b1f7511c93980c385ebe11bd3c93fa --> b9e98a200d245… by @powellsr in #228
- support for arrays of tensors (aka nested tensors, or tensors-of-tensors) by @evaleev in #223
- Feature/gitlab ci by @asadchev in #231
- Asadchev/refactor/math by @asadchev in #230
- Asadchev/refactor/lapack by @asadchev in #232
- small Ranges on stack by @evaleev in #233
- default TA_ERROR to throw if building unit tests by @evaleev in #234
- Asadchev/refactor/unit tests by @asadchev in #239
- converted to C++ BLAS/LAPACK interface by @evaleev in #237
- scalapack usable with distarrays of btas (and other) Tiles. by @evaleev in #241
- blaspp_headers to btas by @evaleev in #242
- APPLE does not imply x86_64 on Apple ARM hardware by @evaleev in #244
- kmp5VT [Feature] Round Robin pmap by @kmp5VT in #235
- Fix CMake Boost discovery by @asadchev in #245
- send notification from travis builds to VG slack by @evaleev in #246
- build from-source-dependencies before building TA by @evaleev in #249
- Asadchev/feature/gitlab cuda build by @asadchev in #250
- Asadchev/feature/GitHub actions ci by @asadchev in #253
- Update README.md for TA::TiledRange1 by @bimalgaudel in #257
- Evaleev/update/btas by @evaleev in #260
- Refactor TA_ASSERT by @asadchev in #259
- numeric_type trait for Eigen matrices not needed since Eigen 3.3 by @evaleev in #264
- Fix inconsistencies in rank-local SVD wrapper by @wavefunction91 in #263
- Fixes #265 by @ryanmrichard in #266
- Evaleev/fix/nonintrusive solver adaptors by @evaleev in #270
- Evaleev/fix/make ta range by @evaleev in #273
- update travis clang to 11 by @evaleev in #274
- SparseShape ctor taking scaled norms zeroes out values below threshold by @evaleev in #275
- Kmp5/feature/btas update by @kmp5VT in #272
- [cmake] use wfn91's linear algebra discovery modules by @evaleev in #254
- Range avoids serializes rank only once by @evaleev in #276
- cuda callback is prebuf-aware by @evaleev in #277
- Updates for revised madness serialization by @evaleev in #279
- Evaleev/fix/tensorimpl distributed ctor by @evaleev in #278
- DistArray::lazy_deleter waits for delayed sets by @evaleev in #280
- set CUDA vars before calling enable_language(CUDA) + misc cleanup by @evaleev in #283
- DistArray::set can properly avoid copies (unless setting remote data)… by @evaleev in #284
- Bump ScaLAPACK++ by @wavefunction91 in #287
- removed residual uses of TA_DEFAULT_ERROR by @evaleev in #289
- Fix the path for MADNESS config.h by @keceli in #293
- Bug fix fill_random() method by @bimalgaudel in #295
- Functions to change taskq wait policy by @asadchev in #294
- Tensor fwddecl "moved" to fwd.h by @evaleev in #296
- introduced umpire host allocator by @evaleev in #297
- clang does not like vector_il/matrix_il/etc. ... by @evaleev in #298
- Bumped BTAS tag to use most recent linalgpp by @evaleev in #299
- Build deps, if not found, via FetchContent, NOT ExternalProject by @evaleev in #300
- follow-up to failed ExternalProject elimination by @evaleev in #302
- run unit tests with raised log_level by @evaleev in #303
- Asadchev/feature/einsum by @evaleev in #285
- [cmake] bump MADNESS tag by @evaleev in #304
- umpire: skip std::filesystem if using old gcc by @evaleev in #306
- installation fixes by @evaleev in #307
- [cmake] BTAS fixes by @evaleev in #308
- foreach works with ShareReductionMethod::Union by @evaleev in #310
- bump umpire to v6+ by @evaleev in #311
- Add QR Implementations by @wavefunction91 in #316
- PaRSEC in MADNESS CI by @therault in #301
- Change ExternalProject prefixes to always match the FetchContent location by @awild82 in #315
- eigen {matrix,vector}_to_array UT needs to replicate the data... by @evaleev in #319
- block tsr expression assignments fix by @evaleev in #318
- DistArray conversion to/fro Eigen::Tensor by @evaleev in #320
- TiledArray/tensor.h: must #include <TiledArray/tile_op/tile_interface.h> by @evaleev in #321
- SparseShape maintains its own sparse threshold by @evaleev in #322
- Bump VG cmake kit tag by @evaleev in #323
- moved FindOrFetchScaLAPACK to vg cmake kit + bump vg cmake kit and BT… by @evaleev in #326
- moar small_vector by @evaleev in #328
- wrong operator[] used in Index::indexof() by @evaleev in #329
- fix einsum permutes by @evaleev in #331
- bump VG's cmake kit tag to allow the use of LAPACK_CXX_COMPILE_OPTIONS by @evaleev in #334
- Evaleev/fix/dox by @evaleev in #336
- [ci] dox fixes + introduced VALEEVGROUP_UBUNTU_TAG envvar to control … by @evaleev in #337
- Fix type signature bug(?) by @bimalgaudel in #338
- bump BTAS tag + make imported Boost targets IMPORTED_GLOBAL ... regim… by @evaleev in #339
- Fix parallel compilation on Umpire by @wavefunction91 in #340
- std::result_of -> std::invoke_result by @evaleev in #341
- TiledArray_{UMPIRE,CUTT} targets usable from the build tree at configure time by @evaleev in #342
- allow extended character set in annotations by @evaleev in #343
- Asadchev/feature/eigen einsum by @asadchev in...
1.0.0
1.0.0-alpha.2
This release improved CMake handling of CUDA and MADNESS, and revises Pmaps to avoid storing O(N) data or O(N) initialization costs.
1.0.0-alpha.1
This is the first official release in the 1.0.0 family. Includes first-class support for CUDA.
0.6.0
v0.4.4-alpha Bug fix
This release includes several bug fixes from the development branch.
v0.4.3-alpha
Changes since 0.4.2:
- Fixed many compile issues with GCC 4.7.
- Fixed many compile issues with Intel compiler 15.
v0.4.2-alpha
Change since 0.4.1:
- Numerous bug fixes.