Releases · ValeevGroup/tiledarray

Major Changes

Platforms
- Improved support for utilization of NVIDIA GPUs via CUDA
- New support for utilization of AMD GPUs via HIP
- Use https://github.com/victor-anisimov/Librett for tensor permutations on GPUs
API:
- einsum for general binary tensor products
- Reference support for DistArrays with nested (Tensors-of-Tensors, aka ToT) tiles
- Efficient re-ranging (retiling, subarrays, etc.)
External dependencies
- Use https://github.com/icl-utk-edu/blaspp/[https://github.com/icl-utk-edu/lapackpp](lapackpp) (aka linalgpp) for 1-node C++ linear algebra
- Use [https://github.com/wavefunction91/blacspp](blacspp)/https://github.com/wavefunction91/scalapackpp for multi-node C++ linear algebra
- Use https://github.com/wavefunction91/linalg-cmake-modules for linear algebra discovery
- Can build modularized Boost from source

Detailed List of Changes

Andrey's revamp of generic algebra by @evaleev in #226
Generic solver interface by @evaleev in #222
Fix eigen MD5 hash 05b1f7511c93980c385ebe11bd3c93fa --> b9e98a200d245… by @powellsr in #228
support for arrays of tensors (aka nested tensors, or tensors-of-tensors) by @evaleev in #223
Feature/gitlab ci by @asadchev in #231
Asadchev/refactor/math by @asadchev in #230
Asadchev/refactor/lapack by @asadchev in #232
small Ranges on stack by @evaleev in #233
default TA_ERROR to throw if building unit tests by @evaleev in #234
Asadchev/refactor/unit tests by @asadchev in #239
converted to C++ BLAS/LAPACK interface by @evaleev in #237
scalapack usable with distarrays of btas (and other) Tiles. by @evaleev in #241
blaspp_headers to btas by @evaleev in #242
APPLE does not imply x86_64 on Apple ARM hardware by @evaleev in #244
kmp5VT [Feature] Round Robin pmap by @kmp5VT in #235
Fix CMake Boost discovery by @asadchev in #245
send notification from travis builds to VG slack by @evaleev in #246
build from-source-dependencies before building TA by @evaleev in #249
Asadchev/feature/gitlab cuda build by @asadchev in #250
Asadchev/feature/GitHub actions ci by @asadchev in #253
Update README.md for TA::TiledRange1 by @bimalgaudel in #257
Evaleev/update/btas by @evaleev in #260
Refactor TA_ASSERT by @asadchev in #259
numeric_type trait for Eigen matrices not needed since Eigen 3.3 by @evaleev in #264
Fix inconsistencies in rank-local SVD wrapper by @wavefunction91 in #263
Fixes #265 by @ryanmrichard in #266
Evaleev/fix/nonintrusive solver adaptors by @evaleev in #270
Evaleev/fix/make ta range by @evaleev in #273
update travis clang to 11 by @evaleev in #274
SparseShape ctor taking scaled norms zeroes out values below threshold by @evaleev in #275
Kmp5/feature/btas update by @kmp5VT in #272
[cmake] use wfn91's linear algebra discovery modules by @evaleev in #254
Range avoids serializes rank only once by @evaleev in #276
cuda callback is prebuf-aware by @evaleev in #277
Updates for revised madness serialization by @evaleev in #279
Evaleev/fix/tensorimpl distributed ctor by @evaleev in #278
DistArray::lazy_deleter waits for delayed sets by @evaleev in #280
set CUDA vars before calling enable_language(CUDA) + misc cleanup by @evaleev in #283
DistArray::set can properly avoid copies (unless setting remote data)… by @evaleev in #284
Bump ScaLAPACK++ by @wavefunction91 in #287
removed residual uses of TA_DEFAULT_ERROR by @evaleev in #289
Fix the path for MADNESS config.h by @keceli in #293
Bug fix fill_random() method by @bimalgaudel in #295
Functions to change taskq wait policy by @asadchev in #294
Tensor fwddecl "moved" to fwd.h by @evaleev in #296
introduced umpire host allocator by @evaleev in #297
clang does not like vector_il/matrix_il/etc. ... by @evaleev in #298
Bumped BTAS tag to use most recent linalgpp by @evaleev in #299
Build deps, if not found, via FetchContent, NOT ExternalProject by @evaleev in #300
follow-up to failed ExternalProject elimination by @evaleev in #302
run unit tests with raised log_level by @evaleev in #303
Asadchev/feature/einsum by @evaleev in #285
[cmake] bump MADNESS tag by @evaleev in #304
umpire: skip std::filesystem if using old gcc by @evaleev in #306
installation fixes by @evaleev in #307
[cmake] BTAS fixes by @evaleev in #308
foreach works with ShareReductionMethod::Union by @evaleev in #310
bump umpire to v6+ by @evaleev in #311
Add QR Implementations by @wavefunction91 in #316
PaRSEC in MADNESS CI by @therault in #301
Change ExternalProject prefixes to always match the FetchContent location by @awild82 in #315
eigen {matrix,vector}_to_array UT needs to replicate the data... by @evaleev in #319
block tsr expression assignments fix by @evaleev in #318
DistArray conversion to/fro Eigen::Tensor by @evaleev in #320
TiledArray/tensor.h: must #include <TiledArray/tile_op/tile_interface.h> by @evaleev in #321
SparseShape maintains its own sparse threshold by @evaleev in #322
Bump VG cmake kit tag by @evaleev in #323
moved FindOrFetchScaLAPACK to vg cmake kit + bump vg cmake kit and BT… by @evaleev in #326
moar small_vector by @evaleev in #328
wrong operator[] used in Index::indexof() by @evaleev in #329
fix einsum permutes by @evaleev in #331
bump VG's cmake kit tag to allow the use of LAPACK_CXX_COMPILE_OPTIONS by @evaleev in #334
Evaleev/fix/dox by @evaleev in #336
[ci] dox fixes + introduced VALEEVGROUP_UBUNTU_TAG envvar to control … by @evaleev in #337
Fix type signature bug(?) by @bimalgaudel in #338
bump BTAS tag + make imported Boost targets IMPORTED_GLOBAL ... regim… by @evaleev in #339
Fix parallel compilation on Umpire by @wavefunction91 in #340
std::result_of -> std::invoke_result by @evaleev in #341
TiledArray_{UMPIRE,CUTT} targets usable from the build tree at configure time by @evaleev in #342
allow extended character set in annotations by @evaleev in #343
Asadchev/feature/eigen einsum by @asadchev in...