Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Echo Scale Test VDS Updates [VS-1095] [VS-1112] #8602

Merged
merged 56 commits into from
Dec 15, 2023
Merged

Conversation

mcovarr
Copy link
Collaborator

@mcovarr mcovarr commented Dec 6, 2023

Lessons learned in VDS creation during Echo Scale Testing.

Successful integration run here.

@mcovarr mcovarr changed the title Rsa vs 1095 Echo Scale Test VDS Update [VS-1095] [VS-1112] Dec 6, 2023
@mcovarr mcovarr changed the title Echo Scale Test VDS Update [VS-1095] [VS-1112] Echo Scale Test VDS Updates [VS-1095] [VS-1112] Dec 6, 2023
mcovarr and others added 9 commits December 6, 2023 12:33
)

This should work around a rare error in `_calculate_new_intervals` that
could generate invalid partitioners in a way that
`calculate_new_intervals` cannot.
If the combiner finishes, but import_gvs fails, and this script is run
again, the combiner output will be deleted and the finished plan will be
used causing no new output to be generated causing the script to fail.
@mcovarr mcovarr marked this pull request as ready for review December 14, 2023 23:09
Copy link
Collaborator

@gbggrant gbggrant left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link

@danking danking left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This all looks right to me. The big fixes are in:

  1. No rmtree before vds combiner.
  2. Print exit code rather than wait status.
  3. Make max age and max idle configurable with a default of infinity.
  4. Support restarting half-way through Avro importing by way of intermediate_resume_point.
  5. Copy Hail log off driver to well-known location.
  6. Use VDS's calculate_new_intervals which correctly constructs intervals for the variants data (key: locus, alleles) based on the density of reference data (key: locus).

@mcovarr mcovarr merged commit f8d5f2d into ah_var_store Dec 15, 2023
20 checks passed
@mcovarr mcovarr deleted the rsa_vs_1095 branch December 15, 2023 17:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants