Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vs 1561 Merging in new monitoring script #9089

Open
wants to merge 4 commits into
base: ah_var_store
Choose a base branch
from

Conversation

koncheto-broad
Copy link

I needed to create a new new version of the cromwell monitoring script to grab specific heap data for identifying where our memory is being used during a previously problematic shard during our AoU extract. It has been uploaded to gs://gvs_quickstart_storage/cromwell_monitoring_script_with_heap.sh

If you want to generate these high-level gatk histograms for extract, just replace the definition of File monitoring_script in the ExtractTask within GvsExtractCallset.wdl to point to the new one.

Here's a successful AoU extract using the new monitoring script:
https://app.terra.bio/#workspaces/allofus-drc-wgs-dev/GVS%20AoU%20WGS%20Echo%20Callset%20v2/job_history/aa5bb1e4-b860-48f3-9c8c-53dd5e6ec206

Comment on lines +242 to +252
function getGATKheapDump() {
# Get the PID of the Java process running gatk.jar
GATK_PID=$(jps -l | grep 'gatk.jar' | awk '{print $1}')
if [[ -z "$GATK_PID" ]]; then
echo "GATK process not found in list of running java processes."
jps -l
else
# We have the GATK PID, now get the heap dump
jmap -histo $GATK_PID | head -n 50
fi
}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is gonna be a bad time on non-GATK images... perhaps document this at the top or name the script more explicitly or something?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ooh or maybe have the script accept optional args that allows this to be unified with the baseline script

bash ~{monitoring_script} --monitor-gatk --sampling-rate-seconds 1800 > monitoring.log &

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants