Hello
I'm Ellie White (she/her) currently a data scientist with the Data Science Branch at US Geological Survey. In my role, I spend about 3/4 of my time working on data science projects and the other 1/4 on improving our data science best practices.
Examples of my USGS work available on GitHub include:
- ๐ฎ from workflows to streamflows: how we build shared understanding to predict streamflow droughts using deep learning. Poster I made for AGU Fall meeting 2023 on how we work. https://github.com/elliewhite-usgs/dissemination/blob/main/agu_2023_drought_final.pdf
- ๐ beyond-a-reasonable drought: Data visualization I made for #chartchallenge23, and the chart is meant to communicate the difference in the length of droughts in the Common Era. The hardest part was setting up the pipeline that processes the data. https://github.com/whiteellie/beyond-a-reasonable-drought
- ๐ fish-in-hot-water: Data visualization I made for #chartchallenge23, and the chart is meant to communicate the effects of climate change on fish spawning. https://github.com/whiteellie/fish-in-hot-water
Examples of my USGS work available on GitLab include++:
- ๐ national-geospatial-attributes-template: [in progress] Lead developer of an R and targets geospatial processing pipeline that takes attributes processed to one spatial scale and transforms them to another scale based on downstream modeling requirements. The forks from this pipeline will be used to develop and QC model inputs for three active projects in the Water Mission area and will be able to accommodate future modeling projects as needs arise โ driving down development time and labor costs. https://code.usgs.gov/wma/dsp/pipeline-templates/rescaling-attributes-template
- โ model-eval: [in progress] Lead developer of a model evaluation pipeline that implements drought pooling and matching for evaluation of observations against predictions. The pipeline evaluates how well a given ML model predicts streamflow droughts across several metrics important to decision makers including drought occurrence, onset, recovery, severity, and duration. https://code.usgs.gov/water/RegionalDroughtEarlyWarning/modeling/model-eval
Examples of my work under previous affiliations:
- ๐ geostatistics-tutorial: Took concepts and computations done manually in a geostatistics course in the Department of Hydrology at UC Davis and translated it to R. Other students have used this to guide their studies and graduate research. https://github.com/whiteellie/geostatistics-tutorial
- ๐ us-presidents: Made a classroom style poster of all U.S. presidents calling out some notable events in history. https://github.com/whiteellie/us-presidents
Writing samples:
- ๐ quality-assurance: [in progress] Quality assurance plans for predictive modeling and data visualizations that list best practices and the Data Science Branch's cultural norms. https://github.com/whiteellie/quality-assurance
- ๐ quirky-keras: blog post about asymmetric loss functions. https://towardsdatascience.com/quirky-keras-custom-and-asymmetric-loss-functions-for-keras-in-r-a8b5271171fe
++ You will only have access to these repos if you are a USGS employee on VPN