Move resource refreshing into plan #26270

jbardin · 2020-09-16T18:49:38Z

This removes the separate Refresh operation that happens during planning, having each resource instance refresh "just in time" during the plan walk. The refresh walk has numerous problems, mostly stemming from the graph being generated by a mix of the state for managed resources, the configuration for temporary values, and a combination of configuration and state for data sources. This can lead to incongruities during refresh that cannot be resolved, often requiring the use of plan -refresh=false in order to apply some configurations. We also skip the problem of data sources reading "stale" state that will be updated during apply, by having data sources evaluated during plan where we have all the change information available.

Merging of refresh into planning is primarily accomplished by moving the bulk of the refresh node eval tree from NodeRefreshableManagedResourceInstance into the managed eval tree for NodePlannableResourceInstance. This alone works for the majority of test cases, and proves out the concept for merging the refresh and plan walk.

The more difficult part of merging the two operations is that in order to have all the prior state information available to render a plan, the plan walk must simultaneously operate on 2 separate states. To do that we add a second "refresh state" to the global terraform Context, along with a RefreshState() getter method. In order to save instances to this state, there is now a phaseState flag for EvalWriteState to switch which state to target during different phases of each instance's planning. This is admittedly clunky, but we are concurrently refactoring the EvalNode pattern and can smooth that out in tandem with upcoming changes.

Another change around handling the refreshed state, is that because planning is responsible for generating the most current state, the plans.Plan structure now includes a states.State. Rather than needing to combine the refreshed state with the plan out of band to create the plan, the refreshed state is now managed directly by Context.Plan().

The ability to plan data sources introduced in 0.13 allows them to generally work correctly without the added refresh step. We can leave cleaning up data source planning for another PR to keep this one smaller.

The current Refresh operation will be left as-is for now, and will no longer be used in the normal operation workflow. Future plans are currently to remove the internals, and refactor the CLI command in terms of a plan and apply with no configuration changes.

mildwonkey

This looks really neat, it's a clever change! Tip for others: this was much easier to review commit-by-commit (thanks in part to good commit messages). I'll hold off approving till you finish fighting with tests 😝

🤔 we may want to chat with the folks who work on sentinel and see if they'd want/expect access to the refreshed state (that's now stored in the plan) as part of the json output, instead of only the current state (tbh I'm fuzzy; would the state in the plan output today include information that was gathered during a refresh?). That's not relevant to this PR, just a thoughtl.

mildwonkey · 2020-09-16T19:45:28Z

terraform/node_resource_plan_instance.go

@@ -107,8 +107,8 @@ func (n *NodePlannableResourceInstance) evalTreeManagedResource(addr addrs.AbsRe
 	var provider providers.Interface
 	var providerSchema *ProviderSchema
 	var change *plans.ResourceInstanceChange
-	var refreshState *states.ResourceInstanceObject
-	var planState *states.ResourceInstanceObject
+	var instanceRefreshState *states.ResourceInstanceObject


I quite appreciate this clearer variable, as you know I've been knee-deep in this code (and lost and confused) recently.

codecov · 2020-09-16T20:30:19Z

Codecov Report

Merging #26270 into master will increase coverage by 0.05%.
The diff coverage is 86.81%.

Impacted Files	Coverage Δ
backend/local/backend_apply.go	`41.49% <ø> (+2.14%)`	⬆️
helper/resource/testing_config.go	`0.00% <0.00%> (ø)`
plans/plan.go	`48.00% <ø> (ø)`
terraform/evaluate.go	`52.32% <ø> (-0.85%)`	⬇️
backend/local/backend_plan.go	`73.39% <100.00%> (-0.19%)`	⬇️
terraform/context.go	`87.18% <100.00%> (+0.72%)`	⬆️
terraform/eval_context_builtin.go	`77.63% <100.00%> (+0.28%)`	⬆️
terraform/eval_context_mock.go	`59.25% <100.00%> (+0.92%)`	⬆️
terraform/eval_count.go	`69.49% <100.00%> (+1.07%)`	⬆️
terraform/eval_state.go	`63.13% <100.00%> (+1.99%)`	⬆️
... and 9 more

mildwonkey

:danceparty:

jbardin · 2020-09-17T13:24:15Z

🤔 we may want to chat with the folks who work on sentinel and see if they'd want/expect access to the refreshed state (that's now stored in the plan) as part of the json output, instead of only the current state (tbh I'm fuzzy; would the state in the plan output today include information that was gathered during a refresh?). That's not relevant to this PR, just a thought.

Yes, the plan is generated from the refreshed state currently, it just happens in a single step with these changes. My wording above was not very good, and there shouldn't be any externally visible changes to the plan output from the previous release.

This change refreshes the instance state during plan, so a complete Refresh no longer needs to happen before Plan.

Since plan uses the state as a scratch space for evaluation, we need an entirely separate state to store the refreshed resources values during planning. Add a RefreshState method to the EvalContext to retrieve a state used only for refreshing resources.

All resources use EvalWriteState to store their state, so we need a way to switch the states when the resource is refreshing vs when it is planning. (this will likely change once we factor out the EvalNode pattern)

Since the refreshed state is now an artifact of the plan process, it makes sense to add it to the Plan type, rather than adding an additional return value to the Context.Plan method.

This breaks a bunch of tests, and we need to figure out why before moving on.

We need to do this for both states during plan

We need to build a new context go get at the modified state

The prior state recorded in the plans did not match the actual prior state. Make the plans and state match depending on whether there was existing state or not.

After apply, any refreshed state from a plan would be invalid. Normal usage doesn't ever see this, bu internal tests may re-use the context.

Update the old acceptance test framework just enough to make the tests pass.

Leaving plan with -refresh=false tests failing for now.

We still need to determine if `-refresh=false` is even useful with the new planning strategy.

This was never picked up by the tests until now

jbardin · 2020-09-17T13:58:24Z

I made the use of the plan.State more obvious by removing the extra variable indirection from the old version, and cleaned up some stale comments too, then rebased on master.

redbaron · 2020-09-30T10:41:48Z

Out of interest, does it mean, that providers now can be configured with datasource, which itself references another resource? Previously it required targeted apply to create dependencies of datasource first.

ghost · 2020-10-18T01:50:04Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

jbardin requested a review from a team September 16, 2020 18:49

mildwonkey reviewed Sep 16, 2020

View reviewed changes

mildwonkey approved these changes Sep 17, 2020

View reviewed changes

jbardin added 20 commits September 17, 2020 09:54

Refresh instances during plan

be757bd

This change refreshes the instance state during plan, so a complete Refresh no longer needs to happen before Plan.

Get the new RefreshState into the right contexts

ad22e13

add a way to selectively write to RefreshState

7b178b1

All resources use EvalWriteState to store their state, so we need a way to switch the states when the resource is refreshing vs when it is planning. (this will likely change once we factor out the EvalNode pattern)

add state to plans.Plan

8cef62e

Since the refreshed state is now an artifact of the plan process, it makes sense to add it to the Plan type, rather than adding an additional return value to the Context.Plan method.

return the refreshed state in the Plan result

5cf7e23

try refreshing during plan as the default

908217a

This breaks a bunch of tests, and we need to figure out why before moving on.

use plan state in contextOptsForPlanViaFile

7d6472d

fixup count transition for refresh state

ced7aed

We need to do this for both states during plan

update comments around evaluating 0 instances

a3c9d7a

contexts have a copy of the state

d19f440

We need to build a new context go get at the modified state

fixup last tests that need correct state

1fa3503

ReadResource is called during plan but not destroy

ad5899d

fix show -json tests

e54949f

The prior state recorded in the plans did not match the actual prior state. Make the plans and state match depending on whether there was existing state or not.

don't leave old refreshed state in the context

19d67b7

After apply, any refreshed state from a plan would be invalid. Normal usage doesn't ever see this, bu internal tests may re-use the context.

remove Refresh steps from legacy provider tests

88509de

Update the old acceptance test framework just enough to make the tests pass.

fix local backend tests to match new behavior

f52d836

Leaving plan with -refresh=false tests failing for now.

skip plan with no refresh test

8658424

We still need to determine if `-refresh=false` is even useful with the new planning strategy.

wrong instance key in test state

312317a

This was never picked up by the tests until now

data sources now show up in the initial plan

86dd893

jbardin force-pushed the jbardin/refresh-plan branch from 3dc2c5f to 86dd893 Compare September 17, 2020 13:55

jbardin merged commit 4295f1e into master Sep 17, 2020

jbardin deleted the jbardin/refresh-plan branch September 17, 2020 14:19

alisdair mentioned this pull request Sep 18, 2020

The "count" value depends on resource attributes that cannot be determined until apply, but resource attributes are already applied #26078

Closed

ghost locked as resolved and limited conversation to collaborators Oct 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move resource refreshing into plan #26270

Move resource refreshing into plan #26270

jbardin commented Sep 16, 2020 •

edited

Loading

mildwonkey left a comment

mildwonkey Sep 16, 2020

codecov bot commented Sep 16, 2020 •

edited

Loading

mildwonkey left a comment

jbardin commented Sep 17, 2020 •

edited

Loading

jbardin commented Sep 17, 2020

redbaron commented Sep 30, 2020

ghost commented Oct 18, 2020

Move resource refreshing into plan #26270

Move resource refreshing into plan #26270

Conversation

jbardin commented Sep 16, 2020 • edited Loading

mildwonkey left a comment

Choose a reason for hiding this comment

mildwonkey Sep 16, 2020

Choose a reason for hiding this comment

codecov bot commented Sep 16, 2020 • edited Loading

Codecov Report

mildwonkey left a comment

Choose a reason for hiding this comment

jbardin commented Sep 17, 2020 • edited Loading

jbardin commented Sep 17, 2020

redbaron commented Sep 30, 2020

ghost commented Oct 18, 2020

jbardin commented Sep 16, 2020 •

edited

Loading

codecov bot commented Sep 16, 2020 •

edited

Loading

jbardin commented Sep 17, 2020 •

edited

Loading