- Rollout Notifications
- Dynamic scaling of stable ReplicaSet (dynamicStableScale)
- Automated rollbacks without analysis (progressDeadlineAbort)
- Kustomize Open API Schema
- Rollout Dashboard as a Service
- Controlling Scaledown Behavior During Aborts (abortScaleDownDelaySeconds)
- Analysis: AWS CloudWatch Metric Provider
- AWS TargetGroup IP Verification
- Weighted Experiment Canary Steps
- Istio: Multicluster Support
- Istio: TLS Route Support
- Istio: Multiple VirtualServices
- AnalysisRun GC
- Analysis: Graphite Metric Provider
- feat: support dynamic scaling of stable ReplicaSet as inverse of canary weight (#1430)
- fix: promote nil pointer error when there are no steps (#1510)
- feat: support management of multiple Istio VirtualService objects (#1381)
- feat: verify AWS TargetGroup after updating active/stable services (#1348)
- feat: ALB TrafficRouting with experiment step
- feat: TrafficRouting SMI with Experiment Step in Canary (#1351)
- feat: ability to abort an update when exceeding progressDeadlineSeconds (#1397)
- feat: add support for Istio VirtualService spec.tls[] (#1380)
- feat: configurable and more aggressive cleanup of old AnalysisRuns and Experiments (#1342)
- feat: ability to auto-create Services for each template in an Experiment (#1158)
- feat: introduce abortScaleDownDelaySeconds to control scale down of preview/canary upon abort (#1160)
- feat: argo rollout compatibility with emissary and edge stack v2.0 (#1330)
- feat: Add support for Istio multicluster (#1274)
- feat: add workload-ref/generation to rollout (#1198)
- feat: support notifications on rollout events using notifications-engine (#1175)
- chore: add liveness and readiness probe to the install manifests (#1324)
- fix: Nginx ingressClassName passed to canary ingress (#1448)
- fix: canary scaledown event could violate maxUnavailable (#1429)
- fix: analysis runs to wait for all metrics to complete (#1407)
- fix: Promote full did not work against BlueGreen with previewReplicaCount (#1384)
- fix: retarget blue-green previewService before scaling up preview ReplicaSet (#1368)
- fix: zero-value abortScaleDownDelay was not honored with setCanaryScale (#1375)
- fix: abort scaledown stable RS for canary with traffic routing (#1331)
- feat: add support for Graphite metrics provider (#1406)
- feat: Support CloudWatch as a metric provider (#1338)
- fix: Analysis argument validation (#1412)
- feat: create windows version for CLI (#1517)
- feat: provide shell completion. Closes #619 (#1478)
- fix: create analysisrun cmd using template generated name (#1471)
- fix: nil pointer in create analysisrun cmd (#1399)
- fix: lint subcommand for workload ref rollout (#1328)
- fix: undo referenced object for workloadRef rollout (#1275)
- feat: allow selection of namespace in rollout dashboard (#1291)
- fix(ui): UI crashes on rollout view due to undefined status (#1287)
- feat: kustomize rollout: add openapi to doc and examples (#1371)
- feat: add rollout stat row to grafana dashboard (#1343)
The v1.1 abortScaleDownDelaySeconds
feature now allows users full control over the scaling
behavior of the canary/preview ReplicaSet during an abort. Previously in v1.0, it was not possible
to affect this behavior. As part of this feature, v1.1 also fixes some inconsistencies in behavior
with respect to abort scale down.
The most notable change is that upon an abort, the blue-green preview ReplicaSet in v1.1 will now
scale down 30 seconds after the abort, whereas in v1.0 the preview ReplicaSet was left running
indefinitely (without any option to scale it down). If you prefer the v1.0 behavior, you can set
abortScaleDownDelaySeconds: 0
, which will leave the preview ReplicaSet running indefinitely
on abort:
spec:
strategy:
blueGreen:
abortScaleDownDelaySeconds: 0
Please read the full documentation to understand the differences in canary/preview scaling behavior for aborted Rollouts from v1.0 to v1.1.
- fix: replica count for new deployment (#1449)
- fix: Nginx ingressClassName passed to canary ingress (#1448)
- fix: Analysis argument validation (#1412)
- fix: retarget blue-green previewService before scaling up preview ReplicaSet (#1368)
- fix: analysis runs to wait for all metrics to complete (#1407)
- fix: canary scaledown event could violate maxUnavailable (#1429)
- chore: release workflow docker build context should use local path and not git context (#1388)
- chore: github release action was using incorect docker cache (#1387)
- fix: Promote full did not work against BlueGreen with previewReplicaCount
- fix: nil pointer dereference when reconciling paused blue-green rollout (#1378)
- fix: Abort rollout doesn't remove all canary pods for setCanaryScale (#1352)
- fix: unsolicited rollout after upgrade from v0.10->v1.0 when pod was using service account (#1367)
- fix: default replica before resolving workloadRef (#1304)
- feat: allow VirtualService HTTPRoute to be inferred if there is single route (#1273)
- fix: rollout paused longer than progressDeadlineSeconds would briefly degrade (#1268)
- fix: controller would drop fields when updating DestinationRules (#1253)
- fix: the wrong panel title on the sample dashboard (#1260)
- fix: analysis with multiple metrics (#1261)
- fix: Mitigate the bug where items are re-added constantly to the workqueue. #1193 (#1243)
- fix: workload rollout spec is invalid template is not empty (#1224)
- fix: Fix error check in validation for AnalysisTemplates not found (#1249)
- fix: make function call consistent with otherRSs definition (#1171)
- fix: avoid using root user in plugin container (#1256)
- feat: WebMetric to support string body responses (#1212)
- fix: Modify validation to check Analysis args passed through RO spec (#1215)
- fix: AnalysisRun args could not be resolved from secret (#1213)
- New Argo Rollouts UI available in
kubectl argo rollouts dashboard
- Ability to reference existing Deployment workloads instead of inlining a PodTemplate at spec.template
- Richer prometheus stats and Kubernetes events
- Support for Ambassador as a canary traffic router
- Support canarying using Istio DestinationRule subsets
Installation manifests are now attached as GitHub Release artifacts (as opposed to raw files checked into git) and can be installed with the release download URL. e.g.:
kubectl apply -f https://github.com/argoproj/argo-rollouts/releases/download/v1.0.0/install.yaml
Argo Rollouts v1.0 now vends apiextensions.k8s.io/v1 CustomResourceDefinitions (previously apiextensions.k8s.io/v1beta1).
Kubernetes v1 CRDs no longer supports the preservation of unknown fields in objects, and rejects
attempts to set spec.preserveUnknownFields: true
(the previous default). In order to support a
smooth upgrade from Argo Rollouts v0.10 to v1.0, spec.preserveUnknownFields
is explicitly set to
false
in the manifests, despite false
being the default, and only option in v1 CRDs. However
this causes diffing tools (such as Argo CD) to report the manifest as OutOfSync (since K8s drops the
false field).
More information:
To avoid the Argo CD OutOfSync conditions, you can remove spec.preserveUnknownFields
from the manifests
entirely after upgrading to v1.0.
Alternatively, you can instruct Argo CD to ignore differences using ignoreDifferences in the Application spec:
spec:
ignoreDifferences:
- group: apiextensions.k8s.io
kind: CustomResourceDefinition
jsonPointers:
- /spec/preserveUnknownFields
The promote flag --skip-current-step
which skips the current running canary step has been
deprecated and will be removed in a future release. Its logic to skipping the current step has
been merged with the existing command:
kubectl argo rollouts promote ROLLOUT
The promote ROLLOUT
command can now be used to handle both the case where the rollout needs to be
unpaused, as well as to skip the currently running canary step (e.g. an analysis/experirment/pause
step).
- feat: support reference model for workloads (#676) (#1072)
- feat: Implement Ambassador to be used as traffic router for canary deployments (#1025)
- feat: support canarying using Istio DestinationRule subsets (#985)
- feat: istio virtualservice and rollout in different namespaces
- feat: add ability to verify canary weights before advancing steps (#957)
- feat: support scaleDownDelaySeconds in canary w/ traffic routing (#1056)
- feat: Add ability to restart maxUnavailable pods to BlueGreen strategy (#937)
- feat(controller): Add support for ephemeral metadata on BlueGreen rollouts. Fixes #973 (#974)
- feat: Allow user to handle NaN result in Analysis (#977)
- feat: Wait for canary RS to have ready replicas before shifting labels (#1022)
- feat: Create RolloutPaused condition (#1054)
- feat: Add RolloutCompleted condition (#1074)
- feat: add print version flag to rollouts-controller
- feat: calculate rollout phase & message controller side
- fix: Fixes the regression of dropping resources from argo-rollouts crds. Fixes #1043 (#1044)
- fix: Set Canary Strategy default maxUnavailable to 25% (#981)
- fix: blue-green rollouts could pause prematurely during prePromotionAnalysis (#1007)
- fix: Clear ProgressDeadlineExceeded Condition in paused BlueGreen Rollout (#1002)
- fix: analysis template arguments validate (#1038)
- fix: calculate scale down count. (#1047)
- fix: verify analysis arguments name with those in the rollout (#1071)
- fix: rollout status always in progressing if analysis fails (#1099)
- fix: Add edge case handling to traffic routing (#1190)
- fix: unhandled error patchVirtualService (#1168)
- fix: handling error on f.close (#1167)
- fix: rollouts in middle of restart should be considered Progressing
- feat: metric fields can be parameterized by analysis arguments (#901)
- feat: support a custom base URL for the new relic provider (#1053)
- feat: Allow Datadog API and APP keys to be consumed from env vars (#1073)
- fix: Improve validation for AnalysisTemplates referenced by RO (#1094)
- fix: wavefront queries would return no datapoints. surface evaluate errors
- fix: metrics which errored did not retry at error interval
- fix: Improve and refactor validation for AnalysisTemplates
- feat: Argo Rollouts api-server and UI (#1015)
- feat: Implement rollout status command. Fixes #596 (#1001)
- feat: lint supporting rollout in multiple doc
- fix: get rollout always return not found except default namespace (#961)
- fix: create command not support namespace in yaml file (#962)
- fix: kubectl argo create panic: runtime error: invalid memory address or nil pointer dereference
- chore: publish plugin image automatically. migrate to quay.io (#1102)
- feat: support ARM builds, remove unused components in Dockerfile (#889)
- chore: update k8s dependencies to v1.20. improve logging (#994)
- fix: add informational exposed ports to deployment (#1066)
- chore: Outsource reusable UI components to argo-ux npm package
- fix: use fixed size int32
- fix: switch pod restart to use evict API to honor PDBs
- fix: ephemeral metadata injection was dropping metadata injected by mutating webhooks
- fix: requiredForCompletion did not work for an experiment started by a rollout
- fix: Add missing RoleBinding file to namespace installation
- fix: Correct Istio VirtualService immediately (#874)
- fix: restart was restarting too many pods when available > spec.replicas (#856)
- fix: plugin incorrectly treated v0.9 rollout as v0.10 when it had numeric observedGeneration (#875)
- Ability to set canary vs. stable ephemeral metadata on rollout Pods during an update
- Support new metric providers: New Relic, Datadog
- Ability to control canary scale during an update
- Ability to restart up to maxUnavailable pods at a time for a canary rollout
- Ability to self reference rollout metadata as arguments to analysis
- Ability to fully promote blue-green and canary rollouts (skipping steps, analysis, pauses)
- kubectl-argo-rollouts plugin command to lint rollout
- kubectl-argo-rollouts plugin command to undo a rollout (same as kubectl rollout undo)
Rollouts v0.10 has switched to using Kubernetes CRD Status Subresources (PR #789). This feature allows the rollout controller to record the numeric metadata.generation
into status.observedGeneration
which provides a reliable indicator of a Rollout who's spec has (or has not yet) been observed by the controller (for example if the argo-rollouts controller was down or delayed).
A consequence of this change, is that the v0.10 rollout controller should be used with the v0.10 kubectl-argo-rollouts plugin in order to perform actions such as abort, pause, promote. Similarly, Argo CD v1.8 should be with the v0.10 rollout controller when performing those same actions. Both kubectl-argo-rollouts plugin v0.10 and Argo CD v1.8 are backwards compatible with v0.9 rollouts controller.
- feat: set canary/stable ephemeral metadata to pods during updates (#770)
- feat: add support for valueFrom in analysis arguments. (#797)
- feat: Adding rollout_info_replicas_desired metric. Fixes #748 (#749)
- feat: restart pods up to maxUnavailable at a time
- feat: add full rollout promotion (skip analysis, pause, steps)
- feat: use CRD status subresource (#789)
- feat: Allow setting canary weight without side-effects. Fixes #556 (#677)
- fix: namespaced scoped controller support (#818)
- fix: fetch secrets on-demand to fix controller boot for large clusters (#829)
- feat: Add New Relic metricprovider (#772)
- feat: Add Datadog metric provider. Fixes #702 (#705)
- feat: Implement kubectl argo rollouts lint
- feat: Add undo command in kubectl plugin. Fixes #575 (#812)
- fix: kubectl plugin should use dynamic client
- fix: rollout kustomize transform analysis ref should use templateName instead of name (#809)
- fix: add missing Service kustomize name reference in trafficRouting/alb/rootService (#699)
- fix: scaleDownDelayRevisionLimit was off by one (#816)
- fix: background analysis refs were not verified. requeue InvalidSpec rollouts (#814)
- fix(controller): fix unhandled panic from malformed rollout (#801)
- fix(controller): validation should not consider privileged security context (#802)
- fix(controller): controller did not honor maxUnavailable during rollback (#786)
- fix(controller): blue-green with analysis was broken (#780)
- fix(controller): blue-green fast-tracked rollbacks would still start analysis templates
- fix(controller): prePromotionAnalysis with previewReplicaCount would pause indefinitely w/o running analysis
- fix(controller): calculate available replicas from active ReplicaSet (#757)
- feat(plugin): indicate the stable ReplicaSet for blue-green rollouts in plugin
- feat(plugin): plugin now surfaces InvalidSpec errors and failed analysisrun messages (#729)
- fix(plugin): bluegreen scaleDownDelay was delaying Healthy status. Present errors in message field (#768)
- feat: writeback rollout updates to informer to prevent stale data (#726)
- fix: unavailable stable RS was not scaled down to make room for canary (#739)
- fix: make controllers tolerant to spec marshalling errors (#666)
- perf: Create IstioVirtualServiceLister (#656)
- fix: add missing log message when a controller's syncHandler returns error (#658)
- fix: support azure auth (#664)
- feat: web metrics preserve data types, allow insecure tls, and make jsonPath optional (#731)
- fix: analysis controller could get into a hotloop with terminated run (#724)
- fix: do not create analysisruns with initial deploy (#722)
- fix: add Failed AnalysisRun phase status to analysis_run_metric_phase and analysis_run_phase metrics. (#618)
- fix: Fix various panics #603
- feat: add security context to run as non-root #498
-
feat: Controller Validation #549
-
feat: Controller Validation for objects referenced by Rollout #600
-
feat: Add Rollout replicas metrics (#507) #581
-
feat: Add support for rootService within ALB traffic routing #634
-
fix: Populate .spec.template with default values before Rollout Validation #580
-
fix: Add Rollout/scale to aggregate roles #637
-
Fix: remove hash selector after switching from bg to canary #515
-
fix: Set the currentStepIndex to max after bg to canary #558
- feat: SMI TrafficSplit Support for Canary #520
- feat: add shortened option -A for --all-namespaces #615
-
feat: ClusterAnalysisTemplates (Cluster scoped AnalysisTemplates) #560
-
feat: Uplevel AnalysisRun status to Rollout status #578
-
fix: Modify arg verification to check ValueFrom #500
-
fix: Fix analysis validation to include Kayenta #545
- fix: Modify arg verification to check ValueFrom (#500)
- fix: remove hash selector after switching from bg to canary (#515)
- fix: Ensure ALB action with weight 0 marshalls correctly (#493)
- fix: Add missing clusterrole for deleting pods (#490)
- fix: Remove validation for limits and requests (#480)
- fix: Duplicate StableRS to canary.StableRS (#483)
- fix: Make kubectl plugin backwards compat with canary.stableRS (#482)
- The metric
rollout_created_time
is being removed. - The
.status.canary.stableRS
is being deprecated for.status.stableRS
. This release has the code to handle the migration, and the Rollout spec will updated to remove.status.stableRS
in a future release.
Thank you to the following contributors for their work in this release!
- cronik
- dthomson25
- duboisf
- jessesuen
- khhirani
- moensch
- nghialv
- feat: Improve Prometheus metrics (#461)
- feat: Add metrics on queues and go client http calls (#416)
- feat: Add patchMergeKey and patchStrategy struct tags and comments (#386)
- feat: Improve removing k8s 1.18 fields (#436)
- fix: Reduce log from error to warning (#394)
- chore: Download go deps explicitly in Dockerfile (#464)
- chore: Standardize controller-gen to v0.2.5 (#431)
- chore: Migrate from dep to go modules (#331)
- chore: Add auto generated sites/ to gitignore (#398)
- docs: Add remote name to 'make release-docs' (#435)
- docs: Documentation cleanup (#437)
- docs: Add Go mod download command to contributor docs (#425)
- docs: Corrected HPA doc (#396)
- docs: Remove extra comma in docs
- docs: Update README.md (#411)
- feat: Introduce Anti-Affinity option to rollout strategies (#445)
- feat: Add ability to restart Pods (#453)
- feat: Add ALB Ingress controller support (#444)
- feat: Add Nginx canary traffic management (#426)
- feat: Add BlueGreen Pre Promotion Analysis (#415)
- feat: Add BlueGreen Post Promotion Analysis (#442)
- feat: Allow Rollout to specify multiples templates (#409)
- feat: Make pause duration as string with time unit (#423)
- feat: Use managed-by annotation (#448)
- refactor: Refactor BlueGreen Strategy (#388)
- fix: Update Role/ClusterRole for Ingress access (#439)
- fix: rollout transformer for pod affinity. add new v0.7 name references and testing (#399)
- chore: Add StableRS to rollout status (#441)
- chore: Fix wrong comment about the formula of calculating the replica number (#447)
- feat: Improve wavefront provider (#465)
- feat: Allow AnalysisTemplates to reference secrets (#420)
- improvement: Surface failure reasons for Rollouts/AnalysisRuns (#434)
- refactor: Perform arg substitution in Analysis controller (#407)
- docs: Use correct podTemplateHashValue attribute for valueFrom (#417)
- docs: Update web metrics section (#381)
- docs: Use correct magic value in Analysis docs (#378)
- feat: Experiments passed duration succeed with running analysis (#392)
- feat: Allow ex to use availableAt and finishedAt as args (#400)
- refactor: Refactor Experiment handling of pod hashes (#385)
- feat: Show scale down time for Blue Green ReplicaSets (#370) (#382)
- feat: Add more command aliases in kubectl plugin (#414)
- chore: Set kubectl flags on root command (#456)
- docs: Generate kubectl plugin docs (#422)
- docs: Plugin command enhancements (#454)
- Update RS if RS's annotations need to be changed #413
- Adding ca-certificates to docker image (#393)
- Add patchMergeKey and patchStrategy struct tags and comments (#386)
- Reduce log from error to warning (#394)
- Allow ex to use availableAt and finishedAt as args (#400)
- Experiments passed duration succeed with running analysis (#392)
- Refactor Experiment handling of pod hashes (#385)
- Please upgrade to v0.6.x before upgrading to v0.7. Pre v0.6.0 has a different pausing logic, and v0.7.0 removes the depreciated PauseStartTime field. The v0.6.x versions have a migration script that is removed in v0.7.0.
- This release introduces an alpha implementation of Rollouts leveraging Istio resources for traffic shaping. Check out traffic management for more info.
- Support instance ids for rollout controller segregation #342
- Remove PauseStartTime #349
- Vendor mockery utility #347
- Remove loud log message #333
- Add stableService field #337
- Initial Istio implementation #341
- Implement watch for Istio resources #354
- Add validation to istio virtual services #355
- Introduce 'kubectl argo rollouts terminate' command #297
- Allow controller to delay analysis #350
- Create one background analysis per revision #309
- Allow AnalysisRun to complete an experiment #345
- Wavefront metric provider #338
- Web metric provider #318
- Refactor common logic in providers to library #368
- Allow web provider to be parameterized #368
- Fix premature scaledown (#365)
- Add namespace restriction to job informer (#362)
- Fix honoring autoPromotionSeconds (#360)
- Ensure podHash stays on stable-svc selector (#340)
- omitted revisionHistoryLimit was not defaulting to 10 (#330)
- Fix panic if rollout cannot create a new RS (#328)
- Enable controller to handle panics with crashing (#328)
- Create one background analysis per revision (#309)
- Fix Infinite loop with PreviewReplicaCount set (#308)
- Fix a delete by zero in get command (#310)
- Set StableRS hash to current if replicaset does not actually exist (#320)
- Bluegreen: allow preview service/replica sets to be replaced and fix sg fault in syncReplicasOnly (#314)
- The pause functionality was reworked in the v0.6 release. Previously, the
.spec.paused
field was used by the controller to pause rollouts. However, this was an issue for users who wanted to manually pause the rollout since the controller assumed it was the only entity that set the field. In v0.6, the controller will add a pause condition to the.status.pauseCondition
to pause a controller instead of settingspec.paused
. The pause condition has a start time and a reason explaining why it paused. This allows users to set thespec.paused
field manually and let the controller respect that pause. The v0.6 controller has a migration function to convert pre v0.6 rollouts to the new pause condition. The migration function will be removed in a future release. - In pre-v0.6 versions, the BlueGreen strategy would have the preview service point at no ReplicaSets if the new ReplicaSet was receiving traffic from the active service. V0.6 changes that behavior to make the preview service always point at the latest ReplicaSet
- Update k8s library dependencies to v1.16 (##192)
- Add Rollout Context to reconciliation loop (##205)
- Refactor pausing (##211)
- Allow User pause (##216)
- Stop progress while paused (##193)
- Add pause condition migration (##229)
- Add abort functionality (##224)
- Rollout analysis plumbing (##183)
- Add AnalysisStep for Rollouts (##188)
- Add background analysis runs for rollouts (##196)
- Clean up old Background AnalysisRuns (##246)
- Clean up Experiments and AnalysisRuns (##197)
- Add initial Experiment Step (##165)
- Make specifying replicas/duration optional in the experiment step (##241)
- Terminate experiments from previous steps (##280)
- Add Analysis to RolloutExperimentStep (##238)
- Fix TimeOut check to consider experiment/analysis steps (##278)
- Pause a rollout upon inconclusive experiment (##256)
- Abort a rollout upon a failed experiment (##256)
- Add create AnalysisRun action in clusterrole (##231)
- Fix nil ptr for newRS (##233)
- Fix Rollout transformer config (##247)
- Always point preview service at the newRS (##217)
- Make active service required (##235)
- Reset ProgressDeadline on retry (##282)
- Ignore old running rs for RolloutCompleted status (##218)
- Remove scale down annot after scaling down (##187)
- Renames golang field names for blueGreen/canary to eliminate two API violations (##206)
Check out the Experiment Docs for more information.
- Refactor experiments to use a context object (##208)
- Allow selectors to be overwritten when starting experiments (##249)
- Simplify experiment replicaset names (##274)
- Integrate Experiments with Analysis (##210)
- Fix experiment enqueue logic (##239)
- Annotate instead of label experiment names in replicasets (##262)
- Fix issue where a replicaset name collision could cause hot loop (##236)
Check out the Analysis Docs for more information.
- AnalysisRun AnalysisTemplate Spec (##166)
- Initial analysis controller implementation (##168)
- Integrate analysis controller with provider interfaces (##171)
- Add metric knob for maxInconclusive (##181)
- Simplify provider interfaces to set error messages (##189)
- Implement ResumeAt logic (##232)
- Define explicit args in AnalysisTemplates and simplify AnalysisRun spec (##283)
- Use a duration string instead of int to represent duration (##286)
- Truncate measurements when greater than default (10) (##191)
- Add counter for consecutiveError (##191)
- Add initial provider and Prometheus implementation (##170)
- Rename prometheus.server to address to better reflect API client interface (##207)
- Treat NaN as inconclusive on Prometheus provider (##275)
- Implement job-based metric provider (##186)
- Job metric argument substitution. Simplify metric provider interface (##268)
- Initialize check in for kayenta metric provider ##284
Check out the kubectl plugin docs for more information.
- Implement argo rollouts kubectl plugin (##195)
- Introduce
kubectl argo rollout list rollouts
command (##204) - Introduce
kubectl argo rollout list experiments
command (##267) - Introduce
kubectl argo rollout set image
command (##251) - Introduce
kubectl argo rollout get
command (##230) - Introduce
kubectl argo rollout promote
command (##277) - Add ability to
kubectl argo rollouts set image *=myrepo/myimage
(##290) - Add
get/retry experiment
commands. Support experiment retries (##263) - Show running jobs as part of analysis runs (##278)
- Surface experiment images to CLI (##274)
- Rollout deletionTimestamp are not honored (##109)
- status.availableReplicas should not count old stacks (##143)
- Fix Infinitely loop on controller loop (##146)
- Fast rollback in BlueGreen during scale down period ##127
- Attach independent scaleDownDelays to older ReplicaSets ##145
- Add scaleDownDelayRevisionLimit to limit the number of old ReplicaSets scaled up ##129
This release of Argo Rollouts introduces the experiment CRD. The experiment CRD allows users to define multiple PodSpec's to run for a specific duration of time. This will help enable the Kayenta use-case where a user will need to start two versions of their application at the same time. Otherwise, the users cannot have an apples-to-apples comparison of these two versions as one will skew as a result of running for a longer period.
- Honor MaxSurge and MaxUnavailable after last step (##141)
- Fix maxSurge maxUnavailable zero check (##135)
- Add .Spec.Replicas if not set in rollouts (##125)
- Workaround K8s inability to properly validate 'items' subfields ##114
-
For the BlueGreen strategy, Argo Rollouts will only pause rollouts that have the field
spec.strategy.blueGreen.autoPromotionEnabled
set to false. The default value ofautoPromotionEnabled
is true and causes the rollout to immediately promote the new version once it is available. This change was implemented to make the pausing behavior of rollouts more straight-forward and you can read more about it at ##80. Argo Rollouts v0.3.2 introduces theautoPromotionEnabled
flag without making any behavior changes, and those behavior changes are enforced starting at v0.4.0. In order to upgrade without any issues, the operator should first upgrade to v0.3.2 and add theautoPromotionEnabled
flag with the appropriate value. Afterward, they will be safe to upgrade to v0.4. -
For the Canary Strategy, the Argo Rollouts controller stores a hash of the canary steps in the rollout status to be able to detect changes in steps. If the canary steps change during a progressing canary update, the controller will change the hash and restart the steps. If the rollout is in a completed state, the controller will only update the hash. In v0.4.0, the controller changed how the hash of the steps was calculated, and you can read more about that at this issues: ##103. As a result, the operator should only upgrade Argo Rollouts to v0.4.0 when all the canary rollouts have executed all steps and have completed. Otherwise, the controller will restart the steps it has executed.
- Add Ability to specify canaryService Service object to reach only canary pods ##91
- Simplify unintuitive automatic pause behavior for blue green strategy ##80
- Add back service informer to handle Service recreations quicker ##71
- Use lister instead of kubernetes api call to load service ##98
- Switch to controller-gen to generate crd with complete openapi validation spec ##84
- Change step hashing function to derive hash from json representation ##103
- CRD validation needs to be removed for resource requests/limits ##101
- Possible to exceed revisionHistoryLimit with canary strategy ##93
- Change in pod template hash after upgrading k8s dependencies ##88
- Controller is missing patch event privileges bug ##86
- Rollouts unprotected from invalid specs ##84
- Fix logging fields ##97
In v0.4.0, Argo Rollouts will have a breaking change where we will only pause BlueGreen rollouts if they have a new field called autoPromotionEnabled
under the spec.strategy.blueGreen
set to false. If the field is not listed, the default value will be true, and the rollout will immediately promote the new Rollout. This change was introduced to address argoproj#80.
To prepare for v0.4.0, v0.3.2 will introduce the autoPromotionEnabled
field, but the controller will not act on the field. As a result, you can add the autoPromotionEnabled
field without breaking your existing rollouts.
- Add autoPromotionEnabled with no behavior change
- Fix controller crash caused by glog attempting to write to /tmp (##94)
Rename autoPromoteActiveServiceDelaySeconds to autoPromotionSeconds ##77
- Switch to Scratch final image ##67
- Enable fast Rollback in BlueGreen ##78
- Respect ScaleDownDelay during non-happy path ##79
- Scale down older RS on non-happy path ##76
- Fix issue where pod template hash could be computed inconsistently ##75
- Cleanup replicasets in canary deployment ##73
- Don't requeue 404 errors ##72
- HPA Support ##37
- Prometheus Metrics ##29 ##47
- Introduce ProgressDeadlineSeconds ##54
- Improved Scalability ##45
The status.verifyingPreview
field was depreciated and move to spec.pause
.
- Add previewReplicaCount ##64
- Add ability to auto-promote active service ##59
- Add ScaleDownDelaySeconds ##57
- HPA Support ##37
- Prometheus Metrics ##29
- Add previewReplicaCount ##64
- Add ProgressDeadlineSeconds ##54
- Add Invalid spec checks with regards to ProgressDeadlineSeconds ##62
- Improve eventing and metrics ##61
- Improve Available Condition ##60
- Convert Kustomize V1.0 to Kustomize v2.0 ##56
- Make Metrics port customizable ##55
- Replace gometalinter with golangci ##46
- Add support for gotestsum ##52
- Remove service informer ##45
- Replace verifying preview with Paused ##43
- Prevent early pause before svc change in BG ##51
- Fix aggregate roles naming collision with Argo Workflows ##44
- Use recreate strategy for controller ##44
Add missing events permissions to the clusterrole
Changes the following clusterroles to prevent name collision with Argo Workflows
argo-aggregate-to-admin
toargo-rollouts-aggregate-to-admin
argo-aggregate-to-edit
toargo-rollouts-aggregate-to-edit
argo-aggregate-to-view
toargo-rollouts-aggregate-to-view
- Implements the initial ReplicaSet-based Canary Strategy
- Cleans up Status fields
- Implicit understanding of rollback based on steps completion and pod hash for Blue Green and Canary
- Creates a controller that manages a rollout object that mimics a deployment object
- Declaratively offers a Blue Green Strategy by creating the replicaset from the spec and managing an active and preview service to point to the new replicaset