Releases: berops/claudie
v0.9.12
v0.9.12
What's Changed
- Retries were added to reading the output from OpenTofu, which could occasionally fail. #1824
- Increased concurrency limits to decrease the build time of larger clusters. This change also affects Claudie's memory requirements, which should fit within 8 GB. #1819
- For autoscaled events, Terraformer will now skip refreshing the LoadBalancers and DNS infrastructure, if present. #1830
v0.9.11
v0.9.11
What's Changed
READ ME: A lot of core changes are made in this release, before updating an already deployed Claudie instance, make sure you have working backups of your kuberentes clusters
- InputManifest was extended to also include a NoProxy list in the proxy settings to bypass the proxy for the listed endpoints, if used. #1745
kubernetes:
clusters:
- name: proxy-example
version: "1.30.0"
network: 192.168.2.0/24
installationProxy:
mode: "on"
noProxy: ".suse.com"
-
Update kubeone to 1.10 #1749
-
Migrate to OpenTofu
v1.6.2
from terraformv1.5.7
#1755READ ME: OpenTofu 1.6.2 is compatible with the previosly used Terraform version 1.5.7, while claudie will take care of the update, make sure you have working backups if you are updating an already deployed Claudie instance, in case of a disaster scenario
-
Add
sprig
to all templates used within claudie #1768 -
Builder will now support faster termination and wait only on the current task being processed instead of the whole workflow #1770
-
Claudie will now support proper HA DNS Loadbalancing #1777
This feature will be available with the latest claudie templates
v0.9.11
READ ME: for already deployed Claudie instances, if you used Cloudflare as a provider you will need to update your secret to also include the Accound ID the token was created for.
-
NGINX was replaced by Envoy on Loadbalancers. #1735
READ ME: If you update an already deployed Claudie instance, this is a one time update that will introduce a small downtime of the services while NGINX is being replaced with Envoy.
-
Upgraded all terraform providers to the latest possible version that still supports the claudie templates version
v0.9.8
#1782 -
Claudie will now perform a rollout restart for the NVIDIA GPU operator daemonset as part of the workflow, which overwrites the
/etc/containerd/config.yml
. #1790
Bug fixes
- Return partially updated state instead of always defaulting to current state after error in deletion #1793
- Restarting SSH session after updating environmnet variables, is now part of the ansible workflow, which previosly caused issue in which the updated environment variables were not reflected in a re-used SSH connection #1792
- Fixed a memory leak in the autoscaler service. #1787
v0.9.10
v0.9.10
What's Changed
- Decrease the amount of retries for cleanup of static nodes during deletion from 4 to 2 #1729
Bug fixes
- Fix panic when deleting clusters with static nodes for which DNS was not built correctly #1724
- Fix propagation of desired state from operator to manager service #1726
- Fix multiple HTTP proxy environment variables present in
/etc/environment
#1727 - Fix partial DNS apply, which would left part of the infrastructure untracked #1728
v0.9.9
v0.9.9
What's Changed
-
General maintenance release, updated dependencies used by Claudie #1709
-
Upgrading Longhorn from version 1.7.0 to version 1.8.1 #1709
After upgrading Longhorn to the newer version, some pods of the old and new versions will coexist if your cluster uses a PVC that uses the Longhorn storage class (which is the default), as they would reference the old v1.7.0.
To upgrade the volumes to the newer version, it's possible to use the Longhorn UI to set
Settings > Concurrent Automatic Engine Upgrade Per Node Limit
to a value greater than 0 to upgrade old volumes.
This is a setting that controls how Longhorn automatically upgrades volumes’ engines to the new default engine image after upgrading Longhorn manager. More on: https://longhorn.io/docs/1.8.1/deploy/upgrade/auto-upgrade-engine/Once the upgrade is complete, the old engine image pods and the instance manager will be terminated after ~60 minutes of non-use (after all volumes have been upgraded to use the latest Longhorn version) You can also follow the official Longhorn post on this: https://longhorn.io/kb/troubleshooting-some-old-instance-manager-pods-are-still-running-after-upgrade/
v0.9.8
v0.9.8
What's Changed
-
Added support for alternative names for load balancers #1693
dns: dnsZone: example.com provider: example hostname: main alternativeNames: - other
Templates that Claudie uses by default, will be updated separately to make use of the alternative names.
Bug fixes
- If the current state was not built and some of the nodes did not have an assigned IP address, Claudie would fail to correctly determine if the nodes were reachable. #1691
- Claudie will now increase the limits for
fs.inotify
to a higher number, as depending on the workload on each node, reaching the limits would result in an error from which Claudie would not recover. #1696 - Annotations for static nodepools will now be correctly propagated. #1696
Claudie v0.9.7
v0.9.7
What's Changed
-
Additional settings were added to roles for LoadBalancers. #1685.
It is now possible to configure adding/removing proxy protocol and sticky sessions.
stickySessions
will always forward traffic to the same node based on the IP hash.proxyProtocol
will turn on the proxy protocol. If used, the application to which the traffic is redirected must support this protocol.loadBalancers: roles: - name: example-role protocol: tcp port: 6443 targetPort: 6443 targetPools: - htz-kube-nodes # added settings: proxyProtocol: off (default will be on) stickySession: on. (default will be off)
-
Claudie will now ping nodes to check If any of the nodes became unreachable, Claudie will report the problem and will not work on any changes until the connectivity issue is resolved. #1658
For unreachable nodes within the kubernetes cluster, Claudie will give you the options of resolving the issue or removing the node from the InputManifest or via
kubectl
, Claudie will report the following issuefix the unreachable nodes by either: - fixing the connectivity issue - if the connectivity issue cannot be resolved, you can: - delete the whole nodepool from the kubernetes cluster in the InputManifest - delete the selected unreachable node/s manually from the cluster via 'kubectl' - if its a static node you will also need to remove it from the InputManifest - if its a dynamic node claudie will replace it. NOTE: if the unreachable node is the kube-apiserver, claudie will not be able to recover after the deletion.
For unreachable nodes within the loadbalancer cluster, Claudie will give you the options of resolving the issue or removing the nodepool or load balancer from the InputManifest, Claudie will report the following issue
fix the unreachable nodes by either: - fixing the connectivity issue - if the connectivity issue cannot be resolved, you can: - delete the whole nodepool from the loadbalancer cluster in the InputManifest - delete the whole loadbalancer cluster from the InputManifest
Bug fixes
-
It may be the case that the cluster-autoscaler image may not share the same version as the specified kubernetes version in the InputManifest. Claudie will now correctly recognize this and pick the latest available cluster-autoscaler image #1680
-
Claudie will now set the limits of max open file descriptors on each node to 65535 #1679
Claudie v0.9.6
v0.9.6
Bug fixes
- Fixed issue where failing to build a load balancer would cause Claudie to hang if the DNS part failed #1660.
Claudie will now recover from this scenario and it is possible for the user to specify the correct DNS settings in the InputManifest to fixed the reported issue.
Claudie v0.9.5
v0.9.5
Bug fixes
- Correctly assign CIDR to loadbalancer nodepools #1654.
This issue was prevalent mostly when working with loadbalancers from cloud providers that were not hetzner.
Claudie v0.9.4
v0.9.4
Bug fixes
- Fix backwards compatibility with changes introduced in Claudie version 0.9.3 for clusters build using older versions 0.9.x #1651.
If you built your cluster using the Claudie version 0.9.3, you can ignore this minor release.
Claudie v0.9.3
v0.9.3
Bug fixes
- Correctly turn HTTP proxy on/off #1636.
HTTP proxy feature introduced in v0.9.1 was not working correctly mostly when switching between the on/off mode.