Skip to content

Add testing for eventual consistency of timer logic #28844

@joestringer

Description

@joestringer

Cilium's internal business logic relies on a highly parallel combination of reactive handlers for incoming information, "triggers" that ratelimit requests for processing to ensure Cilium does not over-consume resources, and "controllers" that periodically perform updates or resiliency checks of configured state. While in general most things are "eventually consistent", the presence of time-based triggers and controllers can introduce challenges when evaluating how Cilium will perform once the "eventual consistency" is resolved.

Issues have been introduced into the tree where timers do not trigger during testing, and the eventual consistency of the agent is in a state that causes connectivity disruption for users (such as #27210, fix #27327). It is quite difficult to systematically identify time-based errors across the entire agent by relying purely on such testing in each package. The goal is to provide a more systematic safety net for timer-based issues.

Tasks

Metadata

Metadata

Assignees

Labels

area/CIContinuous Integration testing issue or flakekind/enhancementThis would improve or streamline existing functionality.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions