zebra: fix stale NHG in kernel #18899

krishna-samy · 2025-05-28T11:57:14Z

Fixing stale NHG issue in kernel.

Issue1:

zebra creates an nhe and sets 'initial delay' flag for the nexthop received along with kernel/connected route and this routes is a v6 route.
Later zebra receives intf_address event for the interface that belongs to the same nhe created above. but this is v4 event. Then zebra iterates through the nhe set linked to this interface and eventually it will end up installing this nhe in kernel

So, we install the NHG in kernel for connected/kernel routes and that looks to be deviating from the expected behaviour. All this happens when we receive interface event, we attempt a reinstall for all the NHGs associated with that intf. But if the 'initial delay' is already set for an NHG, we can skip that.
Fixing the same.

Issue2:
During FRR restart nexthop-group entries are not getting cleaned up in
below scenario.

Let's say an NHG refcnt is getting decremented and it becomes zero. we
add a timer for this NHG before deleting it in zebra/kernel.
so this NHG will be intact in kernel until the timer expires.
Now, the timer is running and frr is getting restarted. All the
NHGs are getting cleaned up in kernel but the one that has timer
running is still installed in the kernel.

Check if any NHG has timer running during zebra shutdown and remove from
kernel.

zebra/zebra_nhg.c

krishna-samy · 2025-05-30T12:39:19Z

Adding another commit to address stale NHG during zebra shutdown.
Both the commits dealing with stale NHG entries in kernel in different scenarios

riw777 · 2025-06-03T12:23:56Z

Is this related to #18891 ???

krishna-samy · 2025-06-03T13:32:30Z

Is this related to #18891 ???

Both are different.
#18891 - This is about stale NHG while 2 different protocols install NHGs (same route with different nexthops)
#18899 - This is about stale NHG where we install them in kernel when 'initial delay' is set.

zebra/zebra_nhg.c

mjstapp

Looks good

krishna-samy · 2025-06-05T04:18:37Z

ci:rerun

krishna-samy · 2025-06-06T04:04:59Z

There is one failure in the CI test. That looks to be failing in other PRs also. It is not related to this code change.
Example: https://github.com/FRRouting/frr/pull/18905/checks?check_run_id=43505419217

ton31337 · 2025-06-11T07:35:22Z

@Mergifyio backport dev/10.4 stable/10.3 stable/10.2

mergify · 2025-06-11T07:35:33Z

backport dev/10.4 stable/10.3 stable/10.2

✅ Backports have been created

#19085 zebra: fix stale NHG in kernel (backport #18899) has been created for branch dev/10.4
#19086 zebra: fix stale NHG in kernel (backport #18899) has been created for branch stable/10.3
#19087 zebra: fix stale NHG in kernel (backport #18899) has been created for branch stable/10.2 but encountered conflicts

ton31337 · 2025-06-12T05:24:53Z

Looks like https://ci1.netdef.org/browse/FRR-PULLREQ3-9554/artifact/ASAN9D12AMD64/AddressSanitizerError/AddressSanitzer.txt are valid?

krishna-samy · 2025-06-12T08:01:32Z

Looks like https://ci1.netdef.org/browse/FRR-PULLREQ3-9554/artifact/ASAN9D12AMD64/AddressSanitizerError/AddressSanitzer.txt are valid?

yes. this call stack looks to be relevant. let me fix it.

krishna-samy · 2025-06-12T16:54:51Z

Looks like https://ci1.netdef.org/browse/FRR-PULLREQ3-9554/artifact/ASAN9D12AMD64/AddressSanitizerError/AddressSanitzer.txt are valid?

There is an issue with using hash_iterate improperly. The function hash_iterate stores hbnext = hb->next; before calling the callback function . Also, in the zebra_nhg_sweep_stale_entry function, the callback is not just deleting the current bucket - it's causing a cascade of deletions that can free other buckets in the chain as well including ones that the iterator hasn't reached yet. This leads to use-after-free when the iterator tries to access the freed memory.
So, modifying the code to use hash_walk similar to other/existing NHG clean-up.

krishna-samy · 2025-06-13T05:12:41Z

There is one test failure and that does not look to be relevant to this change. Same failure is seen in other PRs as well.

krishna-samy · 2025-06-13T07:22:59Z

ci:rerun

krishna-samy · 2025-06-16T04:38:19Z

@Mergifyio rebase

mergify · 2025-06-16T04:38:27Z

rebase

❌ Unable to rebase: user `krishna-samy` is unknown.

Please make sure krishna-samy has logged in Mergify dashboard.

krishna-samy · 2025-06-16T04:39:35Z

ci:rerun

krishna-samy · 2025-06-18T10:03:28Z

ci:rerun

krishna-samy · 2025-06-18T15:58:26Z

@ashred-lnx
I have made the changes as suggested. please check.

ashred-lnx · 2025-06-19T02:03:37Z

@ashred-lnx I have made the changes as suggested. please check.

LTGM

krishna-samy · 2025-06-19T04:05:05Z

ci:rerun

krishna-samy · 2025-06-19T09:41:33Z

The test failures are unrelated to this changes.

krishna-samy · 2025-06-23T13:45:53Z

ci:rerun

krishna-samy · 2025-06-24T04:56:10Z

https://github.com/Mergifyio rebase

I see this issue during below events sequencing 1. zebra creates an nhe and sets 'initial delay' flag for the nexthop received along with kernel/connected route and this routes is a v6 route. 2. Later zebra receives intf_address event for the interface that belongs to the same nhe created above. but this is v4 event. Then zebra iterates through the nhe set linked to this interface and eventually it will end up installing this nhe in kernel So, we install the NHG in kernel for connected/kernel routes and that looks to be deviating from the expected behaviour. All this happens when we receive interface event, we attempt a reinstall for all the NHGs associated with that intf. But if the 'initial delay' is already set for an NHG, we can skip that. Fixing the same. Signed-off-by: Krishnasamy <krishnasamyr@nvidia.com>

During FRR restart nexthop-group entries are not getting cleaned up in below scenario. 1. Let's say an NHG refcnt is getting decremented and it becomes zero. we add a timer for this NHG before deleting it in zebra/kernel. so this NHG will be intact in kernel until the timer expires. 2. Now, the timer is running and frr is getting restarted. All the NHGs are getting cleaned up in kernel but the one that has timer running is still installed in the kernel. Check if any NHG has timer running during zebra shutdown and remove from kernel. Signed-off-by: Krishnasamy <krishnasamyr@nvidia.com>

mergify · 2025-06-24T04:56:24Z

rebase

✅ Branch has been successfully rebased

krishna-samy · 2025-06-24T08:16:52Z

All the comments are addressed and the tests are passing

zebra: fix stale NHG in kernel (backport #18899)

* bgpd: correct no form commands (backport FRRouting#18911) * bgpd: fix to show exist/non-exist-map in 'show run' properly FRRouting#18853 * redhat: make FRR RPM build to work on RedHat 10 (backport FRRouting#18920) * build: check for libunwind.h, not unwind.h (backport FRRouting#18912) * bgpd: use AS4B format for BGP loc-rib messages. (backport FRRouting#18936) * bgpd: fix for the validity and the presence of prefixes in the BGP VPN table. (backport FRRouting#17370) * bgpd: Force adj-rib-out updates if MRAI is kicked in (backport FRRouting#18959) * zebra: Provide SID value when sending SRv6 SID release notify message (backport FRRouting#18971) * bgpd: Fix crash when fetching statistics for bgp instance (backport FRRouting#19003) * nhrpd: fix crash when accessing invalid memory zone (backport FRRouting#18994) * zebra: Initialize RB tree for router tables (backport FRRouting#19049) * zebra: fix null pointer dereference in zebra_evpn_sync_neigh_del (backport FRRouting#19054) * zebra: fix stale NHG in kernel (backport FRRouting#18899) * bgpd: Fix incorrect stripping of transitive extended communities (backport FRRouting#19065) * lib: Fix no on-match goto NUM command (backport FRRouting#19108) * bgpd: Fix extended community check for IP non-transitive type (backport FRRouting#19097) * bgpd: Fix DEREF_OF_NULL.EX.COND in bgp_updgrp_packet (backport FRRouting#19126) * lib: revert addition of vtysh_flush() call in vty_out() (backport FRRouting#19109) * bgpd: Extract link bandwidth value from extcommunity before using for WCMP (backport FRRouting#19165) * Use ipv4 class E addresses (240.0.0.0/4) as connected routes by default (backport FRRouting#18095) * bfdd: Set bfd.LocalDiag when transitioning to AdminDown (backport FRRouting#18592) * zebra: clean up a json object leak (backport FRRouting#19192) * bgpd: Do not try to reuse freed route-maps (backport FRRouting#19191) * lib: fix routemap crash (backport FRRouting#19127) * bgpd: initialize local variable (backport FRRouting#19233) * ospfd: Use after free cleanup of lsa (backport FRRouting#19224) * vtysh: copy config from file should actually apply (backport FRRouting#19242) * bgpd : Fix compilation error in bgpd module: Update TP_ARGS for bgp (backport FRRouting#19266) * bgpd: Ensure addpath does not withdraw selected route in some situations (backport FRRouting#19210) * lib, zebra: mark singleton nexthops inactive/active on link state changes for wecmp (backport FRRouting#18947) * eigrp: validate hello packets and tlvs better (backport FRRouting#19251) * bgpd: [GR] fixed selectionDeferralTimer to display select_defer_time val FRRouting#19283 Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>

frrbot bot added the zebra label May 28, 2025

github-actions bot added size/XS master labels May 28, 2025

ton31337 reviewed May 28, 2025

View reviewed changes

zebra/zebra_nhg.c Outdated Show resolved Hide resolved

mjstapp reviewed May 28, 2025

View reviewed changes

zebra/zebra_nhg.c Outdated Show resolved Hide resolved

krishna-samy force-pushed the krishna-samy/stale-nhg branch from 43aa593 to e3024da Compare May 29, 2025 13:49

frrbot bot added the bugfix label May 30, 2025

github-actions bot added size/M and removed size/XS labels May 30, 2025

krishna-samy changed the title ~~zebra: do not install the nhg for kernel/connected routes~~ zebra: fix stale NHG in kernel May 30, 2025

ashred-lnx reviewed Jun 3, 2025

View reviewed changes

zebra/zebra_nhg.c Outdated Show resolved Hide resolved

ashred-lnx reviewed Jun 3, 2025

View reviewed changes

zebra/zebra_nhg.c Show resolved Hide resolved

mjstapp reviewed Jun 4, 2025

View reviewed changes

zebra/zebra_nhg.c Outdated Show resolved Hide resolved

krishna-samy force-pushed the krishna-samy/stale-nhg branch from a3f1aae to 7fc84dc Compare June 4, 2025 14:39

github-actions bot added the rebase PR needs rebase label Jun 4, 2025

mjstapp approved these changes Jun 4, 2025

View reviewed changes

github-actions bot added the backport label Jun 11, 2025

krishna-samy force-pushed the krishna-samy/stale-nhg branch from 7fc84dc to 4ceb1bf Compare June 12, 2025 14:49

krishna-samy force-pushed the krishna-samy/stale-nhg branch 2 times, most recently from f68fa49 to 3e5c0da Compare June 18, 2025 05:13

krishna-samy added 2 commits June 24, 2025 04:56

krishna-samy force-pushed the krishna-samy/stale-nhg branch from 3e5c0da to 0743cca Compare June 24, 2025 04:56

Jafaral merged commit 034e716 into FRRouting:master Jun 24, 2025
13 checks passed

This was referenced Jun 24, 2025

zebra: fix stale NHG in kernel (backport #18899) #19085

Merged

zebra: fix stale NHG in kernel (backport #18899) #19086

Merged

zebra: fix stale NHG in kernel (backport #18899) #19087

Closed

Jafaral added a commit that referenced this pull request Jun 24, 2025

Merge pull request #19085 from FRRouting/mergify/bp/dev/10.4/pr-18899

5362da4

zebra: fix stale NHG in kernel (backport #18899)

Jafaral added a commit that referenced this pull request Jun 24, 2025

Merge pull request #19086 from FRRouting/mergify/bp/stable/10.3/pr-18899

960b834

zebra: fix stale NHG in kernel (backport #18899)

krishna-samy deleted the krishna-samy/stale-nhg branch June 25, 2025 06:24

zebra: fix stale NHG in kernel #18899

zebra: fix stale NHG in kernel #18899

Uh oh!

Conversation

krishna-samy commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

krishna-samy commented May 30, 2025

Uh oh!

riw777 commented Jun 3, 2025

Uh oh!

krishna-samy commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mjstapp left a comment

Choose a reason for hiding this comment

Uh oh!

krishna-samy commented Jun 5, 2025

Uh oh!

krishna-samy commented Jun 6, 2025

Uh oh!

ton31337 commented Jun 11, 2025

Uh oh!

mergify bot commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Backports have been created

Uh oh!

ton31337 commented Jun 12, 2025

Uh oh!

krishna-samy commented Jun 12, 2025

Uh oh!

krishna-samy commented Jun 12, 2025

Uh oh!

krishna-samy commented Jun 13, 2025

Uh oh!

krishna-samy commented Jun 13, 2025

Uh oh!

krishna-samy commented Jun 16, 2025

Uh oh!

mergify bot commented Jun 16, 2025

❌ Unable to rebase: user krishna-samy is unknown.

Uh oh!

krishna-samy commented Jun 16, 2025

Uh oh!

krishna-samy commented Jun 18, 2025

Uh oh!

krishna-samy commented Jun 18, 2025

Uh oh!

ashred-lnx commented Jun 19, 2025

Uh oh!

krishna-samy commented Jun 19, 2025

Uh oh!

krishna-samy commented Jun 19, 2025

Uh oh!

krishna-samy commented Jun 23, 2025

Uh oh!

krishna-samy commented Jun 24, 2025

Uh oh!

mergify bot commented Jun 24, 2025

✅ Branch has been successfully rebased

Uh oh!

krishna-samy commented Jun 24, 2025

Uh oh!

Uh oh!

Uh oh!

krishna-samy commented May 28, 2025 •

edited

Loading

mergify bot commented Jun 11, 2025 •

edited

Loading

❌ Unable to rebase: user `krishna-samy` is unknown.