linux/net/batman-adv
Linus Lüssing 9b4aec647a batman-adv: fix rare race conditions on interface removal
In rare cases during shutdown the following general protection fault can
happen:

  general protection fault: 0000 [#1] SMP
  Modules linked in: batman_adv(O-) [...]
  CPU: 3 PID: 1714 Comm: rmmod Tainted: G           O    4.6.0-rc6+ #1
  [...]
  Call Trace:
   [<ffffffffa0363294>] batadv_hardif_disable_interface+0x29a/0x3a6 [batman_adv]
   [<ffffffffa0373db4>] batadv_softif_destroy_netlink+0x4b/0xa4 [batman_adv]
   [<ffffffff813b52f3>] __rtnl_link_unregister+0x48/0x92
   [<ffffffff813b9240>] rtnl_link_unregister+0xc1/0xdb
   [<ffffffff8108547c>] ? bit_waitqueue+0x87/0x87
   [<ffffffffa03850d2>] batadv_exit+0x1a/0xf48 [batman_adv]
   [<ffffffff810c26f9>] SyS_delete_module+0x136/0x1b0
   [<ffffffff8144dc65>] entry_SYSCALL_64_fastpath+0x18/0xa8
   [<ffffffff8108aaca>] ? trace_hardirqs_off_caller+0x37/0xa6
  Code: 89 f7 e8 21 bd 0d e1 4d 85 e4 75 0e 31 f6 48 c7 c7 50 d7 3b a0 e8 50 16 f2 e0 49 8b 9c 24 28 01 00 00 48 85 db 0f 84 b2 00 00 00 <48> 8b 03 4d 85 ed 48 89 45 c8 74 09 4c 39 ab f8 00 00 00 75 1c
  RIP  [<ffffffffa0371852>] batadv_purge_outstanding_packets+0x1c8/0x291 [batman_adv]
   RSP <ffff88001da5fd78>
  ---[ end trace 803b9bdc6a4a952b ]---
  Kernel panic - not syncing: Fatal exception in interrupt
  Kernel Offset: disabled
  ---[ end Kernel panic - not syncing: Fatal exception in interrupt

It does not happen often, but may potentially happen when frequently
shutting down and reinitializing an interface. With some carefully
placed msleep()s/mdelay()s it can be reproduced easily.

The issue is, that on interface removal, any still running worker thread
of a forwarding packet will race with the interface purging routine to
free a forwarding packet. Temporarily giving up a spin-lock to be able
to sleep in the purging routine is not safe.

Furthermore, there is a potential general protection fault not just for
the purging side shown above, but also on the worker side: Temporarily
removing a forw_packet from the according forw_{bcast,bat}_list will make
it impossible for the purging routine to catch and cancel it.

 # How this patch tries to fix it:

With this patch we split the queue purging into three steps: Step 1),
removing forward packets from the queue of an interface and by that
claim it as our responsibility to free.

Step 2), we are either lucky to cancel a pending worker before it starts
to run. Or if it is already running, we wait and let it do its thing,
except two things:

Through the claiming in step 1) we prevent workers from a) re-arming
themselves. And b) prevent workers from freeing packets which we still
hold in the interface purging routine.

Finally, step 3, we are sure that no forwarding packets are pending or
even running anymore on the interface to remove. We can then safely free
the claimed forwarding packets.

Signed-off-by: Linus Lüssing <linus.luessing@c0d3.blue>
Signed-off-by: Sven Eckelmann <sven@narfation.org>
Signed-off-by: Simon Wunderlich <sw@simonwunderlich.de>
2016-11-08 19:02:39 +01:00
..
bat_algo.c batman-adv: Allow to disable debugfs support 2016-08-09 07:54:54 +02:00
bat_algo.h batman-adv: netlink: add routing_algo query 2016-08-09 07:54:36 +02:00
bat_iv_ogm.c batman-adv: fix rare race conditions on interface removal 2016-11-08 19:02:39 +01:00
bat_iv_ogm.h batman-adv: Fix bat_(iv|v) function declaration header 2016-06-30 10:29:43 +02:00
bat_v_elp.c batman-adv: retrieve B.A.T.M.A.N. V WiFi neighbor stats from real interface 2016-11-08 19:02:38 +01:00
bat_v_elp.h batman-adv: Keep includes ordered by filename 2016-06-30 10:29:43 +02:00
bat_v_ogm.c batman-adv: Consume skb in receive handlers 2016-11-08 19:02:34 +01:00
bat_v_ogm.h batman-adv: Include main.h in bat_v_ogm.h 2016-06-30 10:29:43 +02:00
bat_v.c batman-adv: Use proper name for gateway list head 2016-10-19 08:37:53 +02:00
bat_v.h batman-adv: Fix bat_(iv|v) function declaration header 2016-06-30 10:29:43 +02:00
bitarray.c batman-adv: Consolidate logging related functions 2016-06-30 10:29:43 +02:00
bitarray.h batman-adv: Use bool as return type for boolean functions 2016-05-10 18:28:54 +08:00
bridge_loop_avoidance.c batman-adv: Allow to disable debugfs support 2016-08-09 07:54:54 +02:00
bridge_loop_avoidance.h batman-adv: add backbone table netlink support 2016-08-09 07:54:43 +02:00
debugfs.c batman-adv: Use octal permissions instead of macros 2016-10-19 08:37:53 +02:00
debugfs.h treewide: remove redundant #include <linux/kconfig.h> 2016-10-11 15:06:33 -07:00
distributed-arp-table.c batman-adv: Add wrapper for ARP reply creation 2016-10-30 11:11:33 +01:00
distributed-arp-table.h batman-adv: update copyright years for 2016 2016-02-02 12:55:10 +08:00
fragmentation.c batman-adv: Consume skb in batadv_frag_send_packet 2016-10-30 11:11:37 +01:00
fragmentation.h batman-adv: Use proper name for fragments list head 2016-10-19 08:37:52 +02:00
gateway_client.c batman-adv: Use proper name for gateway list head 2016-10-19 08:37:53 +02:00
gateway_client.h batman-adv: netlink: add gateway table queries 2016-08-09 07:54:40 +02:00
gateway_common.c batman-adv: make GW election code protocol specific 2016-08-09 07:54:29 +02:00
gateway_common.h batman-adv: add throughput override attribute to hard_ifaces 2016-02-29 16:05:32 +08:00
hard-interface.c batman-adv: Update wifi flags on upper link change 2016-11-08 19:02:38 +01:00
hard-interface.h batman-adv: retrieve B.A.T.M.A.N. V WiFi neighbor stats from real interface 2016-11-08 19:02:38 +01:00
hash.c batman-adv: update copyright years for 2016 2016-02-02 12:55:10 +08:00
hash.h batman-adv: Remove unused function batadv_hash_delete 2016-10-17 16:28:45 +02:00
icmp_socket.c batman-adv: Use octal permissions instead of macros 2016-10-19 08:37:53 +02:00
icmp_socket.h batman-adv: Allow to disable debugfs support 2016-08-09 07:54:54 +02:00
Kconfig batman-adv: Allow selecting BATMAN V if CFG80211 is not built 2016-10-17 16:28:48 +02:00
log.c batman-adv: Use octal permissions instead of macros 2016-10-19 08:37:53 +02:00
log.h batman-adv: Avoid precedence issues in macros 2016-10-19 08:37:54 +02:00
main.c batman-adv: Add module alias for batadv netlink family 2016-11-08 19:02:39 +01:00
main.h batman-adv: Use own timer for multicast TT and TVLV updates 2016-10-30 11:11:35 +01:00
Makefile batman-adv: Allow to disable debugfs support 2016-08-09 07:54:54 +02:00
multicast.c batman-adv: Use own timer for multicast TT and TVLV updates 2016-10-30 11:11:35 +01:00
multicast.h batman-adv: Use own timer for multicast TT and TVLV updates 2016-10-30 11:11:35 +01:00
netlink.c batman-adv: Mark batadv_netlink_ops as const 2016-10-30 11:11:32 +01:00
netlink.h batman-adv: netlink: add translation table query 2016-08-09 07:54:37 +02:00
network-coding.c batman-adv: Consume skb in receive handlers 2016-11-08 19:02:34 +01:00
network-coding.h batman-adv: update copyright years for 2016 2016-02-02 12:55:10 +08:00
originator.c batman-adv: Simple (re)broadcast avoidance 2016-10-30 11:11:35 +01:00
originator.h batman-adv: netlink: add originator and neighbor table queries 2016-08-09 07:54:38 +02:00
packet.h batman-adv: Avoid precedence issues in macros 2016-10-19 08:37:54 +02:00
routing.c batman-adv: Reject unicast packet with zero/mcast dst address 2016-11-08 19:02:36 +01:00
routing.h batman-adv: Use bool as return type for boolean functions 2016-05-10 18:28:54 +08:00
send.c batman-adv: fix rare race conditions on interface removal 2016-11-08 19:02:39 +01:00
send.h batman-adv: fix rare race conditions on interface removal 2016-11-08 19:02:39 +01:00
soft-interface.c batman-adv: Count all non-success TX packets as dropped 2016-10-30 11:11:36 +01:00
soft-interface.h batman-adv: Use bool as return type for boolean functions 2016-05-10 18:28:54 +08:00
sysfs.c batman-adv: Use octal permissions instead of macros 2016-10-19 08:37:53 +02:00
sysfs.h batman-adv: update copyright years for 2016 2016-02-02 12:55:10 +08:00
tp_meter.c batman-adv: Consume skb in batadv_send_skb_to_orig 2016-10-30 11:11:37 +01:00
tp_meter.h batman-adv: throughput meter implementation 2016-07-04 12:37:18 +02:00
translation-table.c batman-adv: Cache the type of wifi device for each hardif 2016-11-08 19:02:37 +01:00
translation-table.h batman-adv: netlink: add translation table query 2016-08-09 07:54:37 +02:00
tvlv.c batman-adv: Consume skb in batadv_send_skb_to_orig 2016-10-30 11:11:37 +01:00
tvlv.h batman-adv: split tvlv into a separate file 2016-06-30 10:29:43 +02:00
types.h batman-adv: fix rare race conditions on interface removal 2016-11-08 19:02:39 +01:00