wireguard-go

Author	SHA1	Message	Date
HeshamTB	4f8b7857f9	change imports to personal fork	2024-01-07 22:03:11 +03:00
Martin Basovnik	12269c2761	device: fix possible deadlock in close method There is a possible deadlock in `device.Close()` when you try to close the device very soon after its start. The problem is that two different methods acquire the same locks in different order: 1. device.Close() - device.ipcMutex.Lock() - device.state.Lock() 2. device.changeState(deviceState) - device.state.Lock() - device.ipcMutex.Lock() Reproducer: func TestDevice_deadlock(t testing.T) { d := randDevice(t) d.Close() } Problem: $ go clean -testcache && go test -race -timeout 3s -run TestDevice_deadlock ./device \| grep -A 10 sync.runtime_SemacquireMutex sync.runtime_SemacquireMutex(0xc000117d20?, 0x94?, 0x0?) /usr/local/opt/go/libexec/src/runtime/sema.go:77 +0x25 sync.(Mutex).lockSlow(0xc000130518) /usr/local/opt/go/libexec/src/sync/mutex.go:171 +0x213 sync.(Mutex).Lock(0xc000130518) /usr/local/opt/go/libexec/src/sync/mutex.go:90 +0x55 golang.zx2c4.com/wireguard/device.(Device).Close(0xc000130500) /Users/martin.basovnik/git/basovnik/wireguard-go/device/device.go:373 +0xb6 golang.zx2c4.com/wireguard/device.TestDevice_deadlock(0x0?) /Users/martin.basovnik/git/basovnik/wireguard-go/device/device_test.go:480 +0x2c testing.tRunner(0xc00014c000, 0x131d7b0) -- sync.runtime_SemacquireMutex(0xc000130564?, 0x60?, 0xc000130548?) /usr/local/opt/go/libexec/src/runtime/sema.go:77 +0x25 sync.(Mutex).lockSlow(0xc000130750) /usr/local/opt/go/libexec/src/sync/mutex.go:171 +0x213 sync.(Mutex).Lock(0xc000130750) /usr/local/opt/go/libexec/src/sync/mutex.go:90 +0x55 sync.(RWMutex).Lock(0xc000130750) /usr/local/opt/go/libexec/src/sync/rwmutex.go:147 +0x45 golang.zx2c4.com/wireguard/device.(Device).upLocked(0xc000130500) /Users/martin.basovnik/git/basovnik/wireguard-go/device/device.go:179 +0x72 golang.zx2c4.com/wireguard/device.(*Device).changeState(0xc000130500, 0x1) Signed-off-by: Martin Basovnik <martin.basovnik@gmail.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-12-11 16:38:47 +01:00
Jordan Whited	4ffa9c2032	device: change Peer.endpoint locking to reduce contention Access to Peer.endpoint was previously synchronized by Peer.RWMutex. This has now moved to Peer.endpoint.Mutex. Peer.SendBuffers() is now the sole caller of Endpoint.ClearSrc(), which is signaled via a new bool, Peer.endpoint.clearSrcOnTx. Previous Callers of Endpoint.ClearSrc() now set this bool, primarily via peer.markEndpointSrcForClearing(). Peer.SetEndpointFromPacket() clears Peer.endpoint.clearSrcOnTx when an updated conn.Endpoint is stored. This maintains the same event order as before, i.e. a conn.Endpoint received after peer.endpoint.clearSrcOnTx is set, but before the next Peer.SendBuffers() call results in the latest conn.Endpoint source being used for the next packet transmission. These changes result in throughput improvements for single flow, parallel (-P n) flow, and bidirectional (--bidir) flow iperf3 TCP/UDP tests as measured on both Linux and Windows. Latency under load improves especially for high throughput Linux scenarios. These improvements are likely realized on all platforms to some degree, as the changes are not platform-specific. Co-authored-by: James Tucker <james@tailscale.com> Signed-off-by: James Tucker <james@tailscale.com> Signed-off-by: Jordan Whited <jordan@tailscale.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-12-11 16:34:09 +01:00
Jordan Whited	1ec454f253	device: move Queue{In,Out}boundElement Mutex to container type Queue{In,Out}boundElement locking can contribute to significant overhead via sync.Mutex.lockSlow() in some environments. These types are passed throughout the device package as elements in a slice, so move the per-element Mutex to a container around the slice. Reviewed-by: Maisem Ali <maisem@tailscale.com> Signed-off-by: Jordan Whited <jordan@tailscale.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-10-10 15:07:36 +02:00
James Tucker	b7cd547315	device: wait for and lock ipc operations during close If an IPC operation is in flight while close starts, it is possible for both processes to deadlock. Prevent this by taking the IPC lock at the start of close and for the duration. Signed-off-by: James Tucker <jftucker@gmail.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-06-27 17:43:35 +02:00
Jordan Whited	3bb8fec7e4	conn, device, tun: implement vectorized I/O plumbing Accept packet vectors for reading and writing in the tun.Device and conn.Bind interfaces, so that the internal plumbing between these interfaces now passes a vector of packets. Vectors move untouched between these interfaces, i.e. if 128 packets are received from conn.Bind.Read(), 128 packets are passed to tun.Device.Write(). There is no internal buffering. Currently, existing implementations are only adjusted to have vectors of length one. Subsequent patches will improve that. Also, as a related fixup, use the unix and windows packages rather than the syscall package when possible. Co-authored-by: James Tucker <james@tailscale.com> Signed-off-by: James Tucker <james@tailscale.com> Signed-off-by: Jordan Whited <jordan@tailscale.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-03-10 14:52:13 +01:00
Jason A. Donenfeld	c7b76d3d9e	device: uniformly check ECDH output for zeros For some reason, this was omitted for response messages. Reported-by: z <dzm@unexpl0.red> Fixes: `8c34c4c` ("First set of code review patches") Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-02-16 16:33:14 +01:00
Jason A. Donenfeld	ebbd4a4330	global: bump copyright year Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2023-02-07 20:39:29 -03:00
Jason A. Donenfeld	bb719d3a6e	global: bump copyright year Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2022-09-20 17:21:32 +02:00
Brad Fitzpatrick	b51010ba13	all: use Go 1.19 and its atomic types Signed-off-by: Brad Fitzpatrick <bradfitz@tailscale.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2022-09-04 12:57:30 +02:00
Jason A. Donenfeld	fc4f975a4d	device: align 64-bit atomic member in Device Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-11-16 21:07:31 +01:00
Jason A. Donenfeld	111e0566dc	device: make new peers inherit broken mobile semantics Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-11-15 23:40:47 +01:00
Jason A. Donenfeld	e3134bf665	device: defer state machine transitions until configuration is complete Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-11-15 23:40:47 +01:00
Jason A. Donenfeld	c27ff9b9f6	device: allow reducing queue constants on iOS Heavier network extensions might require the wireguard-go component to use less ram, so let users of this reduce these as needed. At some point we'll put this behind a configuration method of sorts, but for now, just expose the consts as vars. Requested-by: Josh Bleecher Snyder <josh@tailscale.com> Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-05-22 01:00:51 +02:00
Jason A. Donenfeld	7121927b87	device: add ID to repeated routines Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-05-07 12:21:21 +02:00
Josh Bleecher Snyder	10533c3e73	all: make conn.Bind.Open return a slice of receive functions Instead of hard-coding exactly two sources from which to receive packets (an IPv4 source and an IPv6 source), allow the conn.Bind to specify a set of sources. Beneficial consequences: * If there's no IPv6 support on a system, conn.Bind.Open can choose not to return a receive function for it, which is simpler than tracking that state in the bind. This simplification removes existing data races from both conn.StdNetBind and bindtest.ChannelBind. * If there are more than two sources on a system, the conn.Bind no longer needs to add a separate muxing layer. Signed-off-by: Josh Bleecher Snyder <josharian@gmail.com>	2021-04-02 11:07:08 -06:00
Josh Bleecher Snyder	02e419ed8a	device: rename unsafeCloseBind to closeBindLocked And document a bit. This name is more idiomatic. Signed-off-by: Josh Bleecher Snyder <josharian@gmail.com>	2021-03-30 12:07:12 -07:00
Jason A. Donenfeld	593658d975	device: get rid of peers.empty boolean in timersActive There's no way for len(peers)==0 when a current peer has isRunning==false. This requires some struct reshuffling so that the uint64 pointer is aligned. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-03-06 08:44:38 -07:00
Jason A. Donenfeld	a4f8e83d5d	conn: make binds replacable Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-23 20:00:57 +01:00
Jason A. Donenfeld	587a2b2a20	device: return error from Up() and Down() Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-10 00:12:23 +01:00
Jason A. Donenfeld	da32fe328b	device: handshake routine writes into encryption queue Since RoutineHandshake calls peer.SendKeepalive(), it potentially is a writer into the encryption queue, so we need to bump the wg count. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 19:26:45 +01:00
Josh Bleecher Snyder	4eab21a7b7	device: make RoutineReadFromTUN keep encryption queue alive RoutineReadFromTUN can trigger a call to SendStagedPackets. SendStagedPackets attempts to protect against sending on the encryption queue by checking peer.isRunning and device.isClosed. However, those are subject to TOCTOU bugs. If that happens, we get this: goroutine 1254 [running]: golang.zx2c4.com/wireguard/device.(Peer).SendStagedPackets(0xc000798300) .../wireguard-go/device/send.go:321 +0x125 golang.zx2c4.com/wireguard/device.(Device).RoutineReadFromTUN(0xc000014780) .../wireguard-go/device/send.go:271 +0x21c created by golang.zx2c4.com/wireguard/device.NewDevice .../wireguard-go/device/device.go:315 +0x298 Fix this with a simple, big hammer: Keep the encryption queue alive as long as it might be written to. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-09 09:53:00 -08:00
Josh Bleecher Snyder	cae090d116	device: clarify device.state.state docs (again) Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-09 18:29:01 +01:00
Jason A. Donenfeld	9e728c2eb0	device: rename unsafeRemovePeer to removePeerLocked This matches the new naming scheme of upLocked and downLocked. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 16:11:33 +01:00
Jason A. Donenfeld	eaf664e4e9	device: remove deviceStateNew It's never used and we won't have a use for it. Also, move to go-running stringer, for those without GOPATHs. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 15:39:19 +01:00
Jason A. Donenfeld	a816e8511e	device: fix comment typo and shorten state.mu.Lock to state.Lock Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 15:37:04 +01:00
Jason A. Donenfeld	02138f1f81	device: fix typo in comment Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 15:37:04 +01:00
Jason A. Donenfeld	d7bc7508e5	device: fix alignment on 32-bit machines and test for it The test previously checked the offset within a substruct, not the offset within the allocated struct, so this adds the two together. It then fixes an alignment crash on 32-bit machines. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 15:37:04 +01:00
Jason A. Donenfeld	d6e76fdbd6	device: do not log on idempotent device state change Part of being actually idempotent is that we shouldn't penalize code that takes advantage of this property with a log splat. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-09 15:37:04 +01:00
Josh Bleecher Snyder	57aadfcb14	device: create channels.go We have a bunch of stupid channel tricks, and I'm about to add more. Give them their own file. This commit is 100% code movement. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-08 12:38:19 -08:00
Josh Bleecher Snyder	3516ccc1e2	device: remove device.state.stopping from RoutineTUNEventReader The TUN event reader does three things: Change MTU, device up, and device down. Changing the MTU after the device is closed does no harm. Device up and device down don't make sense after the device is closed, but we can check that condition before proceeding with changeState. There's thus no reason to block device.Close on RoutineTUNEventReader exiting. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-08 10:32:07 -08:00
Josh Bleecher Snyder	0bcb822e5b	device: overhaul device state management This commit simplifies device state management. It creates a single unified state variable and documents its semantics. It also makes state changes more atomic. As an example of the sort of bug that occurred due to non-atomic state changes, the following sequence of events used to occur approximately every 2.5 million test runs: * RoutineTUNEventReader received an EventDown event. * It called device.Down, which called device.setUpDown. * That set device.state.changing, but did not yet attempt to lock device.state.Mutex. * Test completion called device.Close. * device.Close locked device.state.Mutex. * device.Close blocked on a call to device.state.stopping.Wait. * device.setUpDown then attempted to lock device.state.Mutex and blocked. Deadlock results. setUpDown cannot progress because device.state.Mutex is locked. Until setUpDown returns, RoutineTUNEventReader cannot call device.state.stopping.Done. Until device.state.stopping.Done gets called, device.state.stopping.Wait is blocked. As long as device.state.stopping.Wait is blocked, device.state.Mutex cannot be unlocked. This commit fixes that deadlock by holding device.state.mu when checking that the device is not closed. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-08 10:32:07 -08:00
Josh Bleecher Snyder	9c75f58f3d	device: remove device.state.stopping from RoutineHandshake It is no longer necessary. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-08 08:18:32 -08:00
Josh Bleecher Snyder	84a42aed63	device: remove device.state.stopping from RoutineDecryption It is no longer necessary, as of 454de6f3e64abd2a7bf9201579cd92eea5280996 (device: use channel close to shut down and drain decryption channel). Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-08 08:18:32 -08:00
Josh Bleecher Snyder	8a374a35a0	device: tie encryption queue lifetime to the peers that write to it Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-02-03 00:57:57 +01:00
Jason A. Donenfeld	4846070322	device: use a waiting sync.Pool instead of a channel Channels are FIFO which means we have guaranteed cache misses. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-02-02 19:32:13 +01:00
Jason A. Donenfeld	de51129e33	device: use int64 instead of atomic.Value for time stamp Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-29 18:57:03 +01:00
Jason A. Donenfeld	beb25cc4fd	device: use new model queues for handshakes Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-29 18:24:45 +01:00
Jason A. Donenfeld	9263014ed3	device: simplify peer queue locking Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-29 16:21:53 +01:00
Jason A. Donenfeld	d4112d9096	global: bump copyright Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-28 17:52:15 +01:00
Jason A. Donenfeld	6a128dde71	device: do not allow get to run while set runs Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-28 15:26:22 +01:00
Jason A. Donenfeld	ace50a0529	device: avoid deadlock when changing private key and removing self peers Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-27 15:53:21 +01:00
Jason A. Donenfeld	8cc99631d0	device: use linked list for per-peer allowed-ip traversal This makes the IpcGet method much faster. We also refactor the traversal API to use a callback so that we don't need to allocate at all. Avoiding allocations we do self-masking on insertion, which in turn means that split intermediate nodes require a copy of the bits. benchmark old ns/op new ns/op delta BenchmarkUAPIGet-16 3243 2659 -18.01% benchmark old allocs new allocs delta BenchmarkUAPIGet-16 35 30 -14.29% benchmark old bytes new bytes delta BenchmarkUAPIGet-16 1218 737 -39.49% This benchmark is good, though it's only for a pair of peers, each with only one allowedips. As this grows, the delta expands considerably. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-27 01:48:58 +01:00
Jason A. Donenfeld	d669c78c43	device: combine debug and info log levels into 'verbose' There are very few cases, if any, in which a user only wants one of these levels, so combine it into a single level. While we're at it, reduce indirection on the loggers by using an empty function rather than a nil function pointer. It's not like we have retpolines anyway, and we were always calling through a function with a branch prior, so this seems like a net gain. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-26 23:05:48 +01:00
Josh Bleecher Snyder	7139279cd0	device: change logging interface to use functions This commit overhauls wireguard-go's logging. The primary, motivating change is to use a function instead of a *log.Logger as the basic unit of logging. Using functions provides a lot more flexibility for people to bring their own logging system. It also introduces logging helper methods on Device. These reduce line noise at the call site. They also allow for log functions to be nil; when nil, instead of generating a log line and throwing it away, we don't bother generating it at all. This spares allocation and pointless work. This is a breaking change, although the fix required of clients is fairly straightforward. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-01-26 22:40:20 +01:00
Josh Bleecher Snyder	cecb41515d	device: serialize access to IpcSetOperation Interleaves IpcSetOperations would spell trouble. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-01-25 09:38:09 -08:00
Josh Bleecher Snyder	7c5d1e355e	device: remove unnecessary zeroing Newly allocated objects are already zeroed. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-01-20 19:57:07 +01:00
Josh Bleecher Snyder	0cc15e7c7c	device: put handshake buffer in pool in FlushPacketQueues This appears to have been an oversight. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-01-20 19:56:59 +01:00
Josh Bleecher Snyder	48c3b87eb8	device: use channel close to shut down and drain decryption channel This is similar to commit `e1fa1cc556`, but for the decryption channel. It is an alternative fix to f9f655567930a4cd78d40fa4ba0d58503335ae6a. Signed-off-by: Josh Bleecher Snyder <josh@tailscale.com>	2021-01-20 19:56:54 +01:00
Jason A. Donenfeld	29b0477585	device: receive: drain decryption queue before exiting RoutineDecryption It's possible for RoutineSequentialReceiver to try to lock an elem after RoutineDecryption has exited. Before this meant we didn't then unlock the elem, so the whole program deadlocked. As well, it looks like the flush code (which is now potentially unnecessary?) wasn't properly dropping the buffers for the not-already-dropped case. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>	2021-01-07 17:08:41 +01:00

1 2

69 Commits