bcachefs: Work around deadlock to btree node rewrites in journal replay - bcachefs.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Kent Overstreet <kent.overstreet@linux.dev>	2025-07-01 13:36:51 -0400
committer	Kent Overstreet <kent.overstreet@linux.dev>	2025-07-01 13:41:13 -0400
commit	d321d5c84188b9b43d53d5b3ab822dbe4f754456 (patch)
tree	23bd1c32c62fd4d7c9b4f5d820d0599dc801109e /tools/perf/builtin-script.c
parent	63b14f899c4603c5220769b1bb6ec8ada36023a2 (diff)

bcachefs: Work around deadlock to btree node rewrites in journal replaybcachefs-testing

Don't mark btree nodes for rewrites, if they are or would be degraded, if journal replay hasn't finished, to avoid a deadlock. This is because btree node rewrites generate more updates for the interior updates (alloc, backpointers), and if those updates touch new nodes and generate more rewrites - we can only have so many interior btree updates in flight before we deadlock on open_buckets. The biggest cause is that we don't use the btree write buffer (for the backpointer updates - this needs some real thought on locking in order to fix. The problem with this workaround (not doing the rewrite for degraded nodes in journal replay) is that those degraded nodes persist, and we don't want that (this is a real bug when a btree node write completes with fewer replicas than we wanted and leaves a degraded node due to device _removal_, i.e. the device went away mid write). It's less of a bug here, but still a problem because we don't yet have a way of tracking degraded data - we another index (all extents/btree nodes, by replicas entry) in order to fix properly (re-replicate degraded data at the earliest possible time). Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

Diffstat (limited to 'tools/perf/builtin-script.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: