July 26th, 2024

Bcachefs, an Introduction/Exploration

Bcachefs is a next-generation copy-on-write filesystem offering features like checksums, multi-device support, and RAID. It aims to improve performance and stability compared to existing filesystems like ext4 and Btrfs.

Read original articleLink Icon
CuriositySkepticismHope
Bcachefs, an Introduction/Exploration

Bcachefs is a next-generation copy-on-write (COW) filesystem developed by Kent Overstreet, designed to offer features similar to Btrfs and ZFS while improving performance. It includes full checksums, multi-device support, replication, RAID, caching, compression, encryption, sub-volumes, and snapshots, and has been tested for scalability up to 50+ TB. The need for bcachefs arises from the limitations of existing filesystems like ext4, XFS, Btrfs, and ZFS, which either lack modern features or have complex implementations. Bcachefs aims for stability, speed, and a compact codebase, making it accessible for developers.

The filesystem supports various RAID configurations, though some features are still experimental. It allows for degraded mode operation, enabling users to access data even when some devices are unavailable. Bcachefs also provides automation capabilities through user space tools and interfaces like sysfs and debugfs, although a high-level API is still in development.

Documentation and user support are noted to be lacking, with some error messages being cryptic. Users can create and mount filesystems easily, and the filesystem can handle different physical sector sizes. Bcachefs is positioned as a robust alternative for users seeking a modern filesystem with strong data integrity features, particularly in environments where data corruption is a concern. Overall, bcachefs represents a significant advancement in filesystem technology, addressing many shortcomings of its predecessors.

Related

Show HN: EtchaOS – Secure, immutable, in-memory remixes of popular Linux distros

Show HN: EtchaOS – Secure, immutable, in-memory remixes of popular Linux distros

EtchaOS is a secure, minimal, and immutable Linux distribution for cloud, on-premise, and embedded systems. It prioritizes security, consistency, automatic upgrades, flexibility, and customization. Developed by Candid Development LLC.

Linux 6.10 Released

Linux 6.10 Released

The Linux 6.10 release by Linus Torvalds featured increased activity in filesystems, driver updates, and miscellaneous changes. No extra release candidates were needed. Version 6.11's merge window will open soon, coinciding with Europe's summer vacation. Various subsystems received fixes and improvements, including ARM64, USB, networking, SPI, MMC, USB, ALSA, and more. The bcachefs filesystem underwent corrections and optimizations.

Rust for Filesystems

Rust for Filesystems

At the 2024 Linux Summit, Wedson Almeida Filho and Kent Overstreet explored Rust for Linux filesystems. Rust's safety features offer benefits for kernel development, despite concerns about compatibility and adoption challenges.

Bcachefs for Linux 6.11 Lands Accounting Rewrite and Self-Healing on Read Error

Bcachefs for Linux 6.11 Lands Accounting Rewrite and Self-Healing on Read Error

The Bcachefs update for Linux 6.11 introduces disk accounting code rewrite, self-healing for read and checksum errors, and stripe sectors accounting. Kent Overstreet leads these enhancements for improved file-system performance and reliability.

Arch-based CachyOS promises speed but trips over its laces

Arch-based CachyOS promises speed but trips over its laces

CachyOS, a performance-focused Arch Linux distribution, offers speed and customization but faces installation challenges and reliability issues. It supports various desktop environments but may not suit older hardware users.

AI: What people are saying
The discussion around Bcachefs reveals various perspectives on its features and reliability compared to other filesystems like Btrfs and ZFS.
  • Users express concerns about the maturity and reliability of Bcachefs, given its recent integration into the kernel.
  • There are mixed feelings about RAID functionality, with some users highlighting issues with disk management and others seeking improvements.
  • Comments reflect skepticism about Bcachefs being more reliable than established filesystems like Btrfs, which has extensive real-world testing.
  • Some users are eager for Bcachefs to reach production readiness, citing its promising features like caching and encryption.
  • Discussions also touch on the limitations of existing filesystems, such as ZFS's deduplication capabilities and Btrfs's performance with databases and VMs.
Link Icon 18 comments
By @amluto - 6 months
Here’s my pet peeve regarding RAID: no RAID system I’ve ever used gracefully handles disks that come and go. Concretely: start with two disks in RAID1. Remove one. Mount in degraded mode. Write to a file. Unmount. Reconnect the removed disk. Mount again with both disks.

The results vary between annoying (need to restore / “resilver” and have no redundancy until it’s done; massively increased risk of data loss while doing so due to heavy IO load without redundancy and pointless loss of the redundancy that already exists) to catastrophic (outright corruption). The corollary is that RAID invariably works poorly with disks connected over using an interface that enumerates slowly or unreliably.

Yet most competent active-active database systems have no problems with this scenario!

I would love to see a RAID system that thinks of disks as nodes, properly elects leaders, and can efficiently fast-forward a disk that’s behind. A pile of USB-connected drives would work perfectly, would come up when a quorum was reached, and would behave correctly when only a varying subset of disks is available. Bonus points for also being able to run an array that spans multiple computers efficiently, but that would just be icing on the cake.

By @LeoPanthera - 6 months
The "why not btrfs" line boils down to "it took a long time to be stable".

That's a weird argument. Even if it's true, it is now stable, and has been for a long time. btrfs has long been my default, and I'd be wary of switching to something newer just because someone was mad that development took a long time.

By @Tobu - 6 months
> Error handling on CRC read error > 2 or more copies of file, CRC on error, read other copy, data returned to userspace, does not correct bad copy

That's been implemented; in Linux 6.11 bcachefs will correct errors on read. See

> - Self healing on read IO/checksum error

in https://lore.kernel.org/linux-bcachefs/73rweeabpoypzqwyxa7hl...

Making it possible to scrub from userspace by walking and reading everything (tar -c /mnt/bcachefs >/dev/null).

By @ajb - 6 months
Bcachefs was merged into the kernel only months ago, and had an immediate flurry of bug fixes due to the additional testing this brought. (It was in development for some years before that out of tree). That's the level of maturity that it is at. I think there's a hope that it will become more trustworthy than btrfs due to the developers success with bcache.
By @guenthert - 6 months
Hmmh, under "Why bcachefs?" we find

- Stability but also

- Constant refactorings

and later

"Disclaimer, my personal data is stored on ZFS"

A bit troubling, I find

"RAID0 behavior is default when using multiple disks" never have I ever had the need for RAID0 or have I seen a customer using it. I think it was at one time popular with gamers before SSDs became popular and cheap.

"RAID 5/6 (experimental)

    This is referred to as erasure coding and is listed as “DO NOT USE YET”, "
Well, you got to start somewhere, but a comparison with btrfs and ZFS seems premature.
By @ysleepy - 6 months
I wonder why ZFS is marked as not having de-dupe (deduplication).

AFAIK ZFS has had deduplication support for a very long time (2009) and now even does opportunistic block cloning with much less overhead.

By @frankjr - 6 months
> btrfs Encryption Y

btrfs doesn't have a built-in encryption.

> ZFS Encryption Y

I cannot find the discussion right now but I remember reading that they were considering a warning when enabling encryption because it was not really stable and people were running into crashes.

https://github.com/openzfs/zfs/issues?q=is%3Aissue+label%3A%...

By @linsomniac - 6 months
I had really high hopes for HAMMER2, including that it would one day be ported to Linux, but it seems to have remained firmly planted in Dragonfly BSD and it's not really clear what the status is. https://en.wikipedia.org/wiki/HAMMER2
By @xxmarkuski - 6 months
I'm running bcache, with lvm/luks and xfs on top, since >5 years on my desktop and it has been stable and partition manipulations, like resizes, worked without problems, albeit the tooling is not so well supported.

I bought new a new ssd and hdd for my desktop this year and looked into running bcachefs because it offers caching as well as native encryption and cow. I also determined that it is not production ready yet for my use case, my file system is the last thing I want to beta tester of. Investigated using bcache again, but opted to use lvm caching, as it offers better tooling and saves on one layer of block devices (with luks and btrfs on top). Performance is great and partition manipulations also worked flawless.

Hopefully bcachefs gains more traction and will be ready for production use, as it combines several useful features. My current setup still feels like making compromises.

By @gigatexal - 6 months
> ZFS, pioneering COW filesystem, ... commendable, its block-based design diverges from modern extent-based systems due to complexities in implementing extents with snapshots.

why is this a bad thing?

By @curt15 - 6 months
How well does bcachefs handle databases and VMs? Those workloads are well-known to be btrfs' kryptonite whereas ZFS seems to tolerate them pretty well as long as one sets the correct recordsize (example: https://www.enterprisedb.com/blog/postgres-vs-file-systems-p...).
By @Liftyee - 6 months
An interesting analysis. I can't stop my brain from parsing the title as "B C A Chefs".
By @sevg - 6 months
I recently tried btrfs on a new USB thumb drive. I immediately got hard freezes of my main (Linux) OS while working with the USB stick.

Never again.

I eagerly await bcachefs reaching maturity!

By @commandersaki - 6 months
I’m pretty keen to try bcachefs. Has anyone successfully set it up as root filesystem on a raspberry pi?
By @whalesalad - 6 months
Had to do a double-take on the UI of this blog. It looks identical to my notetaking app, Trilium.
By @tripdout - 6 months
Does it allow both shrinking and growing the FS? Really wish ZFS allowed shrinking.
By @jcalvinowens - 6 months
The idea that a brand new filesystem might be more reliable than good 'ol BTRFS, which Facebook runs on basically their entire infrastructure, is downright laughable to me.

Btrfs is also far more reliable than ZFS in my view, because it has far far more real world testing, and is also much more actively developed.

Magical perfect elegant code isn't what makes a good filesystem: real world testing, iteration, and bugfixing is. BTRFS has more of that right now than anything else ever has.

By @NKosmatos - 6 months
Mandatory xkcd comic: https://xkcd.com/927 (replace "standards" with "FS")