Show HN: Walrus – a Kafka alternative written in Rust

65 points by janicerk 3 days ago

Barathkanna 33 minutes ago

Walrus isn’t trying to replace Kafka, but it does beat Kafka in a few narrow areas. It’s a lightweight Rust-based distributed log with a fast WAL engine and modern I/O (io_uring), so the operational overhead is much lower than running a full Kafka stack. If you just want a simple, fast log without JVM tuning, controllers, or the entire Kafka ecosystem, Walrus is a lot easier to run. Kafka still wins on ecosystem, connectors, and massive scale, but Walrus is appealing for teams that want the core idea without the complexity. Really impressed by the direction here, great work!!.

lionkor 3 hours ago

Fun anecdote; a couple years ago I started writing a Kafka alternative in C++ with a friend. I got pretty far, but abandoned the project.

We called it `tuberculosis`, or `tube` for short; of course, that is what killed Kafka.

sgt 2 hours ago

Imagine talking to your clients about tech stacks and "we're running tuberculosis" comes up... while people are dying from it.
- lionkor 2 hours ago
  
  You just say "well, the alternative was Kafka" and they'd surely get it. Or not. Either way we imagined it to be hilarious.
  - ramses0 an hour ago
    
    t10s, pronounced "tíos" or a stuttering "t- tents" on your geo. :-D

k_bx 4 hours ago

There's also Iggy https://github.com/apache/iggy

Never tried it, but looks promising

tormeh 2 hours ago

Looks like it has a solid amount of contributors. Exciting! Some other attempts like Fluvio seem to have lost momentum.

gethly 42 minutes ago

I never understood the popularity of Kafka. It's just a queue with persistent storage(ie. not in-memory queu with ram-size limited capacity) after all.

ertucetin 11 minutes ago

We need Rust alternative not written in Rust

roncohen 4 hours ago

As someone who myself worked on a hobby-level Rust based Kafka alternative that used Raft for metadata coordination for ~8 months: nice work!

Wasn't immediately clear to me if the data-plane level replication also happens through Raft or something home-rolled? Getting consistency and reliability right with something home-rolled is challenging.

Notes:

- Would love to see it in an S3-backed mode, either entirely diskless like WarpStream or as tiered storage.

- Love the simplified API. If possible, adding a Kafka compatible API interface is probably worth it to connect to the broader ecosystem.

Best of luck!

nubskr 3 hours ago

Hi, the creator here, I think its a good idea to have S3 backed storage mode, its kinda tricky to do it for the 'active' block which we are currently writing to, but totally doable for historical data.
Also about the kafka API, I tried to implement that earlier, I had a sort of `translation` layer for that earlier, but it gets pretty complicated to maintain that because kafka is offset based, while walrus is message based.
- EdwardDiego 2 hours ago
  
  TBH I don't think anyone can utilise S3 for the active segment, I didn't dig into Warpstream too much, but I vaguely recall they only offloaded to S3 once the segment was rolled.
  - zellyn 3 minutes ago
    
    The Developer Voices interview where Kris Jenkins talks to Ryan Worl is one of the best, and goes into a surprising amount of detail: https://www.youtube.com/watch?v=xgzmxe6cj6A
    tl;dr they write to s3 once every 250ms to save costs. IIRC, they contend that when you keep things organized by writing to different files for each topic, it's the Linux disk cache being clever that turns the tangle of disk block arrangement into a clean view per file. They wrote their own version of that, so they can cheaply checkpoint heavily interleaved chunks of data while their in-memory cache provides a clean per-topic view. I think maybe they clean up later async, but my memory fails me.
    I don't know how BufStream works.
    The thing that really stuck with me from that interview is the 10x cost reduction you can get if you're willing and able to tolerate higher latency and increased complexity and use S3. Apparently they implemented that inside Datadog ("Labrador" I think?), and then did it again with WarpStream.
    I highly recommend the whole episode (and the whole podcast, really).
seanhunter 4 hours ago
It says on the github page
```
   " It provides fault-tolerant streaming with automatic leadership rotation, segment-based partitioning, and Raft consensus for metadata coordination."
```
So I guess that's a "yes" to raft?
- zbentley 3 hours ago
  
  GP asked about data plane consensus, not metadata/control plane.
  - EdwardDiego 2 hours ago
    
    They asked about data plane replication - e.g., leader -> followers. Unless I misunderstood them.

teleforce 3 hours ago

For Kafka alternative written in C++ there's Redpanda [1],[2].

Redpanda claim of better performance but benchmarks showed no clear winner [3].

It will be interesting to test them together on the performance benchmarks.

I've got the feeling it's not due to programming language implementation of Scala/Java (Kafka), C++ (Redpanda) and Rust (Walrus).

It's the very architecture of Kafka itself due to the notorious head of line problem (check the top most comments [4].

[1] Redpanda – A Kafka-compatible streaming platform for mission-critical workloads (120 comments):

https://news.ycombinator.com/item?id=25075739

[2] Redpanda website:

https://www.redpanda.com/

[3] Kafka vs. Redpanda performance – do the claims add up? (141 comments):

https://news.ycombinator.com/item?id=35949771

[4] What If We Could Rebuild Kafka from Scratch? (220 comments):

https://news.ycombinator.com/item?id=43790420

drob518 12 minutes ago

Or it’s I/O-bound.
nubskr 3 hours ago

In the current benchmarks, I only have Kafka and rocksdb wal, will surely try to add redpanda there as well, curious how walrus would hold up against seastar based systems.
- chaotic-good 2 hours ago
  
  I don't see any mentions of p99 latency in the benchmark results. Pushing gigabytes per second is not that difficult on modern hardware. Doing so with reasonable latency is what's challenging. Also, instead of using custom benchmarks it's better to just use the OMB (open-messaging benchmark).
EdwardDiego 2 hours ago

> It's the very architecture of Kafka itself due to the notorious head of line problem
Except a consumer can discard an unprocessable record? I'm not certain I understand how HOL applies to Kafka, but keen to learn more :)

fareesh 34 minutes ago

coo coo ca choo

oulipo2 3 hours ago

Nice! How does it compare to Redpanda, NATS, etc?

throwfaraway135 27 minutes ago

[dead]

YouAreWRONGtoo an hour ago

[dead]

arschficknigger 2 hours ago

Sounds more like an Rosie O'Donnell alternative.