More

zepearl · 2026-06-03T21:19:19 1780521559

It's the AI. Everything gets answered (somehow), no direct questions need to be asked anymore.

(you're right - I was wondering the same thing 1h ago :o) )

zepearl · 2026-06-03T18:13:33 1780510413

This...

Right now the biggest threat to their IPO's is that people realize that local models are good enough for whatever they're peddling...

...plus the recent price increases by AI companies, made me actually think the opposite: that there might be another additional "run" for memory and/or GPUs.

Therefore, yesterday I decided to order an additional RTX 5060 with 16 GiB VRAM for the ~500$ that I saved during the last months (to be added to the RTX 5070 12 GiB that I bought last year to play games in 4k + my old RTX 3060 12 GiB which I recycled a few months ago after noticing how nice it is to run llama.cpp locally without having to worry about subscription costs).

The original 24 GiB VRAM were actually quite enough for some of the stuff that I do (e.g. transcribe text of image scans of old magazines, coding with Aider, etc - I usually use Q5_K_M quantizations of Qwen & Gemma by Bartowski as lower ones delivered sometimes weird results and/or looped forever in "thinking"-mode), but I guess that with 40 GiB I should be bullet-proof for my pessimistic view of our future :o)

zepearl · 2026-05-07T20:36:09 1778186169

So if I understand correctly 3 modules are involved:

- esp4 (kernel config "CONFIG_AF_RXRPC")

- esp6 (kernel config "CONFIG_INET_ESP")

- rxrpc (kernel config "CONFIG_INET6_ESP")

Is this correct?

eqvinox · 2026-05-07T20:55:03 1778187303

You mixed up the names vs. config options but yes killing those 3 options should make you "safe". No warranty.

zepearl · 2026-05-07T21:58:04 1778191084

damn you're right, thx

zepearl · 2026-04-30T18:50:30 1777575030

Thanks a lot!!!

I was running in Gentoo "6.18.18" (amd64) and the exploit worked (and all other shells which I PREVIOUSLY opened could then just execute "su -" without password to become "root") -> doing temporarily a "modprobe -r algif_aead" on-the-fly did not fix it as I was still able to swap to "root" from the unprivileged user by executing just "su -".

"6.18.25" fixed it (module "algif_aead" still running).

- Maybe older Kernel versions that don't contain the fix should be blacklisted?

- FYI in Gentoo I had to recompile "sys-fs/zfs-kmod" after the minor kernel upgrade (I initially skipped it, but after rebooting with the new kernel I could not mount my raidz1) -> the same might be needed for other external modules.

bombcar · 2026-05-01T00:11:56 1777594316

Yeah in theory genkernel should handle zfs but since I’m zfs_on_root because I like living dangerously I have a one liner that genkernels and then re-emerges zfs and then rebuilds the initramfs.

zepearl · 2026-02-28T23:47:52 1772322472

Using X (at least in this context?) is weird.

teyopi · 2026-03-01T01:21:57 1772328117

https://xcancel.com/OpenAI/status/2027846016423321831

zepearl · 2026-02-28T09:44:41 1772271881

I downloaded Ollama ( https://github.com/ollama/ollama/releases ) and experimented with a few Qwen models ( https://huggingface.co/Qwen/collections ).

My performance when using an RTX 5070 12GiB VRAM, Ryzen 7 9700X 8 cores CPU, 32GiB DDR5 6000MT (2 sticks):

  - "qwen2.5:7b": ~128 tokens/second (this model fits 100% in the VRAM).
  - "qwen2.5:32b": ~4.6 tokens/second.
  - "qwen3:30b-a3b": ~42 tokens/second (this is a MoE model with multiple specialized "brains") (this uses all 12GiB VRAM + 9GiB system RAM, but the GPU usage during tests is only ~25%).
  - qwen3.5:35b-a3b: ~17 tokens/second, but it's highly unstable and crashes -> currently not usable for me.

So currently my sweet spot is "qwen3:30b-a3b" - even if the model doesn't completely fit on the GPU it's still fast enough. "qwen3.5" was disappointing so far, but maybe things will change in the future (maybe Ollama needs some special optimizations for the 3.5-series?).

I would therefore deduce that the most important thing is the amount of VRAM and that performance would be similar even when using an older GPU (e.g. an RTX 3060 with as well 12GiB RAM)?

Performance without a GPU, tested by using a Ryzen 9 5950X 16 cores CPU, 128GiB DDR4 3200 MT:

  - "qwen2.5:7b": ~9 tokens/second
  - "qwen3:32b": ~2 tokens/second
  - "qwen3:30b-a3b": ~16 tokens/second

zepearl · 2026-02-01T01:11:13 1769908273

What about pre-December_2022? I cannot imagine that just a handful were imported.

zepearl · 2025-12-29T23:54:20 1767052460

> The main reason for this is lack of competition for DB in Germany

Cannot be - there is no competition in Switzerland, but things run pretty smoothly -> in the case of Germany I'd rather say: "lack of oversight, controls, 'konsequent zu sein'" -> in the case of Germany's DB I think that nobody at all levels gives a *hit about its problems.

tormeh · 2025-12-30T00:58:51 1767056331

Things work well in Switzerland because the Swiss spend a lot more money on rail. That's unfortunately the secret.

zepearl · 2025-12-29T22:35:33 1767047733

I interpreted your post like what "krupan" posted in the separate sub-thread ("This is a much tighter circle than any of us should be comfortable with"), but maybe others interpreted it differently (the words of your post are quite generic...). Cheers :o)

zepearl · 2025-10-29T12:29:24 1761740964

To fix stuttering I had to disable compositing in the window manager (Xfce on Linux Mint, nVidia proprietary with AMD CPU).

nmz · 2025-10-29T23:25:56 1761780356

And there's also pulseaudio, which I had to run in some games with PULSE_LATENCY_MSEC=90 %command%, other games you can only run in lutris, other games you can't even minimize it or it will mess up with the screen entirely.