@InvertedParallax

InvertedParallax@lemm.ee · edit-2 1 year ago

Rust season!

Seriously, Linus let it in the kernel, dear God why have you forsaken us?

InvertedParallax@lemm.ee · 1 year ago

Nah, I worked on router firmware, we rarely have time to finish the requirements, but I would worry if it was someone like tenda.

Put your own distro on it and be happy, intel has more root ware on that than anyone.

InvertedParallax@lemm.ee · 1 year ago

Many, many, many subnets, so many subnets, different subnets for vms, for jailed services, for guest wifi, ‘secure’ wifi, ‘normal’ wifi (ie phones and shit), my workstation has a routed subnet for its lxc containers, I have remote subnets for my wifi routers over vpn when I travel (with restrictions similar to home access and the same 3 ssids), an unrouted subnet for stuff like bmcs, switches and infrastructure, a subnet in my dmz with statics, the backside of that subnet, the subnet that subnet uses for upstream access.

I have a lot of subnets.

InvertedParallax@lemm.ee · edit-2 1 year ago

You need to start learning about ipmi, try googling ipmitool and the name of your server.

InvertedParallax@lemm.ee · 1 year ago

I used to, but less so now, I get that weakens the separation.

Mostly the vps is hardened to f and that’s my defense but I agree it’s a bad one.

InvertedParallax@lemm.ee · 1 year ago

https://youtu.be/xIAfCupuZ3w

InvertedParallax@lemm.ee · 1 year ago

There’s an internal ip address for the VPN server, say 4.3.2.1, you point the http dns record to that address.

The VPN server has 2 addresses by definition, an internal address and an external, public one that you connect the VPN to. Make sure the webserver only exposes itself on the private address, either by configuration (nginx/apache listen address) or by firewall (iptables -A input -j DROP)

InvertedParallax@lemm.ee · 1 year ago

Same as the public one, just with an internal address.

Have this on my domain, public domain with a subdomain server behind VPN and 1 host that points to an internal address.

Anyone tries to reach from outside just times out or something.

DNS is just a lookup of names to numbers, that’s all it is, the numbers can be anything, I can point my domain to Google if I want.

InvertedParallax@lemm.ee · 1 year ago

You’re not wrong, but at the same time suse is one of the oldest distros, and having worked with them they seem to have the best attitude, I’ve never seen them be dicks about anything to anyone.

Still, never good to depend on anyone lest they turn out evil, but I’ve hated redhat since they started, they wanted to become Microsoft from the beginning and all their code looks like it came straight out of Redmond.

So I don’t see it getting worse at least.

InvertedParallax@lemm.ee · 1 year ago

Hopefully IBM kills redhat with their shit touch like everything else and put them out of our misery.

InvertedParallax@lemm.ee · 1 year ago

Similar to yours, I originally didn’t have many small files, but I turned on sonarr metadata and now there are tons of 1k files everywhere.

I think zfs keeps them compacted though.

So far, this seems pretty simple: set volblocksize=64K, you get 64KiB blocks in your zvol, and that’s that. But recordsize is a bit trickier: the blocks in a dataset are dynamically sized, and recordsize sets the maximum size for blocks in that dataset—not a fixed size.

https://klarasystems.com/articles/tuning-recordsize-in-openzfs/

So I wasn’t worried about the small files in the beginning, the major reason to have smaller recordsize is if you want to make small accesses within a file, not if you want to access small files.

InvertedParallax@lemm.ee · 1 year ago

That’s fine, or someone with a brain come to pipewire systemd into decent software, either way.

InvertedParallax@lemm.ee · 1 year ago

Have a video dataset with 1m recordsize, primarycache=metadata, secondarycache=metadata, and a general dataset as parent with 128kb recordsize, primarycache=secondarycache=normal, compression=lzma or lz44 or something.

Works like a monster, I don’t worry about things like srts and such, though your symlinks idea looks interesting.

I’m reworking my entire system to get off the filesystem structure anyway and use python and some other dB possibly reading from sonarr for metadata seeding, but haven’t got to it yet.

Actually, you make a good point, what would be nice is if sonarr put nfos in a different structure, but since I’m going to read sonarr metadata I can just delete them anyway.

InvertedParallax@lemm.ee · 1 year ago

Actually, simple pinning might be enough if you’re seeing a lot of thrash, but most ml systems have something like openmp to handle that automatically.

InvertedParallax@lemm.ee · 1 year ago

That’s interesting, I spent a decade doing hpc and other optimizations for large software on 2 socket systems, there are degenerate cases, which can be fixed, I just doubt they’re here.

Freecad sounds like it was poorly written with a lot of hopping about ram with poor cache localization, which happens but is pretty ugly.

Ml tends to be better behaved, it’s actually very close to dsp code and the compilers try to enforce locality, more importantly a lot of the modules are hand coded for extreme performance.

I’m not trying to be that discouraging, I’m saying this as someone who originally looked for performance, and often found it in the os, but later found more performance in the loops themselves or the compiler, basically linux is a lot smarter than it used to be, and many applications are too.

Just my 2c, there are performance tools that can tell you how bad the os is vs other things, and you shouldn’t be swapping so much that it hurts you a lot in ml.

InvertedParallax@lemm.ee · 1 year ago

K, for that look at a kernel subsystem/feature called cpu_isol, friend of mine implemented/upstreamed, basically you take cores half out of Linux and can use them for heavy workloads.

But I doubt you’d see more than 1% improvement, linux doesn’t do that much without you asking.

You can try setting rt priority but I’ve never found that to matter much.

Listen, this is the kind of thing I would have tried a decade ago, but the thing to remember is: time spent improving algorithm is generally more effective than time trying to optimize kernel overhead that millions of people have been trying to optimize for decades.

InvertedParallax@lemm.ee · 1 year ago

Stop. Go back. This is the wrong way.

If you’re running python you basically need a full os.

There are projects that run as an rtos, and in fact I worked on an ml soc that ran Linux, but there are 2 levels here:

The ml processing itself, ie the math. This is simple in software and very complex otherwise. The software just says “copy this block and start running a matrix multiply”. The hard logic is in moving data around efficiently.
The stack. This is high level, python or so, and has graph processing overhead too. This needs a lot of “overhead” by its nature.

In either case, don’t worry about any of this, the overhead won’t be very noticeable, you’ll be cpu gated hard, the main thing is finding an optimized pytorch library.

If you have an amd cpu or somehow have an nvidia gpu in your laptop you might be able to use their pytorch library which would improve performance by roughly 1.5-2 orders of magnitude.

Unfortunately there isn’t a pytorch implementation for Intel igpus, but there is an opencl backend for pytorch, and apparently this madlad got it working through opencl on an Intel igpu: https://dev-discuss.pytorch.org/t/implementing-opencl-backend-for-pytorch/283/9

But don’t worry about overhead, it’s less than fractions of percents in these kinds of tasks and there are ways to bypass them completely.

InvertedParallax@lemm.ee · 1 year ago

Speak for yourself, I did the opposite, use my home patterns to fix stuff at work.

InvertedParallax@lemm.ee · 1 year ago

Plex, expose port 32400, require encryption.

InvertedParallax@lemm.ee · 1 year ago

Lot of good choices:

One of the 4 port atom pcs on Amazon, or even one of the arm ones, the key is ethernet ports and remember you’ll need to handle your wifi. Put debian, pfsense, openwrt, whatever you like, it’ll be great.

One of the openwrt systems, a high end glinet isn’t bad, just any of the better ones.

Had a freebsd server that run a vnet jail for routing, was glorious, no notes, jut perfect.

Running a unifi dream machine se right now, mostly because I want someone else to handle security (I know it’s not much, I just don’t have any bandwidth for that now). Works fine, but I’m using unifi wifi so it’s a tie-in there.

If you want a retail system, either openwrt or unifi, I know why people have issues with ubiquiti, but it’s probably the best prosumer hardware and software you can get without using your own. I haven’t used pfsense much, maybe that would change my mind.