rutrum

@ rutrum @lm.paradisus.day

Posts

16
Comments

168
Joined

3 yr. ago

1y ago

karolherbst 🐧 🦀 (@karolherbst@chaos.social) "MAINTAINERS: Remove myself"
Jump
rutrum @lm.paradisus.day 1y ago
For people who didnt understand the phrase like myself: https://en.wikipedia.org/wiki/Thin_blue_line?wprov=sfla1

1y ago

Codeberg down, berg down

Codeberg keeps calling the group the far right. Is there any political motivation or something else here? To me, it just looks like troll behavior. Is there more details about the attacks that I missed?

1y ago

The best Cloud backup in 2025?

Jump

rutrum @lm.paradisus.day 1y ago

I use borgbackup to create backups. I point backups to another home computer and borgbase.com. Borg itself is an amazing tool. I think you should learn how it works even if it doesnt end up being the best fit for you.

1y ago

What do you host on your backup servers?

Jump

rutrum @lm.paradisus.day 1y ago

I've got a subset of my files encrypted and backed up using borg. It gets backed up to another computer in my home and then cloud storage via borgbase.com.

Cooking @lemmy.world

rutrum @lm.paradisus.day

1y ago

What's your favorite kitchen utilities?

1y ago

Suggesting Matrix as a channel for silly "keep-in-touch" group chats after occasional meet-ups and outings

Jump

rutrum @lm.paradisus.day 1y ago

Have you tried this?

1y ago

Do you use virtual credit cards?

Jump

rutrum @lm.paradisus.day 1y ago

I use my exist credit card company, now. I still get my x% cash back. And the credit card company arent the people Im trying to hide from in this case. Thanks for letting me know.

1y ago

Pinepods 0.7.2 - The rust based self-hosted podcast platform, complete with Podcasting 2.0 features!

Jump

rutrum @lm.paradisus.day 1y ago

And how do you like yew? Long ago I used seed.rs, which was more like ELM than react. But I think that project has since gone unmaintained. I also tried yew when I think they were in the middle of a huge API transition. Do you think its easier to write in yew than it would be in react/vue/svelte?

1y ago

Pinepods 0.7.2 - The rust based self-hosted podcast platform, complete with Podcasting 2.0 features!

Jump

rutrum @lm.paradisus.day 1y ago

Cursed tech stack and image. Project looks cool. Can you elaborate more on why you used rust for front end and python for backend? I would assume rust would have been more applicable for back end work. Then again, Im not familiar with fastAPI.

1y ago

(probably not) Microsoft bought "codeburg.org" and redirects it to github.

Jump

rutrum @lm.paradisus.day 1y ago

No I was being accusatory unfairly. I've updated my post.

Programming @programming.dev

rutrum @lm.paradisus.day

1y ago

(probably not) Microsoft bought "codeburg.org" and redirects it to github.

1y ago

What's the deal with ONLYOFFICE?

Jump

rutrum @lm.paradisus.day 1y ago

Glancing through zettlr's website and docs, Im not sure I understand it. Is it just notetaking software, that utilizes pandoc to build professional documents (via pdflatex)? Whats an example use case?

1y ago

Introducing SystemD Pilot, GUI app for managing systemd services

Jump

rutrum @lm.paradisus.day 1y ago

Wow, all in 600 lines of python. Looks great!

1y ago

Microsoft Recall screenshots credit cards and Social Security numbers, even with the "sensitive information" filter enabled

Jump

rutrum @lm.paradisus.day 1y ago

How was jumping from windows to NixOS?

1y ago

Microsoft Recall screenshots credit cards and Social Security numbers, even with the "sensitive information" filter enabled

Jump

rutrum @lm.paradisus.day 1y ago

It might take a screenshot and keep in memory, and only save to disk after some image processing that detects if there is sensitive data.

1y ago

Proton 9.0-4 released

Jump

rutrum @lm.paradisus.day 1y ago

Hey OP, how do you follow all these updates? Is it RSS feeds on these projects? You're on top of it this morning.

1y ago

Any data scientists out there? What's your go to programming language and tools for your work?

Jump

rutrum @lm.paradisus.day 1y ago

First off, understanding the different data structure from a high level is mandatory. I would understand the difference between a dataframe, series, and index are. Further, learn how numpy's ndarrays play a role.

From there, unfortunately, I had to learn by doing...or rather struggling. It was one question at a time to stack overflow, like "how to filter on a column in pandas". Maybe in the modern era of LLMs, this part might be easier. And eventually, I learned some patterns and internalized the data structures.

1y ago

Any data scientists out there? What's your go to programming language and tools for your work?

Jump

rutrum @lm.paradisus.day 1y ago

You are correct. For some data sources like parquet it includes some metadata that helps with this, but it's not as robust at databases I dont think. And of course, cvs have no metadata (I guess a header row.)

The actually specification for how to efficiently store tabular data in memory that also permits quick execution of filtering, pivoting, i.e. all the transformations you need...is called apache arrow. It is the backend of polars and is also a non-default backend of pandas. The complexity of the format I'm unfamiliar with.

1y ago

Any data scientists out there? What's your go to programming language and tools for your work?

Jump

rutrum @lm.paradisus.day 1y ago

I learned SQL before pandas. It's still tabular data, but the mechanisms to mutate/modify/filter the data are different methodologies. It took a long time to get comfy with pandas. It wasnt until I understood that the way you interact with a database table and a dataframe are very different, that I started to finally get a grasp on pandas.

1y ago

Any data scientists out there? What's your go to programming language and tools for your work?

Jump

rutrum @lm.paradisus.day 1y ago

If it works, don't fix it!

1y ago

Any data scientists out there? What's your go to programming language and tools for your work?

Jump

rutrum @lm.paradisus.day 1y ago

A big feature of polars is only loading applicable data from disk. But during exporatory data analysis (EDA) you often have the whole dataset in memory. In this case, filters wont help much there. Polars has a good page in their docs about all the possible optimizations it is capable of. https://docs.pola.rs/user-guide/lazy/optimizations/

One I see off the top is projection pushdown, which only selects relevant columns for a final transformations. In pandas, if you perform a group by with aggregation, then only look at a few columns, you still perform aggregation across all the data. In polars lazy API, you would define the entire process upfront, and it would know not to aggregate certain columns, for instance.