Skip Navigation

Posts
16
Comments
168
Joined
3 yr. ago

  • Codeberg keeps calling the group the far right. Is there any political motivation or something else here? To me, it just looks like troll behavior. Is there more details about the attacks that I missed?

  • I use borgbackup to create backups. I point backups to another home computer and borgbase.com. Borg itself is an amazing tool. I think you should learn how it works even if it doesnt end up being the best fit for you.

  • I've got a subset of my files encrypted and backed up using borg. It gets backed up to another computer in my home and then cloud storage via borgbase.com.

  • Cooking @lemmy.world

    What's your favorite kitchen utilities?

  • Have you tried this?

  • I use my exist credit card company, now. I still get my x% cash back. And the credit card company arent the people Im trying to hide from in this case. Thanks for letting me know.

  • And how do you like yew? Long ago I used seed.rs, which was more like ELM than react. But I think that project has since gone unmaintained. I also tried yew when I think they were in the middle of a huge API transition. Do you think its easier to write in yew than it would be in react/vue/svelte?

  • Cursed tech stack and image. Project looks cool. Can you elaborate more on why you used rust for front end and python for backend? I would assume rust would have been more applicable for back end work. Then again, Im not familiar with fastAPI.

  • Jump
  • No I was being accusatory unfairly. I've updated my post.

  • Programming @programming.dev

  • Glancing through zettlr's website and docs, Im not sure I understand it. Is it just notetaking software, that utilizes pandoc to build professional documents (via pdflatex)? Whats an example use case?

  • Wow, all in 600 lines of python. Looks great!

  • How was jumping from windows to NixOS?

  • It might take a screenshot and keep in memory, and only save to disk after some image processing that detects if there is sensitive data.

  • Hey OP, how do you follow all these updates? Is it RSS feeds on these projects? You're on top of it this morning.

  • First off, understanding the different data structure from a high level is mandatory. I would understand the difference between a dataframe, series, and index are. Further, learn how numpy's ndarrays play a role.

    From there, unfortunately, I had to learn by doing...or rather struggling. It was one question at a time to stack overflow, like "how to filter on a column in pandas". Maybe in the modern era of LLMs, this part might be easier. And eventually, I learned some patterns and internalized the data structures.

  • You are correct. For some data sources like parquet it includes some metadata that helps with this, but it's not as robust at databases I dont think. And of course, cvs have no metadata (I guess a header row.)

    The actually specification for how to efficiently store tabular data in memory that also permits quick execution of filtering, pivoting, i.e. all the transformations you need...is called apache arrow. It is the backend of polars and is also a non-default backend of pandas. The complexity of the format I'm unfamiliar with.

  • I learned SQL before pandas. It's still tabular data, but the mechanisms to mutate/modify/filter the data are different methodologies. It took a long time to get comfy with pandas. It wasnt until I understood that the way you interact with a database table and a dataframe are very different, that I started to finally get a grasp on pandas.

  • If it works, don't fix it!

  • A big feature of polars is only loading applicable data from disk. But during exporatory data analysis (EDA) you often have the whole dataset in memory. In this case, filters wont help much there. Polars has a good page in their docs about all the possible optimizations it is capable of. https://docs.pola.rs/user-guide/lazy/optimizations/

    One I see off the top is projection pushdown, which only selects relevant columns for a final transformations. In pandas, if you perform a group by with aggregation, then only look at a few columns, you still perform aggregation across all the data. In polars lazy API, you would define the entire process upfront, and it would know not to aggregate certain columns, for instance.

  • Imo Rust already has the perfect book. I would make a resource for C developers. Especially since you know C already.

  • Programming @programming.dev

    Any data scientists out there? What's your go to programming language and tools for your work?

  • Linux @lemmy.ml

    Best resources for learning systemd?

  • Open Source @lemmy.ml

    What's your favorite FOSS tools for image editing?

  • Self Hosted - Self-hosting your services. @lemmy.ml

    What to add to your SHTF document? What do you put in the paper document that contains instructions for disaster recovery, such as drive failure, cloud service loss, death, etc?

  • Linux @lemmy.ml

    Recommended whiteboarding programs?

  • Selfhosted @lemmy.world

    How do you manage your encryption keys?

  • Linux @lemmy.ml

    Unable to run TabbyML with GPU on NixOS or Docker (solved on docker!)

  • Privacy Guides @lemmy.one

    Best service for filing taxes?

  • Privacy Guides @lemmy.one

    What's the best strategy for changing to GrapheneOS?

  • datahoarder @lemmy.ml

    What guides, wikis, or megathreads are available for those new to archiving and storing data?

  • Linux @lemmy.ml

    What's with all these hip filesystems and how are they different?

  • Linux @lemmy.ml

    Thanks to dust I deleted a 70 gig file on my drive

  • Privacy @lemmy.ml

    Do you use virtual credit cards?

  • Self Hosted - Self-hosting your services. @lemmy.ml

    What Nextcloud apps do you use the most?