floquant

@ floquant @lemmy.dbzer0.com

Posts

3
Comments

1389
Joined

2 yr. ago

I am sorry to break the bubble but that is a baseless assumption, if not in marketing. GPT models have been sold as having "PhD-" or "MD-" "level intelligence" since GPT3. Anectodally, recent models have been improving in some areas but regressing in others. "Frontier models" have incredibly opaque performance and safety benchmarks, and as time goes on more and more training data is LLM-generated, less and less comes from humans, and models start breaking down.

In response to your point: I am mainly interested in probabilistic reliability - if it gives the correct answer 99.9% of the time, it is clearly superior to the vast majority of human beings

Again, nowhere near the actual accuracy of current models. It is a big jump from 85% (wrong >1/10 of the time) to 99.9% (wrong 1 in 1000 times). At best it would barely break 90%, which is still 1 in 10.

Interestingly, my question "What was India like before the British arrived?" produces consistently biased and misleading answers. Though I haven't asked it for the new model.

An LLM's knowledge, its "intelligence", is its training data, nothing more, nothing less. Its scope, or "purpose" is its context/prompt, nothing more, nothing less. That means answering the question though the lens of British colonialism, based on a corpus of mostly "white history". I bet that if you ask the same question using a timeframe (i.e. "before the 14th century") and don't use the word "British" you'll get a slightly less, but still biased answer.

5d ago

‘This Is a War’: OpenAI and Anthropic are spending massive amounts of money to influence US midterms

Jump

floquant @lemmy.dbzer0.com 5d ago

It's literal fake money. When you read news of Anthropic investing $400B in Google, do you think that's "money" in the same sense it is for real people?

6d ago

Dumb Ways for an Open Source Project to Die

Jump

floquant @lemmy.dbzer0.com 6d ago

Fuck now the song is stuck in my head

6d ago

Nato ready to defend ‘every inch’ of territory as Russian drone hits Romania

Jump

floquant @lemmy.dbzer0.com 6d ago

Huh, I thought VAT was also pronounced as a word

6d ago

Netanyahu orders Israeli army to seize ‘70% of Gaza Strip’, violating ceasefire deal

Jump

floquant @lemmy.dbzer0.com 6d ago

Throughout the eight months of the ceasefire, Israeli forces have continued to open fire

Then maybe ceasefire is not an appropriate word?

6d ago

Mystery company accidentally blew $500 million on Claude in a single month — failed to put usage limit on licenses for employees

Jump

floquant @lemmy.dbzer0.com 6d ago

The more recent report says corporate AI adoption has found several issues with AI, with human workers turning to automating dreary and mundane tasks they don't like doing, rather than valuable or meaningful work.

Thank god we have consulting companies to tell us what humans like!

7d ago

'I Don't F*ck With Trump': Artists Listed for US 250th Anniversary Celebration Drop Out

Jump

floquant @lemmy.dbzer0.com 7d ago

I don’t fuck with Trump. I don’t give a fuck about Trump. I know the type of fucking anarchy he creates.

Ah yes, government overreach and fascism, aka anarchy

1w ago

Do you stay on vpn 24/7 or turn it on whenever you need it?

Jump

floquant @lemmy.dbzer0.com 1w ago

I default to on, and turn it off if I need something that is blocking me and there's no workaround.

Although the traffic itself is protected, I find the signal of «I am doing something "secret" right now» to be best avoided both for passive metadata collection as well as active correlation attacks reasons; as well as to avoid leaks by clicking some link or just loading an image from some server that might be more "revealing" than I'd like. I'm not talking about anything super spicy, for example just by being a Lemmy user the image hosts you see more can depend on the comms/instances you use the most often - a pretty small segment.

1w ago

What profession do y'all hate with deepest of passion

Jump

floquant @lemmy.dbzer0.com 1w ago

1w ago

What is the deal with IPv6?

Jump

floquant @lemmy.dbzer0.com 1w ago

On the LAN side sure, but I don't think many people would make a public website/webapp "true single stack". If there's a network appliance "terminating" the IPv6 connection and "NATting" it over IPv4 that's a terrible hack that is even worse than not having it at all imho

Unless you're talking about the link-local fe80 addresses, but those are basically sparkly MAC addresses

2w ago

How is NoSteam pernounced is it No Steam or Nos Team?

Jump

floquant @lemmy.dbzer0.com 2w ago

I actually think the group is called NoS

2w ago

Me irl

Jump

floquant @lemmy.dbzer0.com 2w ago

Astroturfing in action

2w ago

Netanyahu scolds Israeli security minister for videos taunting flotilla activists

Jump

floquant @lemmy.dbzer0.com 2w ago

2w ago

Stop the datacenters

Jump

floquant @lemmy.dbzer0.com 2w ago

Sorry your feelings were hurt but the discussion is about AI datacenters with supposedly hundreds of thousands of Nvidia cards and very supposedly an electrical capacity of up to 9GW, no one cares about regular datacenters

2w ago

Or any neurodiversity

Jump

floquant @lemmy.dbzer0.com 2w ago

Singing and dancing. Bopping really helps

2w ago

Anyone using zram and similar memory management with 32gb of ram or more?

Jump

floquant @lemmy.dbzer0.com 2w ago

It's not the opinion itself, it's just the attitude. Your comment is a perfect example of what I consider a good reply as you brought both hard data and some nuance in expressing how you formed your opinion