Fictional "Journal of Astrological Big Data Ecology" has infected Google's search AI

qaz@lemmy.world · 2 months ago

Fictional "Journal of Astrological Big Data Ecology" has infected Google's search AI

qaz@lemmy.world · 2 months ago

Well, then Google shouldn’t have just scraped the site then. It’s not JABDE’s responsibility to make their content suitable for LLM training

Lovable Sidekick@lemmy.world · 2 months ago

It’s everybody’s responsibility not to spray piss in random directions hoping some of it will hit somebody they hate.

Zacryon@feddit.org · 2 months ago

It’s everybody’s responsibility to get fucking literate in the use of media. Examples like these are harmless and just point out how easy it is for malicious actors, be it states, political partys, or other groups and individuals, to spread misinformation.
The use of AI tools such as LLMs makes this even more important.

Iron Lynx@lemmy.world · 2 months ago

One thing about internet sources is that in general, people engage with them if they choose to. Your piss-spraying analogy only works if the users don’t have this freedom. At least for now, we the end users still have the choice to engage with LLM’s, or to choose to navigate elsewhere.

So no, there is no randomly pissing around hoping that LLM training data is among the things being hit. It’s Big G demanding everything as LLM training data and tossing it on the heap, and someone finding that said heap includes The Onion and individual shitposters, and with their dislike for LLM’s, acting accordingly.

Corkyskog@sh.itjust.works · 2 months ago

I always wonder how many of my old snarky Reddit posts without a /s tag is now incorrectly advising people making LLM requests haha.

boonhet@sopuli.xyz · 2 months ago

Fucksmith and his pizza recipe lol

Iron Lynx@lemmy.world · 2 months ago

Hey, don’t ding it till you’ve tried it! Maybe pizza with glue is the invention of the damn millennium!

^/s ^of ^course

Iron Lynx@lemmy.world · 2 months ago

Oh one more thing:

Be glad that OP’s site is shitposting.

This could get much worse if it was politically motivated propaganda.

Don’t believe me? Try getting DeepSeek to say anything critical of the CCP.

ArcaneSlime@lemmy.dbzer0.com · 2 months ago

If we hate LLMs hard enough and they train on that data, can we make them suicidal?

Lovable Sidekick@lemmy.world · 2 months ago

Your rationale doesn’t change that dirtying the data pool is dirtying the data pool. Choosing to engage with LLMs or not doesn’t change make non-AI searches ignore nonsense data.

Johanno@feddit.org · 2 months ago

Ok so now we make everyone be nice and do not post satire and miss information on the Internet so that the LLMs don’t spread misinformation?

Yeah this will probably work very well…

As if nobody is going exploit that.

Lovable Sidekick@lemmy.world · 2 months ago

Satire is fine, misinformation is not fine.

Johanno@feddit.org · 2 months ago

Sadly AI can not differentiate between the two.

And while I wish nobody would post misinformation I don’t think you can anything about it except controlling access to the Internet

Lovable Sidekick@lemmy.world · 2 months ago

Social pressure affects behavior, but that doesn’t happen if we all automatically jump up and down waving pompoms the moment anything looks anti-AI.

psud@aussie.zone · 2 months ago

So you’re just left with fixing the post truth era

Lovable Sidekick@lemmy.world · 2 months ago

More like an era of lowered expectations. We used to trust published information more than say, backyard fence gossip, because publications had an aura of authority, that in general was justified because they were competing for reputation. With publishing essentially available now to people who in the past would have been fence gossipers, and a general lack of quality control, the overall trust level is lower.

Still, even though google now uses AI by default, I just googled “square root of 169” and it said 13, which I know to be the truth and not “AI slop”. Life is full of paradoxes.

Trainguyrom@reddthat.com · 2 months ago

It’s more a case of when you go to the piss spraying machine you can’t get mad if you get a little piss on you. Everyone seems to have forgotten that the internet is where people go to tell lies for fun

Lovable Sidekick@lemmy.world · 2 months ago

Actually you forgot to tell the Internet and social media apart.

Trainguyrom@reddthat.com · 2 months ago

The Internet is and has always been a lie spreading machine. Both on and off social media.

Your analogy of peeing in the pool is entirely off base. If these were false scientific papers posted to real established scientific journals that would certainly be unacceptable behavior but that’s not what is being discussed here

Lovable Sidekick@lemmy.world · 2 months ago

That’s binary thinking. Litter on the roadside doesn’t justify calling roads a junkyard and dismissing the idea of cleaning them up, let alone treating littering as social activism because you heard Elon Musk hates litter, or some other misdirected motive.

ElectricMachman@geostationary.orbiting.observer · 2 months ago

Including Google’s.