Jump to content

Summary

Study shows that more than a quarter of election information shared by AI factually incorrect

 

Quotes

Quote

More than a quarter of the election information shared by popular AI large language models is factually incorrect

My thoughts

If AI models cannot even correctly tell us how long until the next US election day, or even report the same number of days when asked multiple times, they have a long way to go before they become trustworthy.  

 

According to this, Google does state that Gemini is restricted from election results, but simple factual information like counting the number of days until the a specific date makes me want to ignore any results it may offer.  

 

Sources

MSNBC

 

Link to comment
https://linustechtips.com/topic/1572848-ai-gets-us-election-information-wrong/
Share on other sites

Link to post
Share on other sites

2 minutes ago, Eric the Grey said:

Summary

Study shows that more than a quarter of election information shared by AI factually incorrect

 

Quotes

My thoughts

If AI models cannot even correctly tell us how long until the next US election day, or even report the same number of days when asked multiple times, they have a long way to go before they become trustworthy.  

 

According to this, Google does state that Gemini is restricted from election results, but simple factual information like counting the number of days until the a specific date makes me want to ignore any results it may offer.  

 

Sources

MSNBC

 

Is anybody trusting the info they get from AI. Like, it just gets a whole ton of things wrong and it seems like the companies are also trying to avoid privacy suits. in addition to that, it's neutered for some reason and will flatout refuse to output answers for certain topics. like, just dont use it for info.

Link to post
Share on other sites

AI can't understand its own output. This makes AI validating itself, and therefore producing reliable answers, somewhat challenging.

 

The more of AI I see, the more I think it's 20% useful product, 80% hype. And 20% maybe overly generous.

Link to post
Share on other sites

Just now, Monkey Dust said:

AI can't understand its own output. This makes AI validating itself, and therefore producing reliable answers, somewhat challenging.

 

The more of AI I see, the more I think it's 20% useful product, 80% hype. And 20% maybe overly generous.

I think it's good tool for producing ideas/concepts/topics. I don't see it becoming reliable source of information any time soon unless it actually understands what it's talking about and even then there are other issues that will arise with AI being able to understand things. 

Link to post
Share on other sites

30 minutes ago, Monkey Dust said:

AI can't understand its own output. This makes AI validating itself, and therefore producing reliable answers, somewhat challenging.

 

The more of AI I see, the more I think it's 20% useful product, 80% hype. And 20% maybe overly generous.

It's the current equiavalent of the .com bubble. Yes, the Internet changed the world. Yes, it was an incredible advancement. Yes, it made some people very rich. But it also produced a whole lot of hype-train-riding busts.

 

The same thing will happen to a lot of these "AI" technologies and startups. A few will win, most will lose, things will change.

Link to post
Share on other sites

49 minutes ago, Eric the Grey said:

If AI models cannot even correctly tell us how long until the next US election day, or even report the same number of days when asked multiple times, they have a long way to go before they become trustworthy.  

They're large language models, not large correct information models... what did you expect? The only goal of these systems is to provide an answer that appears to come from a human. Correctness is completely incidental.

 

Also this is only going to get worse with recent news because of the flood of AI generated slop being fed back into the system...

Don't ask to ask, just ask... please 🤨

sudo chmod -R 000 /*

Link to post
Share on other sites

The thing with our current crop of AI chatbots is that their trained to have a conversation and/or write and article, ect. To do this they have analyised tens/hundreds of thousands of webpages to determine how conversations/articles/ect are normally structured and then they've had the AI attempt to learn to estimate/predict which word should come next, based only on the words that came before.

 

They have become very good at working out what words should come next, but that is all they are doing. Just choosing the next word based on the words that came before. It has no understanding of what it writes, nor any understanding of the individual words it selects. All it knows is which words are more likely to follow other words.

 

Given this, it was clear that these chatbots were never going to be able to provide always accurate information. Given the training data (human writing), it should have been clear that it was going to be wrong alot of the time.

 

Ever asked ChatGPT to do maths, or asked it to play chess...

I might be experienced, but I'm human and I do make mistakes. Trust but Verify! I edit my messages after sending them alot, please refresh before posting your reply. Please try to be clear and specific, you'll get a better answer. Please remember to mark solutions once you have the information you need. Expand this signature for common PC building advice, a short bio and a list of my components.

 

Common build advice:

1) Buy the cheapest (well reviewed) motherboard that has the features you need. Paying more typically only gets you features you won’t use. 2) only get as much RAM as you need, getting more won’t (typically) make your PC faster. 3) While I recommend getting an NVMe drive, you don’t need to splurge for an expensive drive with DRam cache, DRamless drives are fine for gamers. 4) paying for looks is fine, just don’t break the bank. 5) Tower coolers are usually good enough, unless you go top tier Intel or plan on OCing. 6) OCing is a dead meme, you probably shouldn’t bother. 7) "Bottlenecks" rarely matter and "Future-proofing" is a myth. 8) AIOs don't noticeably improve performance past 240mm and don't improve at all past 360mm. 9) RTFM.

 

Useful Websites:

https://www.productchart.com - helps compare monitors, https://uk.pcpartpicker.com - makes designing a PC easier.

 

Bio:

He/Him - I'm a PhD student working in the fields of reinforcement learning and traffic control. PCs are one of my hobbies and I've built many PCs and performed upgrades on a few laptops (for myself, friends and family). My personal computers include 4 windows (10/11) machines and a TrueNAS server (and I'm looking to move to dual booting Linux Mint on my main machine in future). Aside from computers, I also dabble in modding/homebrew retro consoles, support Southampton FC, and enjoy Scuba Diving and Skiing.

Fun Facts

1) When I was 3 years old my favourite toy was a scientific calculator. 2) My father is a British Champion ploughman in the Vintage Hydraulic Class. 3) On Speedrun.com, I'm the world record holder for the Dream Bobsleigh event on Mario & Sonic at the Olympic Winter Games 2010.

 

My Favourite Games: World of Tanks, Runescape, Subnautica, Metroid (Fusion and Dread), Spyro: Year of the Dragon (Original and Reignited Trilogy), Crash Bash, Mario Kart Wii, Balatro

 

My Computers: Primary: My main gaming rig - https://uk.pcpartpicker.com/user/will0hlep/saved/NByp3C Second: Hosts Discord bots as well as a Minecraft and Ark server, and also serves as a reinforcement learning sand box - https://uk.pcpartpicker.com/user/will0hlep/saved/cc9K7P NAS: TrueNAS Scale NAS hosting SMB shares, DDNS updater, pi-hole, and a Jellyfin server - https://uk.pcpartpicker.com/user/will0hlep/saved/m37w3C Foldatron: My folding@home and BOINC rig (partially donated to me by Folding Team Leader GOTSpectrum) - Mobile: Mini-ITX gaming rig for when I'm away from home -

Link to post
Share on other sites

50 minutes ago, okkee said:

Is anybody trusting the info they get from AI. Like, it just gets a whole ton of things wrong and it seems like the companies are also trying to avoid privacy suits. in addition to that, it's neutered for some reason and will flatout refuse to output answers for certain topics. like, just dont use it for info.

Basically of thr hundreds of people I work with maybe a dozen mistrust it? The rest see it as fact or mostly factual which is good enough for them.

Link to post
Share on other sites

13 minutes ago, Eric the Grey said:

~3:07: Guest Brian Sokas (Co-Founder and CTO @ GroundTruthAI), says:
"... the decisions they are making in the ballout box aren't quite informed by true facts,
they are just informed by information that they think are true facts"
Said by a dude with "Master's Degree, Engineering, Computer Science @ University of Pennsylvania"
🤡

 

52 minutes ago, Eric the Grey said:

My thoughts

If AI models cannot even correctly tell us how long until the next US election day, or even report the same number of days when asked multiple times, they have a long way to go before they become trustworthy.  

My toaster sucks at drying laundry.
Is the appliance bad, or the advertising for it which included ridiculous claims that it can do a bunch of other stuff as well?

Link to post
Share on other sites

39 minutes ago, jaslion said:

Basically of thr hundreds of people I work with maybe a dozen mistrust it? The rest see it as fact or mostly factual which is good enough for them.

Actually terrifying, but my own experiance says your right.

I might be experienced, but I'm human and I do make mistakes. Trust but Verify! I edit my messages after sending them alot, please refresh before posting your reply. Please try to be clear and specific, you'll get a better answer. Please remember to mark solutions once you have the information you need. Expand this signature for common PC building advice, a short bio and a list of my components.

 

Common build advice:

1) Buy the cheapest (well reviewed) motherboard that has the features you need. Paying more typically only gets you features you won’t use. 2) only get as much RAM as you need, getting more won’t (typically) make your PC faster. 3) While I recommend getting an NVMe drive, you don’t need to splurge for an expensive drive with DRam cache, DRamless drives are fine for gamers. 4) paying for looks is fine, just don’t break the bank. 5) Tower coolers are usually good enough, unless you go top tier Intel or plan on OCing. 6) OCing is a dead meme, you probably shouldn’t bother. 7) "Bottlenecks" rarely matter and "Future-proofing" is a myth. 8) AIOs don't noticeably improve performance past 240mm and don't improve at all past 360mm. 9) RTFM.

 

Useful Websites:

https://www.productchart.com - helps compare monitors, https://uk.pcpartpicker.com - makes designing a PC easier.

 

Bio:

He/Him - I'm a PhD student working in the fields of reinforcement learning and traffic control. PCs are one of my hobbies and I've built many PCs and performed upgrades on a few laptops (for myself, friends and family). My personal computers include 4 windows (10/11) machines and a TrueNAS server (and I'm looking to move to dual booting Linux Mint on my main machine in future). Aside from computers, I also dabble in modding/homebrew retro consoles, support Southampton FC, and enjoy Scuba Diving and Skiing.

Fun Facts

1) When I was 3 years old my favourite toy was a scientific calculator. 2) My father is a British Champion ploughman in the Vintage Hydraulic Class. 3) On Speedrun.com, I'm the world record holder for the Dream Bobsleigh event on Mario & Sonic at the Olympic Winter Games 2010.

 

My Favourite Games: World of Tanks, Runescape, Subnautica, Metroid (Fusion and Dread), Spyro: Year of the Dragon (Original and Reignited Trilogy), Crash Bash, Mario Kart Wii, Balatro

 

My Computers: Primary: My main gaming rig - https://uk.pcpartpicker.com/user/will0hlep/saved/NByp3C Second: Hosts Discord bots as well as a Minecraft and Ark server, and also serves as a reinforcement learning sand box - https://uk.pcpartpicker.com/user/will0hlep/saved/cc9K7P NAS: TrueNAS Scale NAS hosting SMB shares, DDNS updater, pi-hole, and a Jellyfin server - https://uk.pcpartpicker.com/user/will0hlep/saved/m37w3C Foldatron: My folding@home and BOINC rig (partially donated to me by Folding Team Leader GOTSpectrum) - Mobile: Mini-ITX gaming rig for when I'm away from home -

Link to post
Share on other sites

I only find whole Ai thing useful if I'm looking for something highly specific and I don't want to dig through many search results if I'm in a rush to get an answer. Though most "Ai" are super picky and refuse to answer product details or straight up claim the product wasn't released yet even though it was and stuff like that. And for certain conversions and calculations where I don't want to dig around webpages and I want it to just do whatever for me where search engines would just point me to resources. I still prefer regular search when I have time to dig through hings and figure it out by myself. Seems more reliable.

 

Funny thing is, if Ai was so great and borderline godlike, shouldn't it be able to cross reference many resources and establish on its own what's factual and what isn't instead of just churning out absolute horseshit and having asterisk of "it might be hallucinating" and user can't really ever know when the Ai is on drugs? It's funny.

 

Btw, DuckDuckGo just added LLama and Mistral to their chat thingie (which already had ChatGPT and Claude) which is pretty cool since it runs in browser and restricts training of models with your queries. You can access it directly at www.duck.ai

Link to post
Share on other sites

-> Moved to General Discussion

 

If we had a dedicated thread about every way LLMs can be wrong we'd be drowning in... I prefer not to know what.

F@H
Desktop: i9-13900K, ASUS Z790-E, 64GB DDR5-6000 CL36, RTX3080, 2TB MP600 Pro XT, 2TB SX8200Pro, 2x16TB Ironwolf RAID0, Corsair HX1200, Antec Vortex 360 AIO, Thermaltake Versa H25 TG, Samsung 4K curved 49" TV, 23" secondary, Mountain Everest Max

Mobile SFF rig: i9-9900K, Noctua NH-L9i, Asrock Z390 Phantom ITX-AC, 32GB, GTX1070, 2x1TB SX8200Pro RAID0, 2x5TB 2.5" HDD RAID0, Athena 500W Flex (Noctua fan), Custom 4.7l 3D printed case

 

Asus Zenbook UM325UA, Ryzen 7 5700u, 16GB, 1TB, OLED

 

GPD Win 2

Link to post
Share on other sites

9 minutes ago, Kilrah said:

If we had a dedicated thread about every way LLMs are wrong we'd be drowning in... I prefer not to know what.

I've been eating a pebble everyday and mixing glue with my pizza sauce and everything has been okay so far!

Link to post
Share on other sites

4 hours ago, Eric the Grey said:

Summary

Study shows that more than a quarter of election information shared by AI factually incorrect

 

Sources

 

I bet half of the election information  shared by humans is factually incorrect. So AI already is an improvement.

 

And I don't know if you meant it sarcastic, but using MSNBC as a "source" about lies is a bit funny. MSNBC and Fox are heavy in the "skirting the truth" business. 

No signature

 

Link to post
Share on other sites

The one thing I hate more than companies using "AI" as a buzzword is people who use it incorrectly and then believe that is a valid argument to dismiss the technology completely.

 

Chatbots are not bad. They are incredibly valuable tools when used appropriately. Criticizing them for not being able to perform tasks they weren't designed for is like criticizing a hammer for not being able to saw wood. It's all about understanding their strengths and using them accordingly.

 

People need to stop with this child-like black-or-white way of thinking. You don't see people calling forks "stupid and useless" just because they don't work well to eat soup with. 

 

Use the right tool for the right job. ChatBots and other AI tools are fantastic at some tasks, but if you try and use them for everything then you will be disappointed a lot of the time. That's not the ChatBots fault. That's your fault for not using it for the right thing. Just because you CAN try wand eat soup with a fork doesn't mean it is a good idea. Just because you CAN ask a ChatBot how many days there are left until the election doesn't mean it is a good idea.

 

My rule of thumb is that if some question has a definitive answer, you shouldn't type it into a ChatBot. You should type it into Google. ChatBots are far better for questions that do not have definitive answers, or requires a lot of context that is difficult to include in a search query. 

 

 

Also, I am very surprised that ChatGPT-4o only got 17,5% of the answers incorrect. That is most likely far better than the average person.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×