mastodon.ie is one of the many independent Mastodon servers you can use to participate in the fediverse.
Irish Mastodon - run from Ireland, we welcome all who respect the community rules and members.

Administered by:

Server stats:

1.6K
active users

#cognition

2 posts2 participants0 posts today

#AI #misinformation #cognition

'Both the people and the LLMs tended to be overconfident about how they would hypothetically perform. Interestingly, they also answered questions or identified images with relatively similar success rates.

However, when the participants and LLMs were asked retroactively how well they thought they did, only the humans appeared able to adjust expectations, according to a study published today in the journal Memory & Cognition.'

cmu.edu/dietrich/news/news-sto

www.cmu.eduAI Chatbots Remain Overconfident — Even When They’re Wrong - Dietrich College of Humanities and Social Sciences - Carnegie Mellon UniversityLarge Language Models appear to be unaware of their own mistakes, prompting concerns about common uses for AI chatbots.

Harvard Gazette: Does AI understand? . “As artificial intelligence firms release ever-more-advanced models that reason, research, create, and analyze, the meanings behind those verbs get slippery fast. What does it really mean to think, to understand, to know? The answer has big implications for how we use AI, and yet those who study intelligence are still reckoning with it.”

https://rbfirehose.com/2025/07/20/harvard-gazette-does-ai-understand/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Harvard Gazette: Does AI understand? | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

I just discovered the ARC-AGI initiative and the associated test to estimate how close "AI" models are from #AGI

arcprize.org/arc-agi

While I found the initiative interesting, I'm not sure I understand what in this test really guarantees that the model is capable of some form of generalization and problem-solving.
Wouldn't it be possible for specialized pattern-matching/discovering algorithms to solve such problems?
I imagine some computer scientists, mathematicians or computational neuroscientists have already had a look at this, so would anyone knows of some articles/blogs on the topic?

Maybe @wim_v12e? Is this something you already looked at?

ARC PrizeARC Prize - What is ARC-AGI?Learn more about the only AI benchmark that measures AGI progress.

ARC-3, a sneak peek at the next-gen, interactive reasoning benchmark designed to illuminate the capability gap between today's AI and tomorrow's AGI.

Play First 3 Games
three.arcprize.org/

As with previous ARC tests, the actual games used for testing AI are kept secret. AI algorithms must learn the games on the spot.

There are no instructions. You must play the game to discover controls, rules, and goal.

Interactive Reasoning Benchmarks (IRBs) test for a broad scope of capabilities:

• Exploration
• Percept -> Plan → Action
• Memory
• Goal Acquisition
• Alignment

Game Design Constraints

• Easy for humans (can pick it up in <1 min of game play)
• Core Knowledge Priors (no language, trivia, cultural symbols)
• Should require no instructions to play
• Should be fun for humans and playable in 5-10 minutes
• Innovative and novel game mechanics encouraged (Hidden state, theory of mind, long term planning, navigating other agents, etc.)

ARC-AGI-3ARC-AGI-3 PreviewThe first interactive reasoning benchmark for AI agents.

ChatGPT May Be Eroding Critical Thinking Skills, According to a New MIT Study

Over the course of several months, ChatGPT users got lazier with each subsequent essay, often resorting to copy-and-paste by the end of the study.

#chatgpt #openai #artificialintellignce #AI #MIT #thinking #cognition #technology #tech

time.com/7295195/ai-chatgpt-go

Time · ChatGPT May Be Eroding Critical Thinking Skills, According to a New MIT StudyBy Andrew R. Chow

CRC 1718 is hiring! It's a diverse research center focused broadly on common ground in #linguistics (variation, unification, and testing). The three main areas of research are #Cognition, #Grammar, and #Communication. Great benefits, no teaching.

🧑‍🔬 2 Postdocs (TV-L E13, 100%)
🧑‍🔬 13 PhDs (TV-L E13, 75%)
📅 Deadlines: now! Next Sunday to mid August 2025
🌍 Uni Tübingen
📄 tinyurl.com/ptx2e8a7

tinyurl.comJob advertisements | University of Tübingen
Replied in thread

@sashag Meaning Crisis - Vervaeke:

Creating Solutions to The Meaning Crisis

There is a reason you feel lost in a toxic culture.

Introducing Dr. John Vervaeke:

John Vervaeke, PhD is an award-winning lecturer at the University of Toronto in the departments of psychology, cognitive science and Buddhist psychology.

Watch THIS Series --> videos.trom.tf/w/p/7iqkxh8CVh3
#vervaeke #meaning #meaningcrisis #psychology #cognition