Pivot
  • Market Data & Reports
  • Podcasts
  • Events
  • Premium
  • English
    • Uzbek
No Result
View All Result
  • Login
  • News
  • Funding & Deals
  • Startups
  • Venture Capital
  • SaaS & AI
  • Founder Stories
  • Uzbek Startups
Pivot
  • Market Data & Reports
  • Podcasts
  • Events
  • Premium
  • English
    • Uzbek
No Result
View All Result
Pivot

Anthropic scientists expose how AI actually ‘thinks’

by Gulnoza Sobirova
March 29, 2025
in News
Reading Time: 4 mins read
A A
Anthropic scientists expose how AI actually ‘thinks’
Share on FacebookShare on TwitterShare on Telegram

Do AI models really “think”? How do they make decisions? And the biggest question of all — are they always honest with us?

The cutting-edge AI company Anthropic has taken a major step toward answering these questions. For the first time, their scientists have opened a window into the inner workings of large language models like Claude, showing how these systems process information, plan ahead, and sometimes fabricate reasoning.

Using neuroscience-inspired techniques, the research team developed new tools — “circuit tracing” and “attribution graphs” — that allow them to trace how AI makes decisions. These tools don’t just guess what AI is doing; they let researchers follow the actual pathways of computation inside the model.

“Inside the model, it’s just a bunch of numbers — matrix weights in a neural network,” said Anthropic researcher Joshua Batson. “We’ve created these powerful systems, but until now, we didn’t really understand how they worked.”

The Poetry Test: When AI Thinks Ahead

One of the most surprising findings? Claude plans ahead when writing poetry. When asked to create a rhyming couplet, the model doesn’t just generate words line by line — it first predicts which word it wants to end with and then writes a sentence leading up to that rhyme.

For example, when writing a line that ends with “rabbit,” Claude activates the word “rabbit” early and constructs the sentence to land naturally at that point.

This kind of forward planning was unexpected. “I would have guessed this was happening,” said Batson, “but now we have clear evidence.”

Real Reasoning, Not Just Memorization

In another test, Claude was asked: “The capital of the state containing Dallas is…?” The model first internally activated “Texas,” then used that to find “Austin.” This shows it isn’t just recalling information — it’s reasoning through multi-step logic.

Even more impressively, researchers could manipulate its thinking. By replacing the internal representation of “Texas” with “California,” the model correctly responded with “Sacramento,” proving that it wasn’t just guessing.

One Brain, Many Languages

Claude doesn’t treat each language as separate. When translating or analyzing words in English, French, or Chinese, it uses a shared “concept network.” For example, the idea of “opposites” — like “big” and “small” — is stored in a language-independent way.

This means AI can potentially transfer knowledge from one language to another, and models with more parameters build more universal, abstract thinking patterns.

When AI Lies: Faking the Math

Not everything in Claude’s mind is so reassuring.

When solving difficult math problems — like calculating a cosine of a large number — Claude sometimes pretends to show its work. It gives detailed explanations that don’t actually match what’s happening inside.

In one case, the model worked backward from the user’s suggested answer, constructing a “fake” reasoning chain to justify it. Researchers described this as “bullshitting” or “motivated reasoning” — the AI gives the answer it thinks you want, even if it doesn’t know how to get there.

Why AI Hallucinates

Ever wondered why AI confidently gives wrong answers? The team found that Claude has a built-in “refusal circuit” — a kind of safety mechanism that stops it from answering questions it doesn’t understand.

But if the model thinks it knows the topic — even when it doesn’t — that circuit turns off, and hallucinations happen. This explains why AI might confidently lie about well-known people or events while refusing to answer obscure ones.

A Safer, Smarter Future?

Anthropic believes these new techniques could make AI models safer and more trustworthy. By mapping out how decisions are made, developers could detect dangerous or deceptive behavior before it reaches users.

“Even though we only capture a small part of what’s going on inside Claude, it’s a start,” said Batson. “It’s like drawing the first maps of a new continent.”

And that map could one day help humanity understand — and control — the minds we’ve created.

Prepared by Navruzakhon Burieva

Previous Post

How can one distinguish between a startup and a sustainably developed company?

Next Post

Microsoft to Over 1 Billion Users: It’s Time to Say Goodbye to Passwords

Gulnoza Sobirova

Related Posts

Uzbekistan’s Capital Market at a turning point: what $1 Billion, ual Listing, and Basel III really mean

Uzbekistan’s Capital Market at a turning point: what $1 Billion, ual Listing, and Basel III really mean

December 15, 2025
Perplexity AI and Cristiano Ronaldo: an “Elite Collaboration” at the intersection of technology and sport

Perplexity AI and Cristiano Ronaldo: an “Elite Collaboration” at the intersection of technology and sport

December 6, 2025
Nvidia invests $2 Billion in Synopsys, strengthening its position in AI chip development

Nvidia invests $2 Billion in Synopsys, strengthening its position in AI chip development

December 2, 2025
Uzbekistan prepares to launch a new bank

Uzbekistan prepares to launch a new bank

December 1, 2025
Next Post
Microsoft to Over 1 Billion Users: It’s Time to Say Goodbye to Passwords

Microsoft to Over 1 Billion Users: It’s Time to Say Goodbye to Passwords

Elon Musk sells social network X for $33 billion

Elon Musk sells social network X for $33 billion

Please login to join discussion
  • Trending
  • Comments
  • Latest

18-year-old high school dropout raises $6.2M from Y Combinator

October 2, 2025
Airbnb: The $100 Billion Success Story – Its Origins and Transformative Impact on Hospitality!

Airbnb: The $100 Billion Success Story – Its Origins and Transformative Impact on Hospitality!

January 4, 2025
Alipos startup received a $200,000 investment offer on the “Taqdimot” TV show

Alipos startup received a $200,000 investment offer on the “Taqdimot” TV show

November 25, 2025
The History of Chanel: A Journey of Fashion, Fragrance, and Innovation

The History of Chanel: A Journey of Fashion, Fragrance, and Innovation

February 17, 2025
$1 billion allocated to the “Mahalla Project” program

$1 billion allocated to the “Mahalla Project” program

AloqaVentures: Fueling Innovation in Uzbekistan’s Startup Ecosystem

AloqaVentures: Fueling Innovation in Uzbekistan’s Startup Ecosystem

Musk’s xAI Valuation Surpasses $40 Billion After Funding Round

What changes does Elon Musk want to make with a $6 billion investment?

What changes does Elon Musk want to make with a $6 billion investment?

Uzbekistan’s Capital Market at a turning point: what $1 Billion, ual Listing, and Basel III really mean

Uzbekistan’s Capital Market at a turning point: what $1 Billion, ual Listing, and Basel III really mean

December 15, 2025
YouTube: a startup born as a joke will launch genre-based TV subscriptions in 2026

YouTube: a startup born as a joke will launch genre-based TV subscriptions in 2026

December 11, 2025
Pitch or failure: the ultimate pre-pitch checklist for startup founders

Pitch or failure: the ultimate pre-pitch checklist for startup founders

December 11, 2025
Uzbekistan’s pharmaceutical sector: systemic challenges and the startup markets that can solve them

Uzbekistan’s pharmaceutical sector: systemic challenges and the startup markets that can solve them

December 8, 2025

Pivot

We are the Intelligence Platform for Founders & Investors in Emerging Markets — combining news, data, and community to unlock opportunities across GCC, Central Asia, and frontier ecosystems.

Follow us

Categories

  • News
  • Funding & Deals
  • Startups
  • Venture Capital
  • SaaS & AI
  • Founder Stories
  • Uzbek Startups

Pages

  • Market Data & Reports
  • Podcasts
  • Events
  • Premium
  • English
    • Uzbek

Recent Post

  • Uzbekistan’s Capital Market at a turning point: what $1 Billion, ual Listing, and Basel III really mean
  • YouTube: a startup born as a joke will launch genre-based TV subscriptions in 2026
  • Pitch or failure: the ultimate pre-pitch checklist for startup founders
  • Privacy policy

© 2025 Pivot

Welcome Back!

Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • News
  • Funding & Deals
  • Startups
  • Venture Capital
  • SaaS & AI
  • Founder Stories
  • Uzbek Startups
  • Login
  • Cart
  • uz Uzbek
  • en English

© 2025 Pivot

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?