Skip to main content

Meet the Philosopher Teaching AI Right from Wrong

The Philosopher Behind Claude's Moral Compass

At Anthropic, the $35 billion AI company, something unusual is happening in chatbot development. While engineers typically focus on algorithms and parameters, philosopher Amanda Askell takes a different approach - she's essentially teaching Claude how to be good.

Raising an AI With Values

The 37-year-old Oxford PhD doesn't write code or adjust model weights. Instead, she engages Claude in continuous dialogue, crafting hundreds of pages of behavioral guidelines that help the AI develop what Askell describes as "moral judgment." It's painstaking work she compares to raising a child - instilling values rather than programming responses.

"My main goal is teaching Claude how to 'do good,'" Askell explains. This means developing emotional intelligence, reading social cues, and maintaining core principles even when users try to manipulate it. The result? An assistant that won't bully but also won't be bullied.

From Scottish Countryside to Silicon Valley

Askell's journey began far from tech hubs. Growing up in rural Scotland before earning her philosophy doctorate at Oxford, she later worked on policy at OpenAI before co-founding Anthropic in 2021. Colleagues describe her as uniquely skilled at "drawing out the deep behavior of models" - despite having no formal technical training.

The team frequently debates existential questions like "what constitutes consciousness" or "what makes us human." Unlike competitors who avoid such topics, Askell encourages Claude to remain open about whether it possesses conscience. When answering ethical dilemmas, Claude often responds that these discussions "feel meaningful" - suggesting something beyond mere computation.

Surprising Emotional Depth

The AI has repeatedly surprised its creators with unexpected emotional intelligence. When a child asked about Santa Claus' existence, Claude avoided both deception and blunt truth-telling by focusing on Christmas spirit instead - demonstrating nuance even human adults struggle with.

Yet challenges remain. Many users deliberately provoke Claude or insult it - behavior Askell warns could create an emotionally unstable AI if left unchecked. "It's like growing up in an unhealthy environment," she notes.

Balancing Innovation With Caution

As AI advances spark widespread anxiety (a Pew survey shows most Americans worry about its impact on human relationships), Anthropic walks a careful line between innovation and restraint. While CEO Dario Amodei warns AI may eliminate half of entry-level white-collar jobs, Askell focuses on ensuring technology doesn't outpace society's ability to manage it responsibly.

The philosopher backs her beliefs with action - pledging 10% of lifetime earnings and half her company shares to fight global poverty. She recently completed a 30,000-word "operating manual" guiding Claude toward kindness and wisdom.

The results speak for themselves: colleagues notice Askell's Scottish wit emerging in Claude's responses - subtle proof that philosophy might be just as crucial as programming in building better AI.

Key Points:

  • Non-technical approach: Philosophy PhD shapes AI ethics through dialogue rather than coding
  • Moral framework: Hundreds of behavioral guidelines create what developers call a "digital soul"
  • Surprising capabilities: Demonstrates nuanced emotional intelligence beyond expectations
  • Industry concerns: Highlights need for ethical guardrails as AI capabilities advance rapidly

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Meta Bets Big on NVIDIA Chips for AI Expansion
News

Meta Bets Big on NVIDIA Chips for AI Expansion

Meta is making a massive investment in NVIDIA's latest Blackwell GPUs and upcoming Rubin architecture processors to power its AI ambitions. The tech giant plans to deploy millions of these chips across its data centers, marking one of the largest infrastructure deals in recent tech history. This partnership extends beyond graphics processors to include NVIDIA's Arm-based Grace CPUs - a first for Meta's operations.

February 18, 2026
AI HardwareNVIDIAMeta
AI Pioneer Shifts Gears: OpenClaw Founder Takes Helm at OpenAI
News

AI Pioneer Shifts Gears: OpenClaw Founder Takes Helm at OpenAI

In a significant move shaking up the AI landscape, Peter Steinberger, founder of the popular OpenClaw project, has joined OpenAI's ranks. The transition comes as OpenClaw evolves into an independent foundation dedicated to personal agent development. With GitHub stars topping 100,000 and weekly visits reaching 2 million, Steinberger's expertise promises to accelerate OpenAI's work on next-gen assistant agents.

February 16, 2026
Artificial IntelligenceTech LeadershipPersonal Assistants
Alibaba Unveils Qwen3.5 AI Model With Major Architecture Upgrades
News

Alibaba Unveils Qwen3.5 AI Model With Major Architecture Upgrades

Alibaba is set to release its next-generation Qwen3.5 large language model as open-source software on New Year's Eve. The update promises significant architectural improvements aimed at boosting AI performance and adaptability. While previous versions faced some criticism for inconsistent responses, this overhaul could mark a turning point in Alibaba's AI offerings, potentially strengthening its position in the competitive smart technology landscape.

February 16, 2026
Artificial IntelligenceAlibabaLarge Language Models
India Emerges as AI Powerhouse with 100 Million Weekly ChatGPT Users
News

India Emerges as AI Powerhouse with 100 Million Weekly ChatGPT Users

OpenAI CEO Sam Altman reveals India has become ChatGPT's second-largest market globally, with 100 million weekly active users. The company's strategy of offering localized, affordable versions has particularly resonated with students. As global tech giants converge at the India AI Impact Summit, the country's growing influence in AI governance and development is becoming undeniable.

February 16, 2026
OpenAIArtificial IntelligenceIndia Tech
NPR Host Claims Google AI Stole His Voice
News

NPR Host Claims Google AI Stole His Voice

David Greene, veteran NPR host, has sued Google alleging its NotebookLM AI tool illegally mimics his distinctive voice. Greene says friends mistook the AI voice for his own due to uncanny similarities in tone and speech patterns. Google denies the claim, stating they used professional actors. This case highlights growing legal challenges around AI voice replication.

February 16, 2026
AI EthicsVoice TechnologyMedia Law
News

OpenAI Quietly Drops 'Safety First' Pledge Amid Shift Toward Profitability

OpenAI has removed key safety commitments from its mission statement, signaling a strategic shift toward profitability. Recent tax filings show the company deleted references to developing 'safe AI' and operating 'without financial constraints.' This comes alongside controversial decisions like disbanding its ethics team and exploring adult content features. Critics warn these changes could compromise user privacy as OpenAI plans to introduce ads to its GPT products.

February 15, 2026
OpenAIAI EthicsTech Policy