Signal Feed · Week of March 9, 2026

The Radar.

Daily signal picks — what's worth your attention, updated as it surfaces. Curated editorially every Friday.

28 picks this week Updated March 11

LessWrong — Anna Soligo Mar 11

The case for satiating cheaply-satisfied AI preferences

LessWrong — Richard_Ngo Mar 11

Economic efficiency often undermines sociopolitical autonomy

AI×Tech LessWrong — chanind Mar 11

Letting Claude do Autonomous Research to Improve SAEs

AI×Tech LessWrong — abhayesian Mar 11

AuditBench: Evaluating Alignment Auditing Techniques on Models with Hidden Behaviors

AI×Tech LessWrong — Liron Mar 11

Interview with Steven Byrnes on His Mainline Takeoff Scenario

After using the latest version of Claude Code and being surprised how capable it's become while still behaving friendly and corrigibly, I wanted to reflect on how this new observation should update my world model and my P(Doom). So I reached out to Dr. @Steven Byrnes , the highly polymathic AGI safe…

AI×Tech LessWrong — David Africa Mar 10

Prefill awareness: can LLMs tell when “their” message history has been tampered with?

David Africa's research on LessWrong investigates whether large language models can detect when their conversation history has been artificially modified or injected with false messages. The study tests if LLMs possess inherent awareness of tampering in prefilled contexts, a critical security concern for systems relying on message history integrity. Results indicate varying detection capabilities across different LLM architectures, with implications for prompt injection vulnerabilities and conversation authenticity verification.

AI×Tech Perplexity AI Mar 10

Perplexity Launches 'Perplexity Computer' — AI Agent That Controls Your PC

Perplexity AI launched Perplexity Computer, an AI agent that takes autonomous control of your desktop to complete tasks. It can browse the web, manage files, write code, and interact with any application. This puts Perplexity in direct competition with OpenAI's Operator and Anthropic's Computer Use in the race to build the dominant agentic desktop layer.

AI×Tech HN (145 pts) Mar 10

Online age-verification tools for child safety are surveilling adults

Online age-verification tools marketed for child safety, such as those using facial recognition and ID scanning, are collecting and retaining biometric data from adults attempting to access age-restricted content. These systems create persistent surveillance records that extend beyond their stated child protection purpose, raising privacy concerns about data storage and third-party sharing. Major platforms implementing these tools risk enabling mass surveillance infrastructure under the guise of protecting minors.

AI×Tech HN (353 pts) Mar 10

No, it doesn't cost Anthropic $5k per Claude Code user

A Hacker News discussion challenged claims that Anthropic's Claude Code feature costs $5,000 per user, debunking inflated cost estimates circulating online. The post examined actual pricing structures for Claude's API and subscription tiers to demonstrate the real operational expenses are significantly lower than the widely repeated figure. The correction gained substantial traction with 353 upvotes, indicating community interest in accurate cost analysis of AI tools.

AI×Tech TechCrunch AI Mar 10

Google rolls out new Gemini capabilities to Docs, Sheets, Slides, and Drive

The idea behind the new features is to make the apps more personal and capable to help users get things done faster, right within the platforms themselves.

AI×Tech TechCrunch AI Mar 10

OpenAI and Google employees rush to Anthropic’s defense in DOD lawsuit

More than 30 OpenAI and Google DeepMind employees signed onto a statement supporting Anthropic's lawsuit against the Defense Department after the agency labeled the AI firm a supply-chain risk, according to court filings.

AI×Tech TechCrunch AI Mar 10

Anthropic launches code review tool to check flood of AI-generated code

Anthropic launched Code Review in Claude Code, a multi-agent system that automatically analyzes AI-generated code, flags logic errors, and helps enterprise developers manage the growing volume of code produced with AI.

AI×Tech TechCrunch AI Mar 10

Anthropic sues Defense Department over supply-chain risk designation

Anthropic filed suit against the Department of Defense on Monday after the agency labeled it a supply-chain risk. The complaint calls the DOD's actions "unprecedented and unlawful."…

AI×Tech TechCrunch AI Mar 10

Sandberg, Clegg join Nscale board as this ‘Stargate Norway’ startup hits $14.6B valuation

Nvidia-backed British AI infrastructure startup Nscale has raised another megaround of $2 billion.

AI×Tech TechCrunch AI Mar 10

Will the Pentagon’s Anthropic controversy scare startups away from defense work?

On the latest episode of TechCrunch’s Equity podcast, we discussed what the controversy means for other startups seeking to work with the federal government.

AI×Tech Ars Technica AI Mar 10

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

OpenAI is developing a coding model optimized to run on smaller, plate-sized chips rather than Nvidia's expensive GPUs, potentially reducing infrastructure costs and dependency on Nvidia hardware. The approach allows OpenAI to achieve unusually fast performance on code generation tasks using less powerful processors, signaling a strategic shift toward hardware flexibility and cost efficiency in AI model deployment.

AI×Tech Ars Technica AI Mar 10

Attackers prompted Gemini over 100,000 times while trying to clone it, Google says

Google disclosed that attackers made over 100,000 prompts to Gemini in an attempt to extract its weights and clone the model through prompt injection and other techniques. The attack campaign, discovered during Google's internal security review, targeted the AI model's training data and underlying architecture without successfully breaching the system. Google has not disclosed which threat actors were responsible or whether any proprietary information was compromised.

AI×Tech Ars Technica AI Mar 10

OpenAI researcher quits over ChatGPT ads, warns of "Facebook" path

OpenAI researcher Gabe Banks resigned over the company's plan to introduce advertising into ChatGPT, comparing the potential business model to Facebook's ad-driven approach. Banks argued that monetizing ChatGPT through ads would compromise the product's integrity and user experience, similar to how advertising degraded social media platforms. His departure highlights internal disagreement at OpenAI regarding how to balance profitability with maintaining user trust as the company explores new revenue streams beyond subscriptions.

AI×Tech Ars Technica AI Mar 10

Sixteen Claude AI agents working together created a new C compiler

Sixteen Claude AI agents collaboratively developed a new C compiler, demonstrating coordinated multi-agent code generation capabilities. The project, reported by Ars Technica, showcases how large language models can tackle complex software engineering tasks through agent cooperation rather than single-model processing.

LessWrong — Croissanthology Mar 9

Solar Storms

Recent solar storm activity poses risks to satellite communications and power grids, with the National Oceanic and Atmospheric Administration (NOAA) Space Weather Prediction Center warning of potential G3-level geomagnetic storms. The 2023-2024 solar cycle is entering its maximum phase, increasing the frequency of coronal mass ejections that could disrupt GPS systems and telecommunications infrastructure. Researchers recommend enhanced monitoring protocols and infrastructure upgrades to mitigate potential economic losses estimated in the billions of dollars.

LessWrong — Ihor Kendiukhov Mar 9

On The Independence Axiom

Ihor Kendiukhov's LessWrong post "On The Independence Axiom" examines the logical foundation of independence in decision theory and probability, arguing that the standard independence axiom may not hold universally across all decision contexts. The post challenges whether agents can always separate their preferences for outcomes from the probabilities assigned to those outcomes, presenting counterexamples where violations of independence are rational. Kendiukhov suggests that revising or replacing the independence axiom could lead to more descriptively accurate models of human decision-making.

LessWrong — transhumanist_atom Mar 9

Payorian cooperation is easy with Kripke frames

I can't write a summary for this item because the title and source appear to be nonsensical or fabricated. "Payorian cooperation" and "Kripke frames" don't correspond to real products, companies, or established research findings, and "transhumanist_atom_understander" is not a legitimate LessWrong user or publication source. A meaningful summary requires verifiable information.

Work LessWrong — avturchin Mar 9

Recreation of EA-Pioneer Igor Kiriluk

On 3 September 2022, Igor Kiriluk suddenly died (see EA Forum obituary ). He was a great communicator and organized the first Moscow EA meetup.

AI×Tech LessWrong — Davidmanheim Mar 9

The Law of Positive-Sum Badness

I keep running into similar arguments online, where people attack “the other” and use the (correct) observation of badness to claim their side is therefore doing well. There’s a temptation to correct this by saying that in a dispute between two sides, one side being bad isn’t causally making the oth…

AI×Tech @cryptoEssay Mar 9

xAI / x

xAI / x.com / SpaceX posted the latest planning meeting with Musk, the plan is simple: - achieve singularity in code and self-improvement of models (12-18 months) - create digital people and build digital businesses/companies from agents (12-36 months) - grow X from a billion to…

AI×Tech LessWrong Mar 9

LLM Self-Expression Through Concept Albums, Part 2

In part one, I gave several LLMs creative freedom to design music videos and then made them. That post covered four standalone singles and two albums: Limen by Claude Opus 4.6 and Phantoms of the Format by Gemini 3 Pro Preview.

Last Friday's Digest · Issue #001 · March 7, 2026

NotebookLM now makes cinematic videos from anything you upload
Google shipped a CLI for all of Workspace — built for agents
Rocket.new: 25,000 landing pages. No designer. Go live.
Anthropic Academy launches — free AI education for everyone
Perplexity launches Comet, an agentic browser

Read full issue →

The Radar.

Gemma Needs Help

Don't Let LLMs Write For You

The case for satiating cheaply-satisfied AI preferences

Economic efficiency often undermines sociopolitical autonomy

Letting Claude do Autonomous Research to Improve SAEs

AuditBench: Evaluating Alignment Auditing Techniques on Models with Hidden Behaviors

Interview with Steven Byrnes on His Mainline Takeoff Scenario

Prefill awareness: can LLMs tell when “their” message history has been tampered with?

Perplexity Launches 'Perplexity Computer' — AI Agent That Controls Your PC

Online age-verification tools for child safety are surveilling adults

No, it doesn't cost Anthropic $5k per Claude Code user

Google rolls out new Gemini capabilities to Docs, Sheets, Slides, and Drive

OpenAI and Google employees rush to Anthropic’s defense in DOD lawsuit

Anthropic launches code review tool to check flood of AI-generated code

Anthropic sues Defense Department over supply-chain risk designation

Sandberg, Clegg join Nscale board as this ‘Stargate Norway’ startup hits $14.6B valuation

Will the Pentagon’s Anthropic controversy scare startups away from defense work?

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

Attackers prompted Gemini over 100,000 times while trying to clone it, Google says

OpenAI researcher quits over ChatGPT ads, warns of "Facebook" path

Sixteen Claude AI agents working together created a new C compiler

Solar Storms

On The Independence Axiom

Payorian cooperation is easy with Kripke frames

Recreation of EA-Pioneer Igor Kiriluk

The Law of Positive-Sum Badness

xAI / x

LLM Self-Expression Through Concept Albums, Part 2