Nicholas Carlini

Are large language models worth it?

Are the harms that LLMs have caused, and will soon cause, worth the benefits they may bring? This article (a written version of a keynote talk I gave at CoLM) tries to explore this question.

Published Wed, Nov 19, 2025
Gate-level emulation of an Intel 4004 in 4004 bytes of C

A feature-complete gate-level microcoded Intel 4004 in 4004 bytes of C, capable of emulating the original Busicom calculator ROM for which the chip was originally designed.

Published Mon, Aug 4, 2025
miniHDL: A Python Hardware Description Language DSL

A small hardware description language implemented as a DSL on Python, with a small 170 LoC 32-bit RISC CPU.

Published Thu, Jul 31, 2025
Machines of Ruthless Efficiency

Future LLMs have the potential to cause significant harm due to their ruthless effiency. I'm worried this will happen, and discuss the ways in which it might.

Published Mon, Mar 17, 2025
My Thoughts on the Future of "AI"

I have very wide error bars on the potential future of large language models, and I think you should too. It's possible LLMs basically lead to AGI, and it's also possible they platteau.

Published Thu, Mar 13, 2025
What my privacy papers (don't) have to say about copyright and generative AI

My work on privacy-preserving machine learning is often cited by lawyers arguing for or against how generative AI models violate copyright. This maybe isn't the right work to be citing.

Published Tue, Mar 11, 2025
Career Update: Google DeepMind -> Anthropic

I have decided to leave Google, and will be joining Anthropic to continue my work on machine learning security

Published Wed, Mar 5, 2025
AI forecasting retrospective: you're (probably) over-confident

A one-year review of people's predictions on an AI-forecasting survey I made last year. Most people were over-confident in their predictions.

Published Sun, Feb 9, 2025
A 2-ply minimax chess engine in 84,688 regular expressions

I wrote a (list of) regular expressions that will play a (not very good) chess game by running a 2-ply minimax search.

Published Sun, Jan 5, 2025
Letting Language Models Write my Website

I let a language model write my bio. It went about as well as you might expect.

Published Wed, Dec 25, 2024
You should forecast the future of AI

You should forecast the future of AI in this quiz, so that you can see just how right or wrong you are.

Published Mon, Nov 25, 2024
How I Use "AI"

I don't think that AI models (by which I mean: large language models) are over-hyped. In this post I will list 50 ways I've used them.

Published Thu, Aug 1, 2024
Why I attack

Yesterday I was forwarded a bunch of messages that Prof. Ben Zhao (a computer science professor [a] A full professor with tenure, so I feel entirely within my rights to call him out here. at the University of Chicago) wrote about me on a public Discord server with 15,000 members, including this gem:

Published Mon, Jun 24, 2024
(yet another) Broken Adversarial Example Defense at IEEE S&P 2024

IEEE SP 2024 (one of the top computer security conferences) has, again, accepted an adversarial example defense paper that is broken with simple attacks. It contains claims that are mathematically impossible, does not follow recommended guidance on evaluating adversarial robustness, and its own figures…

Published Mon, May 6, 2024
My benchmark for large language models

A benchmark of ~100 tests for language models, collected from actual questions I've asked of language models in the last year.

Published Mon, Feb 19, 2024
My research idea logfile, 2016-2019

How do I pick what research problems I want to solve? I get asked this question often, most recently in December at NeurIPS, and so on my flight back I decided to describe the only piece of my incredibly rudimentary system that's at all a process. I maintain a single file called ideas.txt, where I just…

Published Sun, Jan 21, 2024
Reading Data off an Apple ProFile Hard Drive with an Arduino

So let's suppose you had a 1980s Apple ProFile Hard Drive, and you wanted to recover the data.

Published Sun, Dec 3, 2023
Playing chess with large language models

Building a chess bot that queries GPT-3.5-turbo-instruct to play chess at the level of a skilled human player.

Published Fri, Sep 22, 2023
Little Bobby |endoftext|

TODO

Published Thu, Aug 3, 2023
A ChatGPT clone, in 3000 bytes of C, backed by GPT-2

This program is a dependency-free implementation of GPT-2, including

Published Sun, Apr 2, 2023
Reflecting on Towards Evaluating the Robustness of Neural Networks

I recently got back from attending USENIX Security 2022, and someone pointed out to me that it's been five years since I wrote Towards Evaluating the Robustness of Neural Networks (with my at-the-time advisor) and they asked if I had any thoughts on this paper. I didn't respond with that great an answer…

Published Wed, Aug 17, 2022
Rapid Iteration in Machine Learning Research

A brief discussion about a tool I use to make rapid iteration in ML research possible.

Published Sun, Jun 19, 2022
A Case of Plagarism in Machine Learning Research

A recent paper ('A Roadmap for Big Model') has copied a bunch of text from over a dozen prior papers. This is bad.

Published Fri, Apr 8, 2022
Multiplexing Circuits on the Game of Life - Part 5

Abstract: Improving digital logic gates on Conway's game of life by allowing 8-bit logic gates instead of boolean logic gates.

Published Sun, Feb 27, 2022
Research Paper Release Checklist

This page contains a few checklists that help prevent embarrassing issues when releasing research papers online (e.g., via arXiv or a conference publication).

Published Sun, Jan 30, 2022
A Simple CPU on the Game of Life - Part 4

Abstract: An implementation of a minimal CPU on Conway's the Game of Life (an 'unlimited register machine'), and runs at ~10Hz.

Published Thu, Dec 30, 2021
Improved Logic Gates on Conway's Game of Life - Part 3

Abstract: This post describes improvemnets made to my prior digital logic gate constructions (e.g., AND/OR/NOT) built on top of Conway's Game of Life, resulting in 100x faster simulations.

Published Tue, Mar 23, 2021
Yet Another Space Game (In 13kb of JavaScript)

This year I entered in JS13K 2020, a game jam for JavaScript games in under 13KB (total size). I wrote a 3rd-person space shooter game, building on top of game engine I built last year for a doom clone.

Published Sat, Dec 19, 2020
InstaHide Disappointingly Wins Bell Labs Prize, 2nd Place

InstaHide (a recent method that claims to give a way to train neural networks while preserving training data privacy) was just awarded the 2nd place Bell Labs Prize (an award for finding solutions to some of the greatest challenges facing the information and telecommunications industry.). This is a grave…

Published Sat, Dec 5, 2020
Yet Another MOBA (In 13kb of JavaScript)

For the third year in a row, I participated in JS13k 2021, where you're tasked with making a game in 13kB of JavaScript. Each year I enter participate I try to learn something new I didn't know how to do before. This year's motivation: I wanted to make a multiplayer game with some nontrivial networking…

Published Sat, Nov 21, 2020
Realtime Screen Recording of Breaking a Defense to Adversarial Examples

I recently broke a defense to be published at CCS 2020, and this time I recorded my screen the entire time---all two hours of it. Typically when I break defenses, I'll write a short paper, stick it on arXiv, and then move on. Pedagogically, this isn't very useful. [a] (Don't you worry, I did that again…

Published Tue, Sep 15, 2020
An Introduction to Circuit Design on Conway's Game of Life - Part 2

Abstract: Using AND/OR/NOT gates built on top of Conway's Game of Life, this post walks through how to construct a actual circuits, for example a 7-segment display.

Published Mon, Jun 1, 2020
Digital Logic Gates on Conway's Game of Life - Part 1

Abstract: This post walks through how to construct digital logic gates (AND/OR/NOT) on top of Conway's Game of Life, demonstrating its Turing completeness.

Published Wed, Apr 1, 2020
Are adversarial example defenses improving?

Abstract: We (again) broke a large collection of published defenses to adversarial examples. Here's how and why.

Published Thu, Feb 20, 2020
Yet Another Doom Clone (In 13kb of JavaScript)

This year I entered in JS13K 2019, which asks people to develop games in under 13K of JavaScript. I entered a Doom Clone called ... Yet Another Doom Clone.

Published Fri, Sep 13, 2019
3D Shadow Mapping Renderer in JavaScript

Late last year I decided it would be fun to build a 3D renderer in JavaScript. Recently it got into some sort of finished state and decided to put it here. This isn't so much of a tutorial on how to get there, but rather more of a here's a fun thing to do with nice pictures. But it was interesting to…

Published Mon, Aug 12, 2019
A Complete List of All (arXiv) Adversarial Example Papers

Abstract: A continuously-updating list of all 1000+ papers posted to arXiv about adversarial examples.

Published Sat, Jun 15, 2019
Adversarial Machine Learning Reading List

Abstract: This reading list provides an introduction to the field of adversarial examples for machine learning models.

Published Sun, Jul 15, 2018
Recommendations for Evaluating Adversarial Example Defenses

Abstract: This document contains a collection of advice for performing adversarial example defense evaluations.

Published Sat, May 26, 2018

Nicholas Carlini

Are large language models worth it?

Gate-level emulation of an Intel 4004 in 4004 bytes of C

miniHDL: A Python Hardware Description Language DSL

Machines of Ruthless Efficiency

My Thoughts on the Future of "AI"

What my privacy papers (don't) have to say about copyright and generative AI

Career Update: Google DeepMind -> Anthropic

AI forecasting retrospective: you're (probably) over-confident

A 2-ply minimax chess engine in 84,688 regular expressions

Letting Language Models Write my Website

You should forecast the future of AI

How I Use "AI"

Why I attack

(yet another) Broken Adversarial Example Defense at IEEE S&P 2024

My benchmark for large language models

My research idea logfile, 2016-2019

Reading Data off an Apple ProFile Hard Drive with an Arduino

Playing chess with large language models

Little Bobby |endoftext|

A ChatGPT clone, in 3000 bytes of C, backed by GPT-2

Reflecting on Towards Evaluating the Robustness of Neural Networks

Rapid Iteration in Machine Learning Research

A Case of Plagarism in Machine Learning Research

Multiplexing Circuits on the Game of Life - Part 5

Research Paper Release Checklist

A Simple CPU on the Game of Life - Part 4

Improved Logic Gates on Conway's Game of Life - Part 3

Yet Another Space Game (In 13kb of JavaScript)

InstaHide Disappointingly Wins Bell Labs Prize, 2nd Place

Yet Another MOBA (In 13kb of JavaScript)

Realtime Screen Recording of Breaking a Defense to Adversarial Examples

An Introduction to Circuit Design on Conway's Game of Life - Part 2

Digital Logic Gates on Conway's Game of Life - Part 1

Are adversarial example defenses improving?

Yet Another Doom Clone (In 13kb of JavaScript)

3D Shadow Mapping Renderer in JavaScript

A Complete List of All (arXiv) Adversarial Example Papers

Adversarial Machine Learning Reading List

Recommendations for Evaluating Adversarial Example Defenses