Field Notes

Technical writing for people turning generative AI into systems.

A deliberately quiet library on reasoning, retrieval, evaluation, multimodal behavior, compute, and deployment. The tone is practical: fewer announcements, more reusable operating knowledge.

Agentic AIApr 30, 2026

The Agent Harness Is the Product

Why the next frontier in production AI agents is the runtime around the model: tools, sandboxes, memory, traces, and control.

Read note

EvaluationApr 30, 2026

When Agents Fail, Debug the Trajectory

Agent reliability improves when teams inspect the execution trace, locate the first critical failure, and fix the system around it.

Read note

Agentic AIApr 30, 2026

Memory Is Becoming First-Class Agent Infrastructure

Agent memory is moving beyond chat history into structured state, experiential traces, working context, filesystems, and trustworthy retrieval.

Read

SecurityApr 30, 2026

Prompt Injection Is Becoming Social Engineering

As agents browse, retrieve, and act, prompt injection increasingly looks like social engineering against a bounded digital worker.

Read

AI ScalingJan 1, 2025

System 1 vs System 2 in Modern AI: From AlphaGo to Large Language Models

How fast pattern matching and slower deliberative reasoning show up in modern AI systems, from AlphaGo to reasoning-oriented language models.

Read

AI ScalingDec 15, 2024

Rethinking AI Efficiency: The Shift from Scaling Pretraining to Test-Time Compute

Why more capable systems increasingly depend on inference-time search, routing, and verification rather than pretraining scale alone.

Read

AI ArchitectureNov 25, 2024

Marco-o1: Revolutionizing Reasoning in AI with Large Reasoning Models

A look at reasoning strategies that improve model accuracy when tasks require more than immediate pattern completion.

Read

Information RetrievalNov 6, 2024

Agentic RAG: Revolutionizing Enterprise Information Access

How retrieval systems change when agents can plan, inspect, execute, and adapt around enterprise knowledge.

Read

AI SafetyNov 1, 2024

Understanding and Tackling Hallucinations in LLMs

The causes, impact, and mitigation patterns behind AI-generated fabrications in production language systems.

Read

AI ScalingOct 30, 2024

Scaling Inference Compute for Long Context RAG Systems

Optimization techniques, hardware considerations, and architecture patterns for retrieval systems with long context windows.

Read

Deep LearningOct 28, 2024

Softmax Dispersion: A Challenge for Robust Reasoning

Why softmax probabilities can be fragile for detecting out-of-distribution samples and unexpected reasoning failures.

Read

Multimodal AIOct 24, 2024

The Dawn of Multimodal AI: A New Era of Intelligent Systems

How multimodal language models combine text, images, and other signals into richer system behavior.

Read

AI SecurityOct 23, 2024

Jailbreaking LLMs with Deceptive Delight: Exploring AI Vulnerabilities

A practical look at prompt-based model vulnerabilities and what they imply for evaluation and governance.

Read

AI ArchitectureOct 22, 2024

The Rise of Mixture of Experts: Transforming Large Language Models

The trade-offs between sparse expert routing and dense model architectures in large-scale language systems.

Read

AI EvaluationOct 21, 2024

Leveraging LLMs for Scalable AI System Evaluation

How language models can be used to evaluate complex systems at scale while still preserving review discipline.

Read

AI AccessibilityOct 20, 2024

Democratizing Finetuning of LLMs on Your Data with PEFT

Methods that reduce the cost and infrastructure requirements of adapting models to specific domains.

Read

AI OptimizationOct 19, 2024

The Power of Quantization in AI: Efficiency Meets Performance

How lower-precision representations reduce model size and improve inference speed without giving up too much quality.

Read

AI SafetyOct 18, 2024

Detecting LLM Hallucinations with Semantic Entropy

A method for identifying unstable model answers by measuring meaning-level variation across generations.

Read

AI HardwareOct 17, 2024

The Power of Parallel Processing with GPUs in AI

Why parallel computation matters for modern model training, inference, and real-time AI workloads.

Read

Generative AIOct 16, 2024

What is Generative AI?

The foundations of generative AI and the applications reshaping software, media, and knowledge work.

Read

Information RetrievalOct 15, 2024

Revolutionizing Information Retrieval with Retrieval Augmented Generation

How retrieval-augmented generation improves model grounding and decision support across knowledge-heavy workflows.

Read

AI ScalingOct 14, 2024

Pretraining AI Models Across Thousands of GPUs

The distributed systems concerns behind large-scale model training across vast GPU clusters.

Read

Generative AIOct 13, 2024

Compound AI Systems: More Than Just a Big Brain

Why real-world AI systems combine models with tools, retrieval, execution environments, and feedback loops.

Read