joe

rough thoughts on language models, overlanding, computer science

Blog post thumbnail

The case for human-agent-agent co-operation

Why an agentic future requires supervision on-the-fly

Blog post thumbnail

We need a new paradigm for agentic evals

How can we evaluate agentic systems and LLM pipelines being used for complex tasks?

Blog post thumbnail

Technologies for an agent-based world

Adoption of AI agents in the real world will be limited by infrastructure initially. What technologies will we need to facilitate the early adoption of agents?

Blog post thumbnail

A reading list for evaluators

A list of posts relating to LLM evals, test time computer and more

Blog post thumbnail

An opinionated overview of a modern production Kubernetes system in 2024

A high-level guide to setting up a modern production system with Kubernetes, Terraform, and GitOps practices.

Blog post thumbnail

Be enhanced or be absorbed

Which systems will be improved by increases in machine intelligence, and which will disintegrate?

Blog post thumbnail

The Future of Knowledge Management

How and why our relationship with knowledge must change

Blog post thumbnail

Non-technical LLM Resources

Resources for keeping up to date with the latest in LLM theory and practice that are not overly technical.

Blog post thumbnail

LM evals reading list

I'm doing a little bit of studying on the eval landscape. Here are the papers I'm reading and my notes.

Blog post thumbnail

Breaking the import cycle in Go

A quick summary of techniques to keep in mind when reasoning about import cycles

Blog post thumbnail

Are LLMs capped at human intelligence?

Thoughts on the claim that transformers doing next token prediction cannot surpass human performance

Blog post thumbnail

Can GPT-N crack SHA256?

Could it be done in theory?

Blog post thumbnail

Talks I have given

I want to keep a log of talks and lectures that I give, for posterity.

Blog post thumbnail

The Scaling Hypothesis made simple

Explaining the scaling hypothesis to those without prior context

Blog post thumbnail

Auto-merging Renovate chore branches

A bash script to make Renovate chores less time consuming

Blog post thumbnail

Migrating your database with Go Migrate

Tips for database migrations using Migrate

Blog post thumbnail

Inverse Scaling

What is inverse scaling and what does it tell us? An exploration of ARC's Model-Written Evals paper.

Blog post thumbnail

Ideas

Things I would like to write about.

Blog post thumbnail

AI & Wealth Inequality

AI could exacerbate an increasingly polarised job market making it hard to cross the gap.

Blog post thumbnail

Barriers to entry in AI

Why does the average person not engage deeply with Artificial Intelligence?

Blog post thumbnail

AI & Democracy

Notes on how AI might degrade and destabilise the democratic society.

Blog post thumbnail

Notes on rationality in machines

A brain dump of some of my thoughts on rationality in machines.

Blog post thumbnail

Utility Indifference

In Pursuit of Corrigible Artificial General Intelligence

Blog post thumbnail

A Level Computing Notes

A Level CS Notes