Hobbyist trains Victorian chatbot from scratch on 28,000 public domain books
Trip Venturella, a writer and MFA graduate, has built a small language model called Mr. Chatterbox, trained entirely on Victorian-era literature from the British Library. The model draws on 28,035 books published between 1837 and 1899, totaling roughly 2.93 billion tokens of training data. Venturella used Andrej Karpathy’s nanochat framework and Claude Code, an AI …