Colin's Notes

AI Coding Brain Rot

Posted on June 3, 2025

AI code assistants will cause great harm to business in the next ten years or so. Why? Because nobody will understand how their software works, what it’s doing exactly, if it’s actually secure, or in what ways it may fail to scale and so many more particulars. To some approximation AI could answer these questions, but good luck with the maintenance. Over years, widespread and deep employment of AI coding will devastate the software development ecosystem companies rely on for healthy cyber-infrastructure – not just application code but operations and security. [Read More]

Tags: AI Software Development

AI for Editing Novels

Posted on May 15, 2025

I’ve tried out most of the AI LLM products at this point. For the most part I haven’t found a use for them beyond short term entertainment. Recent releases of Claude and Gemini have gotten good enough to perform some busy-work coding for me, but anything complicated needs too much review still. But today I stumbled on an actually valuable use-case for writers. [Read More]

Tags: AI Writing

Apple Pizza Recipe

Posted on January 2, 2025

This pizza uses almost none of the standard pizza toppings and it tastes great. [Read More]

Minimum Useful PC Uptime

Posted on June 26, 2024

With the still only partial return to offices, I expect lots of people are running into an unpleseant side of modern computing. Frequent software updates, logins and operating system patches will interrupt your work or at best, restart your computer between work sessions, potentially leaving you some wreckage with your morning coffee. [Read More]

Tags: Software Development Business

Little 'Big Ideas' in Programming Language Design

Posted on June 17, 2024

What follows are a few sort of random observations on topics I’ve pondered while evaluating new languages and thinking about building my own language projects. They aren’t radical design choices or anything groundbreaking but they lend a language its feel for better or worse. [Read More]

Tags: Software Development Programming Languages

The Software Peter Principle

Posted on May 5, 2024

“In a Hierarchy Every Employee Tends to Rise to His Level of Incompetence” [Read More]

Tags: Software Development business

Save Arrow Record Batches Fast to Parquet With Custom Metadata During Incremental Writes

Adding custom metadata is easy and documented when saving an entire table, but adding to batched output is different.

Posted on April 11, 2024

Saving custom metadata – “schema metadata” or “file metadata” – to Parquet could be really useful. You can put versions of an application’s data format, release notes or many other things right into the data files. The documentation is somewhat lacking on how to accomplish it with PyArrow – but you totally can. Last time I reviewed the docs for Polars and DuckDB they didn’t allow for adding your own metadata to Parquet output at all. [Read More]

Tags: Software Development Parquet Python

Notes on simplifying complex Parquet data

Not all tools can read nested logical Map or List type data (often made by Spark.) Here are some tips to make the data more accessible by more tools.

Posted on February 26, 2024

The Parquet columnar data format typically has columns of simple types: int32, int64, string and a few others. However, columns can have logical types of “List”, “Map” as well, and their members may be more “List” or “Map” structures or primitive types. [Read More]

Tags: Software Development Parquet

Investigating Mojo 🔥

I spent the afternoon learning about Mojo. Here are my notes.

Posted on November 3, 2023

Mojo aims to be a super-set of Python by supporting the Python syntax and adding in new keywords for more performant and safe code. Mojo is a compiler that produces extremely high-performance executable binary files. It offers interop with existing Python libraries and a limited set of Python types [Read More]

Tags: Software Development Programming Languages Python

Effectively Avoid Problems When Consuming Legacy Character Encodings in Rust

Posted on October 21, 2023

There’s still a lot of old “extended” ASCII out there and you may need to deal with it. One source can be the old-fashioned “fixed-width” data formats, but it may be found in any old files like spreadsheets in Windows. Whatever the case you can’t just wish it away, sadly. [Read More]

Tags: Rust Software Engineering