Thoughts about the ethics of Large Language Models

05 Mar, 2023

drawing by me after numerous explorations using midjourney into retro mainframe zombie, computer flower head, ink drawing brushwork, junji ito, moebius, horror manga, cyborg flower cyclops

Two weeks after ChatGPT came out and I had my "lol this is fun" time, I decided to use it as fully and honestly as I could for the entire week, as a tool like any other. Tools are often unintuitive; a good tool often requires you to practice a lot before its quality comes to shine—many tools can seriously hurt you if you don't know how to use them. It definitely feels the same with ChatGPT. 3 months later and I still think I am only scratching at the surface at what concretely large language models offer for my field of work: programming.

Tool building and personal ethics

I love tools, I have used and built many in my life—in fact, I think my "goal" in life is to build tools for other people. There is nothing more fulfilling than seeing someone use something I built to build something that absolutely floors me. There is nothing more fulfilling than seeing someone thoroughly enjoy and cherish a tool I built because it makes them feel like they are doing better work.

While I didn't think of formulating my personal ethics before picking up writing (I do now because I discovered that writing is itself a tool to sharpen ideas), I think I had a clear direction for most of my life. Save for one freelance job, I always rejected working for companies that weren't building tools for users (vs say, tools for companies, or just plain nonsense, like ad-tech). Discovering Ivan Illich and his "Tools for Conviviality" last year allowed me to realize that these are concepts that you can articulate and communicate.

While I haven't even made it halfway through Illich's book, the first couple of chapters nail what my ethics are: tools and machines should be built and used to empower humans to live richer, more creative, more free lives. Tools should be built to augment human craft, not replace it.

My standpoint on large language models

I think that large language models (I will never be able to say "AI" because it is such a ridiculous, misleading, polarizing, disingenuous term) have an incredible potential for augmenting human craft. I genuinely didn't think this way of transforming natural language would appear during my lifetime. These models are a paradigm shift in how I work and write software (both professionally and in open-source). I am building tools that more than anything I've built so far, would allow me to share the joy and importance of knowledge and intellectual work, and allow others to do the same. I will write more about why I think that and how I use them to that effect (here's two examples in the meantime: Refactoring with ChatGPT and Documenting a shell script with Copilot )

It also means that I find how these tools are released (especially ChatGPT and Bing Chat) and the overall discourse of "replacing customer support", "a search engine that answers your questions", "replacing artists" absolutely abhorrent. These things don't make art, they don't answer questions, and they absolutely don't replace humans providing meaningful customer support. They can certainly help people do these things, but by themselves, they will just fool people into thinking there was meaning where there is none, and allow grifters to pretend the same. There is a complete 180 between a human using a large language model powered tool to provide better support because they now have more agency, and something taking agency from a human (to the point of replacing them entirely) and packaging it into a sterile chatbot.

As subtle as the difference between these two use cases might seem, to me, they are two entirely incompatible sides of the same medal. One side is building tools to empower humans, the other is building tools to disenfranchise humans, both workers and consumers. That subtlety makes talking about it hard, especially in my heavily "anti-capitalist" circles. The assumption is that LLMs are only there to replace workers and enrich techno robber barons, so any mention of a productive use of LLMs immediately leads to angry callouts and mob dogpiling (I am putting anti-capitalist in quotes because I certainly don't appreciate being called a tech-bro by someone who is a principal engineer at Microsoft, of all companies. Yeah, that stuck...).

What I am doing about it

As an individual, as a tool-builder, as a techno-optimist, I think the biggest impact I can make in order to make the world a slightly better place is to share how I use these tools to enrich my life, my work, creatively and intellectually, because it is not something written about much, and it is not an easy tool to steer.

I also am building opensource tooling and making it not just nice enough to use, but come packaged with a strong ethical stance, so that you can't just take it and then build a chatbot with it without coming into contact with some material that will hopefully make you think twice about what you are doing. What that looks like is still a bit unclear to me, but it's proper dada (see GO GO GOLEMS ).

It also means that I will consistently call out bullshit in the AI-pilled circles I hang out in (because part of being serious about a field that is overhyped is that you come in contact with a lot of BS and grift and exploitation). People are enthusiastic about these technologies for a lot of reasons, and their worldview is heavily shaped by the framing of the companies behind these models—most of them are already victims of the future the companies building these tools wish to unleash upon us, so it is often easy to start an earnest conversation. It is infinitely more productive than telling people that they are gullible fools, or harbor evil intentions.

If I can change the minds of 20 people in an AI-hype discord by dropping a few spicy links, then I'll definitely hang out on AI-hype discords. Conversely, if I can show people that think LLMs are just random words strung together that these tools can help grassroots organizations build more accessible documentation, websites, more secure software; that they can free you from sacrificing your cognitive potential on the altar of capitalist bullshit work, then that's where you will find me too.

(This article was written entirely without LLMs because I just want to get it out. Otherwise, I would have spent quite a bit more time revising and editing it with wordtune and chatGPT, because I think they do make my writing better. The drawing is an ink sketch by me after numerous explorations using midjourney into retro mainframe zombie, computer flower head, ink drawing brushwork, junji ito, moebius, horror manga, cyborg flower cyclops)