- Why hex?
- MVPL (Minimum Viable Programming Language)
- Artificial Inception
- #DecemberAdventure
- Minimalism, distilled
I have a page on my personal wiki system (which is written in min with a h3 frontend, and still closed-source, unfortunately) titled Use cases for a new programming language. The first line I wrote is the following:
A new programming language should only be created if it helps addressing a specific, practical use case
Right. That was meant to dissuade me to embark in endless quests for creating yet another pointless little language that doesn’t really solve any practical purpose. I have a bunch of those lying around or even half-designed in my personal wiki, and of course min has been my go-to personal project for many years. I still enjoy tinkering with it from time to time.
But…
I still wanted to create a new language. The problem with min is that by emphasizing practicality more and more, it became more of a single-file, batteries-included programming language that can be used to create web servers, make HTTPS requests, process XML files, …“big” things like that.
The syntax also evolved quite a bit to support complex data types, it has sigils, and other oddities (bang-postfixed auto-popping symbols, anyone?). It is also not that small anymore: well over 1MB on some platforms (it statically links OpenSSL for Heaven’s sake).
And it is written in Nim. Because I forgot how to program in C after uni. Sometimes I wish I had invested the same amount of time in re-learning C rather than having picked a new, non-mainstream language to build my ecosystem with. But it’s fine, everything helps shaping up the future, I suppose.
Why hex? ⇈
I always wanted to create a programming language that:
- had a really minimalist syntax
- was small in size
- was cross-platform (compiling to WebAssembly and also αcτµαlly pδrταblε εxεcµταblε)
- was implemented in C (that was more of a personal thing than anything else, but C ain’t going anywhere, at least)
- was concatenative (I am still fascinated by the concatenative programming paradigm)
- was able to do basic operations on strings, numbers, I/O, files and delegate everything else to external programs
- could be used to create shell scripts, used as glue code, etc.
- had some quirks. Not many, but some, to make it interesting and fun to play with
- was easy to understand and useful to learn more about concatenative programming even for non-programmers
Then the name. It took me weeks to come up with a good name. I then finally went with hex because:
- It’s short and memorable
- It has not really been used for a popular programming language yet (go try find 1-3 letter words that are not already taken)
- It has something magical about it (besides thinking about hexadecimal numbers, it also means spell or curse)
MVPL (Minimum Viable Programming Language) ⇈
Another page of my personal wiki aims at defining what constitutes a minimum viable programming language.
Here’s a list of things that I consider mandatory:
- integers
- strings
- arrays
- execute external processes
- variables
- define functions
- flow control with conditionals and loops
- error handling
- standard input and output
- file input and output
…and here’s a list of things that I consider nice to have:
- floats
- objects
- booleans
- lexical scoping
- serialization/deserialization
- regex
- networking
- xml
- json
I hope that lexical scoping in the nice to have made you chuckle. I know it would be bad, but technically you wouldn’t really need lexical scoping, really, especially in case of a concatenative programming language where you don’t necessarily need variables.
I decided to put this into practice when designing hex. And that’s why, hex:
- supports only integers, strings, and lists (quotations). It doesn’t have floats or dictionaries, even.
- comes with 64 native symbol for basic stuff, and lets you define your own (global) symbols, to store data and also define the equivalent of functions.
- uses symbols applied on the stack for everything and I mean, literally everything from flow control to error handling and I/O.
And that’s it. It doesn’t implement any of the nice-to-haves (ok, maybe serialization/deserialization, but that comes for free in homoiconic languages). I am also planning of keeping the number of native symbols fixed to 64, and maybe implement a sort of standard library in hex itself.
Artificial Inception ⇈
But but but… last time I checked, I didn’t know how to program in C anymore, right? It’s not that I can magically start writing decent C code from a day to the next, certainly not enough to write a parser, and interpreter, and things like that.
Or, maybe…
OK, I cheated a little bit. Everyone is always talking about AI, how it is changing the world, how it can do X and how there won’t be a need for Y anymore. There is definitely an element of hype in all that, but undeniably Large Language Models have been quite a breakthrough. Now, it’s not that ChatGPT is really intelligent or 100% reliable, but I hadn’t been using it for a few weeks and I was surprised to notice the improvements.
I started off innocently, expecting just some pointers and some pseudocode:
Implement a simple parser for a really simple stack-based concatenative programming language an able to process integer numbers and common arithmetic operations, for example: 1 3 4 * + 2 - Should result in 11 being pushed on the stack.
It blurted out a Python implementation (no matter what, ChatGPT will always go to Python if you ask it to code something without specifying the language).
Provide a C implementation
And it did… I didn’t try the code, but continued asking it to add support for strings. And kept asking for more extensions. I was surprised that it could keep the context for as long as it did. Then it started hallucinating more frequently, but at the end of the day, I was left with a good enough implementation of a basic interpreted concatenative programming language. Which actually compiled and worked, for the most part!
→ hex’s artificial inception (ChatGPT chat)
I kept tinkering with it for a while, and then decided to create a new repo for it.
After a few days, I found myself reading a lot of C code. Timidly, I started writing more and more myself… when I didn’t remember something (memory allocation, pointer arithmetic), I decided to ask ChatGPT for explanations, summaries, cheat sheets… I was re-learning C!
Now, earlier I said that I cheated a little bit. Using AI to write code still feels a little bit like cheating, sure, but it is important to remember that, after all, it is nothing more than a tool.
There are plenty of issues with it, a lot of ethical ones as well, but at the end of the day, AI is here to stay, and we may as well get the most out of it.
I think getting my little project bootstrapped with AI was an interesting experience, and it was very useful for two things, mainly:
- Taking care of the boring, mechanical parts of the project
- Acting as a personal tutor for learning more about C programming
Again, not an infallible tutor, but still better than figuring stuff on my own or with just the Internet. And definitely faster and more efficient than skimming through a programming book or endless Stack Overflow questions.
Using AI for programming is not cheating, no more than using Stack Overflow or… Visual Studio Code is.
Anyhow, overall positive experience. The best thing I got out of it was a better knowledge of the C programming language: I am now more confident and I can actually program in C a bit, and understand it. Better than I did in a long time, anyway.
#DecemberAdventure ⇈
It took me about 15 days of sporadic ChatGPT interactions alternated with even more sporadic (and short) personal coding sessions to get the basics of the language implemented. At the beginning of December, I then decided to take part in #DecemberAdventure and work on hex of course! You can read more about it here.
The funny thing is that most of the design (and implementation) of the language was kinda there already, but I managed to create more of an ecosystem around it.
One thing I actually enjoy about creating new open source projects is that it is never only about the code: I love writing support material for my creations, creating web sites, things like that.
Today, hex:
- has its own web site at https://hex.2c.fyi, with its own browser-based playground, a tutorial, and even a specification.
- runs on different platforms, and can be compiled to WebAssembly to an αcτµαlly pδrταblε εxεcµταblε
- has a virtual machine, comprised of a simple but fully-functional bytecode compiler and interpreter
The thing is… I have manage to achieve this levels of ecosystem completeness with min, never mind doing it in a really short time. But with hex I started (and remained) small, and that was the key.
Minimalism, distilled ⇈
Syntactically speaking, the language is very similar to min. I always quite liked min’s distinctive lispy flavor without actually being a lisp, its simple way of doing things without many special forms.
Take this program for example:
; Print a list of squares!
(0x1 0x2 0x3)
(
"i" :
"The square of " i str " is " i i * str
cat cat cat puts
"i" #
)
each
Sure, it is a bit verbose, but if i tell you that:
- ; is used to start line comments
- Things are pushed on the stack from left to right
- Things within parenthesis are quotations (lists) of other things
- Strings are delimited by double quotes
- Integers are prefixed with 0x
- Everything else is a symbol, and unlike other things, when they are not inside a quotation they disappear from the stack and manipulate it in some way
…you basically already know the language syntax and how it works.
You have literals that get pushed on the stack, or symbols that manipulate it, and that’s pretty much it.
You may want to know that the :
symbol is used to define other symbols and that the #
symbol destroys them, and that the cat
symbol pops two strings from the stack and pushes back a concatenation of the two… sure, you can learn all of hex 64 native symbols, but as I mentioned earlier, there are no statements, or expressions, or special forms: everything is either a literal that gets pushed on the stack, or a symbol that manipulates it.
Well, there you have it. Yet another programming language that you’ll never use. Everyone is creating programming language these days, and now with AI even people that normally wouldn’t will be able to, imagine that.
Still, this has been, and it will continue to be, a very rewarding experience. And maybe, just maybe, this new little thing called hex can teach someone something new (concatenative programming) or even be used to write some very cryptic shell script.