Sometimes it takes me 22 years (+ one evening) to write a blog post. Here are my thoughts on "homoiconicity" and, as an alternative, "bicameral syntax". (Warning: 4000 words.) parentheticallyspeaking.org/articles/bic... - ThreadSky

shriram.bsky.social • 174 days ago

Sometimes it takes me 22 years (+ one evening) to write a blog post. Here are my thoughts on "homoiconicity" and, as an alternative, "bicameral syntax". (Warning: 4000 words.)
https://parentheticallyspeaking.org/articles/bicameral-not-homoiconic/

Comments

alephnaught2tog.bsky.social•173 days ago

An absolutely *gorgeous* read, thank you! Always a delight

acowley.bsky.social•173 days ago

This is a really nice piece of exposition! I think it's hard to accept that the space you've somewhat carefully carved out for the reader could have such ramifications. The wisdom to be found in the description of nascent language development efforts floundering with syntax is deceptively important.

shriram.bsky.social•172 days ago

Ah, thank you kindly. Passing your muster is a high bar!

johnericson.me•173 days ago

Yay! Are the parsing / formal language people ever going to investigate this? I feel like it's so embarrassing to them that there is no theory for reader/parser composition (arbitrary long pipeline stages).

shriram.bsky.social•173 days ago

I'm not sure what there is to investigate beyond what I wrote? I mean, there might be some low-level complexity issues, of academic interest, about the *precise* context-free class, but the pragmatics would likely far outweigh them.

mvsamuel.bsky.social•173 days ago

fyi, I did an experiment a while back to show that a syntactically complete {C.like;} language can be parsed with an OPP parser.

And that there are a number of ways to add quasis.

https://web.archive.org/web/20220524085614/https://temperlang.dev/design-sketches/parsing-program-structure.html

mvsamuel.bsky.social•173 days ago

tldr:
- a cover grammar conflates forms like `if` w/ block-receiving macro calls
- a bespoke operator precedence parser builds lispy trees
- operator trees can flatten to pseudo token lists and parser combinators can recover special forms from them
- recovery can be redone after macros munge trees

johnericson.me•173 days ago

Basically, what is the structure of the tree-to-tree parser? Is it some sort of (homo)morphism?

OK to say "who cares, not needed for practical purposes", but formal language people get sucked into studying other things far in excess of what is needed for practical purposes.

johnericson.me•173 days ago

So much modern abstract math is oriented around homomorphisms, the fact that the parsing people seem not to have explored those is mysterious to me. Feels almost like a monocameral syntax conspiracy ;)

shriram.bsky.social•173 days ago

But there is no "the" tree-to-tree parser.

Also, there's an extensive theory of transducers…

johnericson.me•173 days ago

The transducers you refer to are the finite state ones, right? Yes for regular languages we have a great theory of morphisms. But I never found anything (that was any good like that for fancier formal languages. Seems like a big gap!

johnericson.me•173 days ago

There is https://en.m.wikipedia.org/wiki/Chomsky%E2%80%93Sch%C3%BCtzenberger_representation_theorem which is cute "every CFG can be contorted to be bicameral parens+regex!" and tantalizing, but I couldn't find much follow-up research, and it's unclear if s-expr for lisp is more than a passing resemblance to the Dyck language involved there.

micahcantor.bsky.social•172 days ago

I've been waiting for this since @lexi-lambda.bsky.social launched her own campaign against homoiconic a while ago

shriram.bsky.social•172 days ago

I've been campaigning for a LOONG time! Every time it would be on social media and I'd get a half-dozen half-baked "well how about this definition then?"s instead. Sooner or later I needed to sidestep all that and just write this down…

gregorkiczales.bsky.social•166 days ago

This is excellent, thanks for writing it. XML and S-expressions are the same in just the ways you say. And thanks for debunking homiconicity.

I agree with all you are saying up to “just one of many views onto the core abstract syntax”.

gregorkiczales.bsky.social•166 days ago

Interlisp-D and Smalltalk tried a world in which an internal representation of code was rendered each time before it was edited. But text file-based code representation won out. I THINK because the affordances of lightweight code and comment formatting are pretty great.

gregorkiczales.bsky.social•166 days ago

In Interlisp you could put comments before any expression, but the comment itself was an S-expression – something like (comment here is something interesting about the next expression.) Interlisp was case preserving by default so it worked out alright. But only alright.

gregorkiczales.bsky.social•166 days ago

Any kind of formatting of the comment was impossible. If you popped up a narrow window to look at a function then it appeared quite differently than it looked in a wide window.

That was a long time ago, things might be different now, but I’m still doubtful that’s the “direction” to go.

gregorkiczales.bsky.social•166 days ago

In fact, that’s the thing I was try to say in my too-unclear OOPSLA 2007 talk. That we should get flexibility in how we see and edit code by going the other direction - take text-based files based on one standard syntax as primary, and then be able to

gregorkiczales.bsky.social•166 days ago

register (not quite parsing) that, and see and edit it in different ways. That would get more flexibility in view, and preserves the affordances of text based representation and comments for the primary source.

marutks.bsky.social•166 days ago

Bicameral? I thought Lisp is homoiconic.

shriram.bsky.social•166 days ago

Helps to read before responding!

samth.bsky.social•174 days ago

It would be nice to have dates on your posts.

shriram.bsky.social•174 days ago

I'm trying to anti-blog. Blogs are overly temporal, so I guess I'm trying to be overly-atemporal.

samth.bsky.social•174 days ago

I think this is exactly backwards -- if someone reads this 10 years from now then knowing when it's from is more important than for the conversation today. It's good to know when the EWD notes are from, for example.

shriram.bsky.social•174 days ago

I keep updating my articles. They're supposed to be as current as I can keep them.

samth.bsky.social•174 days ago

Then that one should mention Rhombus ;)

shriram.bsky.social•174 days ago

I'm still trying to figure out how to work Honu and Rhombus in, and for that matter, also Enso. I think they're all related.

k4rtik.bsky.social•174 days ago

“Last updated date” is the standard solution (or if too much hassle, “originally published“). Even books need some timestamping when they are as anti-blog as anything can be. :)

shriram.bsky.social•174 days ago

Unfortunately I've spent a LOT of time thinking about timestamps and haven't come up with a solution I like.The actual timestamps are all in the git repo. So someday I'll probably turn them into something human-readable that I'm comfortable with. I'm in no hurry. (-:

wingolog.org•174 days ago

how can one atemporally end a post with forward-looking comments? :)

shriram.bsky.social•173 days ago

You got me! The problem is that for some ideas, the future always remains in the future.

wingolog.org•174 days ago

(lovely article tho!)

josephhgarvin.bsky.social•173 days ago

FYI there is something slightly off about the rendering on mobile, the paragraphs are the right width to fit on the mobile but there's a huge amount of margin on either side as if it is still trying to make the overall page width right for desktop so you have to scroll horizontally to center it

shriram.bsky.social•173 days ago

The mobile rendering is an endless bugbear for me. I keep hoping some LLM thing will solve it but they never do. (-: I hate CSS. Sorry. I'll try to fix it over winter break.

shriram.bsky.social•173 days ago

The mobile rendering is an endless bugbear for me. I keep hoping some LLM thing will solve it but they never do. (-: I hate CSS. Sorry. I'll try to fix it over winter break.

alephnaught2tog.bsky.social•173 days ago

(holler if you want a hand with the CSS)

shriram.bsky.social•172 days ago

Boy, do I ever need it… I'll just have to bite that bullet at some point!

stefcoetzee.com•173 days ago

Tailwind CSS works well enough for most of these kinds of things, in my experience.

shriram.bsky.social•172 days ago

Ah, thanks! I'll have to give that a try.

noteed.bsky.social•163 days ago

I think it's possible to have a syntax completely described in terms of distfix expressions: https://github.com/noteed/syntactical. This is enough to have a "nice" syntax for those that don't like parenthesis, and easily equivalent to s-expressions so we can focus on semantics.

shriram.bsky.social•163 days ago

1. The idea that it can be non-parenthesized is well-known, see eg the pioneering work by Honu and more recently Shrubbery/Rhombus.

2. This library offers a very limited syntax. It is not "equivalent" to what you get from bicameral languages, which are inherently extensible.

psteckler.bsky.social•166 days ago

Nobody is really sure if you're from the House of Lords.

markguzdial.bsky.social•173 days ago

Thanks for the interesting blog post! I'm trying to use this language to understand Smalltalk. I think: Smalltalk is homoiconic, but definitely not bicameral, and it has a Scanner and Reader (e.g. for defining blocks) but no Parser. Am I close?

shriram.bsky.social•173 days ago

I don't know what "homoiconic" is (that's the whole point), so I can't answer the question wrt Smalltalk. I thought Smalltalk blocks were lambdas. I have never quite grokked Smalltalk syntax well enough to know whether it has a parser or not, but I'm sure it does?

markguzdial.bsky.social•173 days ago

Smalltalk has a syntax (see the famous Smalltalk syntax postcard), but everything is a message. Each object receives a message (with syntax structures like blocks or arrays already mapped to objects) then decides what to do with it (executes a method). Message-sending instead of parsing.

markguzdial.bsky.social•173 days ago

So I asked UM-GPT "Does the Smalltalk Parser do the same thing as a Lisp Reader?" It gave me a good answer that correlates well with what you're saying about Lispy language advantages.

liamoc.net•173 days ago

"syntax structures like blocks or arrays already mapped to objects" such mapping is a parser, no?

markguzdial.bsky.social•172 days ago

Or a Reader? I’m trying to understand the distinction Shriram is making

shriram.bsky.social•172 days ago

I think we first need to get some terminology straight. "Block" can have two different meanings. "Block" in the (rough) sense of "scope, which can be nested", à la Algol, Pascal, and their descendants (including Scheme). "Block" as a run-time language construct that is aka a "closure". ↵

markguzdial.bsky.social•173 days ago

And Forth has only a Scanner, no Reader or Parser.

And the cost is that Forth and Smalltalk can't be analyzed in the same way as modern PLs -- do I have that tradeoff right?

shriram.bsky.social•173 days ago

I think that's right re. Forth.

I think Forth's analysis challenges are peculiar because you can modify things on the fly. Can a Smalltalk program modify itself?

markguzdial.bsky.social•173 days ago

Yes. Since Smalltalk is entirely written in itself (including the compiler), a method could redefine itself. A method could even manipulate its own bytecode.

shriram.bsky.social•173 days ago

Right. But that's entirely a question of the *runtime* semantics. Doesn't say anything about how the syntactic side of it is organized.

markguzdial.bsky.social•173 days ago

You know more about Smalltalk than I know about Lispy macros, which is what I think you're referencing.

The original Smalltalk-72 could change the syntax side of things. A message send included the rest of the code, untokenized and unparsed. Later Smalltalks were more locked down.

jamesbrock.bsky.social•173 days ago

For years I've been trying to convince people to use monadic parsers instead of regex for everything.

https://hackage.haskell.org/package/replace-megaparsec-1.5.0.1

Your essay about bicameral syntax is the first time I've read a principled reason (other than speed) about why a regular parsing pass is a good idea.

codyroux.bsky.social•173 days ago

It's 2024, even languages can be bi(cameral)

etorreborre.blog•174 days ago

Excellent piece, thanks!

tealeg.bsky.social•174 days ago

“Lisp..; it’s the sound of a cricket ball on a bat.” - if I saw that quote, unattributed, I’d guess its source in a moment :-)

shriram.bsky.social•173 days ago

This intersection is shockingly small and Something Must Be Done About It.

tealeg.bsky.social•173 days ago

If only they could all appreciate the sound of willow on lambda!

shriram.bsky.social•173 days ago

If you flip this in a mirror, you know what this looks like? 🏏

tealeg.bsky.social•173 days ago

Alonzo Church at silly mid off?

tealeg.bsky.social•173 days ago

Joe Root putting a bit extra on a closure as he “lifts” it into the crowd? (Ok, ok, I know…)

tealeg.bsky.social•174 days ago

I knew there was a reason I took a day off work.

shriram.bsky.social•173 days ago

Hope you're not squandering it and are getting a good walk and coffee and cake and whatever else you hipster Europeans do in your artisanal cities.

tealeg.bsky.social•173 days ago

Walking, yes. There were artisanal fibre optics being laid in the street around my house (by artisans, naturally). Neither quaint e-auto nor charming e-bike were accessible to me, so my wife used our typical European bus and tram network to reach the dentist, whilst I did some DIY and cooking.

justinhj.bsky.social•170 days ago

Fascinating post thank you

jonathanaldrich.bsky.social•173 days ago

Awesome blog post, thanks for writing it!

One small quibble: I think your consideration of Homoiconicity is a bit of a straw man. There are different, much better (i.e. more well thought out) definitions than the ones you compare against. My favorite:

https://www.expressionsofchange.org/homoiconicity-revisited/

jonathanaldrich.bsky.social•173 days ago

That said, I agree that Bicamerality is a super valuable property of Lispy languages and it's awesome to see it named and highlighted. But I also think Homoiconicity has real value (for human-factor reasons, primarily) and to be Lispy, a language should have both.

jonathanaldrich.bsky.social•173 days ago

Specifically, I suspect (speculation, for now) that Homoiconicity provides human factor value for making macro-like things easier to read and write. Python/JavaScript fail van Schelven's definition (linked above) and don't provide this value.

ltratt.bsky.social•173 days ago

Amongst the challenges with "macros" is that there are several different concepts which either use the term "macro", or should, or are very closely related -- it makes "homoiconicity" look like a well-defined term! Time has taught me, personally, that Lisp is surprisingly unspecial w.r.t. macros.

jonathanaldrich.bsky.social•173 days ago

I agree that there are several different concepts that are related to macros. But how does this make "homoiconicity" less well-defined? van Schelven does not refer to macros in his post, for one! I only referred to macros as a way of illustrating where one sees the human benefits of homoiconicity.

ltratt.bsky.social•173 days ago

I guess what I mean is: I don't think homoiconicity -- at least, under the most of the definitions I know of it -- matters for macros. Paraphrasing something I wish I'd said and avoiding "homoiconicity" entirely: I think macros are necessary for Lisp; I don't think Lisp is necessary for macros.

jonathanaldrich.bsky.social•173 days ago

BTW I agree that some of the off-putting (for some people) factor about Lisp syntax comes from bicamerality. But I also think some of it is just the particular choice of syntax for those trees. Parentheses are the simplest thing you could do--but they are actually kind of unreadable for data too!

jonathanaldrich.bsky.social•173 days ago

I love what Matthew Flatt is doing with Shrubbery/Rhombus in this space. We had ambitions of doing something similar with Wyvern--used it in our type-specific languages work, and planned to extend to macros, but didn't sustain that push. Kudos to Matt's team for their brilliant design!

gordon.bsky.social•170 days ago

Wonderful article. The parallel to JSON is enlightening. Shared with my team.

sbrunthaler.bsky.social•174 days ago

Glad you found the time to write it down, given you suspected obstacles. We should have a Pulitzer prize for science articles, this one would qualify! (If only there'd be a collection of such articles somewhere...)

shriram.bsky.social•173 days ago

Haha, thanks, that's very kind of you. I was obviously in massive work-avoidance. (-: (I'm not even entirely sure what work I was avoiding, it helped me forget about it entirely.)

sriku.org•174 days ago

There is an intermediate thing in Julia that seems to be a somewhat unique factorization of the Str->Toks->Tree->AST->Val chain -- generated functions. Haven't actually used them yet since functions have usually sufficed, but it looks a tad more accessible than a full on AST->AST in Julia's context.

sriku.org•174 days ago

Link in case someone (not Shriram) reading this isn't aware of them - https://docs.julialang.org/en/v1/manual/metaprogramming/#Generated-functions

shriram.bsky.social•173 days ago

Neat! Also unsurprising, given Julia's *very* strong Lisp heritage.

tov.bsky.social•174 days ago

This is great!

(typo: “Scaner --> Reader --> Parser”)

shriram.bsky.social•173 days ago

So much for running a spell-checker…

I was going to turn those into images anyway once I can remind myself of the details of the pict library, so I'll fix it then. Thanks!

lucach.bsky.social•173 days ago

Classic Shriram. Pretends to pass as an AI by using "delve"; fails by inserting a typo.

shriram.bsky.social•172 days ago

You know, I wrote that "delve" — which came completely naturally to me — and thought, "Wait, is someone going to…" Grazie, Luca!

sp.degabrielle.name•170 days ago

Thanks for this. Opened my eyes on things, gave me language to describe them and cleared some stuff up I didn’t realise I didn’t understand well.

Only complaint is your choice of term. It’s good but doesn’t sound sciency enough to feel clever saying it😅

Fun fact: your post was deleted by the mods🤣

gfrison.com•173 days ago

I didn’t thought that it would be different from that

Comments

Posting Rules

Reply