Bulk Transclusion as Reified UI

TLDR: Some common UI patterns — such as the timeline in microblogging, or the thread view in forums — could be natively supported in hypermedia systems with sufficiently powerful bulk transclusion operators. Interpreting bulk transclusion as a reification of particular UI patterns opens an interesting design space for building (collaborative) information infrastructures without implementing special-purpose application logic.

And now to give enough context for the above to start making sense...

The web11For simplicity, I will only consider static web pages in this post. Dynamically created pages turn the web into an ad-hoc anything-goes fest with barely any meaningful structural properties to analyse. is a hypertext system. Individual webpages (like the blog post you are reading right now) are syntactically self-contained documents, which can be displayed by a suitable user interface (the browser). Hyperlinks (like this one) allow for easy navigation between pages. Additionally, iframes provide transclusion: the ability to directly embed a document inside another. Think embedded videos from a video hosting platform, or embedded interactive maps from a map provider.

At its essence, the web displays pages in isolation. Browsers typically support tabs, but those could also be simulated by opening multiple browsers; each rendered page is independent from every other rendered page. There is no native support on the web for, say, getting an overview of all pages hosted on a particular domain. Such sitemaps must explicitly be created by authors in the form of another webpage, with no support from the underlying platform (the web) itself.

This observation is the starting point for this blog post; I will explore (a particular slice of) the design space around hypertext systems which include notions of collections of documents, and user interface concerns which arise from those notions.

To give a better idea of what I mean by UIs for collections of documents, I will start with a few examples:

A profile timeline on a microblogging service displays all “tweets” by the same author sequentially, sorted by creation time.
A thread list in a forum displays the titles of threads, typically sorted by most recent activity.
A file manager displays the names of files (including directories), typically sorted by name. A good file manager allows for plenty of configuration (sorting by most recent activity or by file size, previewing file contents, etc).
Some platforms with comment functionality that allows for nested comments displays a tree of comment, often with platform-specific sorting criteria.

There are some direct similarities between all of these examples; we can derive a general pattern of which these are instances:

Each example aggregates several objects and linearises them according to some sorting criteria. For microblogging and comments, the aggregated objects are self-contained, as are the “normal” files in a file manager. Forum threads and directories in a file manager are themselves again linearised collections of objects.
Objects have metadata; in order to unify these examples we need each object to have a name (for example, a thread name, or a file name), and a timestamp denoting the most recent update. The view aggregating the objects may or may not display that metadata explicitly. Note that sorting in order of original creation (but not rearranging when editing) can be accomplished by using the creation time of an object as its name, and then sorting by name. This is how the microblogging timeline fits into this framework.
When dealing with nested objects, there is a choice of how much nested structure to display. A forum shows typically only the name of a thread and hides the posts behind a link, file managers typically have the same behaviour, comment sections might show a full heirarchy of comments, or they might require explicit user interaction to show deeply nested comments.

This categorisation allows us to specify these UI patterns declaratively, instead of through ad-hoc logic such as the javascript powering a microblogging timeline. A rough list of declarative parametersThis list is far from exhaustive, other parameters could include filter options, rate limits, a flag to control live updating, etc. I am exploring here, not trying to nail down conclusive lists. includes the order by which to sort (name or most recent update), which metadata to display, how much of the content to display, and how deeply to display nested hierarchies. Finally, we would need to specify which documents to aggregate in the view. How this could be done depends on the organisation of the underlying hypertext system. For HTML, the simplemost option (to which I will restrict this exposition for now) would be to specify a directory on some webserver, for example, everything in https://aljoscha-meyer.de/posts/.

It is quite easy to envision web support for this kind of UI. You open a new tab, get a mask for selecting a directory to aggregate, how to sort, which metadata and data to display, how to deal with nesting, hit ok, and the browser issues a corresponding request to the server. The server replies with the set of pages which should be included. The browser then fetches those pages and their metadata, and renders the UI.

What would this give us? For starters, effortless microbloggingI am simplifying. You would not get the ability to react to posts, write replies, etc. I am sticking to the basics for exploring a design space here, not proposing a fully-featured system for decentralised social media. to anyone with a website. No need for centralised microblogging-as-a-service platforms, nor for custom federated protocols. Instead, this would simply be a part of what the web is.

A more abstract but noteworthy property is a certain inversion of control: authors provide data, but the aggregation techniques are left to the viewers. On today’s web, authors implicitly select the one true way in which their documents are aggregated: when using a publishing platform they defer to whichever algorithm and UI patterns the platform provides, and for self-hosted websites the default choice is to not have any aggregation at all. A web with declarative aggregating UIs would lift this burden of choice from authors, and let viewers choose the most appropriate view for whatever it is they are trying to accomplish.

This alone is already pretty cool in my book. But the system I have sketched so far is fairly cumbersome: either viewers would need to re-create the same UI configurations again and again, or browsers would need to develop a whole set of functionality around the persistence of UI choices, and, ideally, their sharing and remixing. This is where we come to the core idea of this text: what if aggregating UIs could be embedded inside the documents of a hypertext system themselves?

The web already has the ability to transclude individual documents via iframes. What if you could point an iframe not only at webpages, but also at directories, together with a declarative description of the UI it should use for aggregating the contained pages? Suddenly, we have reified the bulk view UI as part of the hypertext system itself, through the (conceptually) simple act of bulk transclusion!

An immediate advantage is that the problem of managing UIs almost disappears. You can save UIs as regular webpages, for which there is already a vast hosting infrastructure, plus browser functionality such as bookmarks. Making a particular view available to others turns into the simple act of hosting some static HTML.

This harkens back to the notion of bulk UIs providing an inversion of control for how to access and discover documents. Bulk-UI-as-a-browser-feature would make this inversion of control cumbersome to the viewers, but with reified bulk UI, the burden could be distributed arbitrarily. Authors could easily specify a recommended way of aggregating their documents. But viewers would not be restricted to those recommendations, they might prefer their own UI choices — or those offered by a third party.

A fun observation: you can easly create (mutually) recursive views, which would lead to infinitely deep nesting. UIs which do not unfold nested hierarchies indefinitely have no trouble dealing with this at all (think of how a file viewer would deal with cyclicly symlinked directories). Other UIs would need to detect this case, similar to how browsers must already detect iframes which contain themselves. Whereas today’s browsers simply stop rendering at the first repitition of an iframe, I would argue that for most hypermedia systems it would make more sense to allow unfolding the recursion one level at a time based on user interaction.Another beneficial consequence is the arbitrary nesting of these bulk aggregations. You can aggregate documents which themselves contain aggregations of other documents, possibly with completely different UI parameters. The distinct presentations in forums for listing threads versus listing the posts inside a single thread immediately comes to mind. If bulk UIs were merely a browser feature, switching rendering needs based on levels of nesting would significantly complicate the declarative UI declaration. By reifying bulk UIs through bulk transclusion, this becomes trivial and highly flexible instead.

In fact, one possible design choice here would be to not have any built-in notion of displaying nested hierarchies at all, instead fully relying on the fact that the documents you display can themselves display nested contents again. This reduces the complexity of the bulk-transclusion operator, but at the cost of reducing the inversion of control: rendering of nested hierarchies would rely on the documents which “contain” other documents, not on the (potentially) user-controlled transclusion operator.

Another consideration would be the option of adjusting bulk transclusion operators on the fly. A general-purpose UI could offer options for filtering arbitrary bulk transclusions, for searching within them, for controlling how much of the full contents is displayed, or perhaps simply to change the criteria by which the transcluded documents are linearised (i.e., sorted). All of these could be expressed in interactive widgets — but all of these should correspond to options of the bulk transclusion operator. In fact, it should be simple to persist such adjustments by writing the source document to disk, but with the dynamic changes persistent as options to the modified transclusion operator.

Today’s popular hypermedia systems (i.e., primarily, the web) either refrain from aggregation, or enable it exclusively through non-declarative ad-hoc specification of special-purpose logic. You could use the latter (i.e., javascript) to prototype flexible bulk transclusion operators on top of iframes and the fetch API. But such a prototype would never implement the core underlying idea: that bulk transclusion can be part of the document model itself.

If we extend all these ideas beyond hypermedia systems and follow them to their logical extremes, we reach a design paradigm where there is no difference between bulk transclusion operators and UI logic. Aside from browsers, window managers are another fairly obvious candidate for this paradigm. But it might be worthwhile to consider where else the paradigm could be applied. What would reified UI look like in a video editor, or in a game engine? Do these questions even make sense (and if not, then how can we characterise the class of applications for which they do make sense)?

The final area of this design space I want to briefly highlight concerns multi-author aggregation. For example, we could imagine a bulk transclusion operator specifying a list of domains from which to aggregate the contents. For example, aggregating all pages in https://aljoscha-meyer.de/posts/ and in https://blog.example.org/ to obtain a timeline view. Which locations (or authors) to aggregate could become part of the options of the bulk transclusion operator for sorting. A microblogging timeline would sort by timestamp, and could break ties by authorship, whereas other usecases might call for sorting by author first, and by timestamp second.

Optionally, it might be possible to even merge documents with different authors (i.e., sourced form different domains) but with equal identifiers. A collaborative wiki, for example, could work this way. While it might be difficult or impossible to perform a meaningful semantic merge of arbitrarily many documents by arbitrarily many separate authors, a simplistic solution could be to only display the document with the newest timestamp. Or perhaps to render a list of all “conflicting” documentsYet another option here would be to let documents declare a list of other documents versions that they want to explicitly replace. This requires trust between authors, but would reduce irreconcilable updates down to only the truly concurrent updates, where neither update indicates awareness of the other update., letting a user expand any of them, or perhaps even render diffs on demand. All the different options can and should be reifiable options of the bulk transclusion operator.

Once we enter a multi-author setting, another interesting concern is access control. Should everyone be able to transclude arbitrary documents? Or should it be possible to restrict visibility (and, synonymously, transcludability)? If so, by which criteria? Would transclusion operators need to be (self-) authenticating, or should authentication live higher up in the stack?

A final remark: while I have presented things in light of the web as the most widely-used hypermedia system of the present, my personal interest does not actually lie with the web. Most of these ideas arose around thinking about and working with Willow. In Willow, Entries (the atomic unit of data) have a notion of hierarchical organisation (the path), a timestamp, and a notion of authorship (the subspace id). Further, we have Areas as a natural candidate for how to specify the sets of documents to transclude in bulk. And finally, we have a robust capability system in Meadowcap for incorporating access control from the very start. I am not terribly eager to prototype these thoughts of reified UI on the web. But defining a markdown dialect with an Area-powered bulk transclusion operator and using such markdown-ish documents as the payloads of Willow entries is one of the first applications I want to build on Willow! See here for more information.

Home