Nota: A Document Language for the Browser

moab · on May 12, 2022

Nice work as usual, Will. How do you plan to convince users that this is not just a research prototype that will be abandoned after being published, and won't suffer from bit-rot 5 to 10 years from now? You talk about this a bit at the end of the longer paper on Nota, but I'd be curious if you have additional thoughts. Seems like we need something like Talos/immutable infrastructure but for the web (does that exist already?).

wcrichton · on May 12, 2022

Thanks! Currently, I have no plans to convince users of Nota's longevity. It is a research prototype, and I will pursue it as long as I have the time and still think it's a good idea.

My hope is that in the near-term, I (or other early-adopters) will create a gallery of documents that use Nota's features to convey information in a compelling way. Those examples will convince more people to adopt Nota for their blog or website. Maybe a forward-looking academic workshop convinces its authors to write their papers in Nota. And eventually some Nota will gather some institutional momentum to fund its development. Once there are enough Nota documents worth archiving, then people/orgs will invest resources into archival stability.

So right now, I'm more interested in convincing people that using Nota can enable them to write documents that are significantly better for readers without significantly more effort on behalf of the authors.

moab · on May 12, 2022

The plan makes sense. A likely issue for folks currently preparing papers in LaTeX is that they will be put off by the extra work of having to port/write another copy of their document in Nota, but maybe this is intended. You shouldn't just be dumping your paper to the web.

Re documents that would be useful, you could dogfood Nota with 242 to show how powerful it could be to teach students PL theory. A course like 15-312 would be much less intimidating for students if materials were encoded both informally/formally in Nota. You could imagine porting one chapter from PFPL as a small example.

I'm teaching a new course on algorithms in the spring and I plan to try it out to develop some online course materials. Will get in touch with you with any questions / to share what we prototype. You might also want to share this with Umut who has been developing (what I believe is) a KaTeX-based system called Diderot that a lot of CMU courses are using now.

wcrichton · on May 12, 2022

Yes, I think course notes or textbooks are a great candidate for Notafication. I sadly don't teach 242 anymore since I just graduated, but I'll certainly use it for my next course.

I would be happy to support y'all in using Nota for the new course. And I hadn't heard about Diderot, I will reach out to Umut for sure.

bachmeier · on May 12, 2022

After reading this comment, once important use that comes to mind is an empirical paper where you click a button to see the data and code that produced each of the results.

wolverine876 · on May 12, 2022

I don't understand what it does or is. Is it a sophisticated markup language with embedded JavaScript? What can it do that makes it worth the complexity compared with a markup language - an example would be great. Also, what is the output, a single file?

Also, what are the differences between it and a CMS, such as a wiki?

wcrichton · on May 12, 2022

"Sophisticated markup language with embedded JavaScript" is a fine model of Nota. You can also think about it as another syntax for a React program, like an alternative to JSX that's oriented at writing documents (prose-first) rather than writing websites (code-first).

The complexity of Javascript and React gives you two main benefits:

1. You can use Javascript as a metalanguage to manipulate the document, e.g. to make and reference variables, perform computations, and so on.

2. You can use React to provide semantic structure and interactivity to your document, e.g. to mark definitions of vocabulary and link references back to the definition.

Nota consists of a syntax, a compiler that goes from Nota document to React program, and a set of React components that are commonly used in Nota documents. You could integrate it into a website or CMS, but it is not inherently any kind of information management system.

wolverine876 · on May 12, 2022

Thank you. Would Jupyter be a good comparision (though that isn't prose-oriented)? Maybe even Emacs? What users are you targeting? Developers usually don't need to write a lot of prose.

I'm always interested in systems that are willing to make the tradeoff of more power and sophistication at the cost of a greater learning curve, and I strongly agree that we need something beyond PDFs and webpages (and Word documents). Nota interests me but I'm having a hard time seeing how the power and sophistication will significantly improve my writing or documents (and I don't develop React applications). A use case would be great; the hyperlinks and tooltips on the page I see can be done much more simply.

wcrichton · on May 12, 2022

I'm targeting people who are writing documents: bloggers, academics, textbook writers, that sort of thing. Certainly Nota requires a level of technical expertise to use right now, but I have a WYSIWYG editor as a distant-future TODO.

If you read my original paper about Nota, you can find a more interesting set of applications: https://willcrichton.net/nota/

wolverine876 · on May 13, 2022

Ah, thank you. That paper makes a lot more sense to me, describing a clear problem. For others:

The principal question underlying this work is: how much effort does it take to understand a PL paper?

and

Papers about programming languages involve complex notations, systems, and proofs. Static PDFs offer little support in understanding such concepts. I describe Nota, a framework for academic papers that uses the browser's interactive capabilities to support comprehension in context. Nota uses hover effects, tooltips, expandable sections, toggleable explanations, and other interactions to help readers understand a language's syntax and semantics.

account-5 · on May 12, 2022

Please excuse my ignorance. I don't get why you wouldn't just use HTML and/or JavaScript. Apologies I know I'm missing something.

wcrichton · on May 12, 2022

Nota is really two things:

1. An alternative syntax (vs. JSX) for writing React programs.

2. A set of React components for structuring a web document

As a syntax, Nota is meant to make writing prose-heavy React easier. As a set of components, Nota is meant to help readers use the document's structure for understanding the document's content, e.g. by showing the definition of a term when you click on it. It's not a replacement for HTML or Javascript, but an alternative syntax for them with a rich standard library.

anentropic · on May 12, 2022

Thanks, that's a much clearer explanation than the page itself managed to give

account-5 · on May 12, 2022

Thanks, that makes more sense. I'm not a full time developer just an amateur programmer, I'm currently teaching myself web development (I decided to go vanilla JavaScript no frameworks) but I make extensive use of pandoc and markdown/Zim-Wiki for academic writing, I thought this was something like that.

wcrichton · on May 12, 2022

Academic writing is definitely a target application area for Nota (since I am an academic). Philosophically, Nota combines together all the ideas I think are needed to make a better platform for writing documents like academic papers that are web-native.

Semiapies · on May 12, 2022

This seems a lot like Pollen (https://docs.racket-lang.org/pollen/index.html), but aimed more at JS than HTML and PDF.

avivo · on May 12, 2022

This is fascinating; I (well, with some support from my advisor IIRC!) implemented some things like this in latex for my thesis, but it was very limited. It still did help me keep track of all the variables and letters.

I'm curious why not do this in MDX instead? Would that have been possible? It seems like it might be a good base that can then interact with other things (e.g. I'd love to just use this in https://motif.land).

wcrichton · on May 12, 2022

I originally tried using MDX. But in my experience, JSX syntax is better suited for writing websites than writing documents. The <Tag></Tag> syntax is more verbose than @Tag{}. I ended up having to do lots of raw string embedding too, like <Component>{r`content...`}</Component> which is even worse. Basically you either had a really concise syntax for special Markdown-supported operations, or you had a really annoying syntax for non-special operations. Nota instead provides a uniform somewhat-nice syntax for both.

Additionally, I found the composition semantics between Markdown and JSX somewhat confusing. Random whitespace and newlines would sometimes have a very significant effect on the resulting document. It wasn't really clear when you would be in Markdown mode vs. JSX mode.

FWIW the Nota React components (e.g. @Definition and @Ref) could be used with MDX. The Nota syntax and Nota runtime are separate.

practal · on May 12, 2022

This makes a lot of sense. I have been working on something similar, along the lines of mixing Latex and Markdown, and christened it "Recursive TeXt". Currently it only exists as a macOS app with me as the only user (it is still early stage with lots of changes ahead, but all articles on https://practal.com are done with it).

Making React part of this makes a lot of sense on one hand, but on the other hand it feels to me that documents should be more long-lived than React might be. So a compromise might be to use React under the hood for components, but to abstract it away in a document DSL. But that might be just overthinking it.

cookiengineer · on May 12, 2022

Latex has its purpose for high quality and fully controllable templating. Does nota try to achieve the same?

Latex, however cluttered its syntax is, benefits a lot from strong tooling that has matured over the decades.

So I'm wondering what the plans are for nota, given that it is a subset of HTML and a subset of latex (judging from reading the examples).

Why not use commonmark? The dynamic argument is kind of invalid, given that markdown.js and other parsers exist and are used heavily on social media web apps and sites.

If the feature gap can be identified as bringing latex to the web: why not implement something like latex.js, which could be embedded in web browsers similar to how pdf.js changed the web?

wcrichton · on May 12, 2022

Nota can express any HTML document. It uses KaTeX for math, so it is as expressive as KaTeX.

The goal isn't really to compete with lightweight markup languages like Markdown. There are plenty of those. The goal is to provide a language for writing complex, interactive documents like academic papers.

It isn't as simple as bringing LaTeX to the web. There are many things that LaTeX cannot do or express (e.g. like a tooltip popping up when you click on a reference to a definition). The goal of Nota is to make things like that possible.

qwerty456127 · on May 12, 2022

I once felt like I'd like to learn LaTeX (unrelated to math/science, just for casual typesetting). I gave up in half an hour as it became vividly obvious to me it will take a life to master. To me it seems the most alien language I ever seen.

xigoi · on May 12, 2022

It's not that bad if you ignore all the stuff you don't need.

tonyg · on May 12, 2022

Will, how do you think of Nota in relation to Racket's Scribble [1]?

[1] https://docs.racket-lang.org/scribble/index.html

wcrichton · on May 12, 2022

Nota's syntax is heavily inspired by Scribble. You can think about Nota as "web-native Scribble", or "what if Scribble was written in Javascript".

soegaard · on May 12, 2022

I like it. It looks very much like the notation I used in Urlang to produce React components.

I really need to put some more examples online...

This is an all-in-one-file example:

https://gist.github.com/soegaard/1a12a9c3eccd295f1d91e030f9f...

For the use of Scribble notation to produce React components see line 99.

wcrichton · on May 12, 2022

Very cool! Yeah that's about the closest thing to Nota I've seen so far. Haven't seen Urlang, I'll have to check that out.

soegaard · on May 13, 2022

FWIW here is an overview:

https://racket.discourse.group/t/urlang-is-javascript-with-a...

The Github repo is here:

https://github.com/soegaard/urlang

danmur · on May 12, 2022

Cool language, I wish it wasn't react though. I guess there's more interactivity possible than shown in the demos; the only thing I've seen are popups for terms. Don't really need react for that.

wcrichton · on May 12, 2022

My original paper on the system shows some more interesting examples, such as logical inference rules that can toggle between natural language and mathematical symbols: https://willcrichton.net/nota/

I'll be adding an "Examples" page with more interesting content soon.

The reason for using React is that it provides a nice model for components that encapsulate state and interactivity, so a user can just write "@Component{contents}" and the component author can deal with all the tricky UI issues. A design goal for Nota is to be as declarative as possible, while still integrating nicely with Javascript.

danmur · on May 12, 2022

Thanks, it's very cool :)

awinter-py · on May 12, 2022

file formats are really important -- every saas app in 2017 would have been a file format in 1996, and this is an underappreciated reason that computers generally worked better in 96 vs now

file formats, unlike apps, can be declarative. (or pretend to be, but even 'pretend declarative' leads to good tool integrations)

of course saas apps have collaboration and messaging, and file formats don't; whoever figures out how to do messaging in semi-declarative formats will be a player in the 'future of web' space

notreallyserio · on May 12, 2022

FWIW: every time you double tap a Nota link something is added to the browser history, at least in mobile Safari, which impacts back button usage by "trapping" you on the page.

wcrichton · on May 12, 2022

Thanks for the bug report, I will fix that.

sm1ch · on May 12, 2022

The aim seems similar to Quarto (https://quarto.org/) - an "open-source scientific and technical publishing system built on Pandoc" which "create[s] dynamic content with Python, R, Julia, and Observable". @wcrichton how does Nota relate to Quarto and similar prior efforts?

wcrichton · on May 12, 2022

Quarto is a good example where their design requires users to understand many different DSLs to achieve the end result. Just in their code sample you have:

* YAML for the page metadata (title: ...)

* Markdown (```{python}...```)

* Custom in-text notations (@fig-polar)

* Custom markdown notations (#| label: fig-polar)

Whereas in Nota, this would look like:

    %(export let metadata = {
      title: "matplotlib demo",
      format: {html: {codeFold: true}},
      jupyter: "python3"
    })

    For a demonstration of a line plot on a polar axis, see @Ref{fig-polar}.

    @Code[language=Python][label="fig-polar"][figCap="A line plot on a polar axis"]{
      import numpy as np
      import matplotlib.pyplot as plt

      r = np.arange(0, 2, 0.01)
      theta = 2 * np.pi * r
      fig, ax = plt.subplots(
        subplot_kw = {'projection': 'polar'} 
      )
      ax.plot(theta, r)
      ax.set_rticks([0.5, 1, 1.5, 2])
      ax.grid(True)
      plt.show()
    }

When code or structured data is involved, Nota is "just Javascript" (e.g. exporting metadata, specifying code literals). When text is involved, Nota provides a small core set of sigils like @Component[attr=value]{content}.

Note that Quarto's "#| fig-cap" annotation probably doesn't allow for rich text captions, since it only allows strings. But Nota allows deep composition, so you could do:

    @Code[figCap=@{A @strong{line plot} on a @em{polar axis}}]{
      ...
    }

sm1ch · on May 13, 2022

Thanks! Your answer also suggests a tradeoff: the academics around me would fairly easily get used to YAML/md/etc but would struggle to internalize `let`, `export` and similar notions needed just to declare metadata in your example `%(export let metadata = {`. So it looks like Quarto caters to this population too, at the cost of being less streamlined for those wanting to include code and structured instructions, whereas Nota is more natural for coders at the cost of potentially losing the non-coding academics. Fair?

jjallr · on May 12, 2022

FWIW Quarto does allow you to include arbitrary markdown in fig-cap. So this is valid:

    #| fig-cap: A **line plot** on a _polar axis_

martopix · on May 12, 2022

Cool idea but the syntax is as cluttered as Latex's. I wish it was based on markdown with added features.

jrm4 · on May 12, 2022

Came for this.

There's probably a phrase for what I'm describing but that markdown does well -- something like, keep the "source" as clean as possible and only invoke "code" as absolutely needed. Moreover, strive to be human friendly when you do that.

WolfOliver · on May 12, 2022

Is there a toolchain to render PDFs from a Nota document?

I'm thinking if I could make use of this in MonsterWriter. The export in MonsterWriter is based on LaTex which is a little tricky to handle.

Props for building this project, and props for making it a MIT license!

wcrichton · on May 12, 2022

Not yet, but that is a high priority TODO. The simplest toolchain is just "Page Print > Save to PDF", but I need to first improve the printer-oriented CSS.

There's a broader issue of layout, like if you want to print a Nota document and a figure is split across two pages. I'm not sure how to handle that, or if it even makes sense to. The goal of Nota is to move away from the paper document model, although people of course still like to print things and scribble on them.

destedexplan · on May 12, 2022

Would be nice to think about how it compares to something like Pandoc which -- as far as I can tell from the Tutorial -- has all the expressiveness Nota has without being tied down to a particular output stack.

Also Nota doesn't absolve you of having to learn React, JS, etc. which is fine. Lightweight markup often winds up being a DSL over some more powerful underlying stack. But the goal is to separate authoring from styling. I.e. at the authoring stage you don't need to specify how a sidenote or a definition list would be implemented. Nota seems to mix these up.

wcrichton · on May 12, 2022

AFAIK Pandoc doesn't really have a story for interaction. Like, "show a tooltip when someone clicks on this text". That's the main value add for Nota.

Nota doesn't require users to specify the implementations of things like footnotes. The goal is the user just says "@Footnote{content}" and the document renderer takes care of all the layout. And you can swap in different implementations of the Footnote component.

eitland · on May 12, 2022

What this needs IMNSHO is guarantees that once the document is loaded it won't do anything funny.

My preferred approach is a 1. small subset of modern HTML and css 2. (optional) that can link to a common package of JS to do things like partial updates ans autocomplete.

Main point is: Once the page has loaded nothing goes on in the background unless the user is explicitly changing the document, i.e. regardless of how many such pages you open, your CPU quickly goes to 0 and stay there.

So this could work too.

wcrichton · on May 12, 2022

A goal for Nota is to gracefully degrade in the absence of Javascript. This already mostly works, since we do server-side rendering to fully generate the HTML contents of the document before providing it to the client. When you first visit nota-lang.org or willcrichton.net you are seeing the pre-rendered document.

yodon · on May 12, 2022

Is there a security model for preventing injection attacks? I'm thinking of content-based injection attacks like those that try to break out of language parser enclosures in HTML or Liquid or SQL or etc.

Presumably the responsibility would largely fall on the component authors but I was curious if there were any language-level threat model or mitigation approach in mind here.

wcrichton · on May 12, 2022

There isn't, and I would say Nota is not suitable for e.g. a commenting syntax on a forum. It's intended for blog posts, papers, or other documents coming from more trusted sources.

teg4n_ · on May 12, 2022

It’s certainly interesting but I am having a difficult time seeing the benefit over simple html and pre made web components

foobarandlmj · on May 12, 2022

Thank you for taking the time to build this.

ssl232 · on May 12, 2022

I imagine it will be useful to be able to embed Jupyter widgets to provide the viewer with interactive graphics. Is this possible?

gloosx · on May 13, 2022

Isn't HTML a standard document lanuguage for the Browser? what's the point in changing <ul> to @ul{}?

oxff · on May 12, 2022

> You need Node

garfield_walking_away.jpeg

kseistrup · on May 12, 2022

I get associations to the AmigaDoc document format.

kats · on May 12, 2022

We already have that and it's called HTML... goodness gracious...