Barliman – real-time program synthesis to make the programmer's life easier

noway421 · on Dec 5, 2017

This is dangerous. Instead of acquiring understanding, a tool like this lets the programmer just write the code which passes tests. Without the deep understanding of algorithm you can't write sufficiently full tests which assess program's correctness. TDD wouldn't save here, because instead of being a tool of discipline it becomes a list of instructions for codegen.

Would anyone want to ride in a car or fly in a plane written by set of sporadic tests? I wouldn't risk it.

cle · on Dec 5, 2017

Of course nobody should write production software with this, that's why the project loudly says that this is a PROTOTYPE. It is experimental software.

These kinds of tools are the future IMO. It's a higher layer of abstraction. Most programmers have no idea how their programs are actually executed--we write in a high-level language and rely on voodoo to execute it, without a "deep understanding" of how it's really executed, and without writing instruction-level tests. And it works great. I don't see how this is theoretically different, it's just a higher level of abstraction. Of course, its utility greatly depends on the practicality of its implementation (which includes performance)...

> it becomes a list of instructions for codegen

That describes exactly the everyday programming languages that we all use.

noway421 · on Dec 7, 2017

You don't need understanding of all deeper levels, you just have to know the system til the first non-leaky abstraction.

Concrete concepts are a good foundation of correct code, no matter how high level they are.

wruza · on Dec 5, 2017

Fun fact: when you board on a plane, you have no idea how good its hardware, software and test suite is beyond common “it didn’t crash n-thousand times before” knowledge.

vbuwivbiu · on Dec 5, 2017

the engineers who made and maintain it know everything about it

alcidesfonseca · on Dec 5, 2017

Given how large those systems are, how much layers (OS) of code there are, and the turn-over in software companies, I am pretty confident that no single engineer know everything about.

vbuwivbiu · on Dec 6, 2017

"engineers" - it's been decades since anyone on the planet could know everything about a whole system, but each engineer knows everything about their subsystem

coderjames · on Dec 5, 2017

Which is why I won't fly on certain airlines :-). As a Software Engineer who worked at an avionics company, I know who our big customers were and what bugs are in the products they bought, so I avoid them whenever possible.

moocowtruck · on Dec 5, 2017

most hardware and plenty of software for that hardware is sourced all around the world...engineers involved make assumptions based on how that stuff should behave, perhaps even run it through tests of their own...but they hardly know _everything about it_

camgunz · on Dec 5, 2017

You might be surprised by how much safety critical software is built by code generation tools like SCADE. It's not entirely the same thing, but the programmer is pretty removed from the final product and it is very spec driven.

sabujp · on Dec 5, 2017

As a TDD, I really don't care too much about the implementation as long as my tests pass. Performance however is another issue, but that's something that can be optimized and re-written into something like this depending on the language.

adamsea · on Dec 5, 2017

But how do you know you have sufficiently full tests which assess program correctness, to paraphrase the parent?

jononor · on Dec 6, 2017

Property-based testing, preferably with with heavy use of design-by-contracts style pre/post/invariants checking. Ideally also mutation testing.

rexpop · on Dec 5, 2017

That is a skill requiring study and practice.

I call it "being a good programmer."

yellowapple · on Dec 5, 2017

I've felt for awhile that it's a matter of time before we end up with a development environment that - given a collection of tests - will automatically generate a program which passes all the tests. Barliman looks like a really cool step in that direction, even with its stated limitations.

taneq · on Dec 5, 2017

For some reason the first thing that popped into my head when I read this was fuzzers like American Fuzzy Lopp. I wonder if you could use the same sort of stochastic exploration to explore the space of possible programs satisfying a set of tests rather than explore the space of possible inputs given a program?

maemre · on Dec 5, 2017

That's called program synthesis which is a hot research area. Specifically, what you mention is called "programming by example". I can't remember many examples on top of my head but some people from Microsoft Research are working on it for example: https://microsoft.github.io/prose/

hanbura · on Dec 5, 2017

I can't Google the name right now, but this does exist as an optimization technique. Just start with any program, create random permutations, score them for correctness and performance, keep the best ones, repeat.

The problem is that this is incredibly computation intense, since the number of possible programs is huge. Right now it's viable for improving small code parts with a know-correct starting point. Maybe some day computers become fast enough to make more viable.

maemre · on Dec 5, 2017

Stochastic optimization? STOKE [1] from Stanford is trying to that if I understood it correctly.

[1]: https://github.com/StanfordPL/stoke

vidarh · on Dec 5, 2017

Genetic Programming is a generalization of that. The "collection of tests" is effectively the fitness function.

The challenge is that 1) doing it for non-trivial functions and getting results faster than much simpler generative methods that depends on heuristics is a really hard problem (but can work better when we don't have reasonable heuristics); 2) writing exhaustive tests for a lot of the problems we care about is likely to take more effort than writing the code in the first place.

I think if you want something like this the effort is best expended on tools that help you create a consistent, concise and exhaustive model / test-suite rather than code to implement it, with a focus on making it possible for a human to read and sanity check the generated model.

In this case, the tool is basically trying to create a model that matches the tests, it's just that it never makes the model explicit other than in the form of finished code, which prevents us from verifying that the model is correct other than by inspecting the code and/or expanding the tests.

For small functions that might be helpful, but for larger pieces of code, I think it is likely that generating the code directly is likely to lead to code that is near impenetrable and impossible to validate expanding the test suite. E.g. a recurring problem of research in genetic programming has been that a lot of the resulting solutions are hard to understand even very small/simple algorithms. And for bigger problems it's not unusual to end up exposing weaknesses of the fitness function rather than solving the intended problem.

It's still an interesting project. I just think we're really far from having something with wider appeal.

ako · on Dec 5, 2017

Sounds like training a neural network with some test data? Software 2.0 as outlined here: https://medium.com/@karpathy/software-2-0-a64152b37c35

Sean1708 · on Dec 5, 2017

More generally it's just an optimisation (the mathematical kind) where your parameters are essentially a function body. In that article the function body is a neural network, but there's no reason it has to be.

setr · on Dec 5, 2017

isn't that called prolog

abecedarius · on Dec 5, 2017

Prolog can (sometimes) "reverse execute" a specification, not generalize from examples. It's like saying "a cat has a round face and pointy ears" versus "look at these cat pictures". (Versus "draw an oval and then draw two triangles at the top...")

mycl · on Dec 5, 2017

But generalising from examples has been a research topic in the Prolog community since at least the early 80s. The field is called inductive logic programming (https://en.wikipedia.org/wiki/Inductive_logic_programming).

agumonkey · on Dec 5, 2017

I wonder how far they went with prolog. I remember examples in books talking about solving a problem and outputing a plan. But it wasn't a program, not in the sense of nested/modular systems, more like a linear walk.

That said, webyrd has shown kanren embedded lambda calc (evalo relation) to find which program would be reduced to some value..

tree_of_item · on Dec 5, 2017

No, Prolog is nothing like that. Prolog is perhaps the sort of paradigm that could be used as one part of an implementation of something like that (as we see here with miniKanren), but Prolog itself isn't even close.

maxxxxx · on Dec 5, 2017

It would be fun to expand this to a full app with UI and database.

gergoerdi · on Dec 5, 2017

This seems like a real-time / on-line version of MagicHaskeller: http://nautilus.cs.miyazaki-u.ac.jp/~skata/MagicHaskeller.ht...

aetherspawn · on Dec 5, 2017

This thing is so smart.

Add a 'z' to the end of the first example and it figures out it needs to 'reverse . drop 1 . reverse' to take a character off the end.

Imagine if it could learn on every open source Haskell codebase on GitHub and then learn in real time while people type.

accurrent · on Dec 5, 2017

This looks pretty awesome. Does anyone have good book/lecture/paper/course that deals with program synthesis and the algorithms behind them.

zaph0d · on Dec 5, 2017

Barliman is written in miniKanren[0], a logic/relational programming system built by Daniel Friedman[1], Will Byrd[2] & Oleg Kiselyov[3]. There are implementations of miniKanren in languages other than Scheme, one of the prominent being Clojure[4].

To oversimplify, in the miniKanren world programs are written using relational logic, wherein there are "variables" and then certain "relationships" between the variables. That is the program specification. Now we can run the specification and allow miniKanren to generate one or more variables that satisfy the relations. Thus a miniKanren program can have more than one answers. One interesting side-effect of this kind of an abstraction is that programs can also be run backwards to generate more programs that satisfy certain relations. That's pretty much what's happening with Barliman.

[0] http://minikanren.org/ [1] https://en.wikipedia.org/wiki/Daniel_P._Friedman [2] http://webyrd.net/ [3] http://okmij.org/ftp/ [4] https://github.com/clojure/core.logic

neoncontrails · on Dec 5, 2017

Some of you might recognize Daniel Friedman as the author of The Little Schemer. If you liked that book, you might check out The Reasoned Schemer. Short, accessible, and a bit mindbending, it offers a compelling introduction to logic programming that culminates in the "invention" of a Prolog-like DSL from basic Scheme primitives. Terrific little book. https://mitpress.mit.edu/books/reasoned-schemer

mycl · on Dec 5, 2017

The induction of logic programs in Prolog from examples was done by Ehud Shapiro's "Model Inference System" described in his PhD thesis, "Algorithmic Programming Debugging" (https://www.amazon.com/Algorithmic-Program-Debugging-Disting...), in 1982.

This was one of the earliest inductive logic programming (https://en.wikipedia.org/wiki/Inductive_logic_programming) systems.

tluyben2 · on Dec 5, 2017

ILP + IFP are nice subjects to read about for this kind of thing. At university a number of people believed ILP and/or neural networks and/or genetic programming would replace programmers shortly. That didn't happen (this was around 25 years ago) but it's still interesting material great to learn from.

Davidbrcz · on Dec 5, 2017

You could read "Syntax-Guided Synthesis" by ALUR and al. There is here (http://resources.mpi-inf.mpg.de/departments/rg1/conferences/...) a presentation he did at a summer school last summer.

memebox3v · on Dec 5, 2017

Second that. This looks brilliant. Had no idea something like this existed :)

tluyben2 · on Dec 5, 2017

This combined with a robust type system should be nice. I thought it would be a solid direction which would've happened but didn't. One of my uni courses was ILP (long time ago indeed) and that seemed to have merit with faster computers, more advanced algorithms and heuristics. After that I saw Hoogle when I learned Haskell; seems combining these things can make something powerful. I was going to post MagicHaskeller here as well but gergoerdi did that already.

supermdguy · on Dec 5, 2017

This looks pretty cool! However, I think it would be annoying to have the tests automatically running as you're typing. What happens if you accidentally create an infinite loop or run something dangerous?

etherealG · on Dec 5, 2017

Stop at x time. What else is dangerous?

supermdguy · on Dec 5, 2017

I was thinking of overwriting files or removing database items. I know it's pretty easy to minimize, it just could be a problem if it isn't.

etherealG · on Dec 6, 2017

Fair enough, indeed I would advise to use with caution around data that needs to be kept safe from deletion. I think of this as a nice idea for experimenting forms of coding like spikes, but wouldn’t run something like this against a production database or my root file system.

noobiemcfoob · on Dec 5, 2017

Attach a program that can evaluate and define test parameters... I'm looking at a core of my replacement >.<