The Algorithms Behind Probabilistic Programming

sn6uv · on Jan 30, 2017

Nit

> The distribution must be smooth (this method doesn’t work for discrete parameters), but it need not be analytically differentiable

'Smooth' and 'analytic' both have basic technical meanings in the context of differentiability [1], [2].

Glossing over the assumptions in this way is quite misleading

[1] https://en.m.wikipedia.org/wiki/Smoothness [2] https://en.m.wikipedia.org/wiki/Analytic_function

mr_pink · on Jan 30, 2017

Analytically differentiable (being able to derive a formula for the derivative) is not the same thing at all as analytic functions of complex analysis (which are infinitely differentiable, etc). If anything, the term "analytic function" seems more of a misnomer to me, but "analytically differentiable" is used in its proper sense in the article.

joe_the_user · on Jan 30, 2017

Nice,

The algorithms the article define seem to justify seeing probabilistic programming as an advance beyond twenty line programs that call 20K sized libraries - basically, these algorithm inherently auto-tune a lot of higher level operations so these can be less-leaky abstractions.

3pt14159 · on Jan 31, 2017

It's taking a long time to come out, but the probabilistic approach I think is the most interesting is BayesDB:

http://probcomp.csail.mit.edu/bayesdb/

Almost always what I want is closer to SQL-like questions that I later want to manipulate with languages like Ruby or Python, rather than having to load all the data into a giant array and then have to use probabilistic programming on the whole damn thing.

adamnemecek · on Jan 30, 2017

You should check out http://webppl.org/, you implement your own probabilistic programming language in javascript.

riazrizvi · on Jan 30, 2017

Interesting marketing. It seems to be an API for statistical functions and data. Though the term Probabilistic Programming sugggests a new programming paradigm, which it is not. If I'm not missing something, the term makes a dry subject sound cool.

Cybiote · on Jan 31, 2017

It is a different paradigm in the same sense that Logic programming is a different paradigm. It's not just some API or library you can start calling into, it's a tool which deeply changes how you approach your problem. You need to be comfortable and able to reason with probabilistic concepts^.

Another way to look at it is, similar to how you can have the concept of a program as a differentiable function, you can have probabilistic programs as defining a distribution. Their utility is the ease with which they allow you to model complex probability distributions and then condition on observations. With that, you can do inference.

If you've heard of bayesian networks or graphical models, PPs can generalize these. It's just a few lines to do hidden markov models, bayesian regression, gaussian processes, gaussian mixture models and LDA topic models. Their downside is that performing inference has been less efficient than writing these models out by hand. We know that the ideas are powerful as they produce generative models which work off less data and naturally incorporate uncertainty.

The dream is, just as a more expressive language should have less patterns, probabilistic programming will one day make writing complex generative models so routine people won't bother specially naming them as this or that machine learning algorithm.

^ The kind you find in a book like this: http://www.mbmlbook.com/toc.html

williamsmj · on Jan 30, 2017

There's much more to it than you suggest, but if marketing bothers you then check out the MIT Probabilistic Programming Project's talk at Strange Loop 2015: https://www.youtube.com/watch?v=-8QMqSWU76Q.

Probabilistic programming is an area of huge academic activity, with deep connections to statistical inference, compiler design and, yes, UX.

cscurmudgeon · on Jan 31, 2017

Academia is not exempt from marketing. And not all marketing is bad. :)

Remember the origin of the term "dynamic programming"?

> "I spent the Fall quarter (of 1950) at RAND. My first task was to find a name for multistage decision processes. An interesting question is, Where did the name, dynamic programming, come from? The 1950s were not good years for mathematical research. We had a very interesting gentleman in Washington named Wilson. He was Secretary of Defense, and he actually had a pathological fear and hatred of the word research. I’m not using the term lightly; I’m using it precisely. His face would suffuse, he would turn red, and he would get violent if people used the term research in his presence. You can imagine how he felt, then, about the term mathematical. The RAND Corporation was employed by the Air Force, and the Air Force had Wilson as its boss, essentially. Hence, I felt I had to do something to shield Wilson and the Air Force from the fact that I was really doing mathematics inside the RAND Corporation. What title, what name, could I choose? In the first place I was interested in planning, in decision making, in thinking. But planning, is not a good word for various reasons. I decided therefore to use the word “programming”. I wanted to get across the idea that this was dynamic, this was multistage, this was time-varying. I thought, let's kill two birds with one stone. Let's take a word that has an absolutely precise meaning, namely dynamic, in the classical physical sense. It also has a very interesting property as an adjective, and that it's impossible to use the word dynamic in a pejorative sense. Try thinking of some combination that will possibly give it a pejorative meaning. It's impossible. Thus, I thought dynamic programming was a good name. It was something not even a Congressman could object to. So I used it as an umbrella for my activities."

https://en.wikipedia.org/wiki/Dynamic_programming#History

riazrizvi · on Jan 30, 2017

Oh I see. It is now a recognized application field in CS like Big Data, due to maturity of a suite of different tools from APIs to data engines.

marmaduke · on Jan 31, 2017

> Probabilistic Programming sugggests a new programming paradigm

PP itself is not just an API for stat functions and data. It lets you write a generative, forward program, and then you run the program on outputs (data), and it generates probable inputs (parameters). It's similar to constraint programming paradigm, but based on probabilities.

chiph · on Jan 31, 2017

> Probabilistic Programming

Reminds me of myself, early in my career. "Well, it probably works..."

mr_pink · on Jan 31, 2017

> sugggests a new programming paradigm, which it is not

No, you're just dismissing it without understanding it first. Which seems super common on this forum.

davidw · on Jan 30, 2017

Why is this flagged?! Cursory glance at the article looked like perfect HN material.

laxatives · on Jan 31, 2017

The article consists of a list of algorithms with almost no other information. Its posted because it has a link to the OP's own blog.

platz · on Jan 31, 2017

It was helpful to me, it put these algorithms in context and explained why there are useful over other methods. Were you expecting a graduate level course embedded in the post?

misiti3780 · on Jan 30, 2017

nice article - please change your font - it makes reading those blog post very difficult

hmottestad · on Jan 30, 2017

I also did not like the font, hard to say why. It definitely is not designed with being legible in mind. Feels very unbalanced. I think the biggest issue is with the distracting curves, and varying ways of drawing the various curves.

Kerning is also messed up, the distance between the letters is too large in comparison to the distance between the words.

Thirdegree · on Jan 30, 2017

It feel like a normal font that squished each character into a slightly-too-small box.

aswanson · on Jan 30, 2017

Hilarious, I was just going to comment on how cool/80s retro I thought the fonts were.

pliftkl · on Jan 31, 2017

Glad I'm not the only one. It's not often that I run into fonts that slow me down, but that hurt to read.

ntumlin · on Jan 30, 2017

What did you dislike about the font? I had no trouble reading it.

sp332 · on Jan 30, 2017

It has a lot of straight lines where my eyes expect curves. It really slowed me down reading, so I switched to readability mode.