Hacker News new | past | comments | ask | show | jobs | submit login
Cryptanalysis with Reasoning Systems (chriskohlhepp.wordpress.com)
85 points by saycheese on Jan 8, 2017 | hide | past | favorite | 20 comments



If you want to try solving the puzzle on your own, here's how to get started.

___

(1) Download the single image for the puzzle: https://www.gchq.gov.uk/sites/default/files/grid-shading-puz...

(2) Read: In this type of grid-shading puzzle, each square is either black or white. Some of the black squares have already been filled in for you. Each row or column is labelled with a string of numbers. The numbers indicate the length of all consecutive runs of black squares, and are displayed in the order that the runs appear in that line. For example, a label "2 1 6" indicates sets of two, one and six black squares, each of which will have at least one white square separating them.

___

Source: "A Christmas card with a cryptographic twist for charity" (2015)

https://www.gchq.gov.uk/news-article/christmas-card-cryptogr...

___

SPOILER-1: The above puzzle links to other puzzles, which maybe found here:

https://www.gchq.gov.uk/puzz

SPOILER-2: GCHQ's solutions to all of the puzzles is here: https://www.gchq.gov.uk/directors-christmas-puzzle-answers


This is somewhat interesting, but some of the terminology is idiosyncratic ("lambda closures") to the point of being downright confusing ("attack vector").

The pre-analysis in the derive-known-plaintext is particularly interesting because it's the key to making this tractable. But pieces are missing... Does anyone understand this well enough to explain what known-plaintext and matrix-match would be? I can't even work out their types with confidence.


OK, I took what I understood the idea of this preprocessing to be and implemented it in Prolog (257 lines including data and comments, maybe 100 lines of "real" code): http://pastebin.com/fLXKRKQ6

Some findings:

- I was wondering why the linked blog post didn't show the results of derive-known-plaintext (what I call "preprocessing"). It turns out, this may be because this forward propagation solves the puzzle completely. There is no need for backtracking search, so there is no need for the whole fancy Screamer framework.

- Since preprocessing solves the problem completely, applying the constraints to its result to "finalize" the solution is instantaneous in Prolog. With Screamer this takes 3 seconds. Screamer doesn't seem to be all that impressive... At least not for this particular problem.

- In the case where there are no initial marks on the board, after preprocessing having marked almost everything, Prolog finds the four possible solutions in 0.037 seconds on my 2015 laptop. Preprocessing takes a second.

Let me know if you have questions!


Author here again: and thank you Tom for your feedback. I'm not the author of Screamer so am not going to defend it's runtime properties. These were not of interest to me and yes I could recode much of what I did in C++ which I am more familiar with than Prolog. I work in low latency trading, so I am happy to take any challenge from Prolog in C++. Bummer would be that I could not publish my solution. The reason Screamer is in there is to allow the problem to be expressed declaratively in terms of universal quantification. See also

https://chriskohlhepp.wordpress.com/reasoning-systems/specif...


> The reason Screamer is in there is to allow the problem to be expressed declaratively in terms of universal quantification.

Sure, though this particular example doesn't show its strengths very well. Your pre-analysis is so powerful that even in the case where you start with the empty board, any hand-written primitive brute force enumeration technique should be blazingly fast.

> I work in low latency trading, so I am happy to take any challenge from Prolog in C++.

I'm not saying Prolog is always faster than C++ or Common Lisp :-) I guess I'm saying that every embedding of a Prolog-like system in some other language I have seen so far was less powerful, less elegant, and much less performant than Prolog. This is only partly gloating from a Prolog fan. Another part of me would very much like to have better such systems integrated with other languages.

One drawback of Prolog, of course, is that Prolog can be a bit painful to interface with the rest of your application. Unless, of course, you write everything in Prolog...



My suggestion would just be to email the author; his email in the text of the link below and right above "Licensed under the GNU General Public License", but after the word email:

https://github.com/chriskmanx/demystifycpp/blob/master/READM...


I was poking around for a GitHub page, a mail address, or a comment feature, but gave up more quickly than you did :-)

In the end I now left a comment at the overview post for this series of blog posts at https://chriskohlhepp.wordpress.com/2016/12/06/cryptanalysis... because comments seem to be disabled on the featured post itself.


Author here: Comments should be enabled. I have replied to Tom on the blog - right after his comment there.


Author here: am happy to put the code up on Github if people are interested.


Author here: Ok thank you Tom for your feedback. "Lambda closures" is a term that is ubiquitous in C++, chiefly owing to how closures are implemented in that language. Sorry if that confused non developers. An "attack vector" is any method, approach or means by which a analyst gains an advantage against the security of a system. A more formal definition maybe found here. http://searchsecurity.techtarget.com/definition/attack-vecto... I broadly used the term to denote building up known plaintext ahead of the application of the solver.


Most interesting read I had on HN this week. I suggest reading it end to end!


that's a great IDEA


That's a great example of using a problem solver system to solve a constraint satisfaction problem! It was a fun read.

One nit to pick with the terminology though: solving a cryptogram puzzle is _not the same thing_ as doing cryptanalysis, eg to determine what constitutes a weak DES key. Not even the same ballpark. ;)


Author here: Hello 60654, I fully understand the difference between cracking a run length encoding problem and finding weaknesses in DES. There are many tools at the disposal of a security analyst. Another would be CSP and FDR: https://www.google.com.au/url?sa=t&source=web&rct=j&url=http.... However, that article, like mine, goes to show how logic programming can very well be used to mount cryptanalytic attacks. It differs from quantitative statistical methods as these exploit different weaknesses. Qualitative vs quantitative. Most of the people with skills in the latter work in quantitative trading and hedge funds - including yours truly.


Does anyone know what the name of the color scheme the author is using in their terminal?


That's the default emacs color scheme. The terminal is just SLIME.

https://common-lisp.net/project/slime/


thank you!


Author here: Colour theme is "solarized dark" using Emacs.


Correction, for that article I disabled the theme. The other poster is correct in that syntax highlighting is from SLIME.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: