RegEx Crossword

reificator · on March 12, 2021

So many of these start or end with an `x*`, `(x|y)`, `[xy]` or `[^xy]` that I only see two characters I can fill in before I need to start looking at multiple constraints for the same cell and either doing combinatorics or guess-and-check.

This might be fine for hard mode, but as someone who considers themselves a regexpert it's not very approachable as a first puzzle IMO.

A more gradual introduction to the format would be to give a few clues that give you confidence on specific characters, that then let you lock in some other characters in other hints, and so on.

For instance, replacing `.*H.*V.*G.*` with `.{3}H.*V.*G.*` would go a long way because you could confidently place an `H`. And say that intersected with `(DI|NS|TH|OM)*` on the `H`, you could then place a `T` from the second clue because of what you learned from the first clue.

It could just be that I'm missing something or not as good at regex as I thought, and please let me know if that's the case. Either way though, when I'm trying a new kind of puzzle I'd like to feel like I made some sort of progress after trying for 5-10 minutes, and here 2 chars does not feel like progress.

afranchuk · on March 12, 2021

I just completed it, and can say with certainty that it is solvable by only taking into consideration two constraints at any given time, with the exception of 3 at just one point early on (and they were the easier constraints in the puzzle). That being said, the nature of regex means you kind of need to jump around as far as which constraints you combine.

reificator · on March 12, 2021

I don't doubt that, and I don't doubt it's a good puzzle, especially if you're already familiar with the format.

But since this is my first introduction to this kind of puzzle, I need some anchor points at the beginning so I feel like I have something to work off of.

I'm not even asking for a whole row, just an easier set of known chars at the start of the round so I have a hint at which of the 39 constraints I should start with.

To be honest I'm not even saying this puzzle should change so much as I am looking for a different puzzle to dip my toes.

I don't have any interest in starting the puzzle if I don't feel like I can put a foot down somewhere. It's like trying your first Minesweeper game, making two random clicks, and getting two `7`s. Where do you go from there? Or learning Sudoku from the hardest difficulty level, without having built up a library of patterns from the easier difficulties.

vcxy · on March 12, 2021

I did complete and enjoy it, but I'm both very competent with regex like you, but also big into sudoku variants (as in the youtube channel cracking the cryptic! [1]) so this felt like it was designed for me to enjoy. Considering it took me about half an hour, I'd expect someone who isn't into these kinds of odd puzzles already to basically have exactly your reaction.

[1] https://www.youtube.com/c/CrackingTheCryptic

reificator · on March 12, 2021

I don't watch regularly (and I would consider myself a sudoku dabbler) but I've seen some Cracking the Cryptic videos in the past and enjoyed them greatly.

afranchuk · on March 13, 2021

Yeah I definitely get that. I've never done a puzzle of this type before, but I have done a hell of a lot of logic puzzles (thanks to the Simon Tatham collection among others) so I was able to figure out a good attack vector. I agree, it wasn't easy to find where to start nor where to make progress in the beginning. Lots of data to ingest.

teawrecks · on March 12, 2021

All the ones that start/end with a constant can be filled in immediately. This forces some others that can only be certain strings, ex. (RR|HHHH)*.?

EGreg · on March 12, 2021

Is the complexity basically NP-hard, equivalent to a SAT solver or even harder?

wbl · on March 12, 2021

It's clearly in NP. One way to solve it is to order the squares in some order and combine all the NFAs in some nasty wreath product construction. Then we seek an accepting string. While this has an exponential state size blowup you may be able to construct lazily in the BFS and perhaps that keeps the complexity down.

cammil · on March 13, 2021

Depends what you think N is doesn't it? What would be a variable parameter in this puzzle? The number of letters in the alphabet? The size of the regex expressions? The size of the board?

dandanua · on March 12, 2021

Nonograms are np-complete, so this type of puzzles is also np-complete.

ladberg · on March 12, 2021

Nope, it's basic regex so not NP-hard.

teawrecks · on March 12, 2021

But it's a satisfiability problem. Seems NP-complete in the general case to me.

ladberg · on March 12, 2021

Huh yeah that could be right... I'll have to think about it more.

xg15 · on March 13, 2021

I fully agree. I think it's a really interesting concept but it's missing a bit of game design skill so far.

I had the same experience when I tried it yesterday. There doesn't seem to be any good "starting point", like there is on a traditional crossword puzzle or sudoku.

I think from a game design perspective it is really interesting though: Like sudoku, this kind of puzzle gives you a wide range of options to archive different player experiences and difficulty levels: You can make easy levels by mostly using constant-width regexes and non-conditional letters and you can slowly increase the difficulty level by making regexes less constrained and more ambiguous. A designer could even craft specific "paths" through the puzzle by combining easy and hard regexes.

Finally, a designer could gradually introduce more complex regex features (or other patterns, like multiple constraints) over successive leves.

ilikepi · on March 13, 2021

There are actually four spaces that can be filled by accounting for only a single pattern. Two of those are at the end instead of the beginning.

I agree it definitely feels imposing when you first look at it, but stick with it. Look for spaces that have a very small set of possibilities, and then try to map out what neighboring spaces could have for each possibility. If you really feel stuck, take a screenshot and mark it up.

andrelaszlo · on March 13, 2021

https://regexcrossword.com/ have a lot of great puzzles with a gentler learning curve but I liked the challenge of this one, so I think it would have spoiled the fun a bit for me.

ryangittins · on March 12, 2021

Agreed. Some of them can be simplified quite a bit, which leads to a lot of clutter. For instance,

  (XHH|[^XH])*

is equivalent to just

.*

EDIT: It looks like I read the regex wrong. I guess I need to do the puzzle after all!

rav · on March 12, 2021

I found other patterns that I thought were unsimplified, but (spoiler alert) in the end it turned out that all of the seemingly-unnecessary details were important for the solution! E.g.

    [^C]*[^R]*

LOOKS unsimplified, but it actually means that if the string contains an R, the preceding characters CANNOT be C.

justusthane · on March 13, 2021

Or if the string contains a C, a subsequent character can't be an R?

Doesn't it really just mean a string can't contain a C followed by an R?

DavidSJ · on March 13, 2021

Yes, those are three ways of saying the same thing. :)

justusthane · on March 13, 2021

Not really. Maybe I'm being overly pedantic, but my point was that the first two ways of describing it are overly specific and don't include all the strings that can be matched. Only the third option really describes all the possibilities.

DavidSJ · on March 13, 2021

What is an example of a string that can be matched which does not meet the first or second description?

justusthane · on March 13, 2021

Sorry, you're right. I'm not sure what I was thinking.

Guvante · on March 12, 2021

("The sequence XHH" or "any character by X or H") repeated 0 or more times. Is not the same thing.

It is "If any of X or H appear they appear in a sequence exactly matching XHH".

ladberg · on March 12, 2021

Those are not equivalent, I think you're thinking of [XHH] instead of XHH.

ryangittins · on March 12, 2021

Thanks, that's exactly how I read it.

anderskaseorg · on March 12, 2021

No, (X|H|H|[^XH])*` or ([XHH]|[^XH])* would be equivalent to .*, but (XHH|[^XH])* requires that XHH appear consecutively in that order whenever they appear at all.

ehsankia · on March 12, 2021

Small feature request for this implementation: Highlight/bold the three impacted regex when selecting a given hexagon, trying to trace it manually by eye is a bit of a pain.

(I guess I could PR on github when I get some time later today)

ehsankia · on March 13, 2021

https://github.com/Jimbly/regex-crossword/pull/10

wutbrodo · on March 13, 2021

Thanks for the PR! I noticed that little detail and probably wouldn't have completed the puzzle without it.

beaconstudios · on March 13, 2021

I was just finishing my implementation of this in the console when I saw your comment!

Jimbly · on March 13, 2021

Merged, thanks!

TheCycoONE · on March 13, 2021

Unfortunately they mixed up the coordinates a little. I submitted a fix.

adrianmonk · on March 13, 2021

Your fix is appreciated. It's much nicer not to have to ignore the label highlighting across the top when clicking in rows below the middle.

anderskaseorg · on March 12, 2021

This puzzle was originally written by Dan Gulotta for the 2013 MIT Mystery Hunt.

https://web.mit.edu/puzzle/www/2013/coinheist.com/rubik/a_re...

anderskaseorg · on March 13, 2021

(I got this attribution fixed upstream: https://github.com/Jimbly/regex-crossword/pull/7.)

Jimbly · on March 13, 2021

Huh, well, this explains why my 8-year old repo is suddenly getting a bunch of comments and PRs...

andrelaszlo · on March 13, 2021

I was so happy when I found it again! I was looking through my pictures folder and found this thing that I made the first time I saw your implementation https://helvetet.com/_filedump/crossword.gif

The ones on https://regexcrossword.com/ are nice but this is the original, for me at least!

de_nied · on March 13, 2021

Recycling isn't limited to plastics.

de6u99er · on March 13, 2021

Great idea. Love it!

hk__2 · on March 12, 2021

See also https://regexcrossword.com/, which has been discussed a lot on HN.

carstenhag · on March 13, 2021

Also, my android version of it:

https://play.google.com/store/apps/details?id=de.chagemann.r...

https://gitlab.com/carstenhag/regex-crossword-kotlin

mvolfik · on March 13, 2021

I hate you. I've just spent there way more time than I'd want to admit

jl6 · on March 12, 2021

Brilliant. I wonder how it was made?

I solved this in about 1.5 hours by starting at the top left, entering any string that satisfied at least one condition, then moving on to the next condition and “fixing up” any previous entries. I was fearful that I might arrive at a nearly correct solution that I would have to massively backtrack from, but it didn’t happen - I only needed a few short backtracks. I think the large number of constraints helps a lot.

schoen · on March 12, 2021

See this comment

https://news.ycombinator.com/item?id=26439598

for proper credit to the original author (and place of publication).

mypalmike · on March 13, 2021

Spent a similar amount of time... Ended up with a "solution" that breaks 1 constraint. Must have erred somewhere, but backtracking is only leading to other 1-broken constraint outcomes. Sigh.

schoen · on March 13, 2021

Official solution from the original author: https://web.mit.edu/puzzle/www/2013/coinheist.com/rubik/a_re...

lkbm · on March 13, 2021

Oooh, I was stuck because I was insisting on `.(1)(2)(3)(4)\4\3\2\1.` having the middle eight characters mirror each other, which results in a clear contradiction with a few of the crosses.

Thanks for posting the answers. I think I'm done working on it for now, but that one was really bothering me.

darkerside · on March 13, 2021

This is how I interpret it, too. What am I missing?

Jimbly · on March 13, 2021

Because of the ".*" on either end, it's just having a sequence of 8 somewhere that are mirrored, could be the first 8 or last 8, not strictly the middle 8.

lkbm · on March 14, 2021

Yeah, sorry for the bad formatting. I probably should've taken the time to make it not markdownify the asterisks. Without them, it is mirroring the middle eight.

darkerside · on March 14, 2021

I misread the parent post and I was interpreting the clue correctly after all. Thanks for helping clarify. It was a fun puzzle.

mypalmike · on March 13, 2021

What's the fun in that? :-P

Thanks though. I tried again on my laptop and got it - was a bit easier with a larger display than on my phone.

tzury · on March 12, 2021

Regex Golf used to be my game

https://alf.nu/RegexGolf

walnut_eater · on March 20, 2021

Hmm, I can't quite figure out the syntax. It seems to do partial matching and it ignores my use of ".*".

ineptech · on March 12, 2021

This is an absolute classic! It is not just an exercise in tedium or repetition, it has an internal progression that makes it very satisfying to solve.

cammil · on March 13, 2021

Just the right amount of difficulty before you get rewarded. Like pistachios.

Minor49er · on March 12, 2021

This is awesome. I wish there was a way to rotate the cells so you could see them straightened out. Otherwise, this is like the final boss to every Regex Golf game I've seen

Algol · on March 13, 2021

That was a lot of fun. It can help solving this puzzle if you are familiar with techniques for solving picross (AKA nonogram) puzzles.

adrianmonk · on March 13, 2021

Thanks! I did some nonograms years ago, and I saw the similarity but couldn't remember the name of that type of puzzle.

I think I saw at least 3 places where that technique could be used, probably more.

SCLeo · on March 12, 2021

It took me 1 hour 30 minutes. Have to say, this is so well designed. Love it.

cammil · on March 13, 2021

about the same for me too.

Naac · on March 12, 2021

Any reason why this is implement as a hex instead of the traditional crossword style?

There isn't a technical reason why the traditional crossword format can't have clues in the shape of regex right?

kjhughes · on March 12, 2021

The hexagonal layout allows three regex constraints to be imposed on each cell rather than two for a rectangular layout, providing additional challenge.

ZeroGravitas · on March 12, 2021

Don't think there's a technical reason. I'd previously seen this site linked from here that does it slightly more traditionally:

https://regexcrossword.com/

minitoar · on March 12, 2021

IIRC the more advanced puzzles here are also hexagonal.

freeopinion · on March 12, 2021

Lost interest on 3rd beginner puzzle when validation failed for multiple valid solutions.

vitus · on March 12, 2021

What solutions were you trying?

As far as I can tell, the puzzle is fully constrained.

From the start, the bottom-left cell as well as the right column can be solved just from the starting hints, and then you can derive the top-left cell per the backreference.

In particular, you can reframe the right column's hint as (AB|OO|OR), and only one of those satisfies the bottom row's hint.

freeopinion · on March 12, 2021

Not so.

Something as simple as "OO\nDO" should be valid. But also "OO\nDD" and myriad other solutions.

vitus · on March 12, 2021

Ah. The difference is that you're trying partial matches, as opposed to full matches from the specified patterns.

edit: and by full match (since this seems to be the source of some confusion in another subthread), I explicitly mean anchored with your \A...\Z or whatever you want to use.

lesquivemeau · on March 12, 2021

"Hexagons are the bestagons"

neolog · on March 12, 2021

Does the hex mean each cell has more adjacencies?

ehsankia · on March 12, 2021

There are plenty of normal regex crosswords out there. What makes this one special is the fact that it's in a hex format. If anything, regex is actually what allows people to even make hexagonal crosswords, I don't think it would possible to get any reasonably sized one with normal words.

They just went the extra mile to make an special puzzle.

superfamicom · on March 12, 2021

These sort of games are great as an educational tool, a simpler one would be a nice intro before going to hard mode though.

gotostatement · on March 12, 2021

Why doesnt the empty string satisfy `(DI|NS|TH|OM)*`?

freeopinion · on March 12, 2021

Empty string does satisfy that. This puzzle gets it wrong.

vitus · on March 12, 2021

That's because it's not empty string in the puzzle, it's 8 spaces.

(and on top of that, we're doing a full match.)

freeopinion · on March 12, 2021

8 spaces also matches. I don't know what you mean by a full match. Do you mean that you are using a different RegEx than the one displayed?

mypalmike · on March 13, 2021

8 spaces does not in any way match the language described by that regex. Not a partial match, not a full match.

Plugging the first space into a DFA described by the regex is an immediate failure - there is no exit from the initial state initiated by the space character. It's a non match.

Regex engines will say they do match because they are by default checking for for substrings that match the language (such as the initial empty string of each line of grep), not for strings that match the language.

*edit: added last paragraph.

nxrabl · on March 12, 2021

I assume by 'full match' they mean:

- the pattern is found in the input

- the start of the match is the start of the input

- the end of the match is the end of the input

By this definition, 8 spaces does not match the pattern.

freeopinion · on March 12, 2021

I don't know what any of your three statements actually mean.

R*D*M* does not specify anything that has to be found in the string. Nor does any pattern of ()* or []* no matter what you put between the parens or brackets.

In all of those cases, any possible string matches the regex starting at the beginning of the string and ending at the end of the string.

Your clarification doesn't clarify anything for me.

jcranmer · on March 12, 2021

The regex crossword is working on the basis of matching the entire input string (of " "), not on finding a substring that matches the regex. You can prefer to think of it as having all regexes have an implicit ^ and $, i.e., you're attempting to find a substring that matches "^RDM*$".

dec0dedab0de · on March 12, 2021

You can prefer to think of it as having all regexes have an implicit ^ and $

That is literally what is happening: https://github.com/Jimbly/regex-crossword/blob/master/crossw...

ladberg · on March 13, 2021

Not anymore, hah: https://github.com/Jimbly/regex-crossword/commit/0e603975f21...

dec0dedab0de · on March 13, 2021

Haha! Fantastic commit message

Jimbly · on March 13, 2021

Glad someone appreciated it :D

schoen · on March 13, 2021

Similar to "egrep -x".

adrianmonk · on March 13, 2021

The issue is two different types of regex notation.

Some regex notations include "^" and "$", and some don't. A lot of software (the grep command, for example) uses the kind that does support "^" and "$". This puzzle uses the other kind.

Essentially, when a notation includes "^" and "$", it allows writing cleaner more concise patterns. These notations add an implicit "." at the beginning and end of every pattern unless you use "^" or "$" to turn that off.

As for how you're supposed to know this, the puzzle tell you, but there is a very strong clue, which is that many of the patterns have a leading/trailing ".". This would be totally superfluous in one type of notation, so it must be the other kind.

Here are some patterns from the puzzle's notation:

    .*H.*H.*
    (DI|NS|TH|OM)*
    F.*[AO].*[AO].*

and here are how they'd look in a notation that uses "^" and "$":

    H.*H
    ^(DI|NS|TH|OM)*$
    ^F.*[AO].*[AO]

Jimbly · on March 13, 2021

To be fair, the puzzle didn't tell them at the time of their post, I added the "must be a full match" line after seeing confusion here, and it was not in the original puzzle (people doing puzzle hunts are expected to deduce more than random people on the internet, I guess!).

freeopinion · on March 12, 2021

R*D*M* should match empty string or a string of 8 spaces. or any combination of characters. It should be impossible not to match.

Edit: Fix HN oddity

utborin · on March 12, 2021

A full match has to exhaust all of the characters in the string. R*D*M* is indeed a match for a string of eight spaces, but it isn't a full match, because there are still eight spaces left over after matching.

    % python3
    >>> import re
    >>> re.search(r'R*D*M*', '        ')
    <re.Match object; span=(0, 0), match=''>
    >>> re.fullmatch(r'R*D*M*', '        ')
    >>>

zermelo · on March 12, 2021

nope, puzzle is fine.

ziml77 · on March 13, 2021

I think that was between 1 and 1.5 hours to solve. Was a fun puzzle! Though for some reason my brain was expecting the NYT Crossword completion music when I filled that last hexagon...

chaorace · on March 12, 2021

It took me just under 2.5 hours. How are you folks faring?

Tip: Ctrl-Z is your friend ;)

zermelo · on March 12, 2021

Yep, 1h5 hours for me!

dandanua · on March 12, 2021

Awesome puzzle! Can be completed without guesses.

Any ideas how it was invented?

schoen · on March 12, 2021

Look at the original MIT Mystery Hunt page (and solution). It was part of a puzzle hunt. (I was one of the people who successfully solved it on my team during that very event back in 2013...)

See this comment for the link to the original, including the author's name and the puzzle in context with its official solution:

https://news.ycombinator.com/item?id=26439598

(I wish that other comment would get upvoted to the top -- this was written by an identifiable person for a specific identifiable puzzle event, so it's not like mysterious anonymous Internet folklore or something.)

frob · on March 13, 2021

Hah! I was beginning to suspect there existed some pattern or answer in the result and was looking for an answer. The better part of 2 decades of mystery hunt has conditioned me. I see a four-letter snack in there that jumps out as a possible solution. Should we call hq?

schoen · on March 13, 2021

I'm afraid HQ closed around 8 years and 56 days ago. :-)

jedberg · on March 12, 2021

Oh man. This broke my brain just trying to get one of them without breaking all of the ones that crossed it. I love this.

onion2k · on March 12, 2021

If you fancy making an online crossword, I once made a nice traditional crossword layout using CSS Grid that makes a decent basis - https://codepen.io/onion2k/full/KRQeqm

carols10cents · on March 12, 2021

NY Times Mon < Tue < Wed < Thu < Fri < Sat < Sun < British Cryptic < RegEx

eludwig · on March 12, 2021

I would put Sunday between Thursday & Friday, just way more annoying due to grid size

gfaure · on March 13, 2021

Solved it! I kept tilting my head because it was hard to see the clue and interpret it at the same time. The ability to highlight all three diagonals would have been perfect.

The final clue I had to complete was that tricky .*(.)(.)(.)(.)\4\3\2\1.*

ilikepi · on March 13, 2021

> I kept tilting my head because it was hard to see the clue and interpret it at the same time.

Agreed, I definitely found myself doing that.

> The final clue I had to complete was that tricky .(.)(.)(.)(.)\4\3\2\1.

Interesting...I think I had the main guts of that one worked out when I was about 1/3 of the way through, but admittedly I did take a screenshot and mark it up in order to keep track of the various possibilities. It was (...?)\1* that tripped me up because I had an overly broad interpretation of its mechanics.

I wonder if it would be possible to record a bunch of 0-100% runs and then to make some sort of visualization to demonstrate different approaches...

brianpan · on March 13, 2021

OMG, I was having the hardest time figuring out what 3 regexes applied to each hex. Then I reloaded partway through and the HIGHLIGHTING started working.

Reload if you don't see the regexes turn bold when you click in a hex!

aasasd · on March 13, 2021

This confirms my long-standing annoyance with regexes: the dot is poorly visible among other characters and jumps out at me when I'm already thinking of a match.

amelius · on March 13, 2021

I think it would be more satisfying if the result was some human-readable text rather than a random selection of characters. Otherwise, interesting concept!

siraben · on March 13, 2021

What would writing a solver for regex crosswords entail? Is it just a matter of constraint solving, and what would the running time be?

acdw · on March 13, 2021

I thought this would be a true crossword -- i.e., it'd have /words/ to solve for ... :/

Really neat puzzle though!

Jimbly · on March 13, 2021

In the context of the original MIT Puzzle Hunt, there is a single word or phrase that comes out of solving the entire thing, but, yeah, would be interesting if it were all words!

kubanczyk · on March 13, 2021

Having the words everywhere would take away some fun, because the solution space would become severely constrained. Like with those DI|NS|OM expressions or that (...?)* line.

kjrose · on March 13, 2021

This was awesome. I loved it so much.

Personally I'd even pay good money for a series of these style of puzzles.

beaconstudios · on March 13, 2021

nice, solved it in about half an hour. At first it looked impossible past the first maybe 5 free characters. I feel like there were a lot of characters that had to be solved sudoku-style by holding up to 4 constraints in mind at the same time, which was tricky.

stevage · on March 13, 2021

Is there something wrong? Why isn't the clue `(O|RHH|MM)*` satisfied by blankness?

ufo · on March 13, 2021

You can assume that all regexes are "anchored". They match the entire word, not just a substring.

phonebucket · on March 12, 2021

I thought I could handle RegEx, but this has given me a legitimate inferiority complex.

ufo · on March 12, 2021

Is it supposed to have an unique solution or are multiple solutions possible?

ladberg · on March 12, 2021

Just did it, there's a unique solution and you can get to it just by considering a few rules at a time (i.e. no long backtracking).

sevencolors · on March 12, 2021

The solution: https://www.i-programmer.info/images/stories/News/2014/Dec/B...

Aren't crosswords supposed to spell out a word? This feels like random characters. Unless i'm supposed to rotate it or something?

mypalmike · on March 13, 2021

Think sudoku rather than crossword.

schoen · on March 13, 2021

There is an answer extraction, though, because it's a Mystery Hunt puzzle (where all puzzles extract a word or phrase as an answer at the end). It's just that the individual diagonals in the grid as a whole are not themselves words.

mypalmike · on March 13, 2021

Oh wow I didn't realize there was a phrase embedded in there. Amazing.

cammil · on March 13, 2021

I think each entry I put was forced, so only one solution unless i made a mistake.

hateful · on March 12, 2021

Also, are the answers real words, or random strings?

ufo · on March 12, 2021

It has to be fake words. It's possible to see this because some of the words have to contain things like "cdd", "rrp" and "rxo", which aren't part of any real words.

hateful · on March 12, 2021

Plus, now that I think about it, having a grid without the gaps usual crosswords have would be very complex to create with actual words. Even more so when using hexagons.

torgard · on March 12, 2021

What a game! Thanks for this.

It took me close to three hours to solve this, hahaha

WarOnPrivacy · on March 12, 2021

Well, there went my weekend.

nfoz · on March 12, 2021

This was fun :)

heroHACK17 · on March 12, 2021

glsdfgkjsklfj · on March 12, 2021

99pct of the challenge is counting the positions for things.

Would be nice if they had a couple hand mande puzzles instead of random ones.

smrq · on March 13, 2021

There is no way a puzzle with such a narrow solution path is anything but handmade.

freeopinion · on March 12, 2021

Right away the red and green hints are wrong. Either that or this puzzle used RegEx rules that contradict those I am familiar with. Either way, they lost my attention.

ladberg · on March 12, 2021

They seem to be accurate under all normal regex rules. Keep in mind that empty values are spaces and a full match is needed!

freeopinion · on March 12, 2021

.(C|HH)* should match against " " or against "A " or against "B" or against "ZZZZZZZ".

But this puzzle doesn't recognize any of those as valid.

ladberg · on March 12, 2021

That's not the case... .(C|HH)* matches with single character followed by any number of C's or HH's. So " " or "B" would work but in the puzzle the full line has to match so those aren't possible

freeopinion · on March 12, 2021

Can you explain what you are seeing about "B" that does not match? Or about "_______"?

chaorace · on March 12, 2021

As you can see[1], the regex checker is based on the JS RegExp.match method. Each cell is always 1 character long (so, either a letter of some kind or an empty space). The str parameter for the check function is assembled by concatenating these cells together[2], so the input string for an empty line will look something like this: "_______"

Assuming an empty 7-cell row, the regex in question will match 7 times, once for each character. A result of multiple partial matches is not equivalent to getting just one perfect match, which is what the puzzle requires. Internally, the puzzle enforces this requirement by making each Regex rule require a full line match (see line 118: '^' + rule + '$').

Personally speaking, I felt like that was the most intuitive way to interpret the mechanics of the game, so I'm willing to give the programmer a pass when they slightly modify the regex rule prior to evaluation.

[1]:https://github.com/Jimbly/regex-crossword/blob/8b178f32eba37... [2]:https://i.imgur.com/PS0BmJ6.png

ladberg · on March 12, 2021

I agreed that "B" does match. However, the puzzle needs to match the full line and not just a substring so "ZZZZZZ" or "B " don't.

freeopinion · on March 12, 2021

You have not explained how those are not matching the full line. I assert that they are. Show me actual correct code that says otherwise.

anamexis · on March 12, 2021

Even with the full line match, I can't figure this out.

R?(CR)*MC[MA]* is not matching for the full line RMCMMMMMMMMM

Why?

ladberg · on March 13, 2021

That line matches for me if I add it to the grid, though it breaks other lines.

Are you sure you're going the right direction? It should be going from the bottom left to the right (in the order of the the regex text).

anamexis · on March 13, 2021

Ah, thank you, that was it. I was assuming they started from the label.

mypalmike · on March 13, 2021

Yeah it took some testing to figure out the direction.

freeopinion · on March 12, 2021

What does "full match" mean? Either it matches or it doesn't.

ladberg · on March 12, 2021

I think some people may have thought you just have to match any substring within it, so I was clarifying that the whole line has to be a match.

freeopinion · on March 12, 2021

Some people would be correct to think that. That is what those regexes mean. If you are secretly prepending a ^ and appending a $, then you are not using the regex displayed.

utborin · on March 12, 2021

Sorry, but you're wrong. There's nothing about regular expressions that means you have to use them to search for matching substrings. That's just one particular operation that uses regular expressions. It's not a quality inherent to regular expressions themselves. There are many different operations you can perform with a regular expression besides a substring search.

Python, for example, has a fullmatch method.[0]

libicu's matches() function returns true "if the pattern matches the entire string, from the start through to the last character."[1]

PCRE has various flags that change what it means for a regular expression to match, including PCRE2_ANCHORED and PCRE2_ENDANCHORED. Used together, these options would require a full match with no change to the regular expression itself.[2]

0. https://docs.python.org/3/library/re.html

1. https://unicode-org.github.io/icu/userguide/strings/regexp.h...

2. http://www.pcre.org/current/doc/html/pcre2api.html#SEC27

schoen · on March 13, 2021

Another way to describe this could be that the meaning here is inherently ambiguous between "is-a" and "has-a", but the puzzle only makes sense (and only has a solution) if you interpret as "is-a".

In Mystery Hunt puzzles, which this originally was, "we have to interpret this in a way that would allow there to be a meaningful and unique solution" is not only a perfectly legitimate form of reasoning, but often necessary!

It's not really exactly the same kind of reasoning, but in a puzzle I wrote a year before this for the same event

https://www.mit.edu/~puzzle/2012/puzzles/into_the_woodstock/...

you could look at it and say "hiragana is only ever allowed to be used to write Japanese!!!" but insisting on that rule (much as it applies in most situations) wouldn't give the puzzle a meaningful solution. :-)

Maybe a closer equivalent would be that in this year's Mystery Hunt, there was a puzzle using a set of variant Hashiwokakeru (Bridges) logic puzzles. There were hints about which rules were changed but it wasn't stated whether the rule changes applied individually (one puzzle each) or cumulatively (when rules get changed, they don't change back afterward), or some other way. So, it was necessary to make assumptions about what was meant and see whether they allowed a solution. That's typically considered fair and appropriate throughout Mystery Hunt-land.