Hacker News new | past | comments | ask | show | jobs | submit login

Maybe I've just been away from C++ for too long (~2 years) but I wasn't able to follow what was going on there. What does `sizeof` have to do with anything (doesn't work without it, though)? I would have enjoyed an article that talked me through this in a bit more detail.



The sizeof() forces instantiation of the template.

The trick is that the template declares a friend function with a body. Being a "friend" declaration, it is not scoped within the template. However, the body is defined within the template and is allowed to use the template's parameters.

The result is that the friend function's definition depends on how the template was instantiated, even though the friend function is not scoped within the template. If you instantiate the template two different ways, the compiler actually complains that the friend function has two different definitions.

What this all means is that you can write some code that instantiates a template, and then later on, you can observe how the template was instantiated. Instantiating a template has a programmatically-observable side effect.

And you can use that to trick the compiler into giving you more information about other declarations. SFINAE on steroids basically.


I hate you need to trick the compiler. Reflection can't come soon enough.


You're not the only one; I suggest reading the linked Reddit thread.

https://www.reddit.com/r/cpp/comments/6ydxya/the_great_type_...


For arctually reading anything, old.reddit.com links tend to be better:

https://old.reddit.com/r/cpp/comments/6ydxya/the_great_type_...


Let's break the program down into pieces, although I'm not going to go in program order because it makes the exposition a little easier if I don't.

The first point of interest is this line:

   auto loophole(tag<0>);
C++ parsing is a complex thing, and this is actually one of those cases where I had to stare at it for a while to figure out what it actually classifies as. At first glance, it can be a variable declaration or a function declaration, but Most Vexing Parse kicks in and this is a function declaration.

What threw me here was this is a trailing return type declaration of a function where the trailing return type is omitted, and I wasn't sure at first if this is legal. But it is a function declaration, which means we can't validly use it until we have a definition.

Now the next thing you might be interested is that the struct tag is a template. But, no, that doesn't matter one iota [0]. So we'll ignore it. Instead, let's move on to the template definition:

  template<typename T, int N>
  struct loophole_t {
    friend auto loophole(tag<N>) { return T{}; };
  };
There's two separate things to talk about here. Again eschewing the textual order, let's focus on the the friend line first. We have a definition of a function that is a friend--this causes the friend definition to be added as a member of the enclosing namespace of the class, as if it had been defined outside of the class, so it's as if we had this code:

  auto loophole(tag<N>) { return T{}; }
  template <typename T, int N>
  struct loophole_t {
    friend auto loophole(tag<N>);
  };
But wait, you notice. N and T are undefined when we pull out of the template! Well, this is the second point to bring up. Templates in C++ are really macros of a fashion. When you declare, or even define, a template, nothing happens. At least, not yet. Internally, the compiler basically saves the state of the template AST internally. When the template is instantiated, then it creates an entire copy of the template body as if it were defined, with the values of the template parameters substituted in the body of the template.

I'll use the tag<0> bit to give an example of what the code looks like at the point of our actual auto loophole declaration:

  // internal compiler reference to a templatable tag struct

  // internal compiler reference to a templatable loophole_t struct

  // this is autogenerated template instantiation
  struct tag<0> {};

  auto loophole(tag<0>);
Now we get to the kicker line, the sizeof, reproduced here:

  sizeof( loophole_t<std::string, 0> );
What does sizeof do? Well, nothing. It's a constant expression--even if you drop an expression in the argument of sizeof, that argument is not evaluated. But that's an expression context. Instead, however, we passed in a type name. This type name is a templated type--which means we instantiate that template. So let's add that to our running AST example:

  // Autogenerate the template expansion
  auto loophole(tag<0>) { return std::string{}; }
  struct loophole_t<std::string, 0> {
    friend auto loophole(tag<0>);
  };
  sizeof( loophole_t<std::string, 0> );
Because of the friend body definition, by instantiating the template, we created the function definition for the auto declaration that we provided. It is as if we had defined the function specifically at the point of the instantiation of the template, the first use of the template with specific arguments [1]. So the instantiation of loophole_t<std::string, 0> provides the body for auto loophole(tag<0>) that lets us deduce that its return type is actually std::string.

The final line is this:

  std::is_same< std::string, decltype( loophole(tag<0>{}) ) >::value;
So decltype looks up what the return type of calling the function named loophole with a prvalue of type tag<0> would be. There is only one such candidate function--the one we have a declaration and instantiation-generated definition for--and so we the proper type is the return type of said function, which is deduced from its body (in the instantiation-generated definition) as std::string.

If we omit the sizeof line, there is no instantation of loophole_t anywhere. And without that instantation, there is no definition of the loophole itself provided.

[0] Not for explaining what it does. It's for what it's for--you can easily generate several numbers with the template parameter and associate each number with a particular type.

[1] Okay, I'm shortcutting a lot of complexity here. [temp.inject]¶1 provides that "When a [class] template is instantiated, the names of its friends are treated as if the specialization had been explicitly declared at its point of instantiation."--in other words, precisely as I described it here. But this is specifically only true for friends. For member functions or regular templated functions, a lot of the name lookup actually happens at the point of declaration, not instantiation. But some of it doesn't, based on whether or not it's based (directly or indirectly) on a template parameter. It's complicated!


For those who are looking to understand what is going on, I would recommend the blog series I wrote -almost- more than half a decade ago at https://b.atch.se

Compiler intrinstics havet changed a lot since then, but the wording in the standard remains quite stable in this department

Disclaimer; I did not read your post in full (on set, working atm).

Edit: time flies, it has been more than half a decade.


I think it just causes the type loophole_t<std::string, 0> to be defined (and thus the loophole friend function).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: