Brew commands send data to Google Analytics

pvinis · on Nov 25, 2016

Yes we know. brew tells you that. And you can disable it. Data is being sent to Google everytime you do almost anything in almost all websites, using the same technology. So what?

Some of the times it's to make the product better, or better targeted. Some other times it's just for spying on the users.

Let's stop complaining about stuff that someone does and tell you they do it. There are many more that do the same things without telling you.

Furthermore, now that you know, for brew specifically, will you opt out? Is your brew command history so secret that you care more about noone looking at it than helping making brew better? Chances are you use brew quite often. Chances are brew is not perfect. Will you choose to make its progress slower because of no real security reason?

How do we expect open source to become better if everyone is being a crybaby because brew got a history of how often they do brew update?

blub · on Nov 25, 2016

The US approach to privacy is "let the market sort it out". It has sorted it out by transforming the internet into a giant spying machine sucking up contacts, photos, documents, videos, metadata, what packages you use, when where and how you exercise, everything

The EU approach to privacy (eroded by lobbying and lack of control over US companies) is that citizens have a right over their information. They can ask what personal information a company has on them, ask that it be corrected or deleted. This has resulted in companies that are more careful with data.

Whenever I read a post like yours I immediately think of so-called "useful innocents", to put it euphemistically [1]. Fighting for a non-goal of better open source through analytics (get real), corporate dominance and fewer rights for individuals.

Companies don't need analytics to improve software. They certainly don't need analytics from Google, the leading spyware-as-a-service company in the world.

I expect open source to become better through writing quality software, engaging with users and if need be doing some surveys or organsising other reach-out initiatives in the open. Not analytics turned on by default but it's-ok-cause-we-tell-you-we're-fucking-you.

[1]: https://en.m.wikipedia.org/wiki/Useful_idiot

pvinis · on Nov 25, 2016

Do you use brew? Have you contributed code to it? Have you installed something using brew, before updating brew for a while?

For me, my answers are yes no yes. A while ago, you did that, and you get an old version of the thing you just installed. With analytics, they saw that many people forget to update before installing after a while, so they added an update step in that case.

I call that progress, no matter how small. I didn't change that and to you didn't change that. The brew devs changed that. Maybe they would have changed it anyway later.. Who knows. But the fact that brew because a tiny tiny bit better because of analytics, in part from me, makes me a happier user. Now I don't need to remember to update before installing.

Of course it's not the best way to "contribute", but it's something. Definitely better than "contributing" your data to companies, like you said.

majewsky · on Nov 25, 2016

I can totally see the benefits of telemetry. But it would be way less phishy if it were opt-in, or at least opt-out with a very visible information message.

CJefferson · on Nov 25, 2016

The problem with this is, do you want every program you ever use, to start prompting you with a series of questions about various opt-in / opt-out questions? I get annoyed enough that gnu parallel keeps asking me about citing it, and that's one program.

    Bash would like to record analyitics. y/n/more information
    > y
    > ls *.c
    ls would like to record information about how you use it.
    y/n/more information.
    > y
    file1.c file2.c file3.c
    > grep 'string' *.c
    grep would like to record some information about your
    usage. y/n/more information.
    > y
    file1.c:8: string
    > vim file1.c
    vim would like to record information usage, to help
    improve vim in future. y/n/more information.
    > y
    The 'C' package in vim would like to ...

Nursie · on Nov 25, 2016

No, I want every program I ever use not to leak data about me all over the internet.

I'm amazed we're even having this discussion

TeMPOraL · on Nov 25, 2016

If I saw something like that, I'd format my drive and install another OS.

The very idea of your shell or file listing program sending analytics to the mothership is ridiculous. They shouldn't be even talking to a network directly. They have a set of well-defined tasks, and spying on you is not one of them.

tunap · on Nov 25, 2016

"The problem with this is, do you want every program you ever use, to start prompting you with a series of questions about various opt-in / opt-out questions?"

Simple answer, Yes.

It would lead to conversations about the elephant in the room. It also alleviates the assumption I trust all providers(comprehend the EULA/ToS) and puts their actions under scrutiny. If more people realized what data is collected and by whom for whom they could make an informed, cost/benefit choice. Other useful results are, what is the applicability of that data to the project, what characteristics define the 'Trusted Partners' that data is shared with and, possibly, inject some restraint into the collectors' decisions of what to collect.

edit: removed unintended paste fragment

sangnoir · on Nov 25, 2016

Lol, reminds me of what macOS did[1] to me after a recent upgrade:

  $ git pull

  (long wait)

  ^c

  $ git pull

  (got distracted and had to leave keyboard for a bit)

  Can't execute program until you accept xcode license

I couldn't believe it.

1. paraphrasing the words

codedokode · on Nov 25, 2016

Opt-in can be implemented other way without questions. For example, user could set an environment variable or type a command.

CJefferson · on Nov 25, 2016

Then we hit the problem that if we don't push that request at users, probably a tiny fraction will turn it on.

Worse, that fraction will be the statistically unusual people who bother reading and finding such options, meaning we can't derive any statistically useful results about the user base from them!

TeMPOraL · on Nov 25, 2016

Frankly, that's your problem. I don't see even a true desire to make the software better as a legitimate reason to exfiltrate data from people without asking.

codedokode · on Nov 25, 2016

Maybe you should ask them better. For example you could ask for help on the home page of your project or in the beginning of a tutorial. Or maybe they do not want to paticipate. Is it right to ignore this and turn analytics on by default?

lewiseason · on Nov 25, 2016

The Debian installer gets this right: it prompts you during the install whether it can collect information about installed packages for popcon[1]

[1] http://popcon.debian.org/

majewsky · on Nov 26, 2016

I never understood why popcon is necessary. Couldn't you achieve about the same result by just counting downloads on the package mirrors?

smonff · on Nov 25, 2016

Oh, no!

codedokode · on Nov 25, 2016

Opt-out is what commecrial companies do because they do not respect their users' privacy. Free open source software should do the opposite.

By the way I think that browser history should be disabled by default too.

TeMPOraL · on Nov 25, 2016

Browser history is useful to user (but maybe the access to it could be restricted by means of authentication). I'm fine with off-line browser history that's accessible only to the user (and not JS running on a website). But sending any of it over the wire should definitely be opt-in.

codedokode · on Nov 25, 2016

An unexperienced user might not know that the history is recorded and later can be accidently seen for example by other member of the family. A good software would not allow that.

I remember some IM app for linux had history turned off by default. I was really surprised then.

hyperbovine · on Nov 25, 2016

Come now, a bajillion other software projects have "figured out" that people forget to update everything, always, without resorting to this sort of behavior. There's even a whole library, Sparkle, the exists just to solve this problem. Your nonexample of the necessity of analytics merely serves to reinforce the point you are trying to rebut.

_b8r0 · on Nov 25, 2016

Why do they need to use Google Analytics and hand information over to a third party instead of using something hosted?

I get that there may be money involved, but presumably self-hosting also gives them more of the information to build out what they need.

emartinelli · on Nov 25, 2016

In another comment in this thread by @mikemcquaid (Homebrew lead maintainer):

> (...)and have been trying to find people who will provide us with non-Google hosting but: we're chronically understaffed and underfunded[1]

[1]https://news.ycombinator.com/item?id=13035438

karmacoda · on Nov 25, 2016

What are the estimated costs for (something like) this?

pvinis · on Nov 25, 2016

I agree completely with you and majewsky below.

Chris2048 · on Nov 25, 2016

The issue here is not contribution, but forced/unnanounced contribution. Would it be ok if I steal from your wallet to contribute to Homebrew?

wfunction · on Nov 25, 2016

> The EU approach to privacy (eroded by lobyying and lack of control over US companies) is that citizens have a right over their information. They can ask what personal information a company has on them, ask that it be corrected or deleted. This has resulted in companies that are more careful with data.

More like, this has resulted in things like nonsensical "Cookie warnings" that only waste the user's time.

blub · on Nov 25, 2016

That cookie warning is just a friendly reminder that you are being tracked everywhere.

Of course, websites that don't track their users are exempt: http://ec.europa.eu/ipg/basics/legal/cookies/index_en.htm

I guess the EU didn't expect websites to cling to tracking cookies like their lives depended on them.

Isinlor · on Nov 25, 2016

Cookie law is one of the stupidest laws that EU forced. I had to install additional rules to adblocker to get rid of this nonsense and it still doesn't catch everything and keeps irritating me... I don't care, damn it! http://nocookielaw.com/

kuschku · on Nov 25, 2016

Is it truly?

The law is pretty simple: If you track users, or transmit information to third parties which could track the user, you have to get the user to opt-in.

The original intention was to get rid of Facebook’s shadow profiles, which it did – in the EU, websites don’t embed Facebook’s like button anymore, but have a two-click solution, where you have to click it, then it’s actually loaded, then you can like.

That alone is worth such a law.

Isinlor · on Nov 25, 2016

It is not worth it. I would really want a good way to protect myself as an Internet user from this law that the EU is enforcing on me. To protect myself not from Facebook, not from Google, but from the EU. It also gets in my way as a web developer. The law is not simple. 100 words sentences of law jargon are not simple. Many, many pages of such sentences is the opposite of simple. To be honest I have never really understood the law and I don't think I ever will. Even keeping track of it is far from simple.

But according to this website: http://ec.europa.eu/ipg/basics/legal/cookies/index_en.htm Cookies clearly exempt from consent according to the EU advisory body on data protection- WP29pdf include: (...) third‑party social plug‑in content‑sharing cookies, for logged‑in members of a social network.

So, I think what you say about Facebook is not true.

kuschku · on Nov 25, 2016

> for logged‑in members of a social network.

I was talking about non-logged in users. The shadow profiles.

> The law is not simple. 100 words sentences of law jargon are not simple. Many, many pages of such sentences is the opposite of simple.

The German version of the law is just as simple to read as any newspaper or book you read in German high school, so at least I haven't had an issue before.

DonHopkins · on Nov 25, 2016

Not entirely. I've learned to read the word "cookie" in many different languages.

Semaphor · on Nov 25, 2016

It has resulted in a lot of good things. Cookie consent is most certainly not one of them.

EvilTerran · on Nov 25, 2016

IMO, the intent was good... but yeah, the implementation is a grand demonstration of what you get when legislators don't understand the technology they're regulating. It would be far better if they'd required browsers, not websites, to show the notifications - much like they already do when a site wants to access your webcam/mic/location/etc[0]. That would mean much less implementation work (once per browser instead of per site), no way for underhanded sites to use cookies without the user being notified or despite the user declining them, consistent UI across all sites...

Such a thing could provide significantly more useful information, too - I envisage a notification with "This site wants to use a cookie on your computer" at the top, "allow/deny, now/always" buttons and a "What are cookies?" link at the bottom, and a user-friendly breakdown of this particular case in between, things like:

• "only visible to this site" vs "visible to ad.doubleclick.net" etc - maybe including, say, the Organization Name from the cookie domain's SSL cert, at least in the case of cookies set to "Secure" (maybe only if the cert's EV)

• "until you close your browser" vs "for a week" etc - perhaps with a way for the user to force session-only if desired

• possibly some kind of warning about snooping risk if the cookie's not marked secure, or not HTTP-only & 3rd-party scripts are on the page, etc

• for the case of 3rd-party cookies, it'd be possible to list which other sites have used the same cookie in the past

And so forth. The most importantant point being that you could actually trust this information - your browser has no motivation to lie to you about it, but any random site might.

[0] eg, https://i.imgur.com/NcxWz8zh.jpg

kuschku · on Nov 25, 2016

The issue is: What about technical cookies?

The law allows storing session info, config options, etc without prompting. The browser couldn’t reliably test that.

Unless you’d also let login to sites be handled by the browser, akin to Mozilla’s Persona.

Damn, that’d actually be a much nicer web.

EvilTerran · on Nov 25, 2016

Yeah, that could work.

Or, to look at it another way... so what if the browser pops up notifications more often than the law requires? Might encourage people to make less use of cookies for functionality that doesn't really need them, which would be no tragedy. And if your site really can't do without them, you could pop up a message explaining the situation & politely asking to be unblocked.

I suppose there'd be a danger of alert fatigue... maybe some heuristic analysis of the cookies themselves would be in order, to at least tentatively classify them as "tracking" vs "other". Eg, Google Analytics & Piwik cookies could be identified pretty reliably.

Semaphor · on Nov 25, 2016

> I suppose there'd be a danger of alert fatigue...

But that's happening with the current solution as well (not to mention all those crappy sites with their modal popups that teach you to close anything opening asap anyway)

codedokode · on Nov 25, 2016

Single Sign-on can be implemented without third-party cookies and therefore without the cookie warning.

codedokode · on Nov 25, 2016

Cookie warnings are an indicator showing that a website takes part in a global spying network. But I think instead of warnings browser developers should just disable third-party cookies by default. Sadly, the most popular browser developer by conicidence owns a popular tracking service so that is not going to happen.

lewiseason · on Nov 25, 2016

> global spying network

Or maybe they want to persist session information?

Maybe browsers should prompt, instead of each website, but it's too late for that now.

kuschku · on Nov 25, 2016

> Or maybe they want to persist session information?

That’s specifically exempt from the cookie law: Any technical cookie – session information, config options, etc – is allowed without any request.

The Cookie law only requires opt-in for tracking. Such as Google Analytics.

lewiseason · on Nov 25, 2016

Yeah, good point. I couldn't recall if it was third-party cookies, or if it were tracking cookies that the requirement stated.

dbg31415 · on Nov 25, 2016

> Companies don't need analytics to improve software.

I was with you except this line. Data is needed to make better choices and prioritize improvements. How would you like people to collect data? Google Analytics is free, easy, and powerful. Adding the cost of buioding a KPI tracking and reporting later from scratch... simply would be too much for most open source projects.

kuschku · on Nov 25, 2016

Do market studies?

Take a dozen, or two dozen, users and give them your product, let them use it, and get far more results from that?

You can even do these things for free, especially in open source many of your potential users are willing to do this.

cm2187 · on Nov 25, 2016

It's the combination of companies gathering ever more personal data and companies proving to be absolutely incapable to prevent leaking personal data that I find worrying even more than the direct spying.

vsl · on Nov 26, 2016

The EU approach of “let’s regulate everything and let incompetent out of touch bureaucrats write the legislation” left to annoying (much more than ads for me) cookie banners and other such nonsense. Starting next year, cookies — i.e. often random nonsense numbers — will have to be treated as personal data in EU. I envy US.

tdkl · on Nov 25, 2016

>Data is being sent to Google everytime you do almost anything in almost all websites, using the same technology. So what?

This still doesn't mean we should just shut up and take it for desktop software. What anyone does on its OS shouldn't leave the LAN - same critique stands for recent MS endeavour with Windows 10.

pvinis · on Nov 25, 2016

I definitely don't mean that we should shut up and take it. But before we complain about an open source project that maaaaany devs use happily, let's complain about those other cases first, yea?

blub · on Nov 25, 2016

We can complain just fine about whichever software we want, don't try to make up rules and come to us with fake outrage about even greater injustices.

This is a classical attempt to muddy the waters.

pvinis · on Nov 25, 2016

Ok, sure. At least complain for both. I didn't see the post referring to anything besides brew. I'm all for protesting and trying to fix any size and kind of injustice. But just don't target the less guilty. brew is still guilty for not having an even bigger announcement. But less target others too, at least. I would hate to see brew or any other similar software be the sole target and have to change for the worse, because it was unlucky enough to be the scapegoat.

hueving · on Nov 25, 2016

>I didn't see the post referring to anything besides brew.

That's because this post is about brew. Windows 10 got tons of flak when it pulled this crap as well. "There are worse cases" is not an excuse for a project doing something shitty.

userbinator · on Nov 25, 2016

let's complain about those other cases first, yea?

Search HN for "Windows 10" and you'll get plenty of complaints.

eveningcoffee · on Nov 25, 2016

>Data is being sent to Google everytime you do almost anything in almost all websites, using the same technology. So what?

No, it is not so what. This is a major problem with Google instead. The fact that a single company can track majority of the web traffic is a huge risk.

wheelerwj · on Nov 25, 2016

We have to start somewhere. Opt in should always be the standard.

pvinis · on Nov 25, 2016

Sure. Opt in would be nice. But only if it was opt in always and everywhere, and all users, technical or otherwise, knew the reasons why analytics are gathered for a product they use. I think in that case, many people would still opt in. I would. And I would hope users of my products would too.

wheelerwj · on Nov 25, 2016

Yes, and maybe it's okay if we don't have perfect visibility. We've been building things for thousands of years without Javascript event notifications, I'm sure we will do just fine with a little less data

tomfluff · on Nov 25, 2016

Organ donation?

TeMPOraL · on Nov 25, 2016

Yes, there are arguments for organ donation to be opt-out. But that's extreme case, when one loses literally nothing - by virtue of being already dead - and another person stands to gain additional years of life[0]. Some other socially beneficial things also have strong arguments for being opt-out (like retirement plans, because opt-out protects people from their own stupidity/short-sightedness).

But that doesn't change the validity of the proposal that opt-in should be the standard - exceptions from which must have solid reasons. Just making it easier for someone to make money off people is not one of those reasons. Neither is vague "making the product better".

--

[0] - INB4 yes, there are also valid arguments that opt-out organ donations will reduce doctors' willingness to fight for patient's life. Human societies are complicated.

tremon · on Nov 25, 2016

Yes, there are arguments for organ donation to be opt-out. But that's extreme case, when one loses literally nothing

Actually, I've opted out because my next of kin stand to lose quite a lot (the harvesting of organs needs to be done quickly, well within the mourning period of the people I care about the most). The decision to leave sight of my body needs to be theirs, and theirs alone.

orf · on Nov 25, 2016

Well if you can't bear the thought of your friends and relatives not being able to stare at your dead corpse for a couple of hours and value that over doing something amazing and saving someone's life then more fool you.

If your in the position to donate organs you most likely died in an accident. You likely won't have that rosy picture of your family and friends around your beside. You might not have anyone.

What a waste.

tremon · on Nov 25, 2016

Yes, fuck you too. Try a little empathy next time, and failing that, reading comprehension.

I said the decision should be theirs. I did not say my body couldn't be parted from them, I said it was their decision to do so. I used to be registered as a donor, but because the law has now changed that my donorship overrides the wishes of my loved ones, I have withdrawn it.

orf · on Nov 25, 2016

> I said it was their decision to do so

And I said they may not be there to give that decision, so that's moot really. Organs like lungs and hearts expire very quickly. Reading comprehension indeed.

> Try a little empathy next time

For the dead person, or the person who misses out on a life saving organ due to the dead person?

It's your body and your choice obviously, but I think saying it's up to my relatives to decide isn't a great reason to be taken off the donor registry. There is plenty of time for them to grieve but only a few hours to take a vital organ. If you want to do something great if you expire unexpectedly then it's up to you, don't put that on your parter/relatives.

junke · on Nov 25, 2016

In France, you are a donor by default if you don't opt-out. In 2018, relatives won't be able to oppose organ donation in case of uncertainty.

jacobush · on Nov 25, 2016

Then it's their social contract in their democracy.

rubber_duck · on Nov 25, 2016

Also somewhat related to this - recently I've realized that this privacy paranoia is going to slow down medical advances coming out from big data so much.

For example your wearables get to collect so much biometric info, if that data can be connected to detecting conditions early it would provide a lot of value down the road. At some point we will have the option to collect data about what you ate, what you did, where you went and then how that affected your biometrics, and we can inexpensively collect huge scale DNA samples, etc. all that data if available publicly could really provide insights in to things that are really not practical in limited group studies.

For reasons such as this I think I'm fine if services collect anonymized (unless we solve identity theft and such security concerns) information about me, I'd just want them to make this data free.

TeMPOraL · on Nov 25, 2016

> anonymized

Remember; there's no such thing as "anonymized data", there's only "not enough other data to correlate out identities from it".

blub · on Nov 25, 2016

It's not paranoia, it's simple understanding of history and human nature.

rubber_duck · on Nov 25, 2016

Freaking out about disclosed anonymized analytics about package usage in an OSS project is paranoia in my book

TeMPOraL · on Nov 25, 2016

May be, but you didn't use the word "paranoia" to describe just this in your previous comment; you described also things like medical information.

rubber_duck · on Nov 25, 2016

I mean the general sentiment. I'm not saying people should have access to your full medical history, personal info, etc. on demand.

I am saying is that these benign things are opt-out not because most people wouldn't want to do them if they weigh the prons and cons but because they don't want to put in the effort of doing so and will just be conservative - which is logical from an individual perspective - but will cause us to lose out on opportunities as a whole.

Also this data is getting collected weather you want it or not, even intelligence people are just taping over their webcams as a security measure - the attitude that we must protect every bit of privacy by default will lead to the future where hidden data collection is the only way to access data - people will be making money off it, it won't be available to general public (for eg. public research) and there will be no transparency about it. And if you think the government will protect you - well they are the biggest transgressor here.

So instead of fighting a lost battle with trying to keep absolute privacy why not just make most of that data public and available and focus on protecting the really sensitive stuff.

zAy0LfpBZLC8mAC · on Nov 26, 2016

> So instead of fighting a lost battle with trying to keep absolute privacy why not just make most of that data public and available and focus on protecting the really sensitive stuff.

Because you can't. You cannot build something that protects the really sensitive stuff out of stuff that's leaking data left and right, making it harder and harder to protect the really sensitive stuff.

AsyncAwait · on Nov 25, 2016

> disclosed

Yeah, carrier contracts also "disclose" everything, there's a reason why it's called "small print".

It would only take to make it opt-in, instead of opt-out to rectify this.

smcleod · on Nov 25, 2016

This. Exactly this, thank you.

mikemcquaid · on Nov 25, 2016

> Some of the times it's to make the product better, or better targeted.

Homebrew lead maintainer here. This is exactly why we do it. To repeat myself from below: we're chronically understaffed and underfunded.

> Furthermore, now that you know, for brew specifically, will you opt out?

It's worth noting: if everyone who uses a niche piece of software opt-out when it breaks we'll look at our analytics and likely remove it from Homebrew instead of fixing it. This may be unfair but given the amount of work: we need ways to prioritise. Analytics have helped us do that so we can prioritise fixes on things that are critical for many people.

quesera · on Nov 25, 2016

You do not need telemetry from everyone to make informed decisions like this.

If just 1% of users opt-in to telemetry, you would have more than enough data for a representative sample set.

This is the whole point. You can do the right thing without any loss of data usefulness. So why persist in doing the wrong thing??

tn6o · on Nov 27, 2016

And then the 1% will start complaining. "Why me?"

The team behind Brew is doing a great job and if analytics can help them do a better job, then be it. Nobody is forcing you to use it and opting out is simple enough.

Tepix · on Nov 25, 2016

Does it have to be Google, the company that already sucks up the most information about pretty much everyone on the internet? Why don't you send the data to your own machines?

mikemcquaid · on Nov 25, 2016

We don't have our own machines that are capable of handling this. If you're offering to host them for us for at least two years for free: get in touch.

justinclift · on Nov 26, 2016

Email sent. :)

hueving · on Nov 27, 2016

Well now we'll see if the cost excuse was just a smokescreen I suppose.

justinclift · on Nov 27, 2016

Hmmm.... probably better to see how it turns out before jumping to conclusions. :)

From initial discussion with Mike, there are two parts that would be needed for a (self hosted) setup:

a) The hosting itself (this is something I can likely help with)

b) Setup and ongoing management of Piwik. This isn't a skill set I have (nor am interested in). However I'm generally thinking the Homebrew Community is big enough to rustle up at least a few people with the right skill set for getting this done (and keeping it done :>).

Trying to get both hosting + Piwik setup/mgmt from the same set of people may be a tall order, so doing them separately is probably more achievable.

As a data point, this kind of approach - sponsored hosting/hw/similar + volunteers to look after Community infra - is used successfully in other projects. PostgreSQL is a good example.

Can't really think of any obvious reasons it wouldn't work for Homebrew too. ;)

mikemcquaid · on Nov 27, 2016

It's the cost of running the servers, the experience and time to run the servers and the time to adjust our analytics code to use the new system (which I'm willing to do).

Additionally, this (kind) offer is pointing to other organisations that may be able to host the servers and discussing potential solutions to the other options. It's not a slam-dunk by any means.

This is all done in my free time so I don't need an excuse to do or not do any of this.

akerro · on Nov 25, 2016

> Data is being sent to Google everytime you do almost anything in almost all websites using the save technology

No, it's not. µblock, µmatrix + clean links.

philjr · on Nov 25, 2016

That would be an opt-out strategy, would it not?

asymmetric · on Nov 25, 2016

Is there a point in running both ublock and umatrix? Also, what do you mean by "clean links"? Striped of "utm..." parts?

akerro · on Nov 25, 2016

µblock blocks and removes ads and trackers, µmatrix does the same for cookies and JS. Clean links improve page loading times when you click on a link on reddit or Google, it extracts destination link and gets you directly to the destination without redirect, also google, facebook don't know that you clicked the link.

asymmetric · on Nov 29, 2016

> µblock blocks and removes ads and trackers, µmatrix does the same for cookies and JS.

Not according to its creator https://github.com/gorhill/uBlock/wiki/Blocking-mode

sesqu · on Nov 25, 2016

Clean Links is a plugin that strips the utm-like trackers, affiliate codes, outbound redirects, window.open, and some other relays from links.

hueving · on Nov 25, 2016

>Yes we know. brew tells you that. And you can disable it.

No, we don't know. This article wouldn't have hit the front page if it wasn't subtle and user-hostile by default (opt-out).

>Data is being sent to Google everytime you do almost anything in almost all websites, using the same technology. So what?

Brew is not a website, it's a CLI tool. People expect this behavior of websites, not of sysadmin tools. How would you feel if you found out your compiler shipped off a copy of your code to a remote server each time you encountered a compiler error "to improve the compiler" and you had to opt out of this behavior?

>Some of the times it's to make the product better, or better targeted. Some other times it's just for spying on the users.

It's always spying, regardless of the motivation.

>How do we expect open source to become better if everyone is being a crybaby because brew got a history of how often they do brew update?

Somehow open source has functioned all of these years without silently harvesting users' usage data. I don't know how new you are to open source, but this is definitely not the way you do QA for an open source project.

JakeTheAndroid · on Nov 26, 2016

That's a bit extreme. A compiler sending your codebase has potential IP implications. Sending your brew commands back to brew is much different.

What commands do you run in brew that isn't specific to brew? Whereas, compiling code is a very specific-to-you scenario. If brew fails, it's likely to be caused by brew itself or a package that is part of the repository. If you your compiler runs into an issue, it's not inherent to the compiler necessarily. I'm not a fan of brew at all, and I can't really find the malice here. Either opt out, stop using brew, or accept it. Why are you on OSX if you care that much about your package manager anyways? Ive worked at many companies, and any that let you choose a MacBook will let you opt for a Linux laptop instead (YMMV I guess).

Nursie · on Nov 25, 2016

Opt Out is scammy.

Command line tools do not usually do this at all, particularly FOSS command line tools. This is a very worrying development and reduces my trust in FOSS.

--edit-- to whoever downvoted me, can you explain why?

I love FOSS but I also lovemy privacy. I have come to generally have a default level of trust for popular FOSS projects. Things like this make me question that trust. Why is this wrong?

AsyncAwait · on Nov 25, 2016

> This is a very worrying development

Agreed.

> reduces my trust in FOSS

Please don't distrust a whole category because of a single tool. Most FOSS tools don't do this, especially if they're "Free Software" as opposed to just being "open-source", since "Free Software" indicates a moral, not just practical stance on software.

Nursie · on Nov 25, 2016

Well perhaps my assumptions about well-known and widely used open source stuff were just too naive.

I guess I'm not really thinking "distrust them like they actively include malware" just that it looks like I need to at least keep a look out for stuff like this.

I love FOSS. I believe in it, I use it all the time, I've made a few minor contributions here and there too. It's not like I'm saying "OMG FOSS is teh evil!", just that maybe my trust level was calibrated a little high :)

bnegreve · on Nov 25, 2016

> Some of the times it's to make the product better, or better targeted. Some other times it's just for spying on the users. Let's stop complaining

These kinds of complains are fine, I don't think we should stop them.

It's a bit like corruption in politics, it's not going to be solved anytime soon, but complains do raise awareness and prevent abuses. Who know what's going to happen if we stop complaining?

belorn · on Nov 25, 2016

> Will you choose to make its progress slower because of no real security reason?

No real security reason? We can test that. The developer who added this feature can easy earn trust in such statement by changing the disclaimer and making themselves liable if information extracted through this mechanism causes any identifiable harm.

If this is a sure bet, no risk, perfectly guarantied to be safe, then such change should be trivial.

fs111 · on Nov 25, 2016

> Data is being sent to Google everytime you do almost anything in almost all websites, using the same technology. So what?

not on browsers I control

Chris2048 · on Nov 25, 2016

> you can disable it

if you know about it

> There are many more that do the same things without telling you

This makes it right?

> Will you choose to make its progress slower because of no real security reason

Some will, you think it's OK not to give them the choice?

> How do we expect open source to become better

Is opt-in that hard?

Waterluvian · on Nov 25, 2016

If it tells us, then this is a wake up call to me because I don't recall ever becoming informed of that. I must have rushed past the notification.

I probably would have still say OK but I like to at least be aware. Shame on me.

adrianlmm · on Nov 25, 2016

And why not make it opt in instead of the contrary?

aclsid · on Nov 25, 2016

It takes almost zero effort to implement Piwik instead of giving all your data to Google. A simple donate button will take care of the costs of a $15 per month shared server for this purpose.

Really that is all it takes. Everything else just goes against the fundamentals of open source software, which is, respect user rights.

draw_down · on Nov 25, 2016

Right, as if the feds are coming to get me because they know I installed ImageMagick. Oh no.

hartator · on Nov 25, 2016

I have a friend that uses IM to generate memes. So, yeah IM can be used by dissidents. :)

codedokode · on Nov 25, 2016

Is tracking user's actions the only way to improve software? That is not what one would expect from free and open source software project.

If developers needs an information about how user uses a program, they should ask them to help.

CJefferson · on Nov 25, 2016

this is going to sound insulting, but have you tried being an open-source developer?

I've worked on products with tens of thousands of users, and the vast majority of those users never communicate.

Worse (in my opinion), the users who communicate most tend to be experts, leading to programs tending to become better for expert users, and worse for beginners / occasional users -- see for example how many linux programs have options you can turn on to make them more user-friendly, but of course as a beginner how do you find them?

userbinator · on Nov 25, 2016

Worse (in my opinion), the users who communicate most tend to be experts, leading to programs tending to become better for expert users, and worse for beginners / occasional users

On the other hand, not listening to users but analytics leads to dumbed-down inflexible programs that can't get out of their own way for expert users.

If you have "tens of thousands of users" and most of them don't communicate, I'd call that successful. They're satisfied.

hueving · on Nov 25, 2016

>I've worked on products with tens of thousands of users, and the vast majority of those users never communicate.

And your answer to that is to spy on them unless they read fine-grained details that even the generally expert users on this site missed?

codedokode · on Nov 25, 2016

If you look at Google Play Store you will see a lot of comments from non-expert users (you can easily see it from their messages). Maybe that is because writing a comment in Play Store is easier than subscribing to a mailing list many projects still use or registering in a forum.

Also maybe they do not say anything because everything works ok for them?

kuschku · on Nov 25, 2016

I’ve also worked on products with tens of thousands of users, and while tracking is definitely convenient (I recently added a feature where, on first start, the user is informed that crash reports would be sent, and required them to choose "dismiss" or "opt out" before letting them use the app), but it’s no reason to hide it like this.

Nor is it a reason to even try tracking with Google Analytics.

In my case, I ensure that the backend server is also FLOSS, that no problematic data is transferred, and so on. All FLOSS, everything verifiable by the users.

But running Google Analytics? Without prompting the user?

Great...

codedokode · on Nov 25, 2016

I don't think the problem is using Google or not. The problem is that software collects and sends data to a network without user's authorization. It is a package manager, not a program for reporting what software you are installing and how exactly you do that and what operating system you are using and user might not expect such behaviour.

Free software is supposed to do what the user wants, not what its developer wants.

AsyncAwait · on Nov 25, 2016

> That is not what one would expect from free and open source software project.

Agreed, but please note that Homebrew is only open-source, it is not "free software"[1].

[1] - https://www.gnu.org/philosophy/open-source-misses-the-point....

majewsky · on Nov 25, 2016

> Data is being sent to Google everytime you do almost anything in almost all websites, using the same technology. So what?

For websites, there's a standard process for blocking this shit. For desktop applications, there isn't.

CJefferson · on Nov 25, 2016

I knew, and I was prompted.

I can understand why this might upset people, but for me I hope it will make brew a better product. I'm sure we all know as developers how annoying it can be to not know the problems users have, and what they are using your software for -- that's why most websites have analytics.

I'm hoping to add something similar to software I work on, with an opt-out of course. I believe it will help me serve my users better.

hueving · on Nov 25, 2016

Please make it opt in. If you truly believe people closely read the prompts for having their data harvested, a default to "No" should not impact you at all.

If you don't believe they read the prompts closely, you're an asshole for stealing data by default.

CJefferson · on Nov 25, 2016

Is it "stealing data" to record that certain options in my program are never used, or used frequently, or whenever a user clicks on options A,B and C in sequence the app crashes?

My problem is that I believe that 90% of the users of my app won't care one way or the other if I record these stats. If I default opt-out these people, then I lose all that useful data and in the process, I believe, make my app worse for everyone.

I won't deny this is not an obvious choice, but I think personally on balance it's better to opt-in. Especially when the data is anonymised and doesn't contain any private/identifying information. But I understand how others would disagree. If there was some kind of OS-wide "don't record what I do" option I could hook into, I would use it.

Nursie · on Nov 25, 2016

Your belief may not match the real world. You can only know that by actually asking them.

Look at the outcry when Microsoft put this stuff in Win 10.

At the very least you should present a question on install. Hiding a notice amongst licenses and other information, as often happens, is underhanded and scammy.

hueving · on Nov 25, 2016

>My problem is that I believe that 90% of the users of my app won't care one way or the other if I record these stats. If I default opt-out these people, then I lose all that useful data and in the process, I believe, make my app worse for everyone.

If those people wouldn't volunteer to opt-in, it's just as likely that the reason they are not opting-out is because they missed the notification that someone is collecting data about them.

It's a UX anti-pattern to default behavior to something the users may not want. If you're worried they'll accept whatever default there is, just explicitly ask them if they want to relay usage stats and you'll be surprised how much of the 90% you claim don't care will start caring.

Look at the comments on this thread, there are multiple accounts of people surprised by this. The very fact that this article is on the front page is proof that Brew tricked users.

>Especially when the data is anonymised and doesn't contain any private/identifying information.

You've either made massive breakthroughs in the field of information security or this is a bogus statement. If the user's computer even connects to an analytics service, they've already got an IP address, frequency of connections from that IP, etc and all of the correlation that comes with enabled by their other data sets. Just because it's anonymized by the time it comes out of Google in Brew usage reports doesn't mean it hasn't given Google additional information to profile people.

stan_rogers · on Nov 25, 2016

You should, perhaps, question your beliefs. When I get "do you want to send...", I only very rarely click "yes/OK", and then only if that and the OS were the only things running and the problem is locally reproducible. The state of my machine is none of your damned business.

CJefferson · on Nov 25, 2016

You could be right. I could pop up a message early on, saying I'm doing analytics.

I think, based on the conversation here, I'm going to make it very clear there is analytics, but not allow turning them off. This is (I think, you are welcome to disagree of course) the best situation:

* Everyone knows there is analytics going on, no hiding. * I get good quality analytics from my actual users, as no-one can opt out (well, you could start port blocking, I'm not going to make the program stop working in that situation).

lorenzfx · on Nov 25, 2016

How about choosing no default, and instead ask the user on first run? `Do you want to send usage data to the developer? (yes/no)`

CJefferson · on Nov 25, 2016

I imagine it would get REALLY annoying if every command line program started doing that, every time you logged into a new machine.

Of course, that suggests we should introduce some kind of standard, then bash (or whatever shell you use) could prompt you once, and every other program could use that setting.

jeena · on Nov 25, 2016

At least this would make the problem with how many commandline tools are tracking you visible, and perhaps would start a discussion on the necessity of tracking for tools like brew, ls, cp, dd and so on.

iagooar · on Nov 25, 2016

This. A thousand times this. To me this whole thing feels like the maintainer is trying to justify something that most people won't ever accept as the status quo. Bad times when software this solid is going in a wrong direction. Time for a Homebrew fork?

CJefferson · on Nov 25, 2016

No need for a fork, if someone was willing to put in the time and money to maintain a private anonymous analytics server, the homebrew folks would be happy to use it.

Unless of course you are fundamentally opposed to tracking of any kind, then a fork is required. It is a fairly massive infrastructure to recreate however...

true_religion · on Nov 25, 2016

Calling it theft feels a bit hyperbolic.

tn6o · on Nov 27, 2016

Agreed.

andybak · on Nov 25, 2016

'Stealing' data? Is that like 'stealing' by downloading a movie?

wheelerwj · on Nov 25, 2016

No, it's like stealing personally identifying information and sending it to the world's largest ad company.

daenney · on Nov 25, 2016

The information brew collects I would hardly call personally identifiable. Take a look at what they collect: https://github.com/Homebrew/brew/blob/master/docs/Analytics....

jakobegger · on Nov 25, 2016

They can call it "anonymous data" as much as they want, but in the end it's still data transmitted to Google's servers. If you are simultaneously logged into your Google account on that computer, the information is not anonymous anymore. Google could with high probability correlate connections.

andybak · on Nov 25, 2016

> They can call it "anonymous data" as much as they want, but in the end it's still data transmitted to Google's servers.

You appear to be saying that it's wrong to send any data to Google - which strikes me as an indefensibly extreme position. For example if they were tracking a simple count "number of times anyone anywhere has run brew" and it was stored as a single global integer then it's hard to imagine what issue you could have with it as it's basically no worse than a simple hit counter on a website.

So what are you saying? Surely the problem has to relate to what data is collected?

jakobegger · on Nov 25, 2016

I'm saying that if you make your software talk to Google, don't say that it collects "anonymous aggregate data". The data it sends could easily be traced back to you (by Google).

And after all the Snowden revelations it's very naive to assume that Google (or any other company) will protect your data.

andybak · on Nov 25, 2016

Doesn't that depend on what's being sent? Either it's personally identifiable (or at least de-anonalysable) or it's not. Within reason some things are not of any real interest to anyone.

What is the threat model here?

daenney · on Nov 25, 2016

Sure, but then the problem isn't the information they collect but the fact that it goes to Google. Short from them setting up their own analytics service though, to whichever one you send this data it'll be traceable up to some extent. Google just might have a larger trove of data about you, but that's also through your own doing.

TeMPOraL · on Nov 25, 2016

"Exfiltrating" would be a better word here.

(Also, downloading a movie is a different thing - there you're not stealing nor exfiltrating anything, you're downloading data as intended by uploader, who may or may not have the copyrights for that data.)

(INB4, ripping a DVD and uploading it is not data theft. Exfiltrating a pre-release copy from movie company's servers would be data theft.)

rocqua · on Nov 25, 2016

The ripping would be the 'exfiltration' as would recording a movie in theatre or hacking servers.

The sharing or getting something from a shared thing would merely be copyright infringement.

hueving · on Nov 25, 2016

By your definition, there would be no such thing as stealing IP, PII, launch codes or identities. However, those things are well accepted terms in all industries, including tech. You need to reconsider your strange restriction to mutual exclusion if you ever want to engage in meaningful conversations.

andybak · on Nov 25, 2016

> stealing IP

This is the usage I'm most firmly resisting.

> PII, launch codes or identities

All inaccurate to varying degrees - or at the very least metaphorical/rhetorical uses. If you're allowed to push a meaning in one direction then I'm allowed to point it out and attempt to bend it back a little.

hueving · on Nov 25, 2016

>push a meaning in one direction then I'm allowed to point it out and attempt to bend it back a little.

I'm not pushing anything. Your argument is about whether or not copying information is 'stealing'. I'm just pointing out that it's a very commonly agreed upon vernacular all over many industries. Call your credit card company and tell them someone stole your credit card number and see if they understand what you are talking about and/or try to correct you by saying it was merely copied.

Go ahead and try to change the definition, but don't waste time on HN with that crap when the subject isn't even about whether or not it's "stealing" because it makes for boring conversation.

andybak · on Nov 25, 2016

You don't see anything even slightly problematic about referring to opt-out (rather than opt-in) aggregate and presumably anonymous usage statistics as 'stealing'?

You don't think that is possibly somewhere towards the further reaches of consensus on what is a reasonable application of the word?

You're welcome to disagree with my position but you're going a touch further than that. You're also accusing me of arguing in bad faith and being somehow aware that my argument has no merits. Considering I think I'm stating a fairly reasonable view I find that somewhat disingenuous in return.

andybak · on Nov 26, 2016

I think you've tired of me but I want to finish with one last thought: "Choose your battles wisely" - this might not be the best place to make our last stand. There's a danger of tiring out potential allies before the point things really matter.

Esau · on Nov 26, 2016

This is the correct answer. Opt-out is a jerk move - used by those who value themselves more than their users/customers/fellow humans.

yoran · on Nov 25, 2016

I agree with this. Also note that GA's terms disallow the sending of personable identifiable information so the data they collect has to be anonymous (see https://support.google.com/analytics/answer/2795983?hl=en).

jakobegger · on Nov 25, 2016

I don't think you know what the word "prompt" means.

This is a prompt:

"Do you want Homebrew to enable anonymous aggregate user behaviour analytics (yes/no):"

This is not a prompt:

"Homebrew has enabled anonymous aggregate user behaviour analytics."

michaelt · on Nov 25, 2016

People here are talking about opt out vs opt in but there's a third option: Forced choice. This is where you present the two options to the user (without either prechecked / default) and the user has to choose one or the other to continue.

This avoids both the problem that the user doesn't consent with opt-out, and the problem that nobody cares to opt-in. The disadvantage is it doesn't work with unattended upgrades.

CJefferson · on Nov 25, 2016

Yes, one reason I like linux compared to windows is not having to click through "yes, yes, continue, yes, continue" in every installer. I don't want that bringing back whenever I install a new machine / program.

iagooar · on Nov 25, 2016

No, it won't make Homebrew a better product. Homebrew was so good without analytics. Now it's worse, because I cannot trust it anymore.

drwl · on Nov 25, 2016

How does your lack of trust for Homebrew make it worse?

hueving · on Nov 25, 2016

You can't use it if you don't trust it.

ovao · on Nov 25, 2016

Again, though, the lack of user's truth or faith in a piece of software does not make the software itself worse.

We can argue the merits of distrust and of the user's choice to distrust, but the software itself is the same.

andrewguenther · on Nov 25, 2016

What exactly has brew done to breach your trust?

Edit: downvoted for asking a question?

iagooar · on Nov 25, 2016

Enabling analytics that send whatever information to Google. Yes, I can opt out, but often times I just don't have the time to verify the privacy policies of each and every piece of software I use.

tomjen3 · on Nov 25, 2016

If this is so good, make it out in. Out out is dishonest.

tehbeard · on Nov 25, 2016

Opt in may be "more honest", but you're going to see a a lot less useful data as a result of it, because the pool of people who will opt in is less than the pool of people who wouldn't care what the default is.

hueving · on Nov 25, 2016

If you're going to be subversive to your users by playing on their propensity to ignore prompts, why bother with allowing them to opt out at all?

mhenr18 · on Nov 25, 2016

Let's assume that 90% of people don't bother to turn off telemetry and 10% do. That also means that in your preferred scenario, 90% of people wouldn't bother to turn it on and the 10% also wouldn't turn it on either. If you can't be bothered to turn something off, why would you be bothered to turn it on?

That means that you'll either get something close to 0% telemetry if it's default off, or 90% if it's default on. So, it makes sense to be default on if you want your telemetry dataset to be big enough to be worth it.

But then we get to your question - why bother with an opt out? Well, if the decision is either to use the product and send telemetry or not use the product, that 10% of people aren't going to use your product. They care about not sending telemetry that much.

At that point, as a dev you just have to ask yourself is it really worth losing 10% of your users to an always-on telemetry policy or is it OK to make the concession of allowing an opt out in order to grow your user base?

Personally, I'd rather make the concession and get that 10% of people on board. If they're vocal enough to complain the shortcomings of my product they're probably vocal enough to also talk about the good things and give me free advertising.

hueving · on Nov 25, 2016

> If you can't be bothered to turn something off, why would you be bothered to turn it on?

If people aren't turning telemetry on, do you think they really want to send you data in the first place?

What you are doing is exploiting users assumptions of how normal CLI tools behave. They don't assume it's going to relay my information back to google when I use it.

Your entire argument essentially boils down to, "I'm sure I can get away with taking my users' information by making it the default behavior with a buried notification and I can placate people who care about privacy with an opt-out."

Your entire justification doesn't even mention privacy or caring about users, it only mentions dealing with pesky users who care to get your user-base higher and promote your product. You clearly have very loose ethics when it comes to privacy so I don't think there is much we will agree on. I just hope one day this behavior will be shunned enough that it will stop due to market forces before something like the EU regulates it away.

mhenr18 · on Nov 25, 2016

If people aren't running make with -j, do you think they really want their build to take advantage of all cores in the first place?

My argument is not specific to telemetry, it's a general one. If you have an option to do something that's not a default and it's not part of the software's core functionality, most people aren't going to consider it even if it would be to their advantage. For example, make -j.

It's for that reason that I don't think the "if they aren't turning it on then they don't want it" argument holds as much water as you think it does. That argument groups together three groups of people: people that know about the setting and don't want it on, people that don't know about the setting and don't want it on, and people that don't know about the setting and would be happy if it was on.

Ironically, if you had good telemetry you'd be able to figure out how many people fall in to each group and make decisions about settings based on accurate data. Without that, you're forced to work on assumptions.

> Your entire argument essentially boils down to, "I'm sure I can get away with taking my users' information by making it the default behavior with a buried notification and I can placate people who care about privacy with an opt-out."

I think you're making the assumption that telemetry has to violate your privacy in all kinds of heinous ways and therefore only be a bad thing. If that's your mindset, of course you're going to think that I'm the kind of person that's out to trick and fool people and betray their privacy. And in all fairness to you, it's reasonable to be jaded when companies like Microsoft have horrible things like P2P software update distribution enabled by default. It's reasonable to be jaded when you don't get told exactly what kind of data is being sent back as telemetry. There's a fine line between "this is good" and "you're just relying on people not knowing how to change the defaults in order to try and get away with horrible things" and all too often that line is crossed.

But I'm an idealist. I see the good things that can come out of having telemetry. I want to know if my software has started getting popular in locales that I haven't written translations for yet so I can commission a translation to improve the experience for people in that locale. I want to know if there's a setting many people turn off so that I can consider turning it off by default to match user expectations. I want to know if my users are sticking to older OS versions because if they are I need to keep older hardware around in order to test and provide them the best possible experience.

I don't think anyone would have an issue with software sending back that data (and only that data) if you clearly tell them that's happening, and I also think that most people would be perfectly fine with that being a default behaviour. Of course, there is always going to be a group of people that will have an issue with sending back that data and that's why I made the point about keeping it as an option.

That group isn't just "pesky users who I only care about for promotion" (perhaps I was too flippant about saying that in my original comment). They could be trying to harden a machine so that it only uses the network under known circumstances. At the same time, you'd hope that software intended for use by people who need comprehensive privacy like whistleblowers wouldn't have telemetry at all.

Given that OS X isn't 100% free software, no one should be using brew in that kind of comprehensive privacy situation and so having telemetry isn't inherently bad.

It's the "clearly tell them that's happening" bit that's the issue here. If you have software that has been around for a long time that doesn't do something and then in a new release it starts doing that thing, you need to let people know that! It doesn't matter whether that's telemetry or anything else - if it's something that violates existing expectations you need to tell your users loud and clear.

TeMPOraL · on Nov 25, 2016

Nobody said honest behaviour is easy.

izacus · on Nov 25, 2016

If you listen to Changelog #223 podcast[0], both Mike and the podcast author showed quite a lot of derision and condescension to the people caring about analytics upload, so I don't think it's getting fixed.

[0]: https://changelog.com/podcast/223

hueving · on Nov 25, 2016

Awesome, so not only does he not care about privacy, he feels the need to deride people that do?

Maybe the reason they needed analytics in the first place is because of this myopic perspective. If you assume everyone who disagrees with you is an idiot, you're going to stall really quickly.

tfeldmann · on Nov 25, 2016

To disable this, execute

  brew analytics off

aq3cn · on Nov 25, 2016

How do you turn it off for brew cask?

Anyway this option should have been available in their man page. But anyway they have clearly mentioned it.

https://github.com/Homebrew/brew/blob/master/docs/Analytics....

If they want data, I will happily fill up their survey form but please don't take it for granted that my personal data is yours otherwise I may go to China for their great wall.

opk · on Nov 25, 2016

Any idea what to block if I want to just completely block google analytics at the firewall level?

jbg_ · on Nov 25, 2016

Little Snitch is excellent for this. I remember running brew at some point and being asked by Little Snitch if it could connect to Google. I said "deny forever" and have never worried about it since.

hasperdi · on Nov 25, 2016

The way I accomplish this is by setting up an dnsmasq DNS forwarder service. Find domains list for ad blocking / privacy on the internet and add these entries in dnsmasq's config.

The entries look like this:

address=/doubleclick.net/0.0.0.0

address=/gravity.com/0.0.0.0

address=/outbrain.com/0.0.0.0

...

Then set up the router's DHCP to use this server.

driverdan · on Nov 25, 2016

https://github.com/StevenBlack/hosts

troels · on Nov 25, 2016

/etc/hosts should do it, I suppose?

dictum · on Nov 25, 2016

From https://github.com/Homebrew/brew/blob/master/docs/Analytics....:

> As far as we can tell it would be impossible for Google to match the randomly generated Homebrew-only analytics user ID to any other Google Analytics user ID. If Google turned evil the only thing they could do would be to lie about anonymising IP addresses and attempt to match users based on IP addresses.

Look, by now Google knows my whole stack and what I'm doing with each project, and Homebrew using Google Analytics doesn't bother me much. I still disabled it a while ago and after reading "if Google turned evil" put like a remote possibility I won't turn it back on. There are two extremes in tech — paranoia about everything and magical optimism about everything. This is an example of the latter.

A corporation of Google's scale and relevance to different industries simply cannot afford to not be evil.

Doesn't mean they're absolutely evil and just waiting for an opportunity to partner with a hypothetical fascist government — just that you can't assume the best.

lispm · on Nov 25, 2016

In German we have a word called 'Datensparsamkeit':

http://martinfowler.com/bliki/Datensparsamkeit.html

It's a principle for good software design. This principle is also written down in German law:

https://de.wikipedia.org/wiki/Datenvermeidung_und_Datenspars...

>Damit gilt in Deutschland der Grundsatz, dass die Erhebung, Verarbeitung und Nutzung personenbezogener Daten und die Auswahl und Gestaltung von Datenverarbeitungssystemen an dem Ziel auszurichten sind, so wenig personenbezogene Daten wie möglich zu erheben, zu verarbeiten oder zu nutzen.

Rough translation: Thus in Germany we have there principle, that the collection, processing and using of person related data and the selection and design of data processing systems have to be guided by the aim to collect, process or use as few as possible person related data as possible.

Here, now someone has a list of software installed on various machines and the versions of that software, and whatever else information. It may be enough data to identify me or other users, the computers and the installed software. It may also allow other uses, which I have no idea of.

It's likely that this information spreads to unintended places.

I don't think this is a good idea.

I never liked the idea of using Homebrew, and this makes it even more suspicious.

majewsky · on Nov 25, 2016

For example, data about installed software versions can be used to automatically find systems that are vulnerable to certain exploits.

TeMPOraL · on Nov 25, 2016

Data about installed software can also be used to profile you for headhunters or marketers in general.

rkachowski · on Nov 25, 2016

This is google analytics. It's a bit different than google directly taking the data and using it for google purposes (e.g. streetview cars gathering SSID information for location pinpointing). The data (afaik) is used only by the homebrew team instead of Google.

A more accurate summary would be "Brew is sending usage data to a google owned analytics service".

opk · on Nov 25, 2016

The data still goes to google. While the expectation would be that the data is only used by homebrew, there's no actual way of knowing that google isn't gratefully helping itself. And given the entire way Google's business works, they probably are.

pbiggar · on Nov 25, 2016

Given the number of people who work for Google, and Google's internal culture of openness, I find it extremely unlikely that Google is violating its customers' privacy. That would almost certainly get whistleblown.

hueving · on Nov 25, 2016

>and Google's internal culture of openness

Does not exist like you think when it comes to projects. People are very secretive about lots of projects for all kinds of internal political reasons. Using data they technically have the right to use in a somewhat scummy way is a perfect example of a project that would be kept out of the spotlight.

wingless · on Nov 25, 2016

Like how the PRISM program got whistleblown by Googlers? Oh wait, that never happened.

rocqua · on Nov 25, 2016

Threat of jail / treason conviction is a lot stronger than the threat of being fired by google.

I can totally see someone caving for the one and not the other.

kyrra · on Nov 25, 2016

Google never cooperated with PRISM. As soon as news came out that the NSA was tapping Google's intra-datacenter dedicated lines, Google announced they would accelerate encrypting all that traffic.

https://www.google.com/amp/arstechnica.com/information-techn...

Oletros · on Nov 25, 2016

Do you have any source that can back your suspicion that Google breaks their own policies for Analytics?

beagle3 · on Nov 25, 2016

if google didn't correlate analytics data from multiple sources, it would be about as informative as a local piwik install, which is to say "not much".

Perhaps they don't "use" it directly, but surely they use it to improve their statistical profile of the specific users? Does anyone know if the GA cookie sent by brew is same or correlated with the one in the browser?

missblit · on Nov 25, 2016

From the source code it looks like brew uses a UUID generated locally (e.g. from a source like /proc/sys/kernel/random/uuid). I don't see how it could be correlated with anything in the browser.

In terms of the analytics URL they set the client-ID, which indicates a "particular instance of an application install": https://developers.google.com/analytics/devguides/collection...

tbarbugli · on Nov 25, 2016

Sounds quite reasonable to me when you read it: https://github.com/Homebrew/brew/blob/master/docs/Analytics....

antouank · on Nov 25, 2016

    Opting out [0]

    Homebrew analytics helps us maintainers and leaving it on is appreciated. However, if you want to opt out of Homebrew's analytics, you can set this variable in your environment:

    export HOMEBREW_NO_ANALYTICS=1

    Alternatively, this will prevent analytics from ever being sent:

    brew analytics off

[0]: https://github.com/Homebrew/brew/blob/master/docs/Analytics....

STRiDEX · on Nov 25, 2016

I've seen this crop up a few times recently and a vocal minority seems to panic every time. I ran into it the other day installing pm2 w/ npm which downloads an optional package that they use for analytics https://github.com/Unitech/pm2/blob/master/package.json#L184

Is the problem that they are using google's service? Is it the tracking that people don't like? Is it the messaging/copy to the users? I'm sure there's plenty of people on hacker news that build analytics software for a living. I've worked on email click and open tracking. Doesn't seem terrible if it helps them build their product.

djsumdog · on Nov 25, 2016

You know...you can already get analytics if your mirrors just agree to share logs. It may not be as detailed or complete, for sure, but using another analytical tools does feel a little superfluous.

> Is the problem that they are using google's service?

This is part of it I think. The brew devs get stats, but so does Google which they can aggregate across everything. Standing up your own analytics server/cluster, especially for a project the size of brew, wouldn't be trivial and I can see why they leverage Google's service.

But I can understand people not wanting to have data sent to Google. I moved everything off Gmail to my own E-mail server back in 2013 and moved search to DuckDuckGo.

true_religion · on Nov 25, 2016

Does google simply have access to you analytics data to do whatever they want? I was under the impression that they could only use it in ways that you the account holder defines.

skylark · on Nov 25, 2016

It depends on what you mean by "access to your analytics data." Anonymized, aggregated data is being used to inform product decisions, but engineers don't have direct access to the production database to query whatever they want. There are almost no cases where you'll be granted approval to look at real user data, and if you are granted access, 100% of your activity is monitored.

As a side note, this has been the case at every big company I've worked at, not just Google.

hueving · on Nov 25, 2016

Privacy erodes one complacent action at a time. Every time people secretly (if it weren't subversive the default would be opt-in) funnel information off to Google or some other massive aggregation point, we should raise hell rather than bending over and accepting this hostile behavior.

jakebasile · on Nov 25, 2016

This really doesn't concern me at all. I don't feel I am harmed by Google possibly knowing what software I install, because if anyone wants to look I keep the script that installs it in my public dotfile repo on GitHub.

Can anyone explain to me how this could be used against someone? I'm asking completely seriously as I don't see any harm in this no matter how hard I try.

Edit: I do agree that the notice should be more visible for those that are concerned about this. It should require user input even if it is just waiting for a key press to give you a chance to read it along with info on how to opt out.

yoo1I · on Nov 25, 2016

> Can anyone explain to me how this could be used against someone? I'm asking completely seriously as I don't see any harm in this no matter how hard I try.

This is called the "nothing to hide" argument. If you currently have no foes and are generally closer to the elite of a society than to the margins, then these few little data points collected on you (pseudo-anonymously or not) cannot really hurt your.

But as your digital shadow grows (and with google analytics et al being used to extensively, it certainly is) and you drift to the margins of society, possibly developing some foes in the elites in the process, the information about you that is now available to people in power becomes more threatening for you.

Say, you're now a black, female, delivery driver in Detroit instead of a Silicon Valley software engineer. Once you give cause for scrutiny, say, a conflict with your employer, the digital shadow of data-points can be searched to find anything that, taken out of context or not, can be used against you.

Or, put differently in famous exaggeration by Cardinal Richelieu:

  If you give me six lines written by the hand of the most honest of men, I will find something in them which will hang him.

blub · on Nov 25, 2016

It harms everyone because it's one more actor trying to make collection of data look normal, as can be seem from the multiple misguided posts wondering why this is a problem.

It's not normal that software is sending data to some server somwhere by itself by default. Devs used to have some decency and left this opt-in. They see there's little push-back and they're making it opt-out.

Almost all mobile apps and a significant number of desktop apps have analytics now.

jakobegger · on Nov 25, 2016

Oh... so you're installing Tor? And a Bitcoin client? You're starting to look a lot like a drug dealer.

Let's send a SWAT team and confiscate all your stuff just to make sure.

Nursie · on Nov 25, 2016

Oh hey, you're using foomaker version 1.6? I picked up a great exploit for that earlier!

jakebasile · on Nov 25, 2016

I can no longer edit my comment, but thanks for the examples given. I'm not sure if it's changed my views but I think I understand where the anger is coming from and I have something to think about when things like this come up.

pimpek · on Nov 25, 2016

I agree that it should be opt-in instead of opt-out but worrying what they do else behind the scenes is paranoid.

You can check it out: https://github.com/Homebrew/brew

smcleod · on Nov 25, 2016

Quite honestly, I don't remember brew asking me this on any of my 3 macOS machines, I'm not saying it didn't, but the fact that I'm a 'techie' person and didn't notice or forgot about it worries me greatly.

ProbabilityMoon · on Nov 25, 2016

It probably didn't. I run brew regularly on my machine, and I did not get any prompt asking me to enable analytics. From reading other comments, I see lots of users are in the same position, and homebrew sneaked this in with just a two line notice buried in the verbose output. This worries me deeply, to the point where I'm considering removing this software from my machine.

smcleod · on Nov 25, 2016

While I obviously feel the same way, I'd take a step back and see what Brew's response is to this thread / the backlash of what people want / don't want, because it has, overall been a good thing of macOS.

peteretep · on Nov 25, 2016

I was aware, because I run Little Snitch. If you care, you should be too.

aorth · on Nov 25, 2016

You're right, we probably should be using Little Snitch — just know that it's not a golden ticket! There has been some research about bypassing Little Snitch, for example this one from 2016:

https://speakerdeck.com/patrickwardle/defcon-2016-i-got-99-p...