GitHub was down

kaikai · on July 31, 2017

I would love to see a chart of traffic to other sites when GitHub goes down. My bet is that HackerNews and Twitter both get significant spikes from all those bored developers.

tinalumfoil · on July 31, 2017

> bored developers

Bored? Git's a distributed version control system, so no excuses. Get back to work!

But in all seriousness I kind of wish GitHub provided a way to mirror things like issues and PRs so you never have to be fully reliant on one service. Not being able to read these really does make it impossible to get work done offline.

bradfitz · on July 31, 2017

Go has a mirror of our GitHub project via this thing "Maintner" I wrote (running at http://maintner.golang.org/) that syncs GitHub in realtime to a log of mutations. (As well as syncing Gerrit and all its comments etc).

So then we can slurp all of our GitHub & Gerrit history into RAM (takes about 5 seconds and 500 MB) via https://godoc.org/golang.org/x/build/maintner/godata#Get and walk it in-memory and do stuff with in. (runs our realtime bots, alternate web UIs on planes, alternate search, stats, etc.)

LeifCarrotson · on July 31, 2017

> Maintner is short for "Maintainer"...the name of the daemon that serves the maintner data to other tools is "maintnerd".

Nice work! But as is custom on HN, I'll bikeshed on the name instead of delving into the contents of the tool. Why shorten the word by just 2 letters? Is there something special about the tooling that makes 8-letter projects more desirable than 10-letter projects, or is it linked to the removal of Artificial Intelligence from the process? I'd personally mis-type that name all the time.

macintux · on July 31, 2017

I'd prefer it just for searchability. Good luck finding anything online called "maintainer".

slackingoff2017 · on July 31, 2017

I think he was excgarating the issue a bit. Your logic is sound

j_s · on July 31, 2017

> excgarating

I commit to re-use this as often as possible!

http://www.urbandictionary.com/define.php?term=Excgarated

An inspiring new word invented by a redditor on March 23, 2014. The redditor was actually trying to spell exaggerated. This word was misspelled so horribly that when it was googled the only thing that came up was a link to the comment itself.

slackingoff2017 · on Aug 1, 2017

Glad someone caught it :)

lopopolo · on Aug 1, 2017

ahh, a googlewhack

type0 · on July 31, 2017

Should have called it Matenerd, shorter and much better name

shoyer · on July 31, 2017

I think it's a Go joke. Just be glad it's not called "m"!

yebyen · on July 31, 2017

Oops, instructions not clear, accidentally created another suite of build tooling called "gb"

kornish · on July 31, 2017

> alternate web UIs on planes

Could you expand on this? Sounds interesting. You mean like an offline UI powered by the in-memory mutation list?

dawnerd · on July 31, 2017

I think they just meant lighter. They load quite a bit and on really high latency and slow connections it's painful to use.

saagarjha · on July 31, 2017

> have to be fully reliant on one service

My inner cynic says that that's probably the reason why they don't provide this service.

hinkley · on July 31, 2017

I keep meaning to dig into Fossil (SQLite's VCS/Project Management system), but I have no faith I could convince a team to use it.

Another model to look at is Trac, which had pretty extensive integration with SVN and integrated (ie, cross-linked) issue tracking and wiki, and stored all the data and change history in a svn repository.

SQLite · on July 31, 2017

> I keep meaning to dig into Fossil, but I have no faith I could convince a team to use it.

Developer or Fossil and SQLite here: I agree. In my experience, you'd have better luck convincing the team to switch from vi to emacs. For all its many and well-documented faults, the Git/GitHub paradigm is what people want to use because it is what they are familiar with.

All the same, I intend to keep right on using Fossil, thank you very much!

So here is the idea I've been thinking of lately: What if Fossil were ported or enhanced to use Git's low-level file-formats so that unmodified git clients could seamlessly push and pull against the (enhanced) Fossil server. Call the new system "Fit" (Fossil+Git). Using Fit, you could stand up a GitHub replacement for an individual project in 5 minutes using nothing more than a 2-line CGI script. Git fan-boys could continue to use their preferred interface, while others who prefer a more rational and user-friendly design could use the Fossil-like "fit" command. Everybody could share code, and everybody would have a nice web-based interface with which to collaborate with tickets and wiki and all the other cool (and to my mind essential) stuff that Fossil provides. And nobody who already knows git would be forced to learn a new command-line interface.

I'd be all over writing the code for "Fit", except that I'm already over-extended. Anybody who thinks this is a good idea and would like to pitch in and collaborate, please contact me privately. Thanks.

hinkley · on Aug 1, 2017

The pushback will come in two forms.

First, a backend that works exactly like git is very different from one that works almost like git. You'll be blamed for other people's problems.

Second, you'll have to have a way to deal with commit history the way git does (arbitrarily, and able to be rewritten at any time)

Otherwise the first push -f or rebase breaks the whole thing.

Reading from and writing to a git backend (that is, making fit a client too) might be the safer option. Sort of like git-p4 in reverse.

winter_blue · on July 31, 2017

That sounds like a great idea. By leveraging Fossil's existing UI and getting it to work with Git, it might really gain a lot of traction, and become a viable alternative to roll-your-own-github services like GitLab Community Edition.

EdiX · on July 31, 2017

To be attractive to github users it would have to have a better issue tracker and a better patch review system. Given that sqlite uses a mailing list for both I'm guessing it doesn't have that.

voltagex_ · on July 31, 2017

This'd be a great idea, but I don't have the skillset.

Moto7451 · on July 31, 2017

My guess is their sales pitch would be for you to run your own instance of Github Enterprise or doing something fancy with Web Hooks. At work we did the former for a while before switching to Gitlab. We had less downtime than Github (not that they have much) but since the ops is on you in that situation, YMMV.

weberc2 · on July 31, 2017

I'm of the impression that issues and PRs are all accessible via an API. What more should they do?

curun1r · on July 31, 2017

> What more should they do?

How about storing issues and PRs in the actual git repository? They should index them for the UI, sure, but the source of truth should be a branch of the git repo just like it is with gh-pages. It should be possible to file a new issue by committing a markdown file to the correct branch and pushing it to Github. Their hub command line tool and their own client could facilitate adding all the correct metadata. This would allow people to work while Github.com is offline and synchronize everything once the service outage is over.

If you have access to a distributed database that is already the best available for manual conflict resolution, why wouldn't you want to use it to store this kind of data? A Github outage is like a partition when you think about it from a distributed db perspective, so treat it like an individual node died and still allow reads/writes to the rest of the cluster that can be merged back once the node comes back online.

hk__2 · on July 31, 2017

> How about storing issues and PRs in the actual git repository?

Access to the git repository is regulated; issues and PRs aren’t. Anybody can fill an issue/PR on your repo, but only you (and your team) can modify the repo. You’d need to also store all the comments on all issues and PRs, even closed/rejected ones. In some repositories, that’d be huge.

theshrike79 · on Aug 1, 2017

Of course the issue data in the repo should be considered read only.

And the amount of actual data needed to store issues is peanuts anyway.

hk__2 · on Aug 1, 2017

> And the amount of actual data needed to store issues is peanuts anyway.

It’s not. Take something like github.com/Homebrew/homebrew-core. There are 15k closed PRs there. There have been 20 new ones today. It’s not rare to have 10-20 comments per PRs; some of them even go over 200. Add CI status, actual PR contents (git patches), comments reactions, edits, labels, milestones, assignees, reviews, projects.

IMHO having a tool to fetch issues locally is a good idea; storing them in the repo is not.

tedmiston · on July 31, 2017

From the perspective of someone mid code review during the outage, it's worked as well as it could too. They preserve comment drafts client side that haven't posted to the server yet across page reloads. While it's a little frustrating to have to submit something 3-5x, they're definitely doing something for this use case already.

Also as others pointed out, just because the GitHub app is down doesn't always mean the GitHub git server itself is down.

TheAceOfHearts · on Aug 1, 2017

You might wanna check out Fossil, it's version control system with integrated bug tracking and wiki. It doesn't really work for all projects, though.

[0] http://fossil-scm.org/index.html/doc/trunk/www/index.wiki

sytse · on July 31, 2017

GitLab.com has repository mirroring https://docs.gitlab.com/ee/workflow/repository_mirroring.htm... (this will soon be a paid account feature)

It doesn't mirror issues and PRs but it can do a one time import of them https://docs.gitlab.com/ee/workflow/importing/import_project...

monksy · on July 31, 2017

Shame that data isn't store in a side repository where it could be replicated like the repository.

javajosh · on July 31, 2017

It would be clever to store issues and PRs on a special branch in the repository. I can't currently think of why this wouldn't work (apart from business-wise, it would reduce lock-in considerably).

justinclift · on Aug 1, 2017

Well, it's unlikely to be feasible due to Git's on disk structure.

With git, every commit has an sha checksum of the commit contents & metadata. And each commit's metadata points to the previous commit.

So, if a maintainer wanted to fix a typo or do any other kind of correction/update to an old issue you'd need to rewrite every commit on disk since that one with the updated chain of checksums. eg from the fixed comment, to the head of the special issue/pr branch.

That alone would probably make syncing with the repo's a real pain for anyone working on medium to larger sized projects. ;)

javajosh · on Aug 2, 2017

Your objection is predicated on the assumption that issues would need to be edited "for all time". That's not true for anything else in git, I'm not sure why that would be true for issues. I mean, if you had an "issues/" directory in your repo, and files named "issue1" "issue2" and so on, you would just edit the issue whenever. If someone made a branch for the issue, then they'd want to merge your edit into their branch ASAP - which is good, because it indicates that they read and understood the change. And this merge would be particularly painless since changes to "issues/" are absolutely not going to conflict with anything in "src/".

justinclift · on Aug 2, 2017

Ahhh, I see what you mean. Yeah, that would probably work. :)

ericfrederich · on July 31, 2017

I agree... I wish GitLab would do this too. No reason that everything can't be modeled as a git branch.

GitLab I know models their Wiki as a git repo. I think issues and PRs should be their own repo as well.... a branch for each issue or pr? Could tie it all up using submodules (or not, whatever)... just please someone take the jump first and do this.

brodock · on July 31, 2017

DISCLAIMER: I work for GitLab. We have a feature to replicate GitLab to different locations (EEP). We call it "Geo", which stands for "Geographical Replication". Part of the "Geo" effort is Disaster Recovery which is under heavily development. We want with Disaster Recovery to be able to reliably promote any Secondary node to a Primary, so if your US datacenter melts down under a nuclear war, you can start working with your copy in EU, India, China, etc.

groby_b · on July 31, 2017

If your US datacenter melts down under a nuclear war, I can guarantee that none of your devs will give a damn about contributing to the code base.

You might want to modify your scenarios.

granda · on July 31, 2017

As someone interested in these scenarios, where could I find out more information about what would happen post-nuclear explosion?

groby_b · on July 31, 2017

A lot of this is out in the open

There are active simulations going on: http://www.sciencerecorder.com/article.php?n=scientists-cond...

There are preparedness drills: http://www.independent.co.uk/news/world/americas/us-politics...

There's plenty of policy analysis: https://www.cato.org/publications/policy-analysis/social-eco...

Older .mil reports: http://www.dtic.mil/cgi-bin/GetTRDoc?AD=ADA080907

There are entire books taking apart what happens in specific fields: https://www.ncbi.nlm.nih.gov/books/NBK219154/

Or you could just read disaster porn^W^W post-apocalyptic sci fi.

So, a lot depends on what kinds of info you want to find. But overall, I'd say it's not worth reading, because life's not going to be much fun. Even if you're prepared.

jessaustin · on July 31, 2017

In that event, how much would your "guarantee" be worth? b^)

brodock · on July 31, 2017

it depends on your SLA ;)

optimuspaul · on July 31, 2017

(only partially serious here) I'm not sure recovery from a nuclear event that wipes out US data centers is economically feasible. If an event were to effectively wipe out 20% of the worlds economy I think having my github issues online is the least of my worries.

ericfrederich · on Aug 3, 2017

If it's not Git based then I'm not interested in it.

I see no reason to have a RDBMS for issues, merge requests, pull requests, etc. All of these could be modeled in multiple ways using a Git database. Issues are files in a repo called issue1.md, issue2.md ... or issues are branches on a repo ... or something else.

adventist · on July 31, 2017

Yeah I know I'll be on vacation for a long while!

Jason_malloC · on July 31, 2017

Hope it will be on the moon or in a nuclear shelter. Otherwise all that fallout may ruin your trip.

CUViper · on July 31, 2017

Reminds me of https://en.wikipedia.org/wiki/On_the_Beach_(novel)

syshum · on July 31, 2017

But they want you to Rely on Github... if they could find away to make Git only work with Github they would in a heart bet.

Github is not a open source company, it is not really even a supporter of free software IMO they are a danger to free software for this very reason. They are following the old school Microsoft model of Embrace and Extend... I am waiting to see if they can extinguish,

matthewmacleod · on July 31, 2017

Contentless, illogical mince.

GitHub clearly contributes a fair bit to open source - not only their own projects, but the free hosting for open source projects. You have no basis for this accusation, and frankly we are all stupider for having read it.

syshum · on July 31, 2017

Well I made 2 claims..

1 Claim was that Github is not a open source company, and they are not. Their core business is not open source, yes they have some side projects that are, but so does MS, I do not believe anyone is going to say MS is a "open source company"

the second claim I made is they are a threat to free software, I used free software here for a reason, free software and open source are different.

GitHub may support some Open Source things, they do NOT support free software.

Tom Preston-Werners infamous post on open sourcing only some things (http://tom.preston-werner.com/2011/11/22/open-source-everyth...) Is a key piece of evidence to prove this.

Git hubs general actions over the years show they do not support free software at all, and have limited support for Open Source software.

They are classic Free Software leaches, using and abusing free software not for the ethical reasons around free software but to advance their profit center

See I am support free software for ethical reasons, not monetary reasons.

That is with out even getting into the MASSIVE issue that is having all of these open source projects on a SINGLE platform. Unless you think github is "too big to fail" which I can assure you it is not

Klover · on July 31, 2017

Terribly sorry for this rubbish reply, but here you go: http://i.imgur.com/ueRIgtq.png

I love it, and it's perfect for when you want to say something witty like FUD or D&D, but can't justify it. Now with CIM, I can!

heartbreak · on July 31, 2017

What? Github employs members of the Rails core team including Aaron Patterson. They literally pay for open source software development.

I'm sure they employ direct maintainers/contributors to other open source projects as well.

hueving · on July 31, 2017

Are they paying him to help make an open source alternative to github? If not, then it's not really relevant.

They are actively working against open source tooling for software development by developing features only for their closed platform.

Gitlab is a much better example of a company that fully embraces open source.

daxelrod · on Aug 1, 2017

GitLab does not fully embrace Open Source.

GitLab Enterprise Edition does not have an open source license. Since GitLab.com runs GitLab EE, their two (presumably) primary sources of revenue (EE licenses and GitLab.com paid accounts) come from non-open source software.

But: it's still fair to call GitLab as a company way more Open Source than GitHub. All of their development happens out in the open, the vast majority of their codebase is Open Source, and the source code for GitLab EE is even made available.

This is a spectrum, not black and white.

syshum · on July 31, 2017

And you believe that makes them an "Open Source Company"

To me in order to be a Open Source company your primary product must itself be Open Source.

RedHat as an example... RHEL is open source

GitLab.. GitLab is open source

These are open source companies

Hiring a few devs for work on some side projects that are open source does not make one an Open Source Company

If they open GitHub Core then they can claim to be a open source company

heartbreak · on Aug 1, 2017

Aaron Patterson, who I'll continue to use as an example, works full time on Rails while being paid by Github. Rails is not a side project for him.

Also as mentioned elsewhere in this thread, Github has published plenty of open source software. Electron, Atom, resque, and updates to git itself.

syshum · on Aug 1, 2017

It is a side project for the company, as are all the other projects you mentioned

As I clearly stated, GitHub can not be a Open Source company simply because they hired some devs to work on Open Source

Do you consider Microsoft to be a Open Source Company?

eberkund · on July 31, 2017

The Atom editor is widely used and was a major inspiration for VS Code. Electron, which is a GitHub project is also used for countless other projects.

yellowapple · on July 31, 2017

"major inspiration for VS Code"

Which is a Microsoft product, and Microsoft is the inventor of EEE. Ergo, GitHub is complicit in an EEE attempt. Q.E.D. Case closed.

/s

tracker1 · on July 31, 2017

So anyone who releases open-source code, that Microsoft might use in another open-source project is complicit in EEE? wow.

yellowapple · on July 31, 2017

My apologies; I should have made the "/s" - which denotes sarcasm - much more apparent.

eropple · on July 31, 2017

Yup. GitHub also released/contributed to a lot of other open-source Ruby stuff, like Resque.

_pctq · on July 31, 2017

> if they could find away to make Git only work with Github they would in a heart bet.

Sorry, but this is unfair :) I remember the early days of Gitlab, before they added all the insanely cool features they have now, when they were just a Github opensource copycat. My thought at the time was : "wow, github is super cool to let them go. At least the almost exact design copy could be a legal problem". Clearly, we would have heard about Github vs Gitlab back then, if Github wanted to lock people in.

johannes1234321 · on July 31, 2017

Well, if we "heard about GitHub vs. Gitlab" more people outside HN might hear about GitLab. Might be more dangerous, than the chances of a win shutting GitLab down.

chucksmash · on July 31, 2017

So you fault them for being evil geniuses by not acting evilly?

syshum · on July 31, 2017

More likely they got sound legal advice and realized they could not actually win in court, or their investors killed that idea as a waste of money

You do not hear about many VC funded startups initiating lawsuits for a reason. That is the realm of established companies.

pavement · on July 31, 2017

Just because it's A common motivation and strategy, doesn't mean it's THEIR motivation and strategy.

A more likely strategy/motivation is that the product is you. AKA: farming the users.

Create a desirable pasture, and the animals farm themselves.

syshum · on July 31, 2017

It is partly the MS EEE model and partly the Adobe model, where they give away free or low cost services to get indivuals hooked, often at young age, then when they are employed at larger firms they push the firms to adopt that software internally.

Get a bunch of Open Source Developers to do your marketing at their "real jobs" so they can sell the Enterprise Version.

Since GitHub is a private company it is Unknown (at least I can not find the info) if they are profitable or not, or what their revenue numbers even are so it is unclear if that is a successful plan or not.

GitHub could very well run into the same problems as SoundCloud.

lettergram · on July 31, 2017

"git on the block chain"

sytse · on July 31, 2017

At GitLab.com we're not seeing an increase in usage: https://www.dropbox.com/s/eiz2h7tdnownvfl/Screenshot%202017-...

From http://monitor.gitlab.net/dashboard/db/haproxy-stats?refresh...

But of course it is hard to mirror your repo to GitLab.com when GitHub.com is down, so maybe other sites are a better measure.

cmatija · on July 31, 2017

That's our chart of backend sessions / sec for GitLab.com [1]. There's also a chart for frontend sessions [2], please keep in mind that there's a lot of CI / bot sessions on these dashboards. In addition to that there's much other interesting info on our HAProxy Grafana dashboard [3]. In case anyone's interested in even more metrics, there's a bunch of interesting dashboards on our Grafana instance [4].

[1] - http://monitor.gitlab.net/dashboard/db/haproxy-stats?refresh...

[2] - http://monitor.gitlab.net/dashboard/db/haproxy-stats?refresh...

[3] - http://monitor.gitlab.net/dashboard/db/haproxy-stats

[4] - http://monitor.gitlab.net/?orgId=1

emilsedgh · on July 31, 2017

Its not like your editor is down.

mdellabitta · on July 31, 2017

My team was working on merging pull requests when this happened.

We might actually be at fault! That would be kinda cool.

Aissen · on July 31, 2017

If only you were using a decentralized VCS, you wouldn't have this single point of failure…

scribu · on July 31, 2017

The problem isn't the VCS part so much as all the code review/project management features of Github.

I seem to remember there was a decentralized VCS which included issue tracking besides commits, but it never was mainstream.

Osiris · on July 31, 2017

https://www.fossil-scm.org/index.html/doc/trunk/www/index.wi...

scribu · on July 31, 2017

Yep, that was the one.

WorldMaker · on July 31, 2017

There's also plenty of DIY distributed issue tracker tools to keep issue tracking inside a git repo (often as yaml+markdown). I'm still surprised a good one hasn't stepped up with direct GitHub issue/project tool integration+sync, but that should certainly be a possibility.

pyre · on July 31, 2017

Fossil SCM

fs111 · on July 31, 2017

the linux kernel development works just fine w/o github.

paulddraper · on July 31, 2017

They use self-hosted git repo and bugzilla.

It's not exactly less of a point of failure.

mdellabitta · on July 31, 2017

I think the spf there is Linus' email account, right?

hdhzy · on July 31, 2017

The VCS is decentralized but they made the process centralized.

wooter · on July 31, 2017

but github is honestly so nice. I'll admit in a very subjective way. whats your favorite decentralized vcs?

saagarjha · on July 31, 2017

The joke is that git is decentralized–it's just GitHub that's a single point of failure.

coldcode · on July 31, 2017

We have Gitlab Enterprise and it has trouble scaling.

dblessing · on July 31, 2017

Sorry to hear you're having trouble scaling GitLab. We have many organizations running GitLab and successfully scaling to 10's of thousands of users, and GitLab.com which is the largest GitLab installation. As a GitLab Enterprise customer our support team is happy to help you review your scaling problems and resolve them. Please submit a support request at https://support.gitlab.com.

coldcode · on July 31, 2017

O crap I meant to say GitHUB Enterprise. That's what has trouble. We also have Gitlab Enterprise and that's fine. Sadly for reasons internal we are mostly stuck with GitHub E.

sytse · on July 31, 2017

I'm glad to hear that GitLab is working fine for you. And I'm sorry that you feel stuck with GitHub Enterprise. Please know that GitLab has a high fidelity importer https://docs.gitlab.com/ee/workflow/importing/import_project... and our support team has a lot of experience with assisting people with the migration if the migration is the problem.

sytse · on July 31, 2017

I'm sorry to hear that. GitLab Enterprise Edition should scale fine to 100,000's of users. Are you in touch with our support about this?

wooter · on July 31, 2017

thats why i asked what his favorite was. just curious to see what the feature differences would be

matart · on July 31, 2017

But reviewing, issues, PRs, etc. are. A lot of the planning parts are down.

alpb · on July 31, 2017

> implying developers spend most of their time in editors.

emilsedgh · on July 31, 2017

Well the parent was implying they spend most of their time in Github.

alpb · on Aug 11, 2017

I don't see that implication. I personally spend most of my time on GitHub.

saagarjha · on July 31, 2017

I wouldn't be surprised if this became an issue in the future, with Atom's popularity on the rise…

ci5er · on Aug 1, 2017

Uuuh ... Shouldn't they be, you know, developing? Or has GitHub turned into Reddit when I wasn't looking?

fs111 · on July 31, 2017

what are you doing on GH all day? you are supposed to _write_ code.

alexchamberlain · on July 31, 2017

When I break our GitHub webhooks, I joke it's time for people to practice our Disaster Recovery (DR) procedures. In all seriousness, this is a good opportunity to practice work without GitHub. Any service can go down; can you deploy a critical bug fix without it? If not, why not and what can you do to fix it?

kmfrk · on July 31, 2017

I had to change a username from capitalized to uncapitalized and use my updated remote afterwards, apologies if I broke it for everyone.

mdaniel · on July 31, 2017

To the best of my knowledge, GitHub org and usernames in the URI are case-insensitive for both the website and clone URIs. I haven't tested ssh clone URIs to know if they are also insensitive, but I'd guess they are

You would only need to change it if the "presentation" format bothers you (again, as far as I know)

_euac · on July 31, 2017

Did you know that if you type "Google" in the Google search box you can break the internet? :|

(BTW, this was meant as a joke... although I'm not discarding that kmfrk broke Github just yet!)

cjbprime · on July 31, 2017

It's kmfrk's fault! Get them!

drfuchs · on July 31, 2017

"Git 'em!" FTFY.

_Marak_ · on July 31, 2017

If anyone is interested, I've been working with a git host that is actually distributed across a p2p network using SSB.

see:

https://github.com/clehner/git-ssb

https://github.com/noffle/git-ssb-intro

It's been working fairly well so far. We are using git-ssb to manage a few projects instead of putting them into Github.

swsieber · on July 31, 2017

Hey, that's really cool. Have you considered submitting it for a Show HN?

vertex-four · on July 31, 2017

Is there, at least theoretically, a way to prevent other people from pushing to my repo? That seems like it would suck re griefing for any project that might become even mildly politically sensitive for whatever reason.

_Marak_ · on July 31, 2017

Everything is key based, so only key holders push to your repo. It's all based on the SSB protocol.

I'd suggest reading https://github.com/noffle/git-ssb-intro to get an idea.

vertex-four · on July 31, 2017

That states: "git-ssb's permissionless model has an interesting consequence: anybody can push to anybody else's git repository." The guide doesn't show any key-sharing in order to do that. Are you saying that's incorrect?

_Marak_ · on July 31, 2017

I don't want to give you the wrong answer so I've forward this question to the SSBC network for one of the core developers to answer better.

I'd be surprised if there was no security for pushes. The repos I've worked on did require an invite from the creator.

cel · on Aug 1, 2017

Marak It sounds like you are working with private repos. With git-ssb currently a repo is either public or private. Private repos are encrypted to a fixed set of recipients so only those keyholders can access it. Public repos are unencrypted.

lost_password · on July 31, 2017

Yes, so basically in a centralized permission model some authority (the database) decides if any write is authorized or not, but in decentralized, any peer just writes anything, and then the readers decide whether they interpret that as valid or not.

Here is a description of a model that embraces both any-one-can-edit with degrees of consensus on who is allowed to edit. http://viewer.scuttlebot.io/%25GKmZNjjB3voORbvg8Jm4Jy2r0tvJj... but if you decide that someone cannot edit it, from there perspective they still can, but they are just excluded from your perspective.

vertex-four · on July 31, 2017

Your comment is entirely nonsensical in this context. I want a way to be able to publish a repo and have the people subscribing to the repo be able to only pay attention to my changes in an automated way. This software currently doesn't implement that, as far as the guide that was linked suggests. It is therefore utterly useless - every time someone decides to grief my repo, it requires manual intervention to resolve.

Once you have that very, very basic ability to replicate what people expect when they subscribe to a person's git repository, you can start playing with automatically merging together people's changes - but in practice, merge conflicts are a thing and there's no good way to resolve them. If you can come up with a way to automatically resolve merge conflicts, you'd be rich, frankly speaking.

lost_password · on Aug 1, 2017

you said > Is there, at least theoretically, a way to prevent other people from pushing to my repo?

so I answered, _yes, theoretically_ we have ideas for how to implement that. you can also unfollow and block griefers, but so far pretty much everyone has been nice and we just havn't needed to implement that yet.

vertex-four · on Aug 1, 2017

How do you intend to automatically resolve merge conflicts, which is what your document suggests you want to do?

Why is this not resolved by a good permissions model and the ability to fork? Why should my users have to care about blocking griefers when they just want to pull from repo?

slap_shot · on July 31, 2017

Status now shows Major Service Outage:

12:32 EDTMajor service outage.

https://status.github.com/

indubitably · on July 31, 2017

unicorns all around

luhn · on July 31, 2017

Pages Builds Failure Rate spiked to over 2000%. I don't know how that's possible, but it seems pretty bad.

adrianpike · on July 31, 2017

Maybe 20+ failing retries for any single page build?

westbywest · on July 31, 2017

Guessing absolute failure rate previously not encountered by current graphing tool, incorrect scaling factor. Or, it was decided at some point that always reporting 0.000XXX% failure rate, even if correct, didn't offer an intuitive metric, so zeroes were intentionally truncated.

tedmiston · on July 31, 2017

It'd be nice to have an intuitive explanation on the status page for what PBFR means if it can go over 100%.

frandroid · on July 31, 2017

Bad Metric is Bad.

tambourine_man · on July 31, 2017

Insert remark on why we use a centralized service for a distributed source control system, etc. No one seems to care, unfortunately

jasode · on July 31, 2017

>Insert remark on why we use a centralized service for a distributed source control system,

Because Linus didn't put features into Git that Github solves.

You must break apart the different features of Github:

#1) communication (issues tracking, bug reports, pull requests, README.MD landing page, etc)

#2) hosting disk storage & bandwidth

#3) distributed source code merges based on content hashes (SHA1) instead of using centralized locks/unlocks (check in / check out) model of CVS/SVN.

Git itself only takes care of #3.

Github handles #1 and #2 (and also gets #3 by being built on top of Git).

You can't go back in time and wonder if Linus should have addressed #1 and #2 because he wasn't interested in starting a hosting company. Instead, he focused on the data format (Merkle trees, BLOBs, SHA1) and a sync protocol (git pull, etc) for Git.

If people wonder why we can't just use email for #1 (communications), you have to see that Github has become a "Schelling Point"[1]. Attempting to use email groups & mailing lists will not prevent the emergence of a Schelling point. Email can be a workflow for existing contributors (e.g. contributors of the Linux kernel source) but it's not convenient for discovery of new repositories (e.g. the web's "landing page" of a repo).

As for #2 (hosting), not everybody who wants to share a repository wants to pay $9.99/month VPS or other hosting plan from a web hosting provider. It would also be inconvenient to host it from the home laptop and punch a hole through the ISP router to make it work. Github solves hosting+bandwidth for free for modest non-commercial projects.

To restate, Linus' Git is a distributed _protocol_ but Github is a _service_ acting as a platform for the distributed protocol.

[1] https://en.wikipedia.org/wiki/Focal_point_(game_theory)

DaiPlusPlus · on July 31, 2017

#1 can be done by creating a separate submodule repo that only stores docs+issues files. It's up to the repo's users to agree on a system by which the files should be organized, but it's doable.

I'd propose directories "issues/open", "issues/closed", with each issue filename being "{created:yyyy-MM-dd} - {subject}.md". Symlinks could be used to track ownership/responsibility if each repo contributor has their own directory in the repo too.

apetresc · on July 31, 2017

That's an awful lot of extra work to insure against maybe 1 hour per year of downtime.

sleepybrett · on July 31, 2017

Just because present state is 99.7579% uptime (for the month) doesn't mean it will always be so.

You back up your data, why shouldn't you backup your github data?

manigandham · on July 31, 2017

Backup is one thing, choosing to run a crappy manual system just in case a vendor goes down is entirely different.

DaiPlusPlus · on July 31, 2017

It doesn't have to be a "crappy manual system" - I'm simply suggesting that given that git itself is a damned good distributed versioning database for arbitrary content, then we might as well also use it for distributed issue-tracking. A simple offline-mode browser-based editor that lives in a single HTML file within the repo would provide a nice GUI on top.

Hmm, I think I might be on to something... anyone want to start a project?

jasode · on July 31, 2017

>git itself is a damned good distributed versioning database for arbitrary content, then we might as well also use it for distributed issue-tracking.

For what it's worth, it's interesting to see that the Fossil distributed SCM includes an issue tracker but they made a deliberate architecture decision to not propagate the tickets data.[1] They had a chance to make your "distributed-issues-tracking" idea a 1st-class concept in Fossil but decided against it.

Also, the issues/tickets is just one example feature. Github will continue to evolve to add more and more sophisticatted SDL/ALM (application lifecycle management) like JIRA and Microsoft Team Foundation Server. Those features are not easy to implement in a peer-2-peer SCM with practical usability.

[1] https://fossil-scm.org/xfer/doc/trunk/www/qandc.wiki

DaiPlusPlus · on Aug 1, 2017

Thank you for the link. I read through their justifications and I think using a git-submodule solves their problems of polluting the main project history and permissions issue. Using directories for mutually-exclusive state grouping (e.g. "closed"/"open"/"new") solves the directory problem.

Goladus · on July 31, 2017

The reason is github did a fantastic job of implementing useful features. The visual design is unmatched and they have done a great job implementing developer oriented integrations and social features.

A more federated approach to this sort of thing might have been nice, but so far nothing I have seen comes close to the value-add offered by github.

xutopia · on July 31, 2017

You're sidestepping the main reason I believe it worked so well. It benefits from network effect. It is a collaborative tool and people like to have their work on there so others can collaborate with them.

Goladus · on July 31, 2017

That's important but I think people over-estimate it. It's both. I predict that if you analyzed the github network, you'd find many hubs are based around companies that chose to move their workflows to github based on features other than network effects. Or at least, the existing network was only one of many reasons.

manigandham · on July 31, 2017

As a business, we (and most other companies I know) chose github for features and performance. Nice that other open source stuff is there but doesnt matter for what we pay for.

n-gauge · on July 31, 2017

Totally agree - though I wish they would show files > 2Mb though on the web editor.

I develop directly on github - I even make all my commits to the Master branch, as this allows me to code with a nexus 7 tablet if necessary.

So this outage was a PITA for me. However I have plan B and started up tomcat...

Saved the day!

P.S. Thanks github for the free hosting! I can't really complain .

_skel · on July 31, 2017

Lots of people care, but we also recognize that the advantages of using a centralized system outweigh the disadvantages for many use cases.

tambourine_man · on July 31, 2017

Lots of answers so I'll try to address them all here. It was a rethorical question. We know what GitHub offers. I'm not a fan of its UI, particularly on mobile, but that's beside the point.

The point is, why we failed, once more, to have a distributed solution, even when the underlying tech assumes it.

Email was the last widely successful distributed medium. And it's dying, unfortunately.

Of course centralized services are easier to implement and use. Doesn't mean we should settle.

manigandham · on July 31, 2017

We didnt fail, nobody built it (probably from lack of demand).

tambourine_man · on July 31, 2017

Nobody built it = fail to provide a solution

manigandham · on Aug 4, 2017

Ok, that's not really the same thing but either way, what's the point of having a decentralized project management system?

Code is one thing which clearly allows for many benefits in having the entire local history but pushes more work towards the merging stage. When it comes to issues and discussions, it's often much easier to have a single source of truth without worrying about merge conflicts.

And the issue with github being down isn't an issue of centralization as much as it is about availability of a service. You're free to use github enterprise or gitlab and host the service yourself if you feel you'll get better reliability and performance, however I'm pretty sure you won't beat github's overall without significant investment of time and resources.

Perhaps having a simple read-only offline cache of the latest project management state is a good middle-ground for most of the problem and it shouldn't be that hard to do - but again that's up to how much demand there really is for it.

tambourine_man · on Aug 6, 2017

I'll quote myself for emphasis: Of course centralized services are easier to implement and use. Doesn't mean we should settle.

oliv__ · on July 31, 2017

That's it. I'm starting a github on blockchain.

Rjevski · on July 31, 2017

Count me in when you launch an ICO.

tambourine_man · on July 31, 2017

I wish you weren't kidding.

s73ver · on July 31, 2017

Because most things are easier when you can have one canonical source of truth.

rcthompson · on July 31, 2017

Looking at the status graphs, it seems like there was some clearly anomalous data starting around midnight, about 9 hours before the actual outage "began". Maybe a gradual botnet ramp-up, and 9:27 AM is when it got bad enough to overload some critical service? (Or really any other threshold-based failure scenario.)

aksx · on July 31, 2017

or a bad commit being deployed on their fleet.

sashk · on July 31, 2017

What was happening to Github for a week or so in late June - early July? I see "The status is still red at the beginning of the day" for a whole week.

https://status.github.com/messages/2017-07-03

coderintherye · on July 31, 2017

They seemed to be suffering some external attacks / DDoS but I never saw a post-mortem from them on it, hopefully one is forthcoming (or maybe is out but I missed it)

richardwhiuk · on July 31, 2017

They don't do post mortems.

daxelrod · on Aug 1, 2017

They occasionally do, and they call them Incident Reports.

https://github.com/blog/2106-january-28th-incident-report

https://github.com/blog/2273-incident-report-inadvertent-pri...

chrisferry · on July 31, 2017

Maybe not public ones, but I'm pretty sure they perform internal post mortems when they have outages.

relaxitup · on July 31, 2017

Do these general Github outages affect GH Pages as well, or is that service portion segmented to some degree?

bluehatbrit · on Aug 1, 2017

Pages are static sites served from separate infra from the main github.com from what I've heard, so you should be fine in almost all cases. I've never had one go down unless I pushed a dud build.

leesalminen · on July 31, 2017

I think it started as minor as I was receiving a unicorn once per 10 pages. It's currently happening on almost all.

Of course, I'm trying to dig into a WebKit issue and need the issues to load!

pmoriarty · on July 31, 2017

Where is github hosted?

Do they use AWS or another commercial cloud provider, or do they have their own servers in data centers (hopefully scattered around the globe)?

If AWS, are their services spread among multiple availability groups? I'm just wondering how this could happen.

cormacrelf · on July 31, 2017

... It may surprise some HN readers, but AWS outages aren't the only reason other sites go down. Having multi-AZ just means you're resilient to a localised, single-AZ AWS failure. It doesn't help for the 1000 other ways your service could go down.

sattoshi · on July 31, 2017

I like how this comment has 3 replies saying different things.

I am now feeling less informed than I was after the first reply.

worldexplorer · on July 31, 2017

An ex-github employee who used to give talks on Github scaling challenges revealed they shifted from Rackspace. See yourself - https://github.com/holman/ama/issues/553

worldexplorer · on July 31, 2017

They ran out of rackspace now they are on real hardware. https://github.com/holman/ama/issues/553

moondev · on July 31, 2017

I believe they manage their own hardware in a datacenter

cwingrav · on July 31, 2017

Here's this from 2009. (https://github.com/blog/530-how-we-made-github-fast)

I says Rackspace, but 2009 is a long time ago so it could have changed.

pm90 · on July 31, 2017

They're hosted at Rackspace. I'm not sure if they're hosted on the Public Cloud or whether they have a dedicated hardware or what though.

worldexplorer · on July 31, 2017

No they shifted from Rackspace. https://github.com/holman/ama/issues/553

pramodzion · on July 31, 2017

Github is back online.

Cafey · on July 31, 2017

It has leveled up to a major outage!

0xbadcafebee · on July 31, 2017

Dang. It's too bad their customers' source control files aren't distributed and decentralized, or they could keep working and ignore this.

toyg · on July 31, 2017

The problem is not the files, it's the project management stuff (issues and PR tracking). That is a SPOF. If they could decentralize that, then nobody would care if they are down half the time.

0xbadcafebee · on July 31, 2017

They can - it's called Github Enterprise (or if you're a medium to large corporation, JIRA). I thought occasional downtime was just something people factored into free services. You can always file your issues later, or use a chat service (or, heaven forbid, email) if there's an immediate need for feedback.

mschuster91 · on July 31, 2017

I wonder why you can't export them as a read-only git repository. (Read-only to avoid people messing around with the history)

daxelrod · on Aug 1, 2017

You can actually export them, but the format is undocumented, and is meant only for import into GitHub Enterprise:

https://developer.github.com/v3/migration/migrations/

csense · on July 31, 2017

Because that would make it easier to migrate away from Github, which Github doesn't want you to do.

loomer · on July 31, 2017

Anyone have any knowledge of what specifically happened?

AlphaWeaver · on July 31, 2017

I saw a comment earlier mentioning that GitHub allegedly doesn't release post mortems publicly? If this is true, that's upsetting.

luhn · on July 31, 2017

They have published post-mortems before, but the previous bout of downtime went unacknowledged. Maybe it's a new policy.

Here's a postmortem they did several years ago: https://github.com/blog/1261-github-availability-this-week

Cacti · on July 31, 2017

I don't see why you would expect them to. It's nice and all, but they're under no obligation to tell the world everything that happened every time they have a minor outage. Not every outage is newsworthy or even that interesting, and oftentimes they're little more than a PR piece for the company.

yellowapple · on July 31, 2017

My apologies. I knew my Perl 6 wrapper for GLFW was bad, but never realized it'd be so bad that GitHub would choke to death on it.

ibgib · on July 31, 2017

Are there any other major sites that are down?

bithavoc · on July 31, 2017

All software depending on Github repo releases downloads are down, e.g: Rancher CLI.

smudgymcscmudge · on July 31, 2017

Release downloads are affected?

bithavoc · on July 31, 2017

yep, my Circle-CI deploy fails, the following URL is not available: https://github.com/rancher/cli/releases/download/v0.6.0/ranc...

vishesh92 · on July 31, 2017

It just became a major service outage.

DevKoala · on July 31, 2017

This is happening too frequently now.

Osiris · on July 31, 2017

It's starting to work again for me. I was able to approve a PR and merge it.

xxkylexx · on July 31, 2017

Looks like I am still able to push to/pull from my repos without issue.

DylanBohlender · on July 31, 2017

I've had a few CI jobs fail when attempting to pull, but I've also had some succeed. Seems that it isn't an all-or-nothing outage.

macawfish · on July 31, 2017

Whatever happened to gittorrent?

tevonsb · on July 31, 2017

Thoughts on the cause?

cjbprime · on July 31, 2017

Definitely Bitcoin-related.

midnitewarrior · on July 31, 2017

SegWit's first victim.

orf · on July 31, 2017

Not enough mongodb. clearly they are not webscale yet.

rubicon33 · on July 31, 2017

https://www.youtube.com/watch?v=b2F-DItXtZs

btym · on July 31, 2017

Bug in their Ethereum-backed MongoDB instance.

tabtab · on July 31, 2017

while (! github.works) { this.add(new Buzzword()); }

mirekrusin · on July 31, 2017

AI started backing itself up on github.

komali2 · on July 31, 2017

Defcon let out on Sunday and there's a lot of bored hackers with leftover energy.

Ace17 · on July 31, 2017

Ludum Dare? :-)

GrumpyNl · on July 31, 2017

How does this affects all your dependencies?

kentor · on July 31, 2017

Why would it?