Majority of web apps could just run on a single server

lewisjoe · 2024-03-24T19:40:28 1711309228

I can confirm. I've had quite a few projects that made it to the front page of HN and handled the traffic like cake. All of them ran on 5$ digital ocean droplets.

I accept some projects are more resource expensive than others, but majority of the time you can get away with a bit of asynchronous responses + scheduler/queue to spread the load horizontally over time.

Unpopular opinion: I blame the new age devops culture that made cloud app deployments unnecessarily complicated with k8s and cool new tech (that'd get them high profile jobs.) I've never come across a devops person who'd say "Hey, that sofware is too simple to prematurely scale for sudden spikes of irrational amounts of traffic, so why not just deploy it on a cheap vps?"

tambourine_man · 2024-03-24T20:02:48 1711310568

I'm convinced it's the hiring that has shaped the scene.

If you're hiring, you want to be able to add/replace people as easily as possible. If you're being hired, you want to charge as much as you can. And to satisfy those two demands, the current web stack is great. Almost like it's been built for it.

It's got very little to do with the tech itself, a lot more to do with market dynamics. That's the problem it's trying to solve.

Nextgrid · 2024-03-24T20:14:48 1711311288

The root cause is the decade of zero interest rates which led to companies intentionally overcomplicating their stacks to justify neverending VC rounds. Early prospective employees took notice and adjusted their skills as a result.

The dangerous part is that in the meantime we've got brand new and budding talent that actually took this charade seriously and effectively got high on their own supply, seeing this performance art as the actual normality even in a post-ZIRP world where tech/engineering is primarily there to drive business profits and not a VC mating ritual.

The only winners are the cloud/infra/tooling providers who got an entire generation of "engineers" to do perpetuate their con without even realizing it.

rodgerd · 2024-03-24T20:25:10 1711311910

> The root cause is the decade of zero interest rates which led to companies intentionally overcomplicating their stacks to justify neverending VC rounds.

Extended hot take: the true customer of a company are the current and prospective holders of capital, whether VC, private investment, or public markets. Keeping them happy is the main goal of the exec.

If your owners' portfolios include real estate, you push "back to the office". If it includes container startups, you deploy on k8s. Your "strategic partnerships" are determined by how much your owners care about propping up investment A investment B.

Sammi · 2024-03-24T22:01:38 1711317698

Except zero interest rates stopped almost four years ago, but we're still seeing vcs ape into ai this last year.

So clearly the underlying cause of this squandering of resources must come from somewhere else.

I point the finger at the rising class of super rich who don't know what to do with their money. Why do they exist? Why is taxation seemingly not applying to them any longer?

Jensson · 2024-03-25T09:08:32 1711357712

> Why is taxation seemingly not applying to them any longer?

Investing returns was never taxed, as long as they don't use their wealth for consumption they wont get taxed. This is a good thing since it encourages investments over excessive consumption, building a startup is much better than buying another yacht.

seec · 2024-03-25T13:02:33 1711371753

That's a very simplistic view. Like with all things, there is a point where more investment money does not make things faster or better. In fact, since pretty much everything is still dependent on people and social groups, if all the relevant people are already busy, you won't get much for your money.

And too much money chasing too few choices creates bubbles, wich is exactly what has happened. And not just in tech, it is the problem of real estate market in many places of the rich world, and similar bubbles can be found in many markets.

I believe it is actually one modern problem, a good part of the rich world is getting paid a lot more than there is really a need for; and since we glorified "investment" essentially has a way to make even more money; a too big amount of money is not doing much instead of being useful right now.

If you look at society as a whole, spending every ressource every year would be bad, but also setting too much aside for later is actually wasteful. It's a lot like stocking a pantry, you need to put enough so that you don't run out of food easily but if you have stuff in there that hasn't been used 5 years later it shows the "investment" was too much. Food gets stale and loses nutritional value over time (even canned) but money also gets stale in a way.

As for the startup vs yacht I would say it largely depends on how useful the startup can ultimately be to society and how many people it can employ (how much actual value is being created). Because even though yachts have fossil fuel consumption problems, it actually takes a lot of people to make them, maintain and service them actually. It can potentially have a better social impact than a startup...

And this is the root of the issue, people getting a lot of money actually have a responsibility to redistribute it (in an intelligent manner preferably); but what is happening is that people try to get even richer even though it stops having much value to anyone.

Sammi · 2024-03-26T12:50:06 1711457406

All this money that is aping into an ai gold rush could have been taxed to fund schools and other broadly useful things.

I'm very much a free market capitalist, but seeing all vcs ape into ai at the same time does not make me think there is value in have so many super rich people in the world.

I hate it when discussion devolves into high taxation vs low taxation or high regulation vs low regulation. There must be a thing such as the appropriate type of taxation and the appropriate type of regulation. And that depends on what we want as a society. Do we want a big class of super rich that don't know what to do with their money so they squander it on an ai gold rush? Or do we want to tax them so we can put that money to use that we all benefit from?

dehrmann · 2024-03-24T20:45:41 1711313141

I think you're mixing up cause and effect. Low interest rates caused investors to chase higher returns in things like VC, VC had pressure to make investments, startups had easy money, so they were less scrappy and had more funding for overengineering.

fsloth · 2024-03-24T20:11:45 1711311105

I can’t confirm but it does look like a significant portion of current software engineering zeitgeist is developed for compartmentalized careerism with a good splash of non-value adding complexity.

If that keeps food on the table and the wheels of business spinning, fine, but I’ve seen this leading to situations where a simple thing expands to a role, and then department and … oh, hi, enterprise software, it’s you!

LtWorf · 2024-03-24T21:12:11 1711314731

Hipster devops was overcomplicated long before kubernetes.

In my 1st job we had autoscaling on AWS to handle peaks… except that our servers took about 30 minutes to do the upgrades, download gcc, compile all the needed python modules… We'd always reach the autoscaling limit because the new servers weren't doing anything at all. All of them would be downloading and compiling the same things.

I was very junior, but I told my boss if we shouldn't maybe use base images that already contained everything, instead of a default blank ubuntu for the servers.

The boss said no, because that wouldn't be agile.

The whole thing of using yaml files to configure servers already existed, it worked terribly.

It was basically meme development there. Including using mongodb for no reason at all, and using it badly so that every write was actually moving thousands of records around.

KronisLV · 2024-03-25T09:28:34 1711358914

> I was very junior, but I told my boss if we shouldn't maybe use base images that already contained everything, instead of a default blank ubuntu for the servers.

Honestly, nowadays Docker and other OCI containers do this pretty well. Spinning up new instances and even provisioning nodes has become very easy, in addition to load balancing features those provide.

12 factor apps are also an amazingly concise way to develop and manage multiple services without going into YAML hell: https://12factor.net/

The problem is that the management layers around containers are typically overcomplicated to the point of comedy.

Kubernetes might make sense with something like K0s or K3s might make some sense but is still inherently a full time position to run on prem (with updates and observability) and will cause headaches. Hashicorp Nomad is better, but is meant for scales greater than those of most companies. Docker Swarm hits the spot (especially with something like Portainer), but nobody seems to actually care because it's not a trendy piece of tech.

The day Docker Swarm dies is the day I go back to writing PHP in a shared hosting environment in protest.

LtWorf · 2024-03-25T10:38:43 1711363123

It's not fast if you refuse to create a pre-made image and recompile everything instead. Docker won't make it any more fast.

KronisLV · 2024-03-25T10:44:31 1711363471

> refuse to create a pre-made image

Docker made doing the right thing really easy, to the point where doing that is a no brainer. I guess there are people out there who might still use OCI containers as glorified stateful VMs, but luckily the majority of documentation and examples out there build proper images that have everything included by the time it actually runs and you have to actively go against that if you want the other approach.

LtWorf · 2024-03-25T18:57:31 1711393051

chroot and debootstrap and mkfs have existed for decades before docker.

It would have been no problem to create a ready image. The problem was the refusal to do so.

api · 2024-03-24T19:44:01 1711309441

The problem with devops replacing the old title “sysadmin” was that the dev part dragged in the worst thing about developer culture: the love of complexity and the tendency to build massive towers of it.

Sysadmins usually avoided complexity because their attitude toward it was more sensible: it’s expensive, fragile, and tends to actually multiply failure modes.

I also blame cloud marketing. This stuff is a gigantic money printer for cloud companies. It’s in their interest to encourage as much over engineering as possible, especially if it locks you into things like Kubernetes that are hard to run and thus usually used as services.

dukeyukey · 2024-03-24T20:16:42 1711311402

I swear, I've come to identify myself simple stacker. Complexity should be avoided wherever reasonably possible. But where it's not, it's best left and maintained with the devs who want it.

worik · 2024-03-24T20:37:44 1711312664

> The problem with devops replacing the old title “sysadmin”....

That is a very good observation.

I think that now I am going to stop using the term Dev Ops and start using System Administration.

Wise words, thank you

red-iron-pine · 2024-03-25T12:56:41 1711371401

> I also blame cloud marketing.

burying the lede, mon ami.

biggest tech companies in the world, the ones using this tech, are also the ones with massive ad and marketing complexes.

"of course cloud and devops and k8s are great, buy buy buy!"

a decade later OpEx is out of control and wrangling the craziness requires its own brand of "FinOps"

hparadiz · 2024-03-24T19:53:53 1711310033

If you guys don't see the value of being able to click on a button on a website to deploy, perfectly, everytime over some guy sshing into the box and running git pull I dunno what to tell you.

rmbyrro · 2024-03-24T20:10:12 1711311012

The discussion here is not the method of deployment, but the infra architecture.

You don't need k8s, containers, and multiple cloud instances to automate deployment.

It's perfectly possible and simple to implement a button to deploy on a single machine.

Heck, on OVH or Hetzner, you can have a dedicated bare metal machine with many cores and RAM exclusive to you, cheaper than famous cloud instances. These bare metals will handle what takes sometimes hundreds of containers to handle, with a much simple and easy to maintain infra.

zer00eyz · 2024-03-24T20:18:01 1711311481

> cheaper than famous cloud instances

Not just cheaper, fixed cost.

Everyone who cried about exit fees on AWS needed the lesson of failing to plan ahead...

api · 2024-03-24T22:09:10 1711318150

You can also have ugly manual ad hoc Kubernetes workflows where random devs hack YAML in prod directly. I’ve even seen people ssh into running containers and change things.

This isn’t any better than Joe sysadmin and could be worse since the complexity is higher and there’s a greater chance to do more damage with one change. Adding complexity makes bad process worse.

moondev · 2024-03-24T20:16:07 1711311367

You can deploy k8s on a single machine though. This way you get the benefit of a standardized, documented API without the burden of multiple machines.

zer00eyz · 2024-03-24T20:20:26 1711311626

Your still setting up K8's... I have scp'd my go binary and set it up in system d.

Nothing about K8 or containers are light... they are buckets where bad devs hide the bodies in most cases.

moondev · 2024-03-24T20:29:15 1711312155

kubernetes is also a static go binary (kubelet) deployed with systemd!

zer00eyz · 2024-03-24T20:42:02 1711312922

SO I should put my static go binary in a container, so you can deploy a static go binary to deploy the container with my static go binary in it.

Do you see how many extra steps that is.

hparadiz · 2024-03-24T20:34:35 1711312475

Why use k8s over just a docker container?

LtWorf · 2024-03-24T21:42:55 1711316575

Just use a base OS that has what you need? :D

hparadiz · 2024-03-24T22:01:52 1711317712

Was asking seriously. Haven't had the opportunity to use k8s but I've used container hosting elsewhere so I'm curious why k8s specifically.

pdimitar · 2024-03-26T02:41:43 1711420903

Yes, do that 99% of the time actually.

zer00eyz · 2024-03-24T20:42:55 1711312975

Dont deploy docker in production. Podman.

You could but why all the extra overhead?

hparadiz · 2024-03-24T21:20:31 1711315231

Docker has no overhead on the Linux kernel. It just runs things for you via LXC. The overhead is no different from any other process.

zer00eyz · 2024-03-24T22:30:16 1711319416

The last time I checked docker was libcontainer and not running lxc any longer.

Libcontaier lxc all use the same api calls to the kernel... so in effect your still correct (mostly)

They all have overhead, if you're starting to use their features... cgroups can have some very funny impacts on app performance more so if your running containers ontop of a linux install over a hypervisor vs nix on hardware.

ethanwillis · 2024-03-24T20:47:00 1711313220

This is the best joke I've heard all week. Standardized in what way? By who or what governing body?

rmbyrro · 2024-03-26T12:04:50 1711454690

Why would someone want to self-torture into k8s if they only need a single machine?

If you need isolation for multiple services, just use a container... I don't see an orchestration need that justifies using k8s on a single machine. For such context, it's too much hassle for very little value in return.

banku_brougham · 2024-03-24T20:02:56 1711310576

People arent missing the value, nobody is saying it wont perform as advertised -- the discussion is about the cost.

hparadiz · 2024-03-24T20:08:44 1711310924

Setting up a testing & deploy via a CI script is basically free. AWS gives away CodeDeploy for free. Ansible is open source. I learned all this stuff working on open source stuff where platforms like GitHub give you free compute time.

ProblemExplorer · 2024-03-24T20:48:35 1711313315

> If you guys don't see the value of being able to click on a button on a website to deploy, perfectly, everytime over some guy sshing into the box and running git pull I dunno what to tell you.

Maybe clicking a button that sshes into the box and runs git pull?

mmmm2 · 2024-03-24T19:55:58 1711310158

Emacs has a button for that :-).

erik_seaberg · 2024-03-25T02:33:52 1711334032

PSA: please don't follow a manual checklist over ssh. At least write an Ansible playbook that does those things repeatably, or even better, script idempotent changes in an .rpm or .deb to install.

JodieBenitez · 2024-03-24T20:40:39 1711312839

Hey ! I click on a button to deploy. Turns out that button does the sshing but that still counts, right ?

marginalia_nu · 2024-03-24T20:36:20 1711312580

Yeah my search engine, back when it was hosted on a PC in my living room off domestic broadband would shrug off HN[1][2] without the fans even spinning faster than usual.

And like, internet search should be more resource heavy than the sort of websites that regularly do keel over to HN. Every query is like up to 50 MB in disk reads.

[1] https://news.ycombinator.com/item?id=28550764 [2] https://news.ycombinator.com/item?id=35611923

Nextgrid · 2024-03-24T23:31:55 1711323115

The shift to cloud-based workloads (with oversubscribed CPUs and mandatory networked storage) means that a lot of people lost track of just how fast physical hardware (even mid-range consumer-grade) has become.

paulmd · 2024-03-25T01:27:31 1711330051

That’s also a deliberate thing - cloud providers have consciously avoided increasing the per-unit performance of a vcpu, you still get the same Sandy Bridge performance in 2024 as you did in 2012. They actually go all the way to the extent of having AMD design them smaller, higher-density “cloud” cores that don’t clock as high, to avoid ever increasing that vcpu unit.

MattPalmer1086 · 2024-03-24T22:20:50 1711318850

Not so unpopular. I worked at a startup where they wasted huge amounts of money on a massively complex set up using kubernetes (and this was the early days of kubernetes). Despite this, or maybe because of it, our AWS bill was killing us.

The irony was that the cloud was supposed to be the simplest bit. All the computation and cryptography occurred on the mobile clients, the cloud was really just to provide storage. The same team then rewrote the mobile clients to have a "beautiful" API that took 100s of times more resource than the original code.

I guess they just loved complexity for the sake of it.

benreesman · 2024-03-24T20:40:28 1711312828

Maybe that still counts as an unpopular opinion in the large, but I’ve never seen it be an unpopular opinion by the standards of really effective teams or really productive hackers I had the privilege to be around at times, and it seems to me that your good idea is coming back in a big way recently.

When I worked on teams that denominated egress in terrabits/s, TPS in millions or higher (sometime much higher), and daily warehouse ingest in petabytes it was just the default thing to spin up an instance and a hot standby (sometimes per region or something) if that’s all that was required, and containers and whatnot were used only in the context of bare-metal: you do usually want one level of indirection so containers are really useful if you’re racking metal from a small menu of SKUs.

But as for why it ever became conventional wisdom to wrap a venv inside a container on a hypervisor, often with multiple images composed on a dizzying array of low-friction SKUs?

I never got it and still don’t.

unclebucknasty · 2024-03-24T20:19:46 1711311586

>I've never come across a devops person who'd say "Hey, that sofware is too simple to prematurely scale...

I'd argue it's just as much the dogmatic nature of all things software-related, together with the attendant shiny new object syndrome.

See SPAs, NoSQL, micro-services, etc. There's generally a use case for all of these, but they tend to be too easily extrapolated into, "if you're not using these, you're not doing it right."

colechristensen · 2024-03-24T20:35:52 1711312552

> I've never come across a devops person who'd say "Hey, that sofware is too simple to prematurely scale for sudden spikes of irrational amounts of traffic, so why not just deploy it on a cheap vps?"

I’ve lost startup jobs for basically this. As in “let’s focus on our mvp instead of adopting k8s and discussing “the definition of done” for literally six hours a week”. That one ran out of money and folded, sadly. They had tons of potential and a few bad hires.

But also on the other side, my time is expensive, engineering time is expensive, downtime is catastrophic for a new business. Some people want to spend 4 5 or 6 figures to save twenty dollars a month. Imagine my time is worth $500 an hour, if the effort to make something cheaper doesn’t pay for itself in a year then it’s probably a waste of resources.

throwaway17824 · 2024-03-25T02:27:15 1711333635

Once when working in devops I asked my team lead why we didn't just move everything to Heroku, rather than reinventing a bad in-house version of their features.

I was firmly told not to suggest it again unless I wanted to put all of us out of a job.

That seemed ridiculous to me — we had a laundry list of ways we could help the business if we got basic platform stuff off of our plates. But I sure learned something about how incentives affect otherwise-good engineers.

OhMeadhbh · 2024-03-24T20:55:27 1711313727

But there's a middle ground somewhere.

The current DevOps culture comes from the FAANG guys who really are getting a bazillion requests per second. In the last decade I worked for Amazon, Avalara and Audible Magic. None of these could build an app around a Digital Ocean droplet.

But I think you're pointing out there are PLENTY of useful webapps that can run on a minimal system.

I'm just curious where the middle ground is. Cause I think we've all seen sites blow up after a reference on HN or SlashDot or whatever and by the time you get there the only thing you see is a stock error from the PHP engine saying "My MySQL Engine is Melting" or somesuch.

It would be very cool if one could write an app using {NODE|Ruby|Python|Whatever} and have the infrastructure around it notice when things spike and do some magic under the hood to spin up new containers in geographically distributed data centers and scale up a simple persistence tier.

That way you could move forward with a SIMPLE application instance and not freak out that you'll disappoint new users if there's a spike in demand. You know, sort of like what AWS Lambda was supposed to be.

Hmm... I think I might have come up with a plan for my next startup. Thank you for speaking your truth.

[Edit: To re-iterate, I think I'm saying it's just as wrong to think a small, simple app needs Amazon-level redundancy and immediate scalability as it is to say Amazon could run on a single machine. But... the "Slash-Dotted Website Goes Down" scenario is real and there should be SOMETHING the industry could do that's easier on people than to force them into a custom AWS ECS solution across multiple continents.]

LtWorf · 2024-03-24T21:40:11 1711316411

Yeah most people don't like getting an invoice for thousands of € because their website was under a DDOS.

OhMeadhbh · 2024-03-25T04:35:31 1711341331

sure. but there's a middle ground there somewhere. it's not just service blackout after 5 requests per second or a $10k monthly aws bill. if your budget was $500, you could set off alarms or auto-shutoff after some threshold. and turn on syn cookies if you're worried about the kiddies.

the point is... there's a middleground there somewhere and I think different people put the cost/availability tradeoff at different places.

everforward · 2024-03-27T14:09:38 1711548578

> I've never come across a devops person who'd say "Hey, that sofware is too simple to prematurely scale for sudden spikes of irrational amounts of traffic, so why not just deploy it on a cheap vps?"

I work on the Ops side partially, and our platforms are defined by the highest level of complexity we need to support. That is to say that your basic web app can run on k8s, but that auto-scaling, messaged-based behemoth that 85% of the business flows through will not.

So I can either support k8s, or I can support k8s _and_ old-school rsync deployments to VPS'

The complexity of running a basic web app on k8s is entirely too high, but the cost of keeping an entirely separate deployment/monitoring/oncall/permissions stack for VPS' is worse. Better hope your monitoring vendor has an agent that can run in a VPS' on the OS you want, or you're back to running Nagios yourself.

If at least 1 group is going to write an app that runs on k8s, you might as well run most of the company on k8s. Otherwise you're either going to manage 2 separate stacks, and/or try to write an abstraction layer that will probably itself resemble k8s.

znpy · 2024-03-24T20:20:28 1711311628

> I've never come across a devops person who'd say "Hey, that sofware is too simple to prematurely scale for sudden spikes of irrational amounts of traffic, so why not just deploy it on a cheap vps?"

Was a devops engineer at my previous job.

We already had k8s clusters setup, pre-made CI templates and pre-tailored helm charts (along with monitoring and much more). All those things you (a developer) could mostly clone, slightly customize and ship both to a development k8s cluster and to a prod k8s cluster (with all the safety nets already in place).

Creating (and maintaining) a single vm for a pet project is way more work than using the pre-made and pre-customized and curated toolkit.

This was at a 100+ developers organisation.

If you think you could easily get off with a single vm then you've never seen devops done right, i'm fairly sure.

EDIT: I probably fell for the bait, but the post i'm replying to really made me remember why we went on a killing spree in order to eradicate everything that was not k8s at my previous job and removing as much developer access to prod as possible. Some idiot developers think they know better, usually end up re-inventing a square wheel that breaks as soon as it's not running on their laptop anymore.

worik · 2024-03-24T20:35:01 1711312501

Yes you took the bait!

> This was at a 100+ developers organisation.

IMO the vast majority of software development happens in much smaller organisations than that. Dev Ops still matters there, and the requirements are different.

I am working in an organisation with one and a half developers. I am lobbying that the third of fourth developer concentrates on Dev Ops here.

It is very important. Look at the back up/recovery procedures at your organisation. Has there ever been a fire drill? Are you sure the back ups are sound? Can you recover? What if data corruption occurred a week/month ago. Have you a back up of the uncorrupted data?

That is a very unsexy aspect of Dev Ops, and without somebody dedicated to the job, your backups will not be any of those things.

Bitter. Experience.

znpy · 2024-03-24T20:55:00 1711313700

> I am working in an organisation with one and a half developers. I am lobbying that the third of fourth developer concentrates on Dev Ops here.

not sure you're doing the right thing here. you might want to consider hiring some kind of linux guy that can can do some basic devops. or maybe hire some devops contractor that can work with you on a part-time basis and "curate" some specific aspects of your operations.

I've seen this done in the past: so you've got this consultant on retainer, and you tell them something like: "i've got this issue, can we do something about it? our constraints would be x y z" and the consultant would make 1-3 proposals (different approaches, different pricing levels, different ETAs etc) and then you agree on what gets done. The key aspect here is that a good devops consultant can get stuff done very quickly.

> It is very important. Look at the back up/recovery procedures at your organisation. Has there ever been a fire drill? Are you sure the back ups are sound? Can you recover? What if data corruption occurred a week/month ago. Have you a back up of the uncorrupted data?

Yes (to all questions). I ended up in working in heavily regulated environments. All the things you mentioned were not just niceties, but mandatory by legal requirements.

> That is a very unsexy aspect of Dev Ops, and without somebody dedicated to the job, your backups will not be any of those things.

That's basic system administration. Most devops engineers are former sysadmins.

jameshart · 2024-03-24T21:16:12 1711314972

> IMO the vast majority of software development happens in much smaller organizations than that

I guess it's okay to have an opinion about that, but this seems like something that should probably be a fact. Unfortunately, not sure I can find reliable stats on sizes of engineering organizations.

The thing is, while there are obviously lots of small companies, there are also some really big software development organizations out there. A company like Netflix has 2,500 engineers. Microsoft employs over 100,000 engineers. Walmart employs over 15,000 software developers.

You need a lot of little 10-50 engineer dev shops to add up to the combined size of the engineering orgs of the Fortune 500.

According to https://www.statista.com/statistics/507530/united-states-dis..., at least, 29.4+25.8 = 55.2% of the US "IT Industry" workforce are employed in companies with >100 employees. That's a Long way from telling us about sizes of engineering orgs though.

But still... I'd be careful assuming that the vast majority of developers are in organizations of less than 100 people.

pdimitar · 2024-03-26T02:51:36 1711421496

Why do you assume that only three US landscape is discussed here?

Plus I'm not very sure those statistics are reliable.

Anecdotal evidence: I have 22 years of career and I've only worked in big organizations twice, for the total of a year. Everything else was much smaller.

Solvency · 2024-03-25T01:21:28 1711329688

How the hell does Walmart need 5x as many developers as Netflix?

jameshart · 2024-03-25T02:11:19 1711332679

Walmart has 2.3 million employees in total. The only larger employers than them in the world are armies: https://en.wikipedia.org/wiki/List_of_largest_employers

richardw · 2024-03-24T20:38:35 1711312715

But you’ve got 100 developers. That’s not “most web apps”, that’s firmly in the set of companies that need standardisation and potential scale, where the devops team makes the life of many devs far better. When it’s just a few devs and cash is limited, the business doesn’t need the complexity. It’s mostly for ego and branding.

ongy · 2024-03-24T20:40:10 1711312810

That's really the kicker for me.

When the company is set up well on k8s, then choose k8s. If the company is set up well on VPS, then choose VPS.

If it got neither, I'm unsure what's the better way to go on a greenfield. k8s has nice tooling, but part of that is required because it is massively complex.

But with managed k8s providers (e.g. GKE with autopilot) you can just put in proper CPU and memory limits and essentially get yourself a VPS provisioned without having to worry about anything by k8s measures.

If you have different pricing or location constraints, then it might be better to go with different models and some custom orchestration.

GauntletWizard · 2024-03-24T20:35:13 1711312513

Yeah, I'm an independent, selling these solutions. My minimum stack costs about $500/mo in AWS costs- and you could save 90% of that. But then you'd be paying me $20k more to set it up again when you expand to another dev team, while this way I can add your second dev team for $10k and practically zero additional AWS cost.

Going straight to overkill is good business sense for any company that's going to make it to a medium-sized business, and it's not going to be the differentiating factor for a company that burns out.

zer00eyz · 2024-03-24T20:31:32 1711312292

You're not wrong.

K8s, containers in general, aren't solving the problem.

Developers write apps, engineers write software.

One is installable, it respects the system it lives on, It can be tuned to use the hardware it has or share.

The other, the other is a bucket of shit in a bag. The bag protects you and everyone else from it contaminating everything it touches.

Containers dont solve the problem they just enable it.

worik · 2024-03-24T20:36:01 1711312561

> Developers write apps, engineers write software.

That is a classic distinction without a difference!

Engineers have been to engineering school. If you have not been to engineering school, you are not an engineer!

zer00eyz · 2024-03-24T20:40:58 1711312858

It's not a dead language, Im sorry the valley stole your title but that ship has sailed.

ProblemExplorer · 2024-03-24T20:43:45 1711313025

> All those things you (a developer) could mostly clone, slightly customize and ship both to a development k8s cluster and to a prod k8s cluster (with all the safety nets already in place).

How did/does the devs create, test new code and debug issues? Can they do that locally on their local laptop? If so, how?

zer00eyz · 2024-03-24T20:59:10 1711313950

>> How did/does the devs create, test new code and debug issues? Can they do that locally on their local laptop? If so, how?

I used to buy the idea of this when there were monoliths.

Then I had the joy of running a shop with dozens of web properties.

Local dev became untenable. My systems admin was an early adopter of Xen. That shop ran like a dream... devs could come in and have a new environment update in place and just start to work. Staying in sync with prod was never a problem.

By making systems guys keep devs fed, by making devs work closely with systems folks you get better software. Containers just hid developers shitty decision making in a wrapper that Systems folks can tolerate.

And how DO you debug... cause what you do to figure a problem out locally is not how you trouble shoot when the shit hits the fan in prod. These tools should be the same, sane and well used and loved by everyone. Local debugging is part of the problem, even more so if your service based app that lives and dies on the wire.

znpy · 2024-03-24T20:45:13 1711313113

> How did/does the devs create, test new code and debug issues? Can they do that locally on their local laptop? If so, how?

All those questions are irrelevant to how software gets deployed. Do whatever you want on your laptop.

ProblemExplorer · 2024-03-24T20:53:53 1711313633

Then I assume you have a custom Kubernetes LB that can handle non-HTTP TCP and UDP traffic because you choosing Kubernetes and the design restrictions that comes with it does not affect how the dev solves problems?

The underlying orchestrator definitely affects how the software needs to behave and is definitely not irrelevant.

znpy · 2024-03-24T20:56:35 1711313795

> Then I assume you have a custom Kubernetes LB that can handle non-HTTP TCP and UDP traffic because you choosing Kubernetes and the design restrictions that comes with it does not affect how the dev solves problems?

nginx-ingress-controller does that. no custom stuff required.

https://kubernetes.github.io/ingress-nginx/user-guide/exposi...

jameshart · 2024-03-25T02:29:01 1711333741

Follow the signposts laid out by https://12factor.net.

Think about how to apply those ideas to containerized applications.

Consider that such applications can be run in the same way locally, on a server, or in an autoscaled cluster.

Not sure why TCP/UDP load balancing complicates matters.

ongy · 2024-03-24T21:09:25 1711314565

Which LB doesn't support non-http flows?

maccard · 2024-03-24T20:30:00 1711312200

I think that the $5 tier is a little tight for a web app as opposed to a crud app, but 2x $40 tiers is enough for a decent amount of traffic, with one as a failover.

The problem is that containers are excellent, and IMO there's a gap in the market between "I want to run one container" and "I want a fully managed k8s cluster"

alibarber · 2024-03-24T20:41:17 1711312877

For my personal projects (that right now seem to revolve around April 1st jokes for people in my industry) - I've found a good middle ground to be K3s on a single Hetzner VM that I scale up and down if I think more/less people are going to be looking at it (i.e. between the 5 - 10 dollar/month price range)

I set this up with Terraform and some bash scripts. Infra as code is just too convenient to pass up on for something that I might not come back to for months at a time (and then ask - how did I set that up?), and containers too mean that I can play with some shiny fun technology one time, leave it in a box, and then come back to a clean-slate for the next thing that I want to play with without having to pay for a new VM etc.

maccard · 2024-03-24T21:54:06 1711317246

The other great thing about containers is that you can have entire isolated stacks for your projects, and you can stand the entire thing up with `docker compose up -d` in a matter of seconds. Gone are the days of accidentally connecting to the wrong database with WAMP.

KronisLV · 2024-03-25T11:22:47 1711365767

> The problem is that containers are excellent, and IMO there's a gap in the market between "I want to run one container" and "I want a fully managed k8s cluster"

A single container: Docker

A few containers on the same node: Docker Compose

Containers across multiple nodes with load balancing and networking: Docker Swarm (with Portainer to manage it)

Alternatively, Podman is also pretty nice.

If you need something that runs not just containers in clusters: Hashicorp Nomad

If you want to go for Kubernetes but without it being too hard: K0s or K3s or MicroK8s or RKE (with Portainer or Rancher to manage it)

justsomehnguy · 2024-03-25T11:54:10 1711367650

I found the hard way Swarm is better than Compose even on one node. Bonus point is what you can scale it easily with just a minor forethought.

pdimitar · 2024-03-26T02:36:35 1711420595

Are you willing to share your lessons learned? I'm very curious.

duckmysick · 2024-03-25T12:08:02 1711368482

Does Podman have something like Docker Swarm? Last time I checked (a few years ago) there wasn't and the suggestion was to use kubernetes.

mnahkies · 2024-03-24T20:43:18 1711312998

For my personal stuff that doesn't get any traffic I cobbled together some scripts to manage containers / SSL here https://github.com/mnahkies/shoe-string-server

I don't think I ever got around to making it self healing if a container dies, but it does support gitops style deployments through a cronjob / conf repo similar to argocd

It's been running happily on a <$10 / month aws lightsail instance for a few years now, though tbh I'd still reach for k8s for anything serious

auspex · 2024-03-24T20:52:49 1711313569

Why not use a single Fargate task?

maccard · 2024-03-24T21:51:58 1711317118

All my teams infra runs on fargate + ECS so I'm pretty familiar (and happy) with it. Running in fargate requires knowlege of AWS: VPC's, public and private subnets, security groups, ECR, ALB's, Target Groups, IAM policies + roles. Then, when you want to add a databse, you're back into all of the above, plus the database specific ones like database subnet groups.

When it comes to health checks you have docker health checks, container health checks, load balancer health checks - all of which are configured separately.a Not to mention, doing it "properly" where your task doesn't have a public IP and is only accessible through your load balancer - [0] might be one of the most infuriating responses on stackoverflow.

Meanwhile, with DO droplets, it's pretty much "here's a registry URL, and some configuration in YAML, go for it".

[0] - https://stackoverflow.com/questions/61265108/aws-ecs-fargate...

paulmd · 2024-03-25T01:24:00 1711329840

A ton of the problem is people turning everything into dynamic this or that. Wordpress culture, basically.

Most pages can really just serve a jekyll/astro static-generated page and be fine. But if you shove a database and php in the middle, it’s gonna be multiple orders of magnitude slower.

allarm · 2024-03-27T21:01:14 1711573274

> why not just deploy it on a cheap vps?

I’m that kind of a person and the usual response I get is - hey, that’s too complex!

febed · 2024-03-25T05:37:07 1711345027

Since you seem to have multiple successful and scalable deployments under your belt, what is your goto tech stack?

ProblemExplorer · 2024-03-24T20:39:21 1711312761

> unnecessarily complicated with k8s

Kubernetes is/was a way to fight off walled gardens from cloud providers. The other path would have been to learn the bespoke implementation of each cloud provider depending on what that employer ended up using.

Kubernetes was at the right place, at the right time just as AWS was trying to force feed people their own proprietary solution, as Azure was trying to wall off people into their own walled garden, as GCP was being Google just not giving a damn about any other usecase than what works great at a massive search company.

With Kubernetes, developers can learn one API to deploy their applications and hopefully it works on AWS, Azure, GCP, DO, OVH or a laptop at home.

So that way, developers can learn one thing and transfer their knowledge at an employer that hosts on AWS, and then another that hosts on Azure and so on.

This is in contrast to the experience of a Python developer who's mastered FastAPI/Flask/SQLAlchemy and feels absolutely lost in a Django project or an Angular developer who stares a Next.js project wondering what the heck is happening and how it all works. Neither a Next.js or an Angular developer would start off with an AWS Amplify solution if they could help it.

oceanplexian · 2024-03-24T21:00:05 1711314005

> With Kubernetes, developers can learn one API to deploy their applications and hopefully it works on AWS, Azure, GCP, DO, OVH or a laptop at home.

That's one of the lies developers tell themselves, because at some point you're going to need to manage Accounts, VPCs and ELBs, Certificates, Security Groups, IAM policies, and everything else. All of those underlying primitives that are required and have massive differences in behavior that are expressed differently in GCP, Azure, and AWS.

On top of that Kubernetes is itself a walled garden.

You will inevitably end up cargo culting the entire ecosystem of plugins, like Cilium and Helm and so on. All of this IaC is meaningless outside of Kubernetes. Soon enough, you have 10,000 lines of YAML configuring highly proprietary infrastructure with multiple variants for each cloud. At some point you will have to rewrite controllers to add functionality or correct bugs the upstream maintainers don't want to prioritize, and so on.

Your "knowledge" of the stack ends up being the ability to orchestrate 15 levels of templated YAML. Eventually your company ends up hiring people who only know how to copy/paste YAML, and lose institutional knowledge of how underlying systems work. You didn't break out of the walled garden, you created an elaborate prison. And Amazon and GCP and Azure love you, because you're their #1 customer. The more complex you make it to deploy a CRUD app the more they profit.

LtWorf · 2024-03-24T21:47:39 1711316859

> This is in contrast to the experience of a Python developer who's mastered FastAPI/Flask/SQLAlchemy and feels absolutely lost in a Django project

If he has more than 6 months of experience he will figure it out.

littlestymaar · 2024-03-24T19:43:52 1711309432

> I've never come across a devops person who'd say "Hey, that sofware is too simple to prematurely scale for sudden spikes of irrational amounts of traffic, so why not just deploy it on a cheap vps?"

It Is Difficult to Get a Man to Understand Something When His Salary Depends Upon His Not Understanding It

banku_brougham · 2024-03-24T20:04:16 1711310656

- Upskill Synclayer

baridbelmedar · 2024-03-24T20:43:20 1711313000

> Unpopular opinion: I blame the new age devops culture that made cloud app deployments unnecessarily complicated with k8s and cool new tech (that'd get them high profile jobs.)...

Ah yes, time for the annual debate on the complexities of Kubernetes versus the unparalleled genius of custom scripts that seem to work...sometimes. Because reinventing the wheel is always superior to something with a standardized API.

And let's not forget the sheer elegance of a homegrown scripts that rivals the structured approach of Kubernetes. A true testament to intuitive design.

Sure, Kubernetes might have a few minor benefits beyond 'scaling out'. But honestly, who needs the ability to manage complex applications with any semblance of ease?

ProblemExplorer · 2024-03-24T20:46:53 1711313213

> honestly, who needs the ability to manage complex applications with any semblance of ease?

The argument being made here is majority of applications are rarely complex and hence dont require managing that complexity.

A simple webservice fronted by a simple reverse proxy like Caddy running on a single "modern PC" can do wonders without any Kubernetes needing to get involved.

acabal · 2024-03-24T20:19:06 1711311546

At Standard Ebooks we serve a respectable number of page views and ebooks each month - and have been on the front page of HN three or four times - all of it done with a single 4GB VPS. And the only reason we upgraded to 4GB from 2GB is because we needed more RAM for the server to build the extremely large Decline and Fall of the Roman Empire ebook - if it weren't for that, our 2GB server would still have been just fine for all that traffic.

I wrote a blog post about how we do it here: https://alexcabal.com/posts/standard-ebooks-and-classic-web-...

jameshart · 2024-03-24T20:56:12 1711313772

More applications should consider git as a content management database. It's great architecture. Statically serving files built by a CI process running on the server is very tidy.

But let's be clear, your serving infrastructure is able to be that simple because you outsource donation management to https://fundraising.fracturedatlas.org, contribution management to https://github.com/standardebooks, collaboration, membership and communication to https://groups.google.com/g/standardebooks, and marketing signup management to... looks like postmark.

So sure, you can run your stuff on a single server, but you're relying on a bunch of other people running much more sophisticated services on a lot more infrastructure in order to do it.

acabal · 2024-03-24T21:26:45 1711315605

Sure, at some point you're going to be relying on some 3rd party somewhere. We use a VPS and not a bare-metal hand-installed rack, and we rely on an electrical company and not a hand-turned crank to power our servers. As far as email goes, It's simply not possible to self-host transactional email in 2024 if you want it to arrive in an inbox and not a permanent spam blackhole; this is a people problem and not a technical one. Likewise, another people problem is that it's not possible to accept money online without involving a 3rd party service like Fractured Atlas or Stripe or PayPal.

(Moving away from GitHub towards a self-hosted Git solution, and away from Google Groups to a self-hosted mailing list, is actually on our long-term todo list[1]).

All those things don't mean one can't run one's web app on a single tiny server, like we do. I still argue that outsourcing the basic fundamentals of one's web app, like the OS, runtime, or database to some cloud service, or resorting to flavor-of-the-month frameworks or containers, or doing silly things like using Javascript to render one's entire frontend, often simply result in complexity, slowness, and bloat.

[1] https://github.com/standardebooks/web/blob/master/README.md#...

jameshart · 2024-03-24T21:32:10 1711315930

Fully agree and endorse.

But my point is... those are all web applications too, and they don't have the option of outsourcing everything. Someone has to build a system that does more than just serve static files.

The claim that 'the majority of web applications can run on a single server' is kind of belied by the example of a site where not even the majority of sub-applications that are required to provide the full functionality of the system are running on a single server.

acabal · 2024-03-24T21:37:23 1711316243

Of course! Stripe, Paypal, Postmark/Sendgrid/whatever are not part of the majority I'm talking about. (Although Fractured Atlas, which is merely a wrapper for the Stripe API, is; we use it because it solves a people problem, not a technical one.) There are certainly projects and businesses that will require many high-power servers and more complex technical machinery than basic LAMP. However most people on HN are not developing the next Stripe, even if they don't realize it yet.

jauntywundrkind · 2024-03-24T23:31:56 1711323116

Those other workloads don't sound particularly taxing to me. Many get very sparse traffic; hosting a donation page, web newsgroup/discussions, and user management need not drastically scale up the serving footprint here.

Those hosted services mainly are about not needing to pay the human management/ownership costs.

jameshart · 2024-03-25T00:18:54 1711325934

The real win this architecture has is outsourcing its content management database to GitHub. That's where all the complicated stuff like permissions and authentication and change notifications and approval workflows, that make up the bulk of complexity in most bespoke business applications, as well as all the tricky stuff of managing the actual files that the contributors are managing, is all happening. It's a smart decision! There's a lot involved in running a system like that reliably - outsourcing it is a great idea.

If they were to do that in house, by switching to a self-hosted GitLab, say, well... that could be run on a single machine (https://docs.gitlab.com/ee/administration/reference_architec...) at the cost of having to manage scheduled downtime for upgrades. If the user base or activity level grew beyond what that server could handle (and given that the intention of this project is to cultivate an ever growing community of contributors that might be a concern)... the next stop up is an eight node system: https://docs.gitlab.com/ee/administration/reference_architec....

jauntywundrkind · 2024-03-25T23:33:38 1711409618

Again though, that's not actually computationally expensive. It's just hard to get right & maintain, with few do-all open-source offerings ready to stand in. It's not like having 10x the computing resources at hand would highly motivate someone to bring it onboard: it's a difficult thing to manage well, but not actually computationally expensive to do so.

eyegor · 2024-03-24T20:33:39 1711312419

A lot of resources go to scaffolding for containers, runtimes, caching, etc. 4gb would make for a chugging experience if you run a Java spring boot backend for example. But an old school php + postgres stack would be fine, or modern dotnet, rust, etc. And honestly I'm not sure it matters since ram and compute is cheap for most small-medium sites (at worst maybe $30-50/mo vs $10-20/mo).

VPS services are pretty cheap these days. Hetzner, digital ocean, genesis, etc. Or you spend a couple grand to build a dedicated machine in raid6 and only pay utilities.

acabal · 2024-03-24T20:39:04 1711312744

Of course. The entire point is that so much of that is just unnecessary for 90% of web projects. Most projects that are basically a front-end website backed by a local DB can get by on a 2GB VPS if they embrace classic web tech and don't get sucked in to whatever framework/cloud service/container craziness is hot this week. Once they start hitting millions of page views a month, we can talk about upgrading to 4GB :)

LtWorf · 2024-03-24T21:49:46 1711316986

Remember that rust doesn't catch all the memory leaks… So a long running rust process might degrade the performances in the machine, slowly.

pdimitar · 2024-03-26T03:01:49 1711422109

If you can give examples on how does Rust leak memory, that would be informative.

I had two services run uninterrupted for months in 2GB RAM containers. Memory increased steadily the first 5-10 minutes and then stayed there indefinitely long.

vinnski · 2024-03-25T04:58:07 1711342687

Oh man I absolutely love the work that you guys do. I'm actually in the process of learning Ebook production using the 'Step by Step' guide on your website. I'm essentially learning it all from scratch as I have little to no programming/SWE experience (I learned a bit of Lua because of KOReader[1]) but the technical side of ebook production has always fascinated me enough to keep learning. Also because I wanted to contribute more than just typos and grammatical errors (as important as they are).

[1] https://github.com/koreader/koreader

bysja · 2024-03-24T20:49:34 1711313374

Great book

eddd-ddde · 2024-03-24T19:32:05 1711308725

We use k8s at $JOB for literally no reason other than some engineer decided he wanted to learn k8s.

It's the worst, I don't even get to see my server logs because "that would mean giving you access to the entire thing". I'm not a k8s person but surely that has to be missing something.

We could literally make do with a cloudflare 5$ plan and have left to work with. With better integration, and better DX.

ActionHank · 2024-03-24T19:43:28 1711309408

Current project I work on sees:

* everything running in docker containers, but you aren't supposed to run that locally, only local dev\test is through unit and component tests.

* all k8s is maintained by a separate group pulling in those images through a seemingly very over engineered solution

* when it is all combined in a deployment, things break, or more scary, they don't and you need to manually test and hope it's working.

Issues take too long to track down and resolve and the reasons are very obvious to everyone but technical leadership who are constantly high fiving each other over their amazing creation that is already collapsing under it's own weight.

vbezhenar · 2024-03-24T19:47:29 1711309649

> It's the worst, I don't even get to see my server logs because "that would mean giving you access to the entire thing". I'm not a k8s person but surely that has to be missing something.

I'm pretty sure it's possible to configure access to logs, not to the entire thing (whatever that means). He's probably lazy and does not want to bother.

Besides, you should have centralized logging using loki or something similar. Kubernetes logging is not enough for any reasonable use-case.

eddd-ddde · 2024-03-24T20:12:29 1711311149

I know, but we don't get any. I legit had to do logging on my end to a _database_ because they weren't a fan of something like say new relic or similar.

josephcsible · 2024-03-24T20:17:19 1711311439

Isn't that just because Kubernetes makes it unreasonably hard to configure access to logs like that?

ongy · 2024-03-24T20:52:55 1711313575

It's really not that hard if you got proper users.

But doing proper users on k8s is hard. I suspect they just run with admin credentials and no real way to generate users.

vbezhenar · 2024-03-24T21:33:24 1711316004

For small cluster you can just create service account for a user, create token for it and write it in the kubeconfig. Then assign role to this service account and that's about it.

The main issue with this approach is that you can't organize those "users" into a groups. But for a small number of users you can just create all rolebindings and be done with it.

ongy · 2024-03-24T21:48:31 1711316911

Yes. I was thinking of that for a moment as well. But it requires some understanding and is annoying in various ways.

I should look into how to provision users at some point. but OTOH GKE takes care of it for our prod, and other clusters can be run with just admin.

throaway2342551 · 2024-03-24T20:50:00 1711313400

No, it is simple to setup.

ufmace · 2024-03-24T21:03:41 1711314221

I dunno if k8s is actually right for your company, but it most certainly is possible to configure it to give engineers limited access, including viewing logs, pod status, services and deployments, etc, but not reading sensitive data or updating anything. In my opinion, deploying production services using devops technology nobody on the team has much understanding of is what's a bad idea, not any particular technology.

tayo42 · 2024-03-24T19:39:43 1711309183

That's not a k8s thing. It's an idealistic DevOps thing. You'd probably deal with that in alot of places.

amtamt · 2024-03-24T19:44:02 1711309442

That for sure is not "idealistic devops", but someone missing fixing RBAC to let devs have access to logs of their workload.

tayo42 · 2024-03-24T20:19:25 1711311565

Maybe without details or knowing exactly what he means. But I've worked with people that decided you don't get to see any logs on a server except for a select few through some web ui or a log aggregator.

amtamt · 2024-03-25T07:14:58 1711350898

There are production systems handling PII (finance, healthcare etc), where logs are treated as radioactive and only people with trust, training and protective gears should be touching them. This is the case where these are legal/ compliance requirements. Non-prod of those systems can have wide open access to logs, unless someone decides to import prod data for "testing" in non-prod. While I agree access to logs may seem too restrictive, there are some very solid reasons for that... and then we have some cargo-cult and some lack of time/ understanding which causes too restrictive log access where it might not be required.

eddd-ddde · 2024-03-24T20:38:17 1711312697

I know what you are talking about, unfortunately it's way worse than that.

The same guy that runs the k8s is the guy that "reviews" my PRs. It's also the same guy that can't get eslint into our PR process, and the same guy that merges code from other people that just out right brick the server because it had some bad syntax.

I _wish_ my problems were caused from too strict standards and procedures. But really is just a lack of understanding and cooperation.

5Qn8mNbc2FNCiVV · 2024-03-26T13:37:14 1711460234

> that would mean giving you access to the entire thing

There's clearly a skill issue at play here and your intuition is correct

feverzsj · 2024-03-24T20:01:28 1711310488

If your app doesn't server more requests than sqlite.org daily, you shouldn't pay more than it.

> sqlite.org answers more than 500,000 HTTP requests per day (about 5 or 6 per second) delivering about 200GB of content per day (about 18 megabits/second) on a $40/month Linode. The load average on this machine normally stays around 0.5.

[0]: https://sqlite.org/althttpd/doc/trunk/althttpd.md

charrondev · 2024-03-24T20:23:40 1711311820

I mean it really depends on what your doing.

If you’re just serving static content then of course you can get away with one small box.

If you have user content that’s being constantly updated and inserted you need a lot more.

You need databases, caches, and in our case elastic search (with ~10 billion documents). The data needs to be indexed 16 ways to Sunday to make sure that a user hitting the page with this filter or that sort selects just the right records.

If you care about reliability you should have read replicas of your DBs as well.

Where are your logs going? I’d expect servers for that as well.

Now alternative to a bunch of this you could just use SaaS products but that costs an arm and a leg.

feverzsj · 2024-03-24T20:34:28 1711312468

> dynamic pages on the SQLite website typically do about 200 SQL statements each.

[0]: https://www.sqlite.org/np1queryprob.html

johannes1234321 · 2024-03-24T21:08:12 1711314492

How many of the request on sqlite.org go to the "dynamic pages"? I would assume by far most are on the static docs. The forum and other "dynamic" content is quite hidden.

SQLite · 2024-03-25T09:51:50 1711360310

About 10% overall (9.96% to be precise), according to server logs over the previous 10 days.

Robots hit dynamic content at about twice the rate as humans: 14.12% versus 7.8%. About 34% of traffic is from robots, from what I can tell (though to be fair, many robots these days work hard to disguise themselves has human, so the actual percentage of robot traffic is likely much higher.)

therealdrag0 · 2024-03-24T23:21:44 1711322504

I don’t think this is controversial. 5rps is nothing.

Anyone saying otherwise failed to do basic capacity planning. Granted at any given time the number of junior/untrained/nontraditional engineers outnumber senior engineers so it’s not surprising things are build out of proportion, and I suspect that’s what’s behind the majority of the anecdotes.

toonewbie · 2024-03-24T20:29:00 1711312140

I have been running multiple web apps with decent traffic (each ~10K requests per day) at no cost at all. Oracle Free Tier [1] for backends with Cloudflare tunneling [2] and pages [3] for frontend integration. Works fully seamless.

[1] https://www.oracle.com/cloud/free/ (4 cores, 24 GB ram for free; switch to PAYG to avoid idleness shut down, but you still pay nothing)

[2] https://www.cloudflare.com/products/tunnel/ (protects from attack and automatically connects your backend IP into your domain, e.g. https://domain.com/api will redirect to your backend)

[3] https://pages.cloudflare.com/ (easy static page hosting, very fast deploys)

Aachen · 2024-03-24T20:38:28 1711312708

Somehow hearing of getting free hosting from Oracle makes me think of the advice to not stick your "thing" in crazy. It's probably fine, but I'd not touch it if it can be avoided, especially if I can just pay a small and fair fee at another provider (or, in my case, self host it)

toonewbie · 2024-03-24T21:01:51 1711314111

It was my reaction at first too. Luckily, I am not running anything super important or something requiring much privacy there (it's mostly computational services that don't store data). Since I am a student with no residence (and bad university network uptime), I can't self-host, but will do as soon as I move-in to my own place.

EasyMark · 2024-03-25T02:25:08 1711333508

I'm guess they probably use open source tech and such and can move it over to a new VPS in less than a day...

evrimoztamur · 2024-03-24T20:02:44 1711310564

In fact, a single five euro server can run:

My personal website,

Two multiplayer game servers for the games I built,

frp for tunnelling;

and I'm planning on squeezing more in until it gives up.

All reverse-proxied by nginx under subdomains of my personal domain address, absolutely seamless.

Similarly, other applications I built so far all also run under five euro VMs. There's no denying you might need more because you have serious peaks in traffic that you cannot handle with only one server, but do the accounting. It's a worthwhile decision to think through.

makerdiety · 2024-03-24T20:25:11 1711311911

> Similarly, other applications I built so far all also run under five euro VMs. There's no denying you might need more because you have serious peaks in traffic that you cannot handle with only one server, but do the accounting.

A simple proportional-integral-derivative controller equipped onto your server resource can help you see if future traffic spikes are occurring. The question is, what kind of person is able to have that kind of perception ability and is willing to integrate that into his daily life? Not the average software developer, we can definitely say with great confidence.

evrimoztamur · 2024-03-24T20:34:41 1711312481

I'm partial to forecasting volatility via GARCH models as well!

polishdude20 · 2024-03-24T21:18:30 1711315110

Wait what would a PID controller do? What would it be controlling?

makerdiety · 2024-03-24T23:55:37 1711324537

You're supposed to be repurposing the controller into a reporter. Since you don't need control, you can instead just use the PID portion of the PID controller.

But you can also attach an actual automatic control for when the sensor reports a positive traffic influx prediction value.

And I think stuff like Kubernetes includes this feature. Go figure.

polishdude20 · 2024-03-25T16:41:55 1711384915

What value does the reporter report? Isn't it just a measurement? In which case you don't need a PID?

makerdiety · 2024-03-25T21:10:29 1711401029

It's a sensor that informs its users of it how much control is needed for a networked server that can experience volatility in the form of network traffic spikes. The numerical calculus from PID controllers helps provide that crucial information. A bang-bang controller would, for example, provide very sub-optimal control, as volatility doesn't work under simple on-and-off models.

mcluck · 2024-03-24T20:17:07 1711311427

I once worked a job where I easily spent 4 times as much time fighting with Terraform files than actually writing features. This company got less than 1,000 hits per day. I think about this a lot

Nextgrid · 2024-03-24T20:24:02 1711311842

I've had gigs where it felt like the company was spending more time fighting self-inflicted consequences of their "best practice" "cloud-native" architecture than developing actual revenue-generating features.

So many problems they were dealing with would magically disappear if all the services were running on a single high-end physical machine (the scale didn't mandate anything bigger than that) with a standby one sitting in a different DC for redundancy purposes with incremental DB snapshots shipped to it every 15 mins.

The cloud-native, "modern" infra became a liability and impediment to business but too many people would lose face to admit it.

zer0tonin · 2024-03-24T19:38:02 1711309082

Yeah, a 4$/month VPS can handle thousands of queries per second: https://alicegg.tech/2023/02/06/4dollar-vps

master_crab · 2024-03-24T19:43:43 1711309423

Can the majority of webapps run on a single machine? Probably.

Do most of your customers expect close to 100% uptime? Yes.

Does one machine provide the uptime required by your customers? Most definitely not.

fabian2k · 2024-03-24T20:05:52 1711310752

I don't think the point here changes all that much if you change this to two servers and a load balancer. That's still a pretty simple setup.

But I also think that many applications are far more tolerant of small outages than you imply. And a more complex setup also adds more points that can lead to an outage even though it reduces the chance that hardware will cause one.

Aachen · 2024-03-24T20:41:34 1711312894

Wouldn't you also need two load balancers then? Otherwise you've still got a single point of failure. And how do you keep the failover system in sync? It's a whole can of worms to promise 100% uptime. It's super rare that a server physically breaks and suddenly goes dark with less than a day's warning, and for most applications such a once-in-5-years event is tolerable if that means the hosting costs are divided by five as well (so far I'm ~10 years in and haven't experienced such an event yet; disks in RAID have broken but not a cpu or other SPOF)

erik_seaberg · 2024-03-25T02:49:50 1711334990

> super rare that a server physically breaks

I kinda want to know where you shop, because commodity hardware has a reputation as just utter crap. Before cloud went mainstream, I worked somewhere that had some racked servers in a colo literally catch fire. (The remote hands put them out, disconnected, and refused to touch them ever again.)

kasey_junk · 2024-03-24T20:22:08 1711311728

Nearly no one is competent and able to setup a true load balancer. If you have one either it’s provided by your (most likely vm based) hosting provider or you are a big enterprise.

If you are in one of those two categories then setting up something more robust than a single server isn’t much more complicated than a single server.

That is to say, I think it very much does change the equation around the original statement.

Puts · 2024-03-24T19:55:43 1711310143

Well if you can spin up a new environment in less than a minute – does it matter? I mean if you are Facebook yes, but having worked with a lot of people building things like web shops – having a minute downtime I would say is fine because I've seen so many of these people shoot themself in the foot trying to build high-availability solutions and having constant downtimes just because they can't handle their own complexity.

ardel95 · 2024-03-24T20:04:18 1711310658

If you have the ability to spin up a new machine when the old one fails, and deploy your app onto it in one minute, it’s not a big leap to also run your app on two machines and avoid that downtime altogether.

Nextgrid · 2024-03-24T20:18:48 1711311528

Running two instances of a stateful application in parallel forces you to consider nasty and hard problems such as CAP theorem, etc. If your requirements allow, it's much easier to have an active-standby architecture over active-active.

ardel95 · 2024-03-24T22:09:27 1711318167

Totally. But most applications are not stateful.

Nextgrid · 2024-03-24T23:14:45 1711322085

Most applications as a whole are absolutely stateful. Individual components of them might not be (app servers are stateless with the DB/Redis containing all state), but the whole app from an external client's perspective is stateful.

If we're talking about reliability/outage recovery, we're considering the application as one single unit visible from the external client's perspective - so everything including the DB (or equivalent stateful component) must be redundant.

Sadly this is also where a lot of cloud-native tooling and best practices fall short. There are endless ways to run stateless workloads redundantly, but stateful/CAP-bound workloads seem to be ignored/handwaved away.

I've seen my fair share of stacks that are doing the right thing when it comes to the easy/stateless parts (redundancy, infinite horizontal scalability), but everyone kinda ignores the elephant in the room which is the CAP-bound primary datastore that everything else depends on, which isn't horizontally scalable and its failover/replication behavior is ignored/misunderstood and untested, and they only get away with it because modern HW is reliable enough that its outage/failover windows are rare enough that the temporary misunderstood/unexpected/undefined behavior during those flies under the radar.

ardel95 · 2024-03-24T23:56:00 1711324560

That’s a pretty pedantic interpretation of the word application. In the context of software owned by most teams, that they may decide to run on single vs multiple hosts most applications are absolutely stateless. Most applications outsource state to another system, like a relational database, a managed no-SQL store, or an object store.

And so no, most teams don’t need to worry about the hard problems you bring up.

echoangle · 2024-03-24T22:46:13 1711320373

Is it really an application if it’s not stateful? Maybe you’re managing the state client-side which makes it easier but I wouldn’t call a plain website an application, or am I missing something?

Nextgrid · 2024-03-24T23:22:16 1711322536

At the smallest level, even every byte of an in-flight HTTP request is still state. State, and for that matter "uptime" really depend on what the application/service ultimately does and what the agreement/SLA with the end-customer is.

The correct high-availability solution should take business requirements into account and there is no silver bullet. Running everything on a $5 VPS is no silver bullet, but neither is your typical "cloud-native" "best practice" stack that everyone keeps cargo-culting which often leads to unnecessary cost while leaving many hard questions (such as replicating CAP-bound stateful databases) unanswered.

pvorb · 2024-03-24T20:04:22 1711310662

How long does it take to figure out there's a problem with your environment and react to the failure? That's likely more than a minute.

And spinning up a new VM typically takes longer than a minute. Even on cloud providers.

maccard · 2024-03-24T20:35:20 1711312520

Our environments are entirely automated on AWS. It takes about 10 minutes to get an ALB and a mysql instance just on the AWS side. We've configured ECS to have the quickest health checks we can manage, and it still takes 3 minutes from the initial call before everything is happy.

Our environments are easily repeatable, they're very maintainable, but they're not quick to start up

ocdtrekkie · 2024-03-24T20:16:01 1711311361

You'd be surprised how close to 100% uptime you can get on one machine with well-engineered software. Schedule an automated update and reboot for a time and day of the week you have the fewest customers monthly, and if you have an SSD as the boot drive, unless you are a huge company nobody will notice or care that at 4 AM on Sunday your service was down for a minute and a half.

NorwegianDude · 2024-03-24T21:04:08 1711314248

You can also avoid most reboots by livepatching the kernel.

ocdtrekkie · 2024-03-24T21:17:04 1711315024

Indeed. I come from the Windows world where things are a lot slower and there are less options for things like livepatching. So my "minute and a half downtime" is definitely pessimistic compared to what you can accomplish on a Linux server.

maxcoder4 · 2024-03-24T20:29:56 1711312196

>Do most of your customers expect close to 100% uptime? Yes.

I think this is mostly a self imposed requirement. Banks regularly have overnight technical breaks. My country national rail has 30 minutes of downtime every night (!).

Unless your product is already global, most people won't have a problem with occasional overnight scheduled downtime.

Solvency · 2024-03-25T03:26:04 1711337164

Banks have most people by the balls. Most SaaS don't and wish they did.

Jgrubb · 2024-03-24T19:50:25 1711309825

The correct answer to question 3 is "it depends".

SkyPuncher · 2024-03-24T20:42:04 1711312924

You’d be surprised.

Most people never even test number 2. If your a novel or niche product, you can afford a “lot” of downtime

For number 3, it really all depend on your load. I’ve run most early stage startups on an incredibly simple setup with a proxy placed in front of an app server.

Occasionally, our monitoring will start reporting high p99’s so we go in and bump the servers to the next tier or scale horizontally just a bit more. Eventually, that breaks down but many startups will be well into Series A or Series B point. At that point, you know your customer’s needs and can hire dedicated engineers to solve reliability and uptime.

sjducb · 2024-03-24T19:51:09 1711309869

Master slave failover will give you less than a second of downtime.

hparadiz · 2024-03-24T20:03:38 1711310618

Pretty much any provisioned vm can be split down the middle. Got 4 cpus and 4gb of ram? Congrats you have two vms with 2 cpus and 2gb of ram. Throw them into a 50/50 load balancer and never have any downtime.

toast0 · 2024-03-24T20:13:59 1711311239

Two vms on the same machine doesn't help when the machine fails.

Also, I sure hope your loadbalancers don't suck. I've worked with some that had worse uptime than my servers, and worse capacity too. Went back to putting the two host addresses in DNS, which is mostly fine, but means a lot of waiting when you want to take a host out of rotation for disruptive maintenance.

hparadiz · 2024-03-24T20:21:18 1711311678

I do this on AWS so the load balancer is managed for me. Never had issues. Two randomly provisioned vms are unlikely to be on the same bare metal.

aflukasz · 2024-04-01T09:20:37 1711963237

You do have some control (quite typically?): https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/placemen..., https://docs.hetzner.com/cloud/placement-groups/overview.

viraptor · 2024-03-24T20:20:04 1711311604

VMs going down is not an unexpected event. It's not going to happen even day, but hosts crash, network goes down, datacentres experience thermal events, etc. Your solution may work for your own system crashes, but not most of the rented platform failure modes.

userbinator · 2024-03-24T19:46:23 1711309583

Make it two machines then.

master_crab · 2024-03-24T19:49:24 1711309764

Ya. 1<n<6 has worked 99% of the time for me.

watermelon0 · 2024-03-24T20:51:02 1711313462

Multiple machines don't automatically mean a better uptime. Outside of the regular maintenance, many problems can affect multiple instances, not just one.

(Regular maintenance meaning OS updates needing a reboot, which happens like what? Once per month for a minute?)

NorwegianDude · 2024-03-24T20:59:20 1711313960

Servers are a lot more stable than you think.

nektro · 2024-03-24T20:43:29 1711313009

how many websites out there realistically need close to 100% uptime?

zeroCalories · 2024-03-24T20:27:44 1711312064

Gonna throw out an anti-cirlcenjerk take:

We know. Absolutely no one has ever said that your super basic webapp that has the latency requirements of "make sure it works", an SLO of "it's fine most of the time", and a code base under 10k lines of code worked on by one guy, needs these super complicated systems. Even FAANG will run internal services and dashboards on a single binary, but when every second of downtime can be translated into lost money, then you begin to care.

Yeroc · 2024-03-24T20:51:49 1711313509

Unfortunately many of these super-complicated systems run at odds to having a reliable and fast, system even when every second of downtime can be translated into lost money.

zeroCalories · 2024-03-24T21:44:18 1711316658

Realistically, they are required. I don't care how you do it, but you need 1. Seamless fail overs 2. Easy horizontal scaling 3. Location based load balancing 4. A way to deploy and rollback all of these automatically. As a result, things are gonna get complicated no matter what you do. But as it turns out, using something like Kubernetes, Elixir, or AppEngine can make all of that very manageable.

Yeroc · 2024-03-25T01:11:37 1711329097

#3 is debatable for many, many businesses. The other three are (almost) trivial with a decent load balancer and blue/green deployments. This is something very achievable without an over-complicated architecture.

zeroCalories · 2024-03-25T03:58:47 1711339127

3 is incredibly important for global businesses. You ever try using a Japanese website?

As for how "easy" the others are, the truth is that the options I provided will be easier to make, easier to maintain, and work better than your home grown solution once you need to account for multiple machines.

leeoniya · 2024-03-24T21:16:13 1711314973

this is the correct take, imo. i think a lot of projects start out overly complicated with the expectation of getting huge and needing to scale horizontally in a hurry, which a monolith cannot really do well. of course these expectations rarely become reality and you end up with some huge web of microservices that two devs need to orchestrate and manage for no benefit.

at Grafana we're now decoupling a monolith primarily so teams working on different parts can manage their own deployments/rollbacks/incidents independently of "official" monolith versions. it's a very hard/expensive refactor once you're at a scale of millions of users and hundreds of engs. but really an impossible thing to predict at inception.

dehrmann · 2024-03-24T20:55:04 1711313704

> every second of downtime can be translated into lost money, then you begin to care

You don't even need to be at FAANG-scale for this. Once you have paying customers, you care.

arnaudsm · 2024-03-24T20:46:34 1711313194

I wouldn't say complexity and reliability are correlated, worse, it's an inverse correlation.

zeroCalories · 2024-03-24T22:07:06 1711318026

Yes, programs that don't need to do anything can be very reliable.

rr808 · 2024-03-24T20:42:16 1711312936

Most over engineering is career driven development. You may laugh at the people who unnecessarily use loads of microservices, specialized tools and libraries, exotic cloud solutions etc. But the architects and senior developers who do this stuff now have great high paying jobs. If you choose the simple cheap solution the client might love you (realistically they wont know the difference) but you wont be able to get those high paying jobs. Ask me how I know.

nadermx · 2024-03-24T19:29:14 1711308554

A $5 VPS, with almost any app, can handle the front page of hacker news. If I remember correctly I'd even venture to say hacker news runs on a one or two core box. It should be the standard

abound · 2024-03-24T19:50:40 1711309840

Hacker News might not be the best example, as it seems to get overloaded once every week or two, usually when a contentious/high volume topic comes up.

polishdude20 · 2024-03-24T21:22:03 1711315323

And despite that, hackernews is as popular as ever. I think the moral of this story is that having some downtime isn't a big deal.

Macha · 2024-03-24T20:37:18 1711312638

My impression is that your standard wordpress + Apache + mod_php + mysql on same server install cannot, without plugins that improve wordpress' caching behaviour, and that's a large portion of the sites that do fall over.

Animats · 2024-03-24T19:42:55 1711309375

Yes.

You don't even need containers. Write your program as one Go executable and drive it from fcgi. Go is fast enough that you can get a lot of work done on a minimal server. Fgci provides "orchestration" of multiple processes, plus crash and restart handling.

Also, there's much less attack surface. If your minimal Go program can only respond to specific requests, there's not the problem of an attacker targeting some unused feature of the site. Go has subscript checking, so you don't have buffer overflow vulnerabilities.

throwaway63467 · 2024-03-24T19:56:17 1711310177

Go can even serve traffic directly without a frontend server, ACME support for TLS is also great.

moondev · 2024-03-24T19:49:12 1711309752

You don't even need to write your program in go or configure and deploy with fcgi. Write it in whatever you want and stick it in a container.

Animats · 2024-03-24T21:01:15 1711314075

Right, but the interpreted languages will be on the slow side, which matters with the limited resources of shared hosting.

nojvek · 2024-03-25T12:11:58 1711368718

Slow is relative unless you actually benchmark.

V8 has come a really long way for JS.

LtWorf · 2024-03-24T21:55:51 1711317351

Then write C++ if you want fast :D

jameshart · 2024-03-24T20:24:09 1711311849

What kind of systems are people imagining when they picture 'the majority of web apps'?

In this thread, I see people citing HN (a simple bulletin board system); or sites that will get linked from HN and have to handle load - so, presumably blogs, or CMS-backed marketing sites, maybe with a sign-up form or a single product ecommerce storefront?

Sure, I can buy that you can run a pretty robust bulletin board or CMS on a single server. PhpBB and Wordpress with a Postgres backend will get you a long way.

But if you think that's what all web apps are like, you're not thinking anywhere near big enough. "I've seen some stuff that's over engineered" is not evidence for "everything is over engineered".

sjducb · 2024-03-24T19:58:01 1711310281

The real reason for cloud micro service architecture is legacy code.

Imagine you’ve inherited a 10 year old PHP monolith. The code is almost indecipherable and the business wants new features in a timely and predictable manner.

The easiest way is to implement the new features in their own micro services.

Sure it makes the complexity problem worse, but that’s a problem for someone else in five years time.

bcaxis · 2024-03-24T22:58:54 1711321134

You could solve this other ways. Build a new better monolith and have a reverse proxy route between them and slowly update and move routes over. You will get through a rewrite slowly one bit at a time.

If you keep the same database schema this should generally work pretty well.

I migrated a terribly written web app this way, it worked pretty well.