Open Guide to Amazon Web Services

PaulRobinson · on Oct 11, 2016

A 15-minute scan read of this - specifically the sections on the stuff I've worked with the most - suggests this is a very, very good addition to the official documentation.

I would as a minimum recommend anybody/everybody considering AWS to read and think about the "When to use AWS" section. Whilst it is an excellent set of tools that have completely changed the economics of deploying software, there are times when you should use Google Cloud, times you should use bare metal, times you should use Heroku. AWS is a complex beast. Heroku is simple, but has limitations.

There are a bunch of apps I'm thinking about building at the moment where I realise a hybrid approach is best: some of GCP's stack, some of AWS', and a small amount of my own bare metal. Knowing when to choose which is not intuitive and comes with time, but there are big, big clues that will help the uninitiated in that section of this open guide.

Also, if you're looking to the future, the AWS Lambda and Google Functions stuff is perhaps the most exciting stuff to start building knowledge up of now if you're a developer, I think.

TheDong · on Oct 12, 2016

> There are a bunch of apps I'm thinking about building at the moment where I realise a hybrid approach is best: some of GCP's stack, some of AWS', and a small amount of my own bare metal. Knowing when to choose which is not intuitive and comes with time, but there are big, big clues that will help the uninitiated in that section of this open guide.

unless you have a metric shitton of money to blow, there's never a good reason to start with that.

The most expensive part of any of those cloud providers is networking. If you need to transfer data from bare metal <-> aws, you'll need direct connect which charges basically an arm and a leg. Transferring between aws <-> gce is expensive for the same reason. Sure, if you're apple scale and need better data redundancy maybe it's okay. maybe. But that's not an app you think about building as an individual or small company.

I also don't think GCPs stack has anything whatsoever that AWS's doesn't have, so it's odd to mention it in that phrase.

If you'd be so kind as to provide an example application you're thinking about, and the reason each of those is needed for some part of it, I'd be happy to hear it!

stuartaxelowen · on Oct 12, 2016

Personally, I'm not convinced the price will come down low enough for cloud functions/AWS lambda to ever be cost effective. We've looked at it + API gateway, and it would be orders of magnitude more expensive than our current giant amount of webservers.

Kubernetes (and similar technologies) on the other hand, make it possible to get the same economics as cloud functions while still tying your cost directly to the computing resources you use. Also, it gives you the freedom to (with some pain) move your entire platform to a different provider.

maslam · on Oct 11, 2016

This was exactly my reaction. The tips around Amazon Redshift were spot-on including a few obscure-but-critical ones e.g. the one about many small tables taking up a ton of disk space!

jbpetersen · on Oct 12, 2016

Any advice on quickly learning when the different options you described are appropriate?

andrenotgiant · on Oct 11, 2016

I recommend you also make the content available on a one-topic-per-page format ASAP before someone else does and takes credit for it.

WHY: Google still doesn't handle anchor-links very well. You have 1000 amazing articles on a single page. Each section (e.g.: "High Availability on AWS") would be a great resource for someone searching on that topic in Google. But when you put it all on one page Google infers "1/1000th of this page is about high availability on AWS" and gives better rankings to a page that is 100% about high availability on AWS.

I'm sure it would be pretty simple to write a script that breaks up topics into individual pages. I love the style of having it all on one page but I think it would be a waste of your hard work not to get all this great writing in front of search.

zalzal · on Oct 11, 2016

I understand the concern. We'll try doing something about that. That said, single page on GitHub for the moment means (1) discoverability directly on github.com, which helps everyone and (2) browser search on the whole guide (which actually is more helpful than you might think!).

andrenotgiant · on Oct 11, 2016

Completely agree, once I discover a guide like this, I bookmark it, come back to it, and really value the ctrl-f-ability.

I was recommending the one-topic-per-page idea for others who haven't yet found this nugget. I think a lot more people will discover it and benefit from it if they are finding it from specific google searches.

I know HN can be a source of a lot of unfounded flyby critiques, I dont want to contribute to that trend. I see you have a pretty good contributing guide, maybe I'll try and submit a PR with a solution in the spirit of Hacktoberfest!

jlgaddis · on Oct 11, 2016

As I'm sure you're aware, a lot of documentation is made available in several formats, such as 1) single page HTML, 2) multiple page HTML (e.g. one page per section), and 3) single PDF.

The different versions are automatically generated from a single common source but that would probably require a major change in how you create your guide and so may be more work than you want to take on.

To illustrate why this is useful, I'm a network engineer who primarily works with Cisco gear. Cisco has an absolute wealth of information -- product manuals, configuration guides, etc. -- accumulated over a couple of decades spread out across their web site(s). Unfortunately, their web site team likes to change things -- A LOT! -- and pages "move" frequently and it's often impossible to find them again. Because pretty much everything I'm interested in is available in PDF format, I save these versions locally where I can find them and refer to them later. Quite often, the times that I really need to look up some obscure feature are times when I am somewhere that either 1) I cannot connect my laptop to the network or 2) Internet access is unavailable, heavily filtered, or outright prohibited (of course, that's probably not going to apply for someone working with AWS.)

Regardless, you've put together a wonderful, comprehensive resource here. I'm a "minimal" user of AWS (primarily S3) but I am familiar with the different products and you're done an awesome job of summarizing Amazon's "dense" documentation down to its key points.

noobiemcfoob · on Oct 11, 2016

Why not both things?

I like the monolithic format, but if the cost is lower SEO, maybe have a paginated version "in addition to" (as opposed to "in place of")

tf2manu994 · on Oct 11, 2016

Yea, use the monolithic thing, and have the gh wiki have individual sections

forwidur · on Oct 11, 2016

That's a very insightful suggestion, thanks!

nzoschke · on Oct 11, 2016

This is great. I've been working on AWS for close to 10 years now and an open guide is something I both need and want to contribute to.

Many of us have simple goals on AWS. The official AWS docs are thorough, but are too technical. There are blog posts about anything, they can be hard to find or get out of date.

I hope this open guide helps us all get our jobs done faster and easier!

zalzal · on Oct 11, 2016

Very glad to hear. Its this sentiment exactly that led us to get this started. We all have 100s of valuable tricks and gotchas we learn over the years, but 99% of the time fail to write down and share them helpfully. Do join us on Slack/GitHub and help us get your tips included, too.

nzoschke · on Oct 11, 2016

Done and done. I was pleasantly surprised to see my ECS tips already linked. I hope I can remix this knowledge for the guide!

https://convox.com/blog/ecs-challenges/

jsingleton · on Oct 11, 2016

I have the same issue with not writing stuff down and there are plenty of gotchas. I finally got round to starting a 3 part blog series on AWS vs Azure, vendor lock-in and pricing confusion [^0], but I'll see if I can contribute to this too.

[^0]: https://unop.uk/on-aws-vs-azure-vendor-lock-in-and-pricing-c...

zappo2938 · on Oct 12, 2016

I am relatively new to building larger apps. I've worked for a coulple years building with Drupal and hacking PHP. Now I only want to develope full stack JavaScript. I really enjoy it's messy nature. Last week I discovered that user uploaded files are not persistent on a Heroku hosted app. To solve that problem, I created an AWS S3 account which is the first time I've used AWS. I quickly figured out to exchange Node.js fs functions with the AWS SDK. Setting up a bucket and a test bucket easy. And, configuring IAM rules is intuitive.

You're right. Their docs are far beyond the scope of what I needed to get started. Interestingly, I would rather have Google searches about AWS show Stack Exchange answers but most of the first results are all Amazon documentation which is far more difficult to read and sort.

xbryanx · on Oct 11, 2016

Wow, the link to http://www.ec2instances.info/ alone is so helpful. I wish I'd had this set of resources a year ago when I spent weeks trying to understand AWS' own documentation.

forwidur · on Oct 11, 2016

Right? I wish Amazon were just running a page like that themselves.

jeffbarr · on Oct 11, 2016

Which aspect of it do you find the most useful?

* All of the instance types on one page? * All of the per-type facts in one row? * Sorting?

Let me know and I will share it with the team.

piinbinary · on Oct 11, 2016

* Everything on one page

* Doesn't take 30+ seconds to load

* Sortable and filterable

tokenizerrr · on Oct 12, 2016

* All information on one page with sortable columns

* Filters!

* Ability to see pricing cost per hour, daily, weekly, monthly and yearly

* Quick and easy to switch between regions

* The compare selected feature

My wishlist for this site would be a way to easily compare pricing information between regions.

michaelt · on Oct 13, 2016

If you want to answer the question "What's the cheapest way to get 16gb of ram and 4 cores?" (or the same for a 1 year term) then having a list you can filter and sort is much more helpful than Amazon's pricing pages.

cyberferret · on Oct 12, 2016

Upvoted for this link alone. I am so, so tired of the scroll, squint, hunt & jump I have to do on the current Amazon EC2 pricing page to compare costs and features of instances. Especially when trying to compare legacy instances (which we still have a lot of) to newer or VPC ones.

prohor · on Oct 12, 2016

I will also add https://www.cloudorado.com/ for those who want to also compare against other clouds.

mxuribe · on Oct 12, 2016

Great link; i wish i had this years ago! Thx for this contribution!

anantzoid · on Oct 12, 2016

This is awesome. Almost everytime I've to launch an instance for a new app or something, I end up googling "aws ec2 instances", "ec2 pricing" etc.

zalzal · on Oct 11, 2016

Remember, this isn't a blog, it's living GitHub project: If you see value in info like this, consider contributing or giving feedback to improve it. :)

stormy · on Oct 11, 2016

What I would consider one of the most important pieces of this guide is closer to the bottom (https://github.com/open-guides/og-aws#aws-data-transfer-cost...) where it covers cost management strategies. The Data Transfer Costs diagram makes the buried details of AWS networking costs stand out in a digestible way. I've read the AWS docs on this many times and still missed out on some of the nuggets exposed in the diagram.

robertleon · on Oct 11, 2016

As a consultant that often recommends migration to AWS services for clients, this is a treasure-trove of information when looking at each individual use case and making a determination about how best to advise. It's often difficult to know with certainty whether AWS vs Google Cloud vs bare-metal is the best course of action, and the advice and information here goes a long way in helping make those decisions easier.

imperialdrive · on Oct 11, 2016

One of the biggest lessons I've learned is that you need occasional EBS-to-EBS backups. Anyone that had to recover from snapshots knows the painful reason why...

forwidur · on Oct 11, 2016

Can you send a pull request with that tip? Many people would really appreciate it when they won't need to recover from snapshots. :-)

markwillis82 · on Oct 11, 2016

Why is it painful recovering from snapshots? (have just moved to AWS so have not experienced this yet)

imperialdrive · on Oct 11, 2016

I get a lot of shit for not giving straight answers... just spin up an instance, put a gig of data on EBS drive, snapshot, create EBS from snapshot as if you were recovering, and try pulling 100+ megs of data off it... you'll never not keep EBS copies again. big clue: pre-warming

it will take you an hour to do, and you'll be years wiser

this is probably the number one reason people experience extra extra downtime when suffering from rebuild from whatever issue... and EBS volumes in certain regions can and will experience silent deaths

questionr · on Oct 11, 2016

Someone want to start one for Google Cloud?

tptacek · on Oct 11, 2016

The "use IAM roles for EC2" recommendation is a bit sketchy. The current security zeitgeist, not just after Colin's post but also after DerbyCon and Black Hat, is that EC2 roles are dangerous and, when under attack, not very predictable.

moduspwnens14 · on Oct 11, 2016

Using IAM roles for EC2 is far and away better than what beginners would otherwise do, which is create a set of permanent credentials and deploy it everywhere.

dastbe · on Oct 11, 2016

Do you have links to the DerbyCon and Black Hat talks? And could you clarify what "when under attack, not very predictable" means?

tptacek · on Oct 11, 2016

An attacker who compromises an EC2 instance can quietly grab the instance role credential and use it even after losing access to the instance itself.

voganmother42 · on Oct 11, 2016

"Have the application retrieve a set of temporary credentials and use them." "In the case of Amazon EC2, IAM dynamically provides temporary credentials to the EC2 instance, and these credentials are automatically rotated for you." Attacker should only have access until creds are expired no ?

gtsteve · on Oct 11, 2016

That's right. Instance store credentials have an expiration time of a few hours. However, if the instance policy is very open you could create yourself a new IAM account or use STS to maintain persistence after the generated credentials expire.

This is why it's important to lock down instance profiles to do only what the application needs to do and no more. For example, you may give the permission to s3:DeleteObject, and in the event that the box is compromised the attacker would be able to delete files in your S3 bucket. However, if you don't give access to s3:DeleteObjectVersion you can evict the attacker and restore the deleted objects with relative ease.

This is why I would not recommend giving access to s3:* to an instance profile (or indeed, any production credentials).

voganmother42 · on Oct 11, 2016

Thank you for the reply - that makes sense to me, least privilege seems to be the primary defense in that case. Having explicit creds you rotate yourself I could see having benefits as far as control, but also requires more work / potential for implementation mistakes.

gtsteve · on Oct 11, 2016

Well, the AWS credentials auto-rotate. It does however provide a familiar place for an attacker to go to get the instance credentials, but that doesn't really help. At some point, those credentials must exist in plain-text for you to use them. If they're in a config file, they can be read out, if they're in RAM they can be pulled out with a debugger. At least if your box is temporarily owned due to a zero-day that you later patch, the credentials aren't going to be valid for long - although that situation would be hardly ideal!

You've also got to go to the trouble of getting the credentials on your box to start with. With instance roles, you can launch an instance and have it immediately capable of doing what your application needs. In the case of most applications my company runs, the instance profile is enough and no further security credentials are required. When database credentials are required, they're retrieved via S3, authenticated by the instance profile.

voganmother42 · on Oct 11, 2016

we use iam roles and credstash(dynamodb and kms) for retrieving database credentials. My comment was mostly in terms of the fact we cannot control the rotation for roles, say in the event of a breach like where someone committed keys to github and I can explicitly expire/rotate(assuming those keys were not themselves temporary and have not already expired :))

gtsteve · on Oct 11, 2016

I believe you can actually [0]. In a production setting it's a lot harder to accidentally leak the credentials - my concern would be if someone compromised the instance or if it was tricked into opening the instance store up to the net, such as a badly configured nginx instance (how you'd do that accidentally though I have no idea)

[0] http://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_use...

voganmother42 · on Oct 11, 2016

Good point! slightly less granular than per key but still very helpful, thanks!

gtsteve · on Oct 11, 2016

Can you elaborate on "when under attack, not very predictable"?

imh · on Oct 11, 2016

I would really, really appreciate if you would elaborate on this. Security seems to have the most unspoken community knowledge of anything I need to know.

zalzal · on Oct 11, 2016

Thanks for the comment. Added an issue with this thread: https://github.com/open-guides/og-aws/issues/98

If you'd like to PR or discuss there it'd def help us cover this better.

forwidur · on Oct 11, 2016

Thanks for the insight! Would you consider sending a pull request with a note on that?

jmickey · on Oct 11, 2016

I still don't get why Opsworks is not getting more love?

I guess people don't like Chef? Opsworks has enabled hassle free deployments for us over the past three years or so at no additional cost. :)

melvinmt · on Oct 11, 2016

Yep, not sure where this perception of "nobody's using it" comes from but I have been using it in 2 different companies in the last 3 years as well with nothing but love. In fact, if it were the case that "nobody's using it for good reasons", maybe we should ought to know the reasons?

eric_h · on Oct 11, 2016

Been using opsworks for about a year now and while it has very significantly streamlined our provisioning/deployment tasks, "nothing but love" is not quite how I'd describe it.

It does have some warts.

derwiki · on Oct 11, 2016

Biggest wart we have is that it randomly picks a machine to run migrations on, if it's a deploy with migrations.

roadrunnerfreak · on Oct 12, 2016

You could code up something in the deploy hook to select the master node (mostly the first instance in the layer) to run migrations and you could disable the "Run Migrations" when you deploy. I do this for the Rails app in my company.

eric_h · on Oct 13, 2016

I actually solved this with our custom deploy script. We choose the machine that runs the migrations.

The necessity of building and maintaining a custom deploy script is the biggest wart for us (though I admit that the API is pretty good, and said script has not had that much maintenance overhead).

zalzal · on Oct 12, 2016

Those of use contributing on the Guide so far have generally been companies where it's not used. I'd love to see a contribution (a few bullet points and/or links) that better covers the basics and reflects how/when it's useful.

zalzal · on Oct 12, 2016

https://github.com/open-guides/og-aws/issues/123

dsmithatx · on Oct 11, 2016

Please write an update and submit a PR. I'm moving from Ansible to Chef and would love some real world advice on what Opsworks has to offer me without another dreaded POC.

It's likely the original authors aren't using Chef or just use Chef server as I do now.

zalzal · on Oct 12, 2016

Yes, true. At the moment we don't have any OpsWorks expert contributors. Hope one of you can change that! https://github.com/open-guides/og-aws/issues/123

voltagex_ · on Oct 12, 2016

What's made you move away from Ansible?

alexbilbie · on Oct 13, 2016

What advantage does Opswork have for deployments over CodeDeploy?

forwidur · on Oct 11, 2016

You could add more background on it into the guide. Just submit a PR! :-)

chirau · on Oct 11, 2016

Does anyone here use x1.32xlarge instances? If so what kind of stuff are you doing with it? That thing looks beastly

forwidur · on Oct 11, 2016

My guess is that there are companies with "legacy" applications, that can't really be re-written into a distributed system, have a large footprint, but still need to be run.

The special sub-category of those are huge RDBMS instances - a pretty common choke point in growing companies with weaker engineering teams. Some of those companies would pay basically any price to keep those DBs running.

michaelbarton · on Oct 12, 2016

We're testing them for metagenome assembly with data from complex environments. The memory requirements get extremely large.

elwell · on Oct 11, 2016

I've temporarily scaled up to c4.8xlarge for a few hours every now and then to get some parallelized computations done quickly. Plays nicely with Clojure's (pmap) function.

itschekkers · on Oct 12, 2016

applied ML research here also -- a lot of interactive (but highly parallelizable) modeling, graphing. Using medium-size data sets around 3-4GB in ram, by the time you forked it a few times, you easily end up beyond the m4.10xlarge or c4.8xlarge limits.

IMO theres an awkward space between small data and big data where it isn't really worth spending a long time to treat it like a real "big data" problem, and the x1 instance gives you an easy-out.

swatthatfly · on Oct 11, 2016

we use several of them for real time applications in adtech, and ML.

gshakir · on Oct 11, 2016

This is great. I created a pull request for my S3 Infrequent Access calculator. https://github.com/open-guides/og-aws/pull/110

emmjay · on Oct 12, 2016

> A single EBS volume allows 10k IOPS max. To get the maximum performance out of an EBS volume, it has to be of a maximum size and attached to an EBS-optimized EC2 instance.

Out of date; EBS volumes can be up to 20k IOPS per volume and what is "maximum size"? To get the maximum performance out of a volume depends on workload, the instance size you've attached it to (rather than EBS Optimization) and the number of IOPS provisioned, and whether you've prewarmed it from a snapshot restore or not.

> A standard block size for an EBS volume is 16kb.

A block can be 1kb -> 256kb in size. It depends on the application.

> EBS volumes have a volume type indicating the physical storage type. The types called “standard” (st1 or sc1) are actually old spinning-platter disks, which deliver only hundreds of IOPS — not what you want unless you’re really trying to cut costs. Modern SSD-based gp2 or io1 are typically the options you want.

The ST1/SC1 wording is misleading. You only need '100s' of IOPS when dealing with big blocks for ST1, and SC1 isn't performance oriented at all.

trowawee · on Oct 12, 2016

Might be more helpful to make a PR, rather than comment here. It'll take about the same amount of time.

ranman · on Oct 12, 2016

Any IOP on EBS is measured in 16kb granularity. Not the same as block size but helpful to know because it lets you set your read ahead and other values to not lower than 16kb. At least this was the case for many years. Trying to find the official docs now.

nodesocket · on Oct 11, 2016

I converted to pdf using Typora[1].

https://github.com/nodesocket/og-aws-pdf/raw/master/the-open...

[1] https://www.typora.io/

zalzal · on Oct 11, 2016

Let us know if you find the PDF helpful? We could post one on the GitHub repo too — though it's so full of links I'm not sure how useful it is.

bogomipz · on Oct 12, 2016

I think the PDF is a useful option, use case being offline reading.

xarope · on Oct 12, 2016

Great work! I started using AWS back when it was just simple websites, and the plethora of services now (50!), and pricing (especially pricing!), is overwhelming to track.

So overwhelming, in fact, that I decided it was easier to get some VPSs and use common, work anywhere, tools to manage (e.g. saltstack), than have to skill up on AWS specific stuff.

kregasaurusrex · on Oct 11, 2016

Thanks a lot for posting this, I went to a linux conference over the weekend and was talking with some friends about their datacenter jobs. I felt hopelessly lost in trying to understand all its intricacies at routing, storage, and backup levels where this guide gives a good bird's-eye view of the stacks.

usdeveloper · on Oct 12, 2016

The Azure team could learn a thing or two about putting together good documentation for their products.

wslh · on Oct 12, 2016

I would add as a VPC gotcha the use of the EIP_Disable_SrcDestCheck flag [1] to enable layer 2 capabilities. This is a feature that is only present in AWS. Neither Google Cloud Engine nor Microsoft Azure have it. So, if you craft an Ethernet packet modifying the destination address but not the destination IP in your local subnet, the packet will be sent to the computer by IP and not by MAC address as you expect in an Ethernet network.

[1] https://docs.aws.amazon.com/AmazonVPC/latest/UserGuide/VPC_N...

chapingt · on Oct 11, 2016

As someone who is having to learn AWS very quickly for an urgent project, I thank you.

ohstopitu · on Oct 11, 2016

I have recently started out on AWS (I initially used AWS like I used to use Digital Ocean, however after trying out Serverless, I'm of a different mind and changing my ways to do it the AWS way), So this is pretty awesome!

buckbova · on Oct 11, 2016

Are you thinking dynamo for a backend, RDS, or other?

ohstopitu · on Oct 11, 2016

Dynamo for the backend and ElasticCache (Redis) for some cache

buckbova · on Oct 11, 2016

Know of any good resources, links, etc for dynamo or going to figure it out as you go?

ohstopitu · on Oct 12, 2016

I had tried a lot of databases (postgres, mongo, couch and very recently Rethink) before trying out Dynamo. So I just jumped in, and started something basic, and read tutorials as I went along.

There's still a lot of stuff I don't fully know about (for about Read / Write volumes that is set - I left it at a default of 5) but I guess, I'll learn as I go along.

aidos · on Oct 11, 2016

Great guide. I've been using AWS since there were only a handful of services and it's become increasingly hard to keep up with all the additional ones that have been added in the last few years.

EFS had completely passed me by. Does anyone have experience with it? I'm wondering what it would be like to use for Whisper / Graphite (just on a single machine). I'm less interested with concurrent access and more interested in not having to resize drives as data grows / overprovision drives all the time.

ljosa · on Oct 12, 2016

The latency is higher than I had hoped. I wrote 10,000 files with 10 kb in each. It took 23 ms per file on average. Then I read them back. That took 8 ms per file on average.

That's way too much for the use case I was contemplating, so I didn't investigate further.

aidos · on Oct 12, 2016

It definitely _felt_ a bit slow rysncing to it last night. In the Whisper use-case, there are a ton of small appends to do every minute - so that could be an issue. I'm going to set up a machine with a linux 4 kernel today to try it on (as that's what they recommend, along with async mode).

inyourtenement · on Oct 11, 2016

As I recall it's significantly more expensive than EBS, which kept me away. I've had a few use cases come up where shared access would be nice, but I was always able to use objects in s3 instead, which is far cheaper.

aidos · on Oct 11, 2016

It's about 3x the price of EBS by the looks, but then again, I probably run 10x the size EBS drive required so I don't need to deal with scaling it often...!

nodesocket · on Oct 11, 2016

Wow, this is a treasure chest of information. Bookmarked for sure.

alexnewman · on Oct 12, 2016

A good edition, but I wish there was a place for horror stories about this tech. For instance, we can't launch or than 4 or 5 containers a second on our ecs clusters.

dastbe · on Oct 12, 2016

Are you measuring by placement, by container is running, or by container is fully initialized?

simonw · on Oct 11, 2016

This is so needed. I find Amazon's official documentation to be way too full of buzzwords and marketing speak. I just want someone to tell me what the thing does!

alexbilbie · on Oct 13, 2016

https://www.expeditedssl.com/aws-in-plain-english

xorgar831 · on Oct 11, 2016

I think a better approach would be to use annotations on the current AWS docs so that additional information is inline with the official documentation so you have both in the same place. The Hypothesis project is working on such a browser plugin that does this for example and is having success with academic research already. https://hypothes.is/

vmarsy · on Oct 11, 2016

Thanks, I like that Service Matrix[1] !

I've a few questions for AWS experts :

The only container orchestration that is open source seems to be Kubernetes. Is it easy to run on AWS?

What's the equivalent of Azure "Service Fabric" in the AWS world? (and in the Google Cloud?)

[1] https://github.com/open-guides/og-aws#service-matrix

nzoschke · on Oct 11, 2016

You probably are aware, but AWS has a container orchestration service built into the platform with ECS. The container agent is open source (https://github.com/aws/amazon-ecs-agent).

We're building an open-source platform at Convox that leverages ECS very successfully. https://github.com/convox/rack

In my experience, ECS is easy to run, as it's a first class part of the platform. Boot up the right "cattle" AMIs with the right ASG configuration and you're good to go.

K8, Docker Swarm, Mesos and Nomad have plenty of documented success but you to stand up and operate the orchestration layer yourself. This is booting up "pet" AMIs and making sure they are monitored, etc. Then you boot up your "cattle" AMIs to run your apps.

The Convox philosophy is that you get application portability by packaging your app correctly with Docker. The orchestration layer should be invisible, something that you shouldn't build or operate yourself.

ecliptik · on Oct 11, 2016

We run Rancher[1], which is open source, across multiple AWS regions using a single ELB endpoint for container orchestration into different environments. You can use the stock AWS AMIs for the instances and Rancher also provides RancherOS AMIs that work extremely well.

Rancher also has k8s as an option and makes deploying it much easier.

1. http://rancher.com/

fabs_ · on Oct 11, 2016

There are other open source container frameworks, we are using Aurora (http://aurora.apache.org/) but there is also Mesosphere (https://mesosphere.com/) and some other smaller ones...

jen20 · on Oct 11, 2016

There are plenty of others which are open source. Nomad is one such example.

sharmak1 · on Oct 11, 2016

Fantastic guide! The cost management part with spot and network usage is extremely helpful and practical. Thanks for pushing this out!

dsmithatx · on Oct 11, 2016

Awesome! As I read this I wonder exactly how many 100's of hours I could of saved the past 7 years if I had this resource.

mxuribe · on Oct 12, 2016

Although I'm familiar (high-level only) with numerous topics/services related to AWS, I'm still doing things the legacy way on providers like Digital Ocean (which I'm 100% happy with), and by no means a guru of AWS...So this guide looks awesome for someone like me!

Kudos to the authors and contributors!

bmoresbest55 · on Oct 12, 2016

Wow, this is exactly what I need right now. Thanks to the original author (github.com/jlevy I believe).

user5994461 · on Oct 11, 2016

Question to readers:

- What are your goals on AWS?

- What topic do you need help with? What articles would you like to be written?

alainchabat · on Oct 12, 2016

https://github.com/Netflix/ice looks pretty good. But you have to pay $780 for the highstock license to use it. Anyone has a free alternative to this?

falcolas · on Oct 11, 2016

Sadly, I could never get the company lawyers to approve contributions under a CC-BY-SA.

Of course, I'm not 100% sure I could get them to approve contributions to any external repo due to liability concerns, etc.

jedberg · on Oct 11, 2016

Can you just contribute as you instead of as your company?

forwidur · on Oct 11, 2016

I feel you. The laws in many parts of the US and in some other countries should really change.

Luckily, in California the company can't really stop you from working on your own project or contributing to an opensource one.

derricgilling · on Oct 11, 2016

Really like the single page format. Much easier to search compared to scattered documentation on AWS's own site. Definitly like the 1:1 mapping to Google /Azure

nepotism2016 · on Oct 12, 2016

Nice work but seriously can you please avoid using acronyms?!? You have plenty of space to write

Simple Storage Service

Rather than making me scroll down to find out what KMS stands for!

Cheers

SteB · on Oct 12, 2016

Great job. How are you guys keeping up to date with AWS and all the new updates/features they launch every day?

mhewett · on Oct 11, 2016

Can you add one column to the table: a brief description of the service? Thanks for collecting all the information.

alexmorenodev · on Oct 11, 2016

Do you mind if I translate it to PT-BR?

forwidur · on Oct 11, 2016

The license allows you to do that and many other things.

Also, you might want to wait a day or two before starting and let the dust settle a bit. ;-)

_vafj · on Oct 11, 2016

Wish there was more information about Elastic beanstalk - it always confuses me about how it works..

forwidur · on Oct 11, 2016

Well, you could start by submitting a PR with everything you already know about Beanstalk:

1. That would be very valuable for everyone else.

2. A section, that does not look overwhelmingly empty would attract more and higher quality contributions from others. Kind of a reverse broken windows theory (https://en.wikipedia.org/wiki/Broken_windows_theory).

drewmassey · on Oct 11, 2016

I've had the impression that elastic beanstalk (which I use) has suffered the fate of a few other Aws offerings in has been seen as less trendy than Docker/ECS. (See also: cloud search vs elasticsearch). But EB can do some things very well and very painlessly.

abrookewood · on Oct 12, 2016

EB tends to work very well when you're requirements fit within its framework - and very badly when you try to do anything differently. We've moved to CodeDeploy because: EB was slow to deploy; often left applications in an 'unknown' state after deployment; ties application configuration to deployment; and generally felt fairly restrictive.

bogomipz · on Oct 12, 2016

This is fantastic! Nice work.

pawanpe · on Oct 11, 2016

Awesome, thanks!

Dustin82 · on Oct 11, 2016

swingbridge · on Oct 11, 2016

Excellent!

_w3e8 · on Oct 11, 2016

This is a goldmine.

I've been compiling a lot of tips and tricks personally that I use to help train coworkers. I'm definitely going to cross reference and see if I can open a few useful PR's.

andrewvijay · on Oct 11, 2016

This is fantastic. I was thinking about it in the afternoon and I see it now! Very useful for guys like me who are just booting up in the back end and devops side!