Untangling Jenkins

oblio · on Dec 25, 2017

I've been a build/release engineer for quite a few years now, most spent using Jenkins.

It can be quite a heavyweight beast, especially since it's written in Java (so heavy in terms of resource usage) and since it's old (so lots of legacy in terms of architecture).

Still, in terms of costs and scalability (up to the level where you have thousands of developers and you probably want to roll your own), I think it is one of the best tools out there for its role as a prettier cron/queuing system.

A few tips:

* if you can afford it, stop ASAP using the UI for configurating jobs; look into Pipelines or the JobDSL, based on your needs

* once you have more than a few jobs, stop running anything on the master node, for increased stability

* use the minimum number of plugins and tie as few things to Jenkins as possible; script as many things and put the scripts into version control

* backup the global configuration; you can try to version control it but it's tricky; at least back it up periodically and most of the time it is enough

That's it for now, for more details, $200/h :p

grogenaut · on Dec 25, 2017

Going off of this I wouldn't even use job dsl or pipelines as they allow you to execute arbitrary groovy. I would consider emulating Travis ci or Amazon pipelines where it is all declarative not imperitive. This way you can re-impliment and consolidate without doing a Turing complete parse of tons of custom groovy scripts

Matt3o12_ · on Dec 25, 2017

I‘m not sure what’s bad about executing arbitrary groovy scripts because you are running arbitrary code anyways (unless Jenkins is not running any code at all – e.g. compile static assets, do something else). If some developer wanted access to the secret files – config files which might have sensitive information such as the database password for the staging environment (I don’t know what they are called in Jenkins) they could just modify the code to dump them to stdout.

I am asking this because I don’t see a problem with running turning compete scripts if the application is most likely to run arbitary code anyways, which has access to syscalls any many other dangerous stuff anyways. You need to trust your employees anyways and if you can’t trust them with not dumping the staging environment in purpose, you should not have hired them. It is possible to include harmless looking code which can be malicious and still be considered a bug if found during code review’s.

grogenaut · on Dec 25, 2017

The point is that you get mu hich better reuse by not allowing arbitrary groovy and building up interface contracts. This is exactly like any other coding. Using groovy is akin to allowing arbitrary SQL from your web devs intead of apis (from an interface pov not security).

Instead just invoke something outside of Jenkins groovy framework like maven, make, etc. Makes you much more portable and upgradable.

Jenkins should just be your execution/workflow environment not a dev framework.

Also never mutate the worker boxes, everything should be completely isloated from itself and each other.

Jenkins is like any other software architecture.

Finally doing too much in Jenkins makes it impossible to test Jenkins locally, which means you can neither debug builds locally without pushing new code to build, nor can you test alterations to Jenkins itself locally which makes it much much more fragile.

How do I know? I'm a sr dev coming in and fixing a mid sized engineering org (400+ devs) jenkins 4 years of accretions. These are major issues we've identified in just a week of looking.

Build systems should be simple, scalaboe, agile, and composable, not monolithic. Keep the monolithic to the individual builds not cross company.

Matt3o12_ · on Dec 26, 2017

Got it, my groovy files have always been setup the environment (I.e. write config files for the test database, API keys, etc everything that should not be in git), and then just execute make to make so that I don’t have a completely different setup each environment. Thank you very much!

voidr · on Dec 27, 2017

> Finally doing too much in Jenkins makes it impossible to test Jenkins locally, which means you can neither debug builds locally without pushing new code to build, nor can you test alterations to Jenkins itself locally which makes it much much more fragile.

If you would have used Jenkins pipelines, you would know that you can debug it directly from the UI. I can make my builds hard to run locally on configuration-based systems as well, this is not really an argument against Pipelines.

> How do I know? I'm a sr dev coming in and fixing a mid sized engineering org (400+ devs) jenkins 4 years of accretions.

You haven't used Jenkins Pipelines though, if you would have used it, you would know about the debugging part I mentioned above.

> Build systems should be simple, scalaboe, agile, and composable, not monolithic. Keep the monolithic to the individual builds not cross company.

These have nothing to do with the actual topic, these are just generalisations.

> The point is that you get mu hich better reuse by not allowing arbitrary groovy and building up interface contracts.

You can reuse the groovy code you write, which means you actually get more reuse.

> This is exactly like any other coding. Using groovy is akin to allowing arbitrary SQL from your web devs intead of apis (from an interface pov not security).

That is a bad analogy, a correct one would be something like: writing NodeJS build scripts in JavaScript instead of using Grunt or Gulp, or Groovy vs. Maven/Ant.

> Jenkins should just be your execution/workflow environment not a dev framework.

What is a "dev framework" in your mind?

> Also never mutate the worker boxes, everything should be completely isloated from itself and each other.

What does that have to do with using a programming language instead of a config DSL?

> Jenkins is like any other software architecture.

Fail to see how that vague statement supports your argument.

grogenaut · on Jan 4, 2018

>If you would have used Jenkins pipelines Nice, way to start off with an ad hominem attack. Yes I'm lying to you about having used pipelines. That's what I do, I go on hacker news and lie about what I do, especially jenkins, to throw people off the scent. Seriously... if people dont know that pipelines are easily debuggable, that's a problem with the documentation, not with the person. It's also not a reason to attack people.

I have used jenkins pipelines. However I didnt use it from scratch so no, I couldn't do what you said due to all of the other issues. My point is that it does let you hang yourself, I prefer the config based systems as it's harder to hang yourself.

Your NodeJs anology is more apt but who cares you got the point. This is a discussion not a literature critique.

A dev framework to me is the type of stuff you've built up around a typical compile of a given "stack". For instance, doing ruby on rails, you're likely to have a bunch of build tooling you've built up to make that easier. I would prefer to distribute that stuff more generically and unrelated to jenkins vs some other build system, as opposed to building it all into jenkins.

As for mutating workers... you seem to be under the mistaken notion that I'm discussing the one thing you are discussing. But in general, if you have a turing complete language with complete system access like you do in the groovy, you can do anything and anything isn't always a great idea. I like the config based systems as it doesn't allow you to do anything, it allows you to do things we've deemed as good for building software. Jenkins boxes end up being shared resources in a large company so it's very difficult to audit what everyone is doing and side effects they are causing.

>Fail to see how that vague statement supports your argument.

Ok

>You can reuse the groovy code you write, which means you actually get more reuse.

You can but most people don't know how to or dont. It's a break from your typical dev loop and it's yet another framework to learn. I don't think it's effective unless you dive in and most people don't dive in. So to me it's a bad model. This is opinion.

oblio · on Dec 25, 2017

There's a Declarative Pipeline these days plus you can turn on the Groovy sandbox. There's also a YAML based thing but I forgot its name.

grogenaut · on Dec 25, 2017

Can you provide links? I'm on a plane so very hard to Google.

Seriously? Downvotes? On a holiday on a plane and I get down votes asking for a little help? I did try and google and didn't find the stuff.

jeremya · on Dec 26, 2017

I agree it's not easy to Google for the Declarative Pipeline or Groovy Sandbox. Here's the comparison of the two syntaxes available in Jenkins Pipeline (Scripted and Declarative):

https://jenkins.io/doc/book/pipeline/syntax/#compare

Here are details on the Groovy Sandbox (provided by the Script Security Plugin and installed by default when installing recent Jenkins). The Groovy Sandbox is on by default for Jenkins Pipeline:

https://jenkins.io/doc/book/managing/script-approval/

grogenaut · on Dec 26, 2017

thanks

RyJones · on Dec 26, 2017

We use JJB, which is YAML based.

https://github.com/hyperledger/ci-management/tree/master/jjb

for instance

harlowja · on Dec 26, 2017

Just fyi :)

Docs:

https://docs.openstack.org/infra/jenkins-job-builder/

Reviews:

https://review.openstack.org/#/q/project:openstack-infra/jen...

Code:

https://git.openstack.org/cgit/openstack-infra/jenkins-job-b...

grogenaut · on Dec 26, 2017

thanks

jnsaff2 · on Dec 26, 2017

Yea but the declarative pipelines have cheat codes that allow cheeky (read lazy) people to still run code in the pipeline:(

vorg · on Dec 27, 2017

> Travis ci or Amazon pipelines where it is all declarative not imperitive [...] a Turing complete parse of tons of custom groovy scripts

Even worse than Apache Groovy being an imperative DSL (compared to Travis ci and Amazon pipelines being all declarative) is that the most declarative portions of Groovy (i.e. its collections API) are disabled when used in the Jenkins pipelines.

mikewhy · on Dec 25, 2017

Another downside to Jenkins: $200/hr to learn other products do it just as well with much less overhead.

emmelaich · on Dec 26, 2017

I think $200/h is a low-end average estimate for any type of enterprisey software, open source or not.

If you don't buy expertise in the short term, it may cost you a lot more in the long run.

mooreds · on Dec 26, 2017

$200/hr sounds expensive until you realize you are gaining access to all the mistakes they've made on someone else's dime.

closeparen · on Dec 26, 2017

Consultants are long gone by the time their mistakes are apparent.

oblio · on Dec 26, 2017

That was actually a joke since I've never done any Jenkins consulting but Jenkins, including the setup cost is waaaay cheaper than other solutions I've seen for what I used it for: distributed cron, queuing system for tasks, higher level dashboard for managers, CI server, etc.

piva00 · on Dec 27, 2017

I could tell it was a joke when you mentioned keeping backups of your main configuration instead of porting ALL of your Jenkins config to code and maintaining that in version control.

At my current job I started as a SRE for the team supporting Jenkins and other CI/CD tools for the rest of engineering (around 50+ other teams), I moved back to a SWE position but as an insight: don't run a single master Jenkins for all of your teams, make it federated (per-team/project or whatever suits your organisational structure) AND do all of Jenkins' configuration through scripts.

Never rely on backing up the old configuration (problematic in cases of rollback to previous Jenkins versions, brittle for recovery), just start it fresh every time you need to recover and re-run your provisioning scripts, that also includes NEVER maintaining jobs through the UI, only through JobDSL, JJB or Pipeline scripts.

oblio · on Dec 27, 2017

You need the team you're part of to have enough discipline for that. Not always the case. Sometimes you need to know when to cut your losses and go for the cheap, plastic, solution (i.e. backups).

voidr · on Dec 27, 2017

That's just the poster's consulting price, you can learn everything for free online.

correlation · on Dec 25, 2017

I have come up with a slogan for Jenkins: "Jenkins - The butler that does not hesitate to do a disservice".

It is currently the most brittle piece of infrastructure we run, with jobs spiralling out of control and slaves crashing with cryptic exceptions. Add to that a horrible plugin ecosystem (tried to install the Slack plugin, it crashed all builds and our webhook endpoint). Add to that groovy (I don't feel like becoming an archeologist).

Gosh, can't someone just build a lightweight ci/cd system ontop of e.g. k8s jobs?

Or should we just give up and do managed ci/cd?

CodeAlong · on Dec 25, 2017

My workplace has primarily switched all net new ci/cd workflow from Jenkins to Drone (https://github.com/drone/drone - Drone is a Continuous Delivery system built on container technology.). Having managed masters with 20,000 + configured jobs I agree that Jenkins is a very sore spot for our entire pipeline. We’ve had greater success with increasing the number of masters (one per team or application) and using the kubernetes plugin for slaves. However, most of our pipeline complaints have been mitigated by getting teams to migrate from Jenkins to Drone.

While I don’t think Drone currently supports k8s job, it is very lightweight in comparison to Jenkins. Most of our developers would agree that it isn’t a perfect product, but it has reduced friction points by a large amount.

SOLAR_FIELDS · on Dec 27, 2017

Interesting war story. We have a couple of master instances that go down all the time because of several thousand jobs running. Gonna ask our devops what they are cooking up to resolve and compare to this solution.

Everlag · on Dec 25, 2017

Allow me to shill for Concourse[0]. It's oriented around 'tasks' composed into 'pipelines'. Each task either fails or succeeds with the yaml pipeline definition providing hooks into either state to proc other tasks. A task is a yaml definition that invokes a bash script. Each task runs inside a preconfigured docker container for full reproducibility.

It's declarative and everything needed to bring up a pipeline can be stored in git alongside the software it supports. For an example pipeline definition, check out the one for bosh-gcscli[1] and its accompanying tasks[2].

Now, there is the downside that upkeep is a hassle because it requires buying into the BOSH ecosystem to deploy Concourse. That pays for itself by being explicitly anti-brittle, with support for migrations and canary rollouts builtin.

Disclaimer: I worked with Concourse at my previous position but currently have no financial stake in its success.

[0] https://concourse.ci/ [1] https://github.com/cloudfoundry/bosh-gcscli/blob/c908176/ci/... [2] https://github.com/cloudfoundry/bosh-gcscli/tree/30316d6/ci/...

mikewhy · on Dec 25, 2017

We use Gitlab's built in CI and it's been a dream. Pipelines, Docker, artifacts, caching, coverage, in-repo config, Windows/macOS/Linux.

infamousjoeg · on Dec 26, 2017

You're not the first I've heard going this route... +1

slavik81 · on Dec 26, 2017

That matches my experience. Jenkins is built out of a spiderweb of plugins and that makes it very fragile. Though, I did find I could mitigate the problem.

* only use Jenkins stable releases

* snapshot the system before updating or installing anything

* make regular backups of your Jenkins server

* never install a plugin unless it's actually neccessary

* always update Jenkins before installing any new plugins as otherwise you may end up with an untested mishmash of old and new plugins

I like having control over my CI, but it does seem a lot harder than it should be.

mianos · on Dec 25, 2017

I replaced it with saltstack. I had so many things not working I thought it was me just being dumb or not spending enough time learning to use it. A few salt states on the end of a git hook did everything, auto deployment, testing, staging, monitoring. I am probably missing something about the awsome power of jenkins but know firms still running my saltstack CI/CD systems years after I left. Weird.

stephen_g · on Dec 25, 2017

We use JetBrains’ TeamCity CI.

It’s really nice, but expensive if you exceed the limits for the free version (3 agents and 100 build configurations). Luckily we are a fair way off that so can go along fine without having to pay yet.

chrisanthropic · on Dec 26, 2017

The best Jenkins advice I ever received was from here: https://medium.com/mindera/jenkins-a-la-travis-6c5a8debbb5b

On setting up Jenkins Job Builder using a 'seed job'. This setup allows you to add a .jenkins directory to you project, define a yaml based jenkins job inside that directory, push to github, and Jenkins will create the job you defined - NO web UI required.

In other words you make Jenkins behave like Travis.

Set this up with your first job being a job to backup Jenkins to S3 and things aren't as bad to manage.

finspin · on Dec 27, 2017

I'm currently looking into how JJB could be used for simple scheduled jobs. But isn't the setup described in the article basically the same what you can do with Jenkinsfile?

ddevault · on Dec 25, 2017

If I may suggest my own alternative to Jenkins: builds.sr.ht. It runs build manifests submitted through an API and does not support job configuration itself; you must use an external integration to automate your builds. Build manifests look something like this:

https://git.sr.ht/~sircmpwn/wlroots/tree/.build.yml

More complex example:

https://git.sr.ht/~sircmpwn/builds.sr.ht/tree/.build.yml

Builds run with full virtualization (KVM) on any number of distributed (optional trust) build runners. The whole thing is open source:

https://git.sr.ht/~sircmpwn/builds.sr.ht

It's also self-hosting, it builds and deploys itself and the rest of the *.sr.ht network:

https://builds.sr.ht/job/602

If anyone wants to give it a try, please get in touch (email in my profile) and I'll hook you up with an account on my hosted instance. I could use more people to kick the wheels and help me find the pain points.

jonthepirate · on Dec 25, 2017

I am spearheading an effort to release a CloudFormation template that will launch Jenkins clusters into your AWS account. It represents everything learned while working with Jenkins for over 6 years including 2.5 years of Jenkins at Lyft and 1 year at DoorDash. If you would like to collaborate, please contact block.jon@gmail.com. The project was written with an eye towards making the launching of a Jenkins cluster very very easy. It's designed with Docker workloads in mind. Some of the gems include autoscaling, instrumentation & metrics, automatic Let's Encrypt certificate registration, GitHub webhook integration and retries during pipeline slave disconnects which makes spot instances viable even when they abruptly go away... something that saves you 75%!! on your AWS instance costs.

mooreds · on Dec 26, 2017

I don't have cycles to help, but please post this on HN when you are ready!

jonthepirate · on Dec 26, 2017

Here's a preview of what I'm working on that I just published to YouTube:

https://www.youtube.com/watch?v=GkflqV6KenM

I'd love to hear more about what problems any of you are facing with Jenkins so I can be sure to address them in the work I'm doing.

Jon

BlanketApple · on Dec 25, 2017

Man, I'm having this problem right now at work and it's been a nightmare. Admittedly the problem is like 90% cultural.

People (on my team) don't understand pipeline and they absolutely don't understand Groovy. Before, they would just write a job with a shell script that e.g. ran valgrind, etc. Lots of small repos with similar steps, so they'd write one job and apply it to 20 different repos. It worked pretty well.

That's still an option with pipelines, but it feels much more discouraged (to e.g. write a shared pipeline job that runs a multi-line shell script).

It's also just a bit of an organizational nightmare to see 200+ jobs on the main screen (as opposed to 20 jobs that did the same thing for 200 repos).

Unfortunately, it also seems like declarative pipeline is limited enough that we end up writing a lot of Groovy. I sorta get why Groovy became the official scripting language, but it's like pulling teeth getting people to learn even basic Groovy.

I realize, again, that a lot of this is a cultural problem. Mostly just trying to give a counterpoint that pipelines don't always look as clean as the article implies, especially for orgs with a lot of repos (where committing the same 'test.sh' file to each repo doesn't make a lot of sense).

Someone · on Dec 26, 2017

”It's also just a bit of an organizational nightmare to see 200+ jobs on the main screen”

I never look at the main screen. You can create tabs that segment those 200 jobs. If your job names are consistent (as they should), you can do the segmenting by regular expression (I’m not sure whether creating tabs requires a plug-in)

As to doing (almost) the same thing for different projects: you can POST XML job definitions to the web interface, so if you have the rights to create jobs and to run code on your local system, you can script job creation. In my experience, that’s the way to keep job definitions consistent. To figure out what the XML to POST should look like, grab it from your browser by GETting job/jobname/config.xml (https://support.cloudbees.com/hc/en-us/articles/218353308-Ho...)

Yes, that duplicates lots of stuff in the Jenkins job definitions, but you shouldn’t treat that as source, but as the output of your job creation scripts.

tokenizerrr · on Dec 26, 2017

If you get to that point, check out the Job DSL plugin.

Someone · on Dec 26, 2017

I would have to convince the system administrator to install that plug-in.

I also don’t like having to use yet another language (or two, counting Groovy and the Job DSL separately), but that, I could live with.

jonthepirate · on Dec 26, 2017

Rather than run monolithic Jenkins with a kagillion jobs in it that will easily bog down, you could consider running distributed Jenkins in which each team, or even each microservice, gets its own Jenkins Cluster with a small number of slaves and a small number of jobs. See my earlier post in this thread where I posted a YouTube video of the method I use for doing this using a CloudFormation template.

Also, when you say that your developers don't understand groovy, you could teach them how to ding make targets inside of their docker containers. That's the pattern we use at DoorDash... very minimal Jenkins scripting which exists to ding make targets inside Docker images.

grogenaut · on Dec 25, 2017

Why not make it easy to reuse functions to do the steps the groovy is doing and build all of the common steps in a reusable way? Eg what the article is suggesting

BlanketApple · on Dec 25, 2017

Yeah, we've done that. And again, this is totally a cultural problem.

But inevitably I end up writing that shared groovy code (because I have the most familiarity), and so when things go wrong, or it needs to be changed, I end up being the one having to debug it. Definitely a huge bus factor.

It'd be nice if it was a slightly more common language (e.g. Python) so it wasn't such a pain.

grogenaut · on Dec 26, 2017

Aah. Cultural problems are different. What's the main issue there?

Making a Python language plugin wouldn't be hard though.

nunez · on Dec 26, 2017

Jenkins and any other CI/CD tool is a prime candidate for immutable infrastructure; it’s the one thing that nobody cares about too much until it no longer works, and it is relatively easy to get going with scripts or Docker

emmelaich · on Dec 26, 2017

Has anyone here used BlueOcean and have anything to say about it good or bad?

https://jenkins.io/projects/blueocean/

tokenizerrr · on Dec 26, 2017

Meh. The interface for viewing logs lags my browser.

emmelaich · on Dec 26, 2017

It is beta-ish still, so that may go away. I was more looking for comments about the features, pipeline etc.

Also, regarding logs the site says this:

"Diagnose problems instantly and say good-riddance to endlessly scanning through logs."

Won't believe till I try it but it's a good goal!