Cyphernetes: A Query Language for Kubernetes

alpb · 2024-12-16T05:52:07 1734328327

During many years of operating several-thousands of nodes production clusters on Kubernetes, I've never seen any of these observability tools that query kube-apiserver work at that scale. Even the popular tools like k9s make super expensive queries like listing all pods in the cluster that if you don't have enough load protections, can tip your Kubernetes apiserver over and cause an incident. If you're serious about these querying capabilities, I highly recommend building your own data sources (e.g. watch objects with a controller and dump the data in a sql db) and stop hitting apiserver for these things. You'll be better off in the long run.

f0e4c2f7 · 2024-12-16T14:40:03 1734360003

There is a funny parallel I see with Kubernetes that I also saw a lot with Linux in the early years. There are thousands of packages and tools you can install on Linux (think phpmyadmin for example) and new users sometimes go wild installing every single package they read about.

After a while, the more mature Linux engineers start going the other way. Ripping out as much as possible. Stripping down to the leanest build they can, for performance but also to reduce attack surface and overall complexity.

Very similar dynamic with k8s. Early days are often about scooping up every CNCF project like you're on a shopping spree. Eventually people get to shipping slim clusters running and 30mb containers with alpine or nix. Using it essentially as open source clustering for Linux.

atombender · 2024-12-16T14:26:04 1734359164

What's surprising to me is that there's no way to listen to any object type. You have to know the "kind" beforehand, because the watch API requires it. To watch all objects in the system, you have to start a separate watch request for every type. This may in turn be expensive.

If you have direct access to Etcd (which may not be possible in a managed cloud version of Kubernetes?), putting a watch on / might scale better.

(As an aside, with the Go client API you have to jump through some hoops to even deserialize objects whose kinds' schemas are not already registered. You have to use the special "unstructured" deserializer. The Go SDK often has to deal with unknown types, e.g. for diffing, and all of the serializer/codec/conversion layers in the SDK seem incredibly overengineered for something that could have just assumed a simple nested map structure and then layered validation and parsing on top; the smell of Java programmers is pretty strong.)

bluepizza · 2024-12-16T16:24:22 1734366262

The watch API has horrible user experience in all platforms. One must send a GET and keep the pipe open, waiting for a stream of responses. If the connection is lost, changes might be lost. If one misses a resource version change, then either the reconnection will fail, or a stale resource will be monitored.

The Java client does this with blocking, resulting in a large number of threads.

I truly like Kubernetes, and I think most detractors' complaints around complexity simply don't want to learn it. But the K8s API, especially the Watch API, needs some rigorous standards.

rtpg · 2024-12-16T07:05:23 1734332723

how are Kubernetes apiservers suffering this much from this kind of query? Surely even in huge systems the amount of data that would need to be traversed is super small, right?

Is this a question of Kubernetes just sticking everything into "standard" datastructures instead of using a database?

nvarsj · 2024-12-16T12:23:41 1734351821

My knowledge is out of date now, but the main issues IMO are/were:

- No concept of apiserver rate limiting, by design. I see there is now an APF thingy, but still no basic API / edge rate limiting.

- etcd has bad scalability. It's a very basic, highly consistent kv store that has tiny limits (8GB limit in latest docs, with a default of 2GB). It had large performance issues throughout its life when I was using k8s, I still don't know if it's much better.

crabbone · 2024-12-16T13:48:51 1734356931

Long ago I wanted to re-implement at least part of kubectl in Python. After all, Kubernetes has documented API... what I quickly discovered was that kubectl commands don't map to Kubernetes API. Almost at all. A lot of these commands will require multiple queries going back and forth to accomplish what the command does. I quickly abandoned the project... so, maybe I've overlooked something, but, again, my impression was that instead of having generic API with queries that can be executed server-side to retrieve necessary information, Kubernetes API server offers very specialized disjoint set of commands that can only retrieve one small piece of interesting info at a time.

This, obviously, isn't a scalable approach, but there's no "wrapper" you could write in order to mitigate the problem. The API itself is the problem.

mltsd · 2024-12-16T09:04:10 1734339850

Pretty sure the apiserver just queries the etcd database (and maybe caches some things, not sure) but i guess it could be the apiserver itself that can't handle the data :P

alpb · 2024-12-16T16:29:50 1734366590

Kubernetes only lets you query resources by object type and that's only a prefix range scan on etcd database. There are no indexes whatsoever in the exhaustive LIST queries, and kube-apiserver handles serialization of the objects back and forth between multiple wire types. Over the years there has been a lot of optimizations, but you don't wanna list all pods in a 5000 node high density cluster every time you spin up client-side tools like this.

remram · 2024-12-16T15:16:06 1734362166

In my experience, they don't, you can just run more of them and you can stick them behind a load-balancer (regular HTTP reverse proxy). You can scale both etcd and apiserver pretty easily. Of course you have less control in cloud environments, I have less experience with that.

tasuki · 2024-12-16T10:25:05 1734344705

I no longer know anything about Kubernetes, but share your surprise! From first principles it seems the metadata should be small.

moshloop · 2024-12-16T19:41:00 1734378060

This is the approach we took while building our Internal Developer Platform: watches (via client-go informers with client-side caching) to sync data into a Postgres database as JSONB. Changes are tracked using JSON patches and Kubernetes events. To avoid a watch on every resource kind, we handle this by performing incremental object fetches for the objects involved in watched events.

Getting this to perform well required several optimizations at both the Go and Postgres levels. On the Go side, we use prioritized work queues, event de-duplication, and even switched to Rust for efficient JSON diffs. For Postgres, we leverage materialized views and trigger-based optimistic locking

elliotxx · 2024-12-17T03:42:38 1734406958

That's how https://github.com/KusionStack/karpor did it. It has a resource-syncer component to synchronize resources in real time to Elasticsearch, and then allows users to search for K8S resources through SQL and natural language through a search bar on a web UI.

In fact, recently it is preparing to integrate Cyphernetes as a new search method. I believe this will be a new start!

ramoz · 2024-12-16T06:03:37 1734329017

How fun was kube-ops-view though

fatliverfreddy · 2024-12-16T05:56:08 1734328568

This is a very good point and is on the roadmap.

danpalmer · 2024-12-16T05:17:09 1734326229

I'm not against replacing jq/jsonpath for the right tool, they're not the most ergonomic. What isn't clear to me though is why this isn't SQL? It's so nearly SQL, and seems to support almost identical semantics. I realise SQL isn't perfect, but the goal of this project isn't (I assume) to invent a new query language, but to make Kubernetes more easily queryable.

rubenvanwyk · 2024-12-16T05:44:23 1734327863

It's based on Cypher, which is a query languages for graph databases. The author/s probably thought the data is more graph-like than relational.

danpalmer · 2024-12-16T09:56:09 1734342969

Ah. I’ve not heard of Cypher before.

I’d disagree and say that Kubernetes is much more relational that graph based, and SQL is pretty good for querying graphs anyway, especially with some custom extensions.

This does make more sense though.

amanj41 · 2024-12-16T15:56:08 1734364568

Graph DBs are generalized relationship stores. SQL can work for querying graphs, but graph DB DSLs like Cypher become very powerful when you're trying to match across multiple relationship hops.

For example, to find all friend of a friend or friends of a friend of a friend: `MATCH (user:User {username: "amanj41"})-[:KNOWS*2..3]->(foaf) WHERE NOT((user)-[:KNOWS]->(foaf)) RETURN user, foaf`

philsnow · 2024-12-16T05:27:31 1734326851

Reading your comment made me think that they're so close to "OSQuery for k8s", but that already seems to exist: https://www.uptycs.com/blog/kubequery-brings-the-power-of-os...

captn3m0 · 2024-12-16T12:52:18 1734353538

I haven't tried it , but steampipe has a k8s plugin which lets you use PG/sqlite: https://hub.steampipe.io/plugins/turbot/kubernetes/tables

ricenb · 2024-12-17T05:41:50 1734414110

For SQL based query, you could take a look at Karpor(https://github.com/KusionStack/karpor), which provides powerful, flexible queries across multi clusters, also in the next release we will support Cyphernetes as other query method.

atombender · 2024-12-16T15:47:45 1734364065

This looks great for scripting. I will say that the query language looks a bit too verbose for daily use — meaning when you're interacting with a cluster to diagnose a problem, follow a job, testing the rollout of something experimental, or similar.

For example, I'd love to be able to just do this as the whole query:

    metadata.name =~ "foo%"

or maybe:

    .. =~ "foo%"  // Any field matches

or maybe:

    $pod and metadata.name =~ "foo%"  // Shorthand to filter by type

I think a query language for querying Kubernetes ought to start with predicate-based filtering as the foundation. Having graph operators seems like a nice addition, but maybe not the first thing people generally need?

It's not quite clear who this tool is for, so maybe this is not the intended purpose?

fatliverfreddy · 2024-12-17T04:57:08 1734411428

Hey, thanks for the feedback!

re. intended purpose: Initially I started writing this to help tackle bigger problems - stuff you'd normally use multiple nested kubectl commands or write a lot of code for interacting with api-server.

Over time, I developed the shell environment around it and it became a daily driver for me as well. Indeed, there's a threshold where writing Cyphernetes becomes more economical than using kubectl but for doing most of the simple day to day stuff writing Cypher is too verbose.

The Cyphernetes shell has an early-stage feature that allows a syntax like you suggested - there's a tiny "macros" feature that lets you define custom procedures of one or more queries (currently shell only, not supported in the web client yet).

Macros are prefixed by ":" and you could define something like:

:pod condition

MATCH (pods:Pod) WHERE $condition

RETURN p.metadata.name, p.status.phase; // and whatever other fields you'd like

Then use it like this: > :pod .metadata.name=~"foo%"

So it gives you a tiny way to customize how you do this day-to-day stuff. Ships out-of-the-box with common stuff you do with kubectl like :getpo, :getdeploy, :createdeploy, :expose and so on - definitely a feature that could be developed further to make this more of a daily driver.

devops99 · 2024-12-16T06:30:58 1734330658

The

  brew install cyphernetes

at the top of the page is an immediate turn-off.

rjh29 · 2024-12-16T06:50:29 1734331829

Agree but I'm not sure why. I'm not a mac user so the initial impression is like "this isn't for you, go away". At least add a linux command alongside it!

marxisttemp · 2024-12-16T17:38:05 1734370685

Even on macOS, brew is wildly inferior to MacPorts; to be fair, brew is “blessed” by Swift Package Manager whereas MacPorts is not, but this is ironic given the guy behind MacPorts both worked at Apple and designed the original FreeBSD ports system.

devops99 · 2024-12-19T02:58:15 1734577095

That's the very weird thing about the little universe within Apple's macOS, there are pockets of very high quality here and there, next to "what the fuck are you guys even doing?", and a Unix core that's been effectively bastardized, abandoned, and frankenstein'd

Instead of Compton a short ride from Beverly Hills it's like the houses from those two wildly different hoods all stacked in a repeatedly alternating sequence.

marxisttemp · 2024-12-20T18:20:13 1734718813

I think the overall quality level on Macs is leagues above anyone else, even the UNIX stuff (I personally prefer the BSD utils to the GNU ones, which pretty much anyone else running POSIX stuff is using unless they’re daily-driving an actual BSD.

I just don’t get why Apple would hire Jordan Hubbard for their UNIX team, see him implement a very well thought out version of the standard package manager of all time that he also wrote, and then decide to use brew as their blessed package manager for their various open source releases, .systemLibrary in Swift packages, etc.

fatliverfreddy · 2024-12-16T07:10:55 1734333055

Thanks for the feedback. Will add more commands there on rotation to show the different installation options.

koito17 · 2024-12-16T07:10:16 1734333016

Homebrew has a Linux variant, but I assume almost nobody uses it.

Personally use a Mac with Nix, and so do many of my coworkers. Assuming Homebrew, even for a Mac user, leaves a bad impression on me.

jm2dev · 2024-12-16T09:32:06 1734341526

I also prefer Mac with Nix over homebrew.

moondev · 2024-12-16T14:47:14 1734360434

      go run github.com/avitaltamir/cyphernetes/cmd/cyphernetes@v0.14.0 --help

allyant · 2024-12-16T10:50:37 1734346237

It would be good to have some example commands that can be ran right after installation, rather than having to figure out how to run the queries.

riku_iki · 2024-12-16T06:34:40 1734330880

why?..

TheDong · 2024-12-16T07:05:22 1734332722

Kubernetes only runs on linux, so it follows to reason if you care about k8s you should care about linux. My experience is also that good experienced sysadmins often use linux for their own machines as well.

Targetting a tool at macOS users, and omitting linux instructions, gives the impression that the tool isn't targeted at sysadmins or hackers (i.e. at us), but rather at beginners, frontend developers, etc.

falconertc · 2024-12-16T09:54:31 1734342871

Saying it's targeted at beginners because it supports MacOS shows a lot of disconnection with what many DevOps people use these days. The year of the linux desktop has yet to arrive, and Mac is king for people in IT (at least in the US)

cess11 · 2024-12-16T14:00:22 1734357622

I have yet to meet a competent sysadmin that cares much about "desktop", and to the extent they do they mostly seem to invent their own graphical tools, with Tcl/Tk and so on.

Are they common where you live?

darkwater · 2024-12-16T07:26:29 1734333989

I'm a "sysadmin". I only run Linux on my workstation. I even run NixOs on a home server. I manager Kubernetes clusters. Yet, I use Homebrew on Linux.

politelemon · 2024-12-16T08:04:59 1734336299

Most, however, do not, nor should they be expected to. Homebrew is not a safe or viable package manager, especially when better and safer package managers exist in the Linux ecosystem.

riku_iki · 2024-12-16T15:40:41 1734363641

Brew runs on linux too..

falconertc · 2024-12-16T09:53:12 1734342792

What? I love seeing this. I want to see how to get it quickly via package manager.

tasuki · 2024-12-16T10:26:38 1734344798

Not everyone uses the same package manager that you use.

jeremya · 2024-12-16T05:34:26 1734327266

This is fantastic. I’ve always enjoyed the cypher language that the neo4j team created for querying graph data. The connected k8s api objects seem like a great place to apply that lens.

multani · 2024-12-16T06:58:53 1734332333

I really really like Steampipe to do this kind of query: https://steampipe.io, which is essentially PostgreSQL (literally) to query many different kind of APIs, which means you have access to all PostgreSQL's SQL language can offer to request data.

They have a Kubernetes plugin at https://hub.steampipe.io/plugins/turbot/kubernetes and there are a couple of things I really like:

* it's super easy to request multiple Kubernetes clusters transparently: define one Steampipe "connection" for each of your clusters + define an "aggregator" connection that aggregates all of them, then query the "aggregator" connection. You will get a "context" column that indicates which Kubernetes cluster the row came from. * it's relatively fast in my experience, even for large result sets. It's also possible to configure a caching mechanism inside Steampipe to speed up your queries * it also understands custom resource definitions, although you need to help Steampipe a bit (explained here: https://hub.steampipe.io/plugins/turbot/kubernetes/tables/ku...)

Last but not least: you can of course join multiple "plugins" together. I used it a couple of times to join content exposed only in GCP with content from Kubernetes, that was quite useful.

The things I don't like so much but can be lived with:

* Several columns are just exposed a plain JSON fields ; you need to get familiar with PostgreSQL JSON operators to get something useful. There's a page in Steampipe's doc to explain how to use them better. * Be familiar also with PostgreSQL's common table expressions: there are not so difficult to use but makes the SQL code much easier to read * It's SQL, so you have to know which columns you want to pick before selecting the table they come from ; not ideal from autocompletion * the Steampipe "psql" client is good, but sometimes a bit counter intuitive ; I don't have specific examples but I have the feeling it behaves slightly differently than other CLI client I used.

All in all: I think Steampipe is a cool tool to know about, for Kubernetes but also other API systems.

nathanwallace · 2024-12-16T12:47:28 1734353248

Steampipe project lead here - thanks for the shout out & feedback multani!

I agree with your comment about JSON columns being more difficult to work with at times. On balance, we've found that approach more robust than creating new columns (names and formats) that effectively become Steampipe specific.

Our built-in SQL client is convenient, but it can definitely be better to run Steampipe in service mode and use any Postgres compatible SQL client you prefer [1].

You might also enjoy our open source mods for compliance scanning [2] and visualizing clusters [3]. They are Powerpipe [4] dashboards as code written in HCL + SQL that query Steampipe.

1 - https://steampipe.io/docs/query/third-party 2 - https://hub.powerpipe.io/mods/turbot/kubernetes_compliance 3 - https://hub.powerpipe.io/mods/turbot/kubernetes_insights 4 - https://github.com/turbot/powerpipe

robertlagrant · 2024-12-16T10:14:25 1734344065

I really like Steampipe too. Writing the plugins is quite fun.

jeffreyaven · 2024-12-16T21:57:44 1734386264

our project https://github.com/stackql/stackql has a k8s provider which might be of interest here, we implement our own front end SQL parser and expose all control plane routes (and data plane routes in many cases) through overloaded SQL methods, this is not FDW based and does not require a server (postgres etc)

omrispector · 2024-12-16T08:22:21 1734337341

This is way cool. The ability to visualize the k8s object model as a graph and query it as such makes so much sense! The hottest feature in my mind is applying this in an operator - maintaining state as defined by a simple graph query. It is much more readable, and does so with very little code. Well Done!

ricenb · 2024-12-17T05:51:01 1734414661

Since you mentioned visualize k8s object model as a graph, I guess you might be interested in: https://karpor-demo.kusionstack.io/, hope this can bring some new possibility to you.

Disclaimer: I am the creator of Karpor.

nikau · 2024-12-16T05:02:14 1734325334

What does this offer over jq which I can also afford?

weddpros · 2024-12-16T05:22:20 1734326540

Cyphernetes seems capable of graph/relational logic.

The example on the homepage is literally "give me deployments with more than 2 replicas with pods that are not Running, and give me the IP address of the service they're serving"...

Any idea how to do that with kubectl | jq? Their solution seems elegant to me.

nikau · 2024-12-16T06:03:32 1734329012

Can just use normal jq select filters unless I'm missing something?

weddpros · 2024-12-16T06:16:17 1734329777

the thing is you'd need 3 k8s queries, one for pods, one for deployments, one for services, then link all of them, and filter... jq helps with the filtering, kubectl can query, but you still need to join the 3 resources to answer the query...

nikau · 2024-12-16T08:45:49 1734338749

Right, so doable just a bit more effort to do 3 queries to pipes or tmp files

astonex · 2024-12-16T10:28:55 1734344935

This is Dropbox comment all over again. Lots of things are doable with more manual effort.

nikau · 2024-12-16T11:33:17 1734348797

True - its a trade off like everything in life - do I want to learn yet another language syntax, or master one like jq.

Personally I feel like mastering jq has more value across a lot more things.

UltraSane · 2024-12-16T07:19:47 1734333587

I am a big fan of Cypher I love this. I really wish actual Cypher supported the dot notation for nested keys.

matanavital · 2024-12-16T09:16:40 1734340600

The one thing I have been waiting for

gz5 · 2024-12-16T16:25:27 1734366327

since cyper-based (instead of sql), is the key question whether my k8s data is more graph-like or relational?

adjacent but lots of experts here - independent of Cyphernetes or specific tooling, what are you doing to secure k8s api / kubectl / k8s control plane?

solatic · 2024-12-16T04:50:43 1734324643

I dunno, Kubernetes has a query language, it's called jq. As in, kubectl get pods -A -ojson | jq -r '.items[] | ...'. Cyphernetes seems simpler perhaps but it's not the 10x improvement I need to switch and introduce a new dependency.

philsnow · 2024-12-16T05:25:29 1734326729

I guess they would say that you have to send the output of that to be inputs of another kubectl command like

  $ kubectl logs -n foo $(kubectl get pod -n foo | awk '/Running/{print $1}')

because one of their selling points is "no nested kubectl queries".

I don't see how their queries can be more efficient than hitting the kube-apiserver multiple times, unless they have something that lives clusterside observing lifecycle events for all CRDs and answering queries with only one round-trip instead of multiple.

Or maybe they're selling "no nested kubectl queries" as an experience feature, saying that a query language is more ergonomic than bash command redirection. My brain has been warped into the shape of the shell, for better or for worse, so it's not a selling point for me.

Thaxll · 2024-12-16T05:05:00 1734325500

You usually don't need that, since kubectl supports jsonpath.

mdaniel · 2024-12-16T18:37:51 1734374271

I am firmly in the camp of jq because (a) I am able to bring my years of muscle memory to this problem (b) jq is without a doubt more expressive than jsonpath (c) related to the muscle memory part I have uncanny valley syndrome trying to context switch between jsonpath and jmespath (used by awscli for some stupid reason) so it's much easier to just use the one true json swiss-army tool upon the json those clis emit

wwader · 2024-12-17T10:19:52 1734430792

Note on (b): As i understand it JSONPath by design is limited to only be able to select things from the input, so can't build a new object, array etc.