More

sepin4 · on July 5, 2021

Knowledge is power and no one entity powerful enought likes to have it flowing freely. It would become free at about the same time the monetary system becomes obsolete, in other words never. That doesn't mean that the opposition to the powerful should stop and that is what Sci-Hub is.

FooBarBizBazz · on July 5, 2021

> flowing freely

Another reference to Saraswati.

sepin4 · on June 24, 2021

like the doctors don't have incentive to prescribe drugs...

sepin4 · on June 24, 2021

You allow them to grow up, how magnonimous of you.

sepin4 · on June 15, 2021

While I was working in my previous job for some clients in USA, I regularly had meetings starting from 6PM up to 10PM local time. I'm from Bulgaria. We have 7 hours difference with east coast. I had so much compensation hours/overtime that I was earning 20-25% more on top of my salary(outside of office hours are payed with multipliers) and had extra vacation days. Nowadays I'm not so inclined to make this much overtime, however I'm OK working with shifted office hours.

sepin4 · on June 3, 2021

Most of Europe has relatively nice public transport. I used to read books while on my way to work. That was 1.5-2 hours per day for my library. My previous office was 30 minutes walking distance from my home. Great way to wake up as opposed to coffee. While I was in the US, they had scarcely any sidewalks. Only in big cities it seemed appropriate to walk to somewhere.

sepin4 · on June 3, 2021

You don't factor the electricity/water for cooking & cleaning and the time spent doing it(calculate it by the hourly rate of a mid-range cook assuming you are there).

sepin4 · on June 1, 2021

We are not microservices. People in a team have a lot in common, they do systematically the same type of work albeit in a different context and they can be a lot more useful to each other when they communicate seamlessly instead of at scheduled time slots.

sepin4 · on June 1, 2021

I used to read books during commute. No time was wasted. In fact, now I seem to have a lot less time for reading, because there is always something more pressing at home.

sepin4 · on June 1, 2021

It is surprising how antisocial is the new generation. The dynamic between colleagues is something that comes naturally when the team has great cohesion. Half the time I didn't have to finish the sentence of what you describe and they are on it. I can't remember how many great ideas have come up during the "hey let's get a cup of coffee" break. Now we look less like humans and more like avatars.

WWLink · on June 1, 2021

It is surprising how social is the old generation. So many face to face interactions could have been taken care of by a 1 line email.

Like, a couple weeks ago I had to waste half my day to go into the office and sit in a 3 hour meeting. I got some useful information out of that, but there was SO. MUCH. OVERHEAD. I could've watched the webex from home and saved so much time because the relevant parts were 2 10 minute sections spaced out by like 2 hours.

Plus I hate sitting close to people who had onions for lunch. And alcohol. And it's always that dude you've gotta pair program with that smells like a gorilla and has bad breath. xD

sepin4 · on May 17, 2021

MongoDB IMHO also has resource allocation issues, because for a simple two "node" distribution setup you have ~4 times the demand for resources than a "regular" clustered service like Postgre or Oracle. Added to that the actual promised efficiency relies heavy on the data model and has no real easy way to correct once deployed.

jd_mongodb · on May 17, 2021

How do you get four times the demand for resources from two nodes?

The "promised efficiency" of any system is a complex question. The dynamic schema approach of MongoDB makes it trivial to update schema design on the fly so I am confused by "no real easy way to correct once deployed".

BTW : The recommended production setup is three nodes (one primary, two secondaries). This is what you get by default in MongoDB Atlas (you can't go smaller).

(I work for MongoDB)

sepin4 · on May 17, 2021

I was talking about a two "node" Mongo setup which would mirror the capabilities of a classic two node setup of Oracle RAC for example. Since in Mongo this is not a clear 1:1 match due to a different model of operation, in order to get the same performance/continuity as RAC here is what I mean.

If I want to gain a relative 100% increase in concurrent read/write access and transparent session failover to my application I have to procure two servers(with roughly the same performance lets put it at rawPerf=1.0 for simplicity) and a SAN/NAS with SCSI HBA and that would be it. All the necessary components will be running on those two servers and the application will have access to almost all capabilities of a single instance with a roughly twice the performance and fault tolerance. To do that in Mongo, I would have to procure 2 replica sets with 3 instances each(bare minimum to get the fault tolerance and not use arbiters which according to Mongo University is a bad idea) for the actual data and another RS for the configuration meta-data, which is another 3 instances. I will also need instances where I can run my "mongos" to route/consolidate my requests and to have fault tolerance I will need at least two of those. Now to do a break down of the different instances. From the two replica sets for data, I can use only 2 of the instances for writes and the remaining four only for consistent reads and that is only with the appropriate write concern. And while I have two instances for writes I am not guaranteed to get a consistent increase in write operations unless the data is at least somewhat evenly distributed. I will need six rawPerf 0.5 servers(assuming 0.5 rawPerf is capable to handle roughly half the data), that is a total of 3.0 so far. The configuration RS while not as demanding as the actual data RS will still require resources so I count it as roughly between 0.5-1.0 total for all three instances. And finally I have the "mongos" instances which although will not need disk space will need some CPU and the same amount of RAM as a regular instance in case the queries and not optimized, so I will have to give it 0.5 per instance. And the total amount of resources comes to 3.0+0.5+1.0=4.5. On the other hand with the Oracle RAC I would've needed 2.0.

As for the schema flexibility, from the Mongo documentation I take it that once the shard key is created it can't be changed arbitrarily. It can only be further fine-grained by adding more keys to it, however this is available from version 4.4. In order to change it cardinally, I would have to recreate the collection.

I'm sure you are well aware of all this, however I'm just stating my experience with MongoDB so far.

jd_mongodb · on May 18, 2021

AH I see. When we talk about nodes in MongoDB they are the elements of a replica set. That is what confused me. Yes, you are right, to get the equivalent performance of a two node database server you would have to run a sharded cluster with two shards. That is definitely more nodes than the two node design you have outlined for the RAC like design.

I am not familiar with Oracle RAC architecture but casual mention of a SAN fails to recognise the cost and complexity of even the smallest SAN. On the other hand MongoDB can run on dumb disks without a SAN or RAID required (though RAID can help with performance).

With you two node system loss of a node results in a 50% loss in performance whereas with MongoDB loss of a node will result in a secondary taking over the primary role with no loss in performance. You get what you pay for.

Setting up such clusters used to be a PITA until the advent of MongoDB Atlas. Now you can set them up with a few mouse clicks.

As regards shard keys. Yes, they are immutable, but the documents they index are not. So dynamic schema properties are preserved in a sharded cluster.

sepin4 · on May 19, 2021

While a SAN/NAS is overall an expensive hardware in both price and manageability, there are entry level devices. You can even get direct SCSI devices in some cloud services(Azure has "shared disks"). For Oracle RAC the data is accessed from same location, it's not distributed/replicated across nodes. It's literally the same disks attached to all nodes. Also ASM works with raw block devices which bypasses the need for file system abstraction and all the latency that comes with it. You could also deploy a cluster on a cluster/network file system, but this would generally not be a good idea since you get a lot of overhead.

Postgre clusters can be configured in similar fashion.

I'm not saying that Mongo doesn't have it's strengths. However using the full potential of your environment requires much more planing and has a lot more pitfalls.

I was particularly discussing the implications on what the relative immutability of the shard key has on the data distribution and that it is not that easy to change.