Glibc is still not Y2038 compliant by default

tgsovlerkhgsel · on Dec 29, 2021

We're 16 years away from the overflow. In other words, some of the systems being put in production right now are going to be around, untouched, unchanged, when it happens.

Given this article, it seems we're still doing it wrong, and that means... 2038 will be "fun" (either the remediation, or the consequences of the lack thereof).

At least most of us in the industry now have a good retirement plan: Fixing the legacy systems in 16 years...

Animats · on Dec 29, 2021

We're 16 years away from the overflow. In other words, some of the systems being put in production right now are going to be around, untouched, unchanged, when it happens.

Yes. It's getting close. Windows XP, released in 2001, still had a significant presence in 2017, 16 years after introduction.[1] 42% of companies still had some XP systems running back then. 11% of machines were still running it.

[1] https://community.spiceworks.com/networking/articles/2719-wi...

Beltiras · on Dec 29, 2021

Thankfully upgrades move faster in the OSS world. The worrysome cases are embedded systems. Especially those built intentionally 32 bit.

masklinn · on Dec 29, 2021

...

TFA is about how GNU/Linux systems being deployed right now are not Y2038-safe.

Windows applications have been safe from this issue by default starting from VC8.0 (VS 2005), macOS users since 10.7.

And to be fair Linux itself has used 64b time_t on 64b machines from the start (and since 5.6 — last year — has also migrated i386 to 64b time_t[0], though a few issues will remain forever).

[0] https://lkml.org/lkml/2020/1/29/355?anz=web

btown · on Dec 29, 2021

> upgrades move faster in the OSS world

There are many who don’t think like this. Microsoft has dedicated teams that ensure backwards compatibility for legacy applications, and even then people stay on XP. If you have an inherited Ubuntu system that has been happily running a proprietary binary for years, there’s even less guarantee (and certainly no marketing organization making that guarantee) that an upgrade won’t break things. And so you fall far behind LTS.

zymhan · on Dec 29, 2021

laughs in debian

au contraire, it's quite easy to keep museum-level ancient Linux distros running

azernik · on Dec 29, 2021

Your desktop Linux distro is not the scary case here

jandrese · on Dec 29, 2021

Ironic that so many people are hoping to retire before they get asked to deal with this mess since so much medical equipment is built on ancient and poorly maintained microcontroller environments.

treesknees · on Dec 29, 2021

Which part is ironic?

jandrese · on Dec 29, 2021

The irony of retiring to avoid the problem only to have it become a problem with the equipment keeping you alive in retirement.

cesarb · on Dec 29, 2021

> In other words, some of the systems being put in production right now are going to be around, untouched, unchanged, when it happens.

Most of these systems being put in production right now will be using 64-bit distributions, which have always used 64-bit time_t. Most of the rest will be using embedded distributions, which often use something other than glibc. So only a fraction of these systems "being put in production right now [which] are going to be around" will be affected.

xyzzy_plugh · on Dec 29, 2021

This is false. Many, many embedded systems simply use glibc. I wouldn't go so far as to say "most" embedded systems do not use glibc.

DiabloD3 · on Dec 29, 2021

I don't know of any embedded systems still using glibc.

Dietlibc, uclibc (used by openwrt and the other consumer router replacement firmwares), musl libc, and bionic (since Android is technically an embedded Linux, so should be mentioned as well) dominate the embedded Linux space.

Unklejoe · on Dec 29, 2021

I do embedded development and I use Glibc on all of our products. It's as simple as checking the box in Buildroot, so why not? The version of Glibc I am using has special accelerated memcpy and string functions for my CPU (e6500 PPC, Altivec).

ahartmetz · on Dec 29, 2021

OpenEmbedded / Yocto / whatever (it's complicated) uses glibc, and that's a quite popular distribution construction kit.

magicalhippo · on Dec 29, 2021

I'm no expert, but at least STM32's default is newlib or newlib-nano, which it inherits from the ARM GCC toolchain[1] from what I gather. I just checked and here it's defined as the following by default:

    #define _TIME_T_ __int_least64_t   
    typedef _TIME_T_ time_t;

[1]: https://imagecraft.com/blog/2019/04/embedded-gcc-libraries-n...

cozzyd · on Dec 29, 2021

BeagleBoneBlack Debian:

ldd --version ldd (Debian GLIBC 2.28-10) 2.28 Copyright (C) 2018 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. Written by Roland McGrath and Ulrich Drepper.

And of course, on actual microcontrollers newlib-nano is what you're likely using with a GNU toolchain.

jcelerier · on Dec 29, 2021

I don't remember ever being involved with embedded not using glibc if it was using any sort of Linux.

champtar · on Dec 29, 2021

OpenWrt uses musl for some years now.

kevin_thibedeau · on Dec 29, 2021

Things will start breaking in 2036 when NTP rolls over. Then everyone will wonder why they didn't have two years left.

onei · on Dec 29, 2021

I have no clue how widely implemented it is, but as far as NTP is concerned we're currently in era 0 which spans from 1900 to 2036. When we reach 2036, we just move to era 1 and have another 136 years until we reach era 2. NTP itself should be resilient to this kind of issue by design.

dtech · on Dec 29, 2021

We've seen time and time again that it doesn't matter what the spec says, if it works now, noncompliance will be there. Heck we're stuck with TCP & UDP because routers and firewalls will break anything else. If it works long enough, ossification makes it the new reality.

How many NTP implementations don't consider era or contain bugs if era != 0?

fanf2 · on Dec 29, 2021

There will almost certainly be failures in devices without battery-backed clocks (eg, Raspberry Pi) that boot in 1970 and rely on NTP to get the correct time, because NTP can’t tell them they are in Era 1 not Era 0.

To fix this you need a persistent low water mark on the time, compiled into the NTP program and/or stored in the filesystem (eg the timestamp on the NTP drift file). Then NTP timestamps can be interpreted as spanning the 136 years after the low water mark, using modular arithmetic.

Unklejoe · on Dec 29, 2021

> To fix this you need a persistent low water mark on the time

Yep, this is what many GPS receivers do to deal with the 20 year issue. They burn their manufacturing date, so they know which "cycle" they're in.

asgeirn · on Jan 3, 2022

From https://docs.ntpsec.org/latest/warp.html:

  > The NTP protocol will synchronize correctly, regardless of era, as long as the system clock is set initially within 68 years (a half-era) of the correct time

Which means 2036 will appear to work just fine, as 1970 is only 66 years away. Now jump forward to 2038, and this would not be the case..

floatingatoll · on Dec 29, 2021

Chances are, based on how poorly leap seconds and TLS 1.3 (and, I predict, http3/udp) are handled in actual reality by deployed systems, that a variety of software and hardware will fail inside infrastructure worldwide when ntp ticks over to era 2.

https://news.ycombinator.com/item?id=28046997

https://news.ycombinator.com/item?id=13750379

https://news.ycombinator.com/item?id=27402222

Unklejoe · on Dec 29, 2021

I've implemented the 1588 protocol a couple of times, and I have less optimism.

This "era" thing seems like the exact type of thing that someone would ignore when implementing the protocol, or if it is supported, probably doesn't get tested much. It's like leap seconds. Yeah, the leap second info is conveyed in PTP, but most of the implementations I've seen simply ignore it and just jam the clock when the time jumps.

xattt · on Dec 29, 2021

How many eras are possible in NTP?

grogers · on Dec 29, 2021

Infinitely many IIRC. Era isn't sent in the protocol, it's determined implicitly - if you know the time correctly to within a few dozen years you know the right era

foxfluff · on Dec 29, 2021

So a device made in 2021 whose (1970 based?) RTC battery drained has no idea which era it is in 2036. Perfect.

Someone · on Dec 29, 2021

A device made in 2021 could assume that the current time is at least 2021. Barring the invention of time travel and (a lot more risky) bugs assuming that NTP time stamps can be ordered by comparing them as 64-bit unsigned integers, that would make it OK for up to 2157 or so.

Working with that, conceptually, isn’t difficult. You can use any dateline as the epoch by subtracting the chosen epoch’s NTP timestamp from the one you received with wraparound.

Making a dateline library use that epoch for formatting dates and times isn’t hard, either, but could be a lot of work.

foxfluff · on Dec 29, 2021

Of course there are ways to make it work. I'm just assuming the vast majority of devs are not aware of this issue and mostly everyone's just solving their time problems by throwing NTP at it. Lots of things will break.

ectopod · on Dec 29, 2021

Absolutely. The only way we can be sure that this will be fixed by 2036 is by introducing a new NTP version that doesn't have this problem. Anything running the current version (which could be seen on the wire) would be considered broken.

Someone · on Dec 29, 2021

That version has been in development for some time (https://www.rfc-editor.org/info/rfc5905 is a proposed standard since June 2010)

https://en.wikipedia.org/wiki/Network_Time_Protocol#Timestam...:

NTPv4 introduces a 128-bit date format: 64 bits for the second and 64 bits for the fractional-second. The most-significant 32-bits of this format is the Era Number which resolves rollover ambiguity in most cases. According to Mills, "The 64-bit value for the fraction is enough to resolve the amount of time it takes a photon to pass an electron at the speed of light. The 64-bit second value is enough to provide unambiguous time representation until the universe goes dim."

tgsovlerkhgsel · on Jan 2, 2022

Luckily remediating this is relatively easy: Use some indicator for "certified 2036 proof" NTP clients that's visible on the network (a different port, an extension, ...), then observe the network to find legacy traffic.

Won't catch absolutely everything (e.g. local containers), but will probably get pretty close. Especially if modern NTP client versions have some way to resolve the ambiguity, e.g. a hardcoded "it's after 2020".

pdonis · on Dec 29, 2021

> A device made in 2021 could assume that the current time is at least 2021.

As long as the device knows it was made in 2021. But the device can't know that by magic; the manufacturer has to store that information in it somehow. Do all device manufacturers do that?

dathinab · on Dec 29, 2021

The time library used could compiler in the time of compilation, in which case the library user doesn't have to do anything/get it for free.

pdonis · on Dec 29, 2021

Yes, that would work too.

dathinab · on Dec 29, 2021

Probably a bunch of devices will just jump to 1970 tbh.

Through if you do it properly you add some fixed known min . time (like manufacturing time, or time of the last system software/is update) and if the NTP time is noticable below the min time you increase the epoch by 1.

E.g. in you case the device could know it's revision had a min. time of idk. 2015. Then if it receives 1973 it knows it's 3 years in the next epoch.

But as anyone can guess there will be devices which didn't implement that at all and will end up in 1970. Or which require setting time by hand and will end up in 1970 because it truncates to 32bits at some point or similar.

Through some embedded devices might happen to avoid it. Like due to special reasons they might use a non-unix epoch.

tyingq · on Dec 29, 2021

RFC5905 uses 32 bits for era number and 32 bits for era offset in the 128 bit timestamp type. Though I don't think the 128 bit timestamp is used much...I can't find support for it in either ntpd or chrony.

PixelOfDeath · on Dec 29, 2021

  xfs filesystem being mounted at BLABLABLA supports timestamps until 2038 (0x7fffffff)

Hmmm...

zaphirplane · on Dec 29, 2021

Wouldn’t there be things that have future values set, like financial/insurance to name a few

funcDropShadow · on Dec 29, 2021

They and everybody else should avoid using timestamp representations for any dates so far in the future. Because, it is a nightmare to translate timestamps when the timezone database is updated. Which you would have to do, if e.g. the EU finally manages to get rid of summer time and you care that dates past that change stay the same.

aulin · on Dec 29, 2021

wait, why? unix timestamps are safe from this kind of mess as they are UTC seconds since the epoch, all the tz-aware conversion happens when you need to display localized dates but the timestamp you save should be as neutral and tz-unaware as possibile. I've seen way more bugs from people assuming system clock to be UTC while it was localized. Like missing and overwritten data on daylight saving changes.

dtech · on Dec 29, 2021

You lose context. Say you have 2100-01-01T10:00:00 in PST. That's 4102452000, so you store it.

A few decades later, timezones are changed. PST should add 30 minutes, all others stay the same. You only have 4102452000, what should you convert it into? You lost the time zone information, so you can't make the correct decision.

8organicbits · on Dec 29, 2021

Changes to timezone changes can be quite abrupt too, this one was announced about 3 months before it took effect. Developers scramble to update software.

https://www.bbc.com/news/blogs-news-from-elsewhere-28423647

https://mm.icann.org/pipermail/tz/2014-July/021104.html

aulin · on Dec 29, 2021

if you need context, e.g. location, you can still save it in another field and you will be able to correctly display localized time at any point given the universal timestamp and the current location. Or you can save properly localized and tz-aware time, I just find timestamps safer from pitfalls.

mlyle · on Dec 29, 2021

> if you need context, e.g. location, you can still save it in another field

No. Say I have a database with a bunch of timestamps for future events. I calculate the time of those events with my current idea of local time (because my customer or because legal requirements require me to do something at a specific local time), convert to GMT, and store as a timestamp in my database.

In 2024, California decides to no longer observe Daylight Savings Time. All of those current timestamps no longer happen at the proper time in the newer version of America/Los_Angeles.

So there's a big pitfall here, in that the relation between localized time and UTC timestamp changes.

aulin · on Dec 29, 2021

Sure, you and others like you make pretty valid points. I believe we are talking about different use cases, hence the confusion. I was thinking of my most common use case of archiving events, usually experimental data, with a non ambiguous time reference.

If you need to schedule future events it gets obviously a lot more complicated, you don't need a timestamp you need local time and you need it to be robust to future changes in timezones, DSTs, politics.

mlyle · on Dec 29, 2021

Basically, it's important to recognize what you're actually measuring/capturing and store something maximally equivalent to that. Conversions may render figuring things out later impossible or very difficult.

If you care about a past moment in time as reported by GPS or a computer clock-- use a UTC timestamp.

If you care about a future time delineated by a precise interval from now-- use a UTC timestamp.

If you care about a future time expected to be delineated in a specific time zone, use a timestamp in local time and note the zone.

If you care about a future time in a customer's time zone no matter where they may be, store the time in an unspecified local time and the customer for whom the time applies. Etc.

dtech · on Dec 29, 2021

It's not about localization, it's about correctness. you need both the timestamp and tz if you care about the future local date and time.

whilenot-dev · on Dec 29, 2021

You seem to make a contradicting statement here with what you wrote in the sibling comment, no? timezone information are influenced by political means, whereas localization (lat/long) information is not. UTC timestamp + localization seem more correct to me.

bdonlan · on Dec 29, 2021

The semantics of "time X at point Y" are different from "time X in time zone Z". Sometimes the former may indeed be what you want, but it's uncommon for the user to be providing precise location information for a point in time far in the future. If you're told "time X, Eastern standard time", then the semantically correct thing to do is to not guess a point in space, but instead preserve the time zone as provided.

ak217 · on Dec 29, 2021

This line of thinking is how we end up with nonsensical UX like having to select "America/Los Angeles" to get US Pacific time. There are many use cases for datetimes and timezones, not all of them need a location and in some cases it's incorrect or misleading to include a location. You could certainly use a UTC timestamp and coordinates in your application but that doesn't work for many other use cases.

oleganza · on Jan 3, 2022

Your example is valid and replies are ill-informed. There is a difference between a "date" and "time interval". Timestamps are good for storing intervals, but if you need to plan something on a _date_, you need to store the date. This means you care not about specific number of seconds having passed since now, but about what calendar (and the clock) is saying when the event has to happen.

Example: nobody cares if a concert planned on November 19, 2024 at 19:00 in Honolulu starts in 12345600 seconds or 12349200 seconds since now. But everyone cares that everone's calendars and watches are in sync by that time and show specifically that date and that time, regardless of how many times people switched DST or timezones in the years in-between.

osrec · on Dec 29, 2021

Just store the time stamp and time zone separately and update the time stamp to reflect any changes to a time zone's offset (if that actually happens)...

It takes a bit of housekeeping, but isn't especially difficult.

masklinn · on Dec 29, 2021

> Just store the time stamp and time zone separately and update the time stamp to reflect any changes to a time zone's offset

Thus reinventing zoned datetimes badly

> if that actually happens

It happens literally all the time, sometimes with very little heads up e.g. the decision to apply DST or not can be take mere weeks before it actually occurs[0], and something as impactful as an entire calendar day disappearing can happen with 6 months notice[1].

[0] https://egyptindependent.com/egypts-hasty-goodbye-daylight-s..., https://codeofmatt.com/time-zone-chaos-inevitable-in-egypt/

[1] https://www.theguardian.com/world/2011/dec/30/samoa-loses-da...

osrec · on Dec 29, 2021

Thanks for this - I've learned something new today. I did not realise timezone updates were so frequent.

I do find that having the timestamp at hand makes math easier in a lot of the use cases I've come across. I wonder if there is an easy hybrid solution where you can capture timezone info, timestamp info and have it adjust without a full DB update, even if timezone offsets change. Maybe a mapping table of some kind, with versioned timezone codes.

masklinn · on Dec 29, 2021

Do note that this is an issue for future events.

For past events, you can store timestamps, it doesn't matter, the mapping is known and fixed.

dtech · on Dec 29, 2021

That works, but then you're working with timestamp+tz, not just timestamp. You lose a fair bit of the simplicity compared to e.g. ISO8601 strings.

> if that actually happens

The IANA time zone database [1] is updated several times a year, it happens all the time

[1] https://www.iana.org/time-zones

whilenot-dev · on Dec 29, 2021

As far as i'm aware, you should only store the location next to the timestamp and let the IANA[1] do the maintenance of the tz database. Please avoid the housekeeping on your side.

[1] https://www.iana.org/time-zones

3pt14159 · on Dec 29, 2021

It actually is especially difficult at scale for non-trivial applications. Large databases won't be able to atomically change every instance which may lead to very frustrating bugs where collisions cannot be allowed. For example, on a calendar app. I agree with dtech, if you're storing in the future don't use a timestamp unless the input value is specifically intended to be timezone unaware, which is typically only useful for short term things like control systems or alarms. For longterm human uses a datetime+timezone field or an ISO datetime+timezone string is safer.

dzdt · on Dec 29, 2021

For many purposes the correct specification of a future time is in terms of local time at a particular location on that date. This is true for financial contracts (including trillions of dollars in options markets -- 10am in New York means NY time) but also of work schedules (scheduled to arrive at 9am? That is on the workplace clock, not UTC), local transit schedules, and many other things.

That means when daylight savings time rules change or timezone lines are moved the UTC time of these events change.

Unfortunately I don't believe anyone has standardized a format for storing times which are specified local time at a specified location on a specified date. So it is all roll-your-own or use UTC or zoned datetime and be burned when things change.

davrosthedalek · on Dec 29, 2021

The problem is that the contract normally doesn't specify epoch seconds. So if the translation of UTC since epoch to local time changes, the local time specified in the contract does not, so your UTC since epoch timestamp has to.

CorrectHorseBat · on Dec 29, 2021

I don't think so. Imagine you have date saved somewhere at 10am 1 July 2031. If the EU abolishes DST in between, do you want the hour changed to 9am or keep it at 10am? I'd say both could be correct.

aulin · on Dec 29, 2021

You convert it to the local time at that location at that point in history. That's something that belongs to time localization, timestamp should be universal and immutable against those changes.

CorrectHorseBat · on Dec 29, 2021

So I'd say you are arguing against using timestamp representations for any dates far in the future since they can change depending on what happens with timezones and DST, but timestamps cannot.

aulin · on Dec 29, 2021

My point is that either you save a properly localized tz-aware time in unambiguous standard format (iso 8601?) or you save a universal timestamp (and location if needed) and defer all the localization process to the moment you need to display localized time.

From my experience the latter is safer because people tend to save datetime strings in a format that's either not standard nor unambiguous.

xxpor · on Dec 29, 2021

I don't think you're answering the question here though. It's completely plausible there are situations where the contant is local time, not UTC (or TAI, if you want to get really obnoxious). Let's say I made an appointment to get my hair done in Berlin 10 years in advance, at 14:00 local time 29 Dec 2031, for whatever reason. Let's say someone goes crazy and moves Berlin to +1:30 this decade. I would still expect to show up at 14:00 local, but that's not the same universal time it was 10 years ago. How do you represent that in your DB where everything is in UTC?

whilenot-dev · on Dec 29, 2021

Frankly, time doesn't work that way... if i am atm (CET) and i make an appointment for sometime in summer 2022 (CEST), then i don't expect the hair salon to make the appointment in CET.

xxpor · on Dec 31, 2021

No, I think we agree. We're both making the appointment at whatever everyone in Berlin calls 14:00 on the day of the appointment.

aulin · on Dec 29, 2021

Honestly I'm not sure, in ten years Berlin timezone could not exist anymore, in your example you need a special time representation that says "this time in local time, whatever local time will mean in the future". I'm not sure we have that in current date time representation standards, do we? you could still save tz-unaware local time and a location, and figure out in the future what local time means at that point of time and space.

Spooky23 · on Dec 29, 2021

I’ve worked on systems where data has a 100-126 year retention objective. We had the “fun” of dealing with this looking backward as far as 1968!

When we reimplemented the system from an ancient mainframe, we stored the value as a Unix timestamp and a standard representation of local time. It was derived from an old format going backwards and recorded in the new format going forward.

Having both is critical for our successors to figure out wtf is going on. UTC 1/1/70 is an anchor point, and the local representation is canonical at a point in time. I don’t recall how 1968-1970 was handled. Not only do you have to plan for “Berlin time” going away, but for daylight (or double daylight) times changing. Having both entries allows you to reconcile, so the poor bastard figuring out what to purge in 2090 has a fighting chance!

mindwok · on Dec 29, 2021

What’s the alternative to a timestamp representation?

wyufro · on Dec 29, 2021

A string, ISO 8601, https://en.wikipedia.org/wiki/ISO_8601 Preferably RFC3339 (which is a specific profile of ISO 8601) but, as noted in a sibling comment, it isn't always appropriate for future dates.

PeterisP · on Dec 29, 2021

Finance/insurance generally sticks to dates instead of timestamps for most of its data, especially since the effective date might be different from the timestamp date (e.g. a weekend or night transaction having the value date for interest calculations set to the next business day; but you also might have backdated transactions caused by various corrections). Timestamps and timezones are used for logs and auditing, but for settling money it's generally considered that the thing that matters is at the granularity of a whole date.

Of course there are things like HFT where precise time matters a lot, but there you don't schedule things years in advance, all the long-term things like mortgage schedules and insurance terms simply ignore time and timezones.

rst · on Dec 29, 2021

There are also applications which might reasonably have to process dates 16 years in the future right now. I suspect most of finance, for instance, is on 64-bit systems at this point, or we'd be hearing a little bit more about this.

dathinab · on Dec 29, 2021

DateTime processing and system time are often handled separately with some system time to datetime time conversion.

plank · on Dec 29, 2021

Perhaps. But things like Log4j might help us: while in the previous century (2k problem) making a potential problem largely go completely away, it might be that in 10 years such an endeavor may be a simple task.

phendrenad2 · on Dec 29, 2021

There are a lot of people, even in this very thread, who will say "don't break backward compatibility!" right up until 03:14:07 UTC on 19 January 2038. At SOME point we'll have to let saner people prevail, and actually break backward compatibility. I'm just hoping it happens, you know, sooner rather than later.

jrimbault · on Dec 29, 2021

I'm exactly of this opinion, it's better for us (programmers/admins/etc) to deal with some level of breakage now (gigantic as it may be), than for actual users to deal with breakage with their future dates.

paledot · on Dec 30, 2021

Bingo. We have the option of recompiling with an older version of glibc while we work out the bugs. Our users won't have the option of running our code in 2037 when the epoch arrives.

AnssiH · on Dec 29, 2021

The article says that users will never get 64-bit time_t on 32-bit systems on glibc.

However, glibc documentation (https://www.gnu.org/software/libc/manual/html_node/Feature-T...) says:

> If _TIME_BITS is undefined, the bit size of time_t is architecture dependent. Currently it defaults to 64 bits on most architectures. Although it defaults to 32 bits on some traditional architectures (i686, ARM), this is planned to change and applications should not rely on this.

So it sounds like glibc plans to change the default, just like musl has done.

dveeden2 · on Dec 29, 2021

I think the best way of doing this is a 4 step approach:

1. Old situation, time_t is 32-bit

2. Migration starts, time_t is 32-bit by default, -D_TIME_BITS=64 to get 64-bits

3. Migration continues, time_t is 64-bit by default, -D_TIME_BITS=32 to get 32-bits

4. New situation, time_t is 64-bit

And with a long time between these steps. So glibc is at step 2 while musl skipped step 2 and is at step 3. There should be plenty of time between steps. It might make sense to keep it at step 3 until 2037 or so for very restrictive embedded plaforms etc.

gpvos · on Dec 29, 2021

It may be an idea to have a step between 2 and 3 with NO default, to force everyone who compiles their code against your header files to make their choice explicit.

exikyut · on Dec 29, 2021

That would be untenable - I wouldn't be able to cleanly `gcc -o blah blah.c` without making an explicit decision, and by extension wouldn't be able to (continue to) compile existing code either.

Rebuilds on every codebase everywhere ever would promptly blow up.

IMHO the revolts would come from two camps, a) bureaucracies for whom a build system change would normally take months, and b) individual devs who would be all like "compilation flags?? in MY defaults?! that's LESS likely than you think!".

Worst case scenario, someone forks glibc, removes the offending requirement, proffers commercial support for their fork... and ends up making bank.

gpvos · on Dec 29, 2021

Blowing up rebuilds is basically the point, and there is a simple fix in the make file or whatever to keep the status quo.

For the bureaucracies of case (a), a change of glibc or, more so, gcc should then take many years if they take the impact of changes seriously.

darknavi · on Dec 29, 2021

> Rebuilds on every codebase everywhere ever would promptly blow up.

Only if they use time_t, but I get the point.

chmod600 · on Dec 29, 2021

In other words, all non-trivial code builds would break? Sounds unreasonably painful.

spinningslate · on Dec 29, 2021

painful yes, but isn't that a win? If it's breaking at build time, that means someone's actually /building/ their app. And the fix is easy enough (as long a they don't set it to 32 bit...).

The bigger issue is all the systems that won't be rebuilt (per several sibling comments).

--

EDIT: fixed grammar.

leni536 · on Dec 29, 2021

There is no "no default", your distro will ship with one or the other. Very few people compile and ship glibc, upstream defaults only matter in the way they affect distro defaults.

funcDropShadow · on Dec 29, 2021

The macros aren't set when building the libc, they are set when programs using the libc are build. So every program build on a distribution uses either the default or set one of the macros.

leni536 · on Dec 29, 2021

That's interesting, how does glibc maintain ABI compatibility? Alias attributes?

Even if glibc does it, if other libraries expose time_t in their ABI and don't do the same, then it's the same problem.

mananaysiempre · on Dec 29, 2021

> [H]ow does glibc maintain ABI compatibility? Alias attributes?

Symbol versioning[1,2]. Kind of like aliases, but more awkward. In theory it’s not specific to glibc, in practice hardly anyone else bothers being so careful about their ABI.

Note that dynamically linked musl, which Alpine uses (I think..?), doesn’t understand symbol versioning, though I expect this will only last until the first ABI break in musl.

[1]: https://www.akkadia.org/drepper/symbol-versioning

[2]: https://maskray.me/blog/2020-11-26-all-about-symbol-versioni...

leni536 · on Dec 29, 2021

> Symbol versioning

I know that glibc uses symbol versioning to remain ABI-compatible within versions. What I meant was ABI compatibility between 32-bit and 64-bit time_t with the same glibc library. Looks like it uses something like aliases for that (but self-implemented using asm):

https://sourceware.org/git/?p=glibc.git;a=blob;f=misc/sys/se...

https://sourceware.org/git/?p=glibc.git;a=blob;f=misc/sys/cd...

Hello71 · on Dec 29, 2021

musl has lasted almost one-third as long as glibc, with significantly less than one-third the number of ABI breaks. as far as I know, the only musl ABI break has been time64; despite glibc's supposedly superior version handling, musl has completed time64 support with headers only, whereas glibc is still in progress.

user-the-name · on Dec 29, 2021

They are set when building the libc. If you change the macro to something else and then link, you will get code that crashes due to mismatches in ABI.

TheDong · on Dec 30, 2021

I think that would be a bad idea in this specific case.

Why? Because for the majority of codebases, the time_t change won't require any work.

You'd be forcing a compiler error, and it would effectively say "Add -D_TIME_BITS=64. If you're doing something really weird with time_t then you may have to update your code too, but in 95% of the cases, you can just add that flag."

I think the compiler error might be warranted for something like "You are using a function that is 95% of the time insecure or wrong, please add -DGAPING_SECURITY_HOLE if you know what you're doing", but if the error really is just "add a flag, you do not need to think about your code probably", the library authors themselves might as well default it for you.

erk__ · on Dec 29, 2021

If I understand it correctly glibc is on a mix of 2 and 3, 3 on 64-bit systems and 2 on 32-bit systems.

Though I am not certain 64-bit supports 32-bit time so it may be on 4 there already

I am basing this of these two sites and a quick look at the source code though there may be better documentation on it

https://sourceware.org/glibc/wiki/Y2038ProofnessDesign

https://www.gnu.org/software/libc/manual/html_node/64_002dbi...

jabl · on Dec 29, 2021

64-bit glibc has always had 64-bit time, all this matters only to 32-bit applications.

mkup · on Dec 29, 2021

On transition from step 2 to step 3 there will be a great mess: some libraries in the OS distro are built with sizeof(time_t)=4 assumption and other libraries + application are built with sizeof(time_t)=8 assumption. ABI will break at random boundaries, and not just functions: structures will have incompatible layout etc.

If step 2 is omitted, then similar great mess will happen on transition from step 1 to step 3.

We need a better plan, like modifying compilers (for i686 and other 32-bit targets) to emit '32-bit time_t code' and '64-bit time_t code' simultaneously, and then resolve to proper functions later, at the link time.

mlyle · on Dec 29, 2021

Is phase 2 that useful?

Why would you -D_TIME_BITS to something other than the default? It's a hard thing to do safely, since you're choosing to be ABI incompatible (in a way that's not caught at compile or even invocation time) with the system's default.

And then, all you get is a binary that will potentially handle times past 2038 safely, when the whole rest of the system won't.

MichaelBurge · on Dec 29, 2021

I think it lets developers opt-in to fixing their glibc-dependent programs ahead of time, so there's less work to do on stage 3.

mlyle · on Dec 29, 2021

The thing is, not too much can be expected to break from a 64 bit time_t. Only stuff that directly shoves time_t's over the network or to disk -- a bad practice -- will.

Note anything that builds on x86-64 or other 64 bit architectures has already figured out how to tolerate 64 bit time_t's.

The major reason it's not safe to build with a 64 bit time_t is because you don't know whether other libraries are. If the C library cuts over, when people/distributions move to the new library version, they know that's not a concern.

Defaults change in tooling all the time, requiring code changes for distributions bundling newer tooling/libraries. This one would require less change than most.

Of course, it's possible to survive a 64 bit time_t and still not be 2038-safe. But at least you can be correct if you have a C library and other libraries that will tolerate 64 bit time_t's.

nextaccountic · on Dec 29, 2021

So the fix is to list all libraries in popular distros that break with time_t being 64 bits, file issues for all of them, and track progress somehow. Messing with the defaults is only useful if the ensuing breakage is promptly fixed.

mlyle · on Dec 29, 2021

> So the fix is to list all libraries in popular distros that break with time_t being 64 bits

All libraries in popular distros build on x86-64, which in turn means they use a 64 bit time_t there.

It's time for 32 bit architectures to join 64 bit architectures in having a 64 bit time_t.

Anything that ships today in distributions will ship in embedded systems for 5+ years. And then a lot of those embedded systems (too many) will last a long time. 2026 is getting pretty uncomfortably close to 2038.

deknos · on Dec 29, 2021

> Is phase 2 that useful?

Yes for preparing the upgrade seamlessley for developers, administrators and users.

you can prepare your buildscripts before it breaks, you can build it easier as feature/forward/backward toggle

same logic and code with different toggles is better

mlyle · on Dec 29, 2021

In practice, all the code in distributions has built in environments with 64 bit time_t. Some end-user legacy code may not be, but it probably isn't much: most things just don't care about a field getting wider behind the scenes.

The big pain, IMO, at this point is the big cutover where all libraries need to move to the new ABI.

This still doesn't prove things as 2038-safe, but it at least means things reasonably can be 2038-safe on 32 bit if they choose... while today it's impossible for practical purposes.

usr1106 · on Dec 29, 2021

Isn't the title clickbait? If it read "glibc in 32 bit user space is still not Y2038 compliant by default" I would not have clicked it.

turminal · on Dec 29, 2021

Unfortunately you probably daily rely on multiple 32-bit linux machines, most of the time not realizing you do. I agree this should be mentioned in the title though.

chasil · on Dec 29, 2021

Oracle Linux for ARM is purely 64-bit, with no 32-bit support at all. Busybox 32-bit binaries will run, but only 64-bit RPM packages are offered.

Aside from Android, it is the only ARM distribution that I know of with this property.

mlyle · on Dec 29, 2021

Which in turn means it doesn't run on a lot of hardware-- not everything is aarch64.

DarkmSparks · on Dec 29, 2021

I was feeling the same, about the only thing I have using 32bit userspace these days are a few rPIs, little bit early to be panicking they may suffer from the Y2038 bug, most companies waited to about 6 months before the Y2K bug was going to hit before they started caring.

throw0101a · on Dec 29, 2021

> I was feeling the same, about the only thing I have using 32bit userspace these days are a few rPIs, little bit early to be panicking they may suffer from the Y2038 bug

Quite the opposite if you're talking about rPIs: the small/embedded space is where things can live for a long time. They're the primary place to worrying about. Especially if you're talking about industrial processes.

mwcampbell · on Dec 29, 2021

But if we techies get way ahead of the problem this time, maybe we can prevent it becoming a mass hysteria as Y2K was.

alkonaut · on Dec 29, 2021

If you spend large amounts of money today, fixing a system that is later decommissioned in 2029 or 2032… you wasted a lot of money.

The correct play could very well be to wait to the last second and pay absurd money for scarce experts to fix the systems.

DarkmSparks · on Dec 30, 2021

The correct play is surely to be the scarce expert available at the last second.

jmercouris · on Dec 29, 2021

I think glibc is right to not change the defaults out from under the feet of users. “ This is the worst possible way to do this. Consider libraries: what if a dependency is built with -D_TIME_BITS=64, and another dependency is built without, and they need to exchange struct timespec or similar with each other? ” - this argument makes no sense to me. If you need new features and new values update your flags rather than breaking any program that depends on your CLI API

mlyle · on Dec 29, 2021

> If you need new features and new values update your flags rather than breaking any program that depends on your CLI API

The thing is, 2038 is less and less far away with each day that goes by. Especially since real programs often have to use future timevalues (and often further in the future than one would think).

Stuff has to change at some point. A glibc major version which changes the structures and typedefs for all 32 bit code could handle it.

But as it stands now, someone who wants to make a 32 bit program y2038-compliant will have a hard time doing it: they can't safely hand time values to any library that is possibly compiled differently.

This punts it to whom, distro maintainers? They have to try and rebuild everything with the correct define? And the lone end-user who does gcc -o myfile myfile.c gets code that's broken against the system libraries? Or they patch glibc itself and ship a glibc that's ABI incompatible with everyone else. Meh.

GoblinSlayer · on Dec 29, 2021

>I think glibc is right to not change the defaults out from under the feet of users.

Either glibc will do it, or time itself will do it. But when time will do it, it will be much harder breakage.

throw0101a · on Dec 29, 2021

The current version is 2.34:

* https://www.gnu.org/software/libc/

Change the default and release "3.0" with the time_t alteration at the top of the ChangeLog. 2.x can perhaps live for a while for some overlap.

The distros will eventually pick it up, and all code compiled/released for a particular distro version will deal with it in course.

mongol · on Dec 29, 2021

It is only changed from under their feet if it is done without communication. Most important is to communicate a plan. Tell the users that at from date X, all new releases will have the default changed.

tick_tock_tick · on Dec 29, 2021

> It should be noted that both approaches described above introduce zero friction to get the right thing.

If your code is called by any other code and took in a time_t recompiling would have broken your ABI.

mlyle · on Dec 29, 2021

Sure. And the ABI needs to break sometime before 2038. So, new major libc version, and...

I mean, waiting does mean there will be fewer 32 bit systems: so there's an argument to put off the transition to lower the scope of work.

However, most of the ones that stay around will be embedded and long-lived: just the worst kind of thing to bite us in the future.

mrweasel · on Dec 29, 2021

> Sure. And the ABI needs to break sometime before 2038.

That really highlights the benefit of the BSD model, where the OS, libc and all the base packages are shipped as a complete system. OpenBSD switch to 64 bit time_t on all supported platforms back in 2014. We all talked about Y2038 back then, and most agreed that something should be now as quickly as possible. Then the world just sort of forgot. I mean it is solved, but if you're a new developer, you might not know that you need to do something special.

The GNU and Linux world is really good at not breaking ABI compatibility and mostly that's great, but in this case it's a problem and pushing it much further is going to be a problem. It's also going to be weird if the default never change, then we just have a C library that that just keep producing defective programs, unless you add certain flags.

dtech · on Dec 29, 2021

The default always was 64-bit for 64-bit systems, so this isn't as big of an issue at the blog posts makes it out to be.

mrweasel · on Dec 29, 2021

There are a large amount of 32bit devices still being designed and built. Not for servers, desktops or laptops, but embedded devices, controllers and so on. These are actually the worst device to have the issue in, because they will last longer than 2038. If you ship an 32bit embedded device today, there's a reasonable chance that that device will be in service 20 years for now.

PixelOfDeath · on Dec 29, 2021

> The GNU and Linux world is really good at not breaking ABI compatibility ...

If you exclude like every user land library there is (including the glibc)

AnssiH · on Dec 29, 2021

Isn't glibc, specifically, very careful about ABI compatibility?

You can still run Linux programs linked against decades-old versions of glibc on current systems as they have kept ABI compatibility with the use of symbol versioning, without changing their SONAME ("libc.so.6").

Sure, that does not extend to being able running programs linked against new glibc symbols on old systems, but if you consider that ABI-breaking then the Linux kernel would also be ABI-breaking as you cannot run binaries relying on new kernel syscalls (etc) on older Linux kernels.

PixelOfDeath · on Dec 30, 2021

DebConf14: Linus about breaking librarie-ABIs:

https://www.youtube.com/watch?v=Pzl1B7nB9Kc&t=168s

rjsw · on Dec 29, 2021

OpenBSD was the slowest of the major BSD systems to switch. NetBSD made the change in 2009, it also preserves backward compatibility.

mrweasel · on Dec 29, 2021

I did not know that. The OpenBSD change just stuck in my head somehow. Any idea how NetBSD fixed the backward compatibility?

mlyle · on Dec 29, 2021

NetBSD provides a binary compatibility layer clear back to ancient NetBSD versions. If your NetBSD binary targeted a version before 6.0, it gets the compatibility stubs like http://cvsweb.netbsd.org/bsdweb.cgi/src/lib/libc/compat/time...

GoblinSlayer · on Dec 29, 2021

glibc itself doesn't break ABI, it provides both 32-bit and 64-bit functions, just defaults to the wrong portion of them on the C API level.

Asmod4n · on Dec 29, 2021

The problem doesn’t start in 16 years, it’s happening now. Try to do date/time calculation after the overflow today. This can have major issues with ensurance companies right now.

alkonaut · on Dec 29, 2021

Right. This is an overlooked aspect.

At the precise date of the epochalypse, the date will roll over for the current time_t.

But one second before that, it will blow up for a system that peeks 1 second into the future (e.g a control system).

One day before, the date will be screwed up for any program doing addition with one day such as a rental system.

In the years before 2038, systems that do calculations on year scales will fail, like perhaps insurance or mortgage and similar.

recov · on Dec 29, 2021

Yep. Also see https://twitter.com/jxxf/status/1219009308438024200

yjftsjthsd-h · on Dec 29, 2021

> Consider libraries: what if a dependency is built with -D_TIME_BITS=64, and another dependency is built without, and they need to exchange struct timespec or similar with each other?

What happens in the reverse case on Alpine (or anything using the other approach)? If I build a new program but link against a dependency that predates the switch (say, I upgraded my workstation to a new Alpine release but didn't `make clean`), will I get the same breakage?

mlyle · on Dec 29, 2021

> If I build a new program but link against a dependency that predates the switch (say, I upgraded my workstation to a new Alpine release but didn't `make clean`), will I get the same breakage?

If you jump between incompatible versions of a C library without rebuilding, you can expect nothing to work.

DSMan195276 · on Dec 29, 2021

> If you jump between incompatible versions of a C library without rebuilding, you can expect nothing to work.

Is there a reason musl introduced this in a non-major version? Your point here is perfectly valid, but to me the next obvious question is "what qualifies as an incompatible version?", and after doing some Googling I'm surprised it's not very clear to me. I would have assumed that going from 1.1.X to 1.2.0 was compatible, or IE. two programs compiled against 1.1.0 and 1.2.0 will work fine together, but clearly that's not actually the case.

formerly_proven · on Dec 29, 2021

musl created new symbols for all time_t related functions. This means that an 32-bit time_t application can link against any version, because the 32-bit time_t symbols did not change. The problem is when code passes around time_t to non-musl code - you can't mix 32-bit and 64-bit time_t then, because those are different types.

Source: https://musl.libc.org/time64.html

DSMan195276 · on Dec 29, 2021

Right, I understand that, but this just gets back to my question of "what qualifies as an incompatible version?". Clearly 1.1.X and 1.2.0 are compatible in the sense that they still expose the same ABI for 32-bit time_t, and thus programs compiled against 1.1.X will still work with 1.2.0. But as the musl devs identified time_t is commonly used in lots of other places besides libc, so in practice this change requires recompiling all code against 1.2.0 to ensure it is using the right size of time_t, else you might have have programs attempt to communicate with each-other using different sizes.

My question is then is this an "incompatible" release or not? From the version number alone I would have assumed 1.2.0 doesn't require a full recompile of everything, but the commenter I responded to suggested it is 'incompatible' and I don't understand why that is the case with feature releases. Do you need to recompile the world for every musl update and ensure everything is compiled against exactly the same version? Or just the feature version has to match?

mlyle · on Dec 29, 2021

The tricky thing with library versioning that you get into is when types change.

Let's say my app uses libc and libfoo, both from the underlying operating system / distribution.

libc has a 32 bit time_t, and libfoo on my system also has a 32 bit time_t. I upgrade to a new operating system where both libc and libfoo have 64 bit time_t. libc was kind enough to include symbol versioning which makes time(NULL) return a 32 bit number to my binary.

Now, libfoo either has to be patched to handle symbol versioning and have a new version inside so that my app can get the old 32 bit time_t, or it will have a subtly broken ABI.

The thing is-- who does this? Libfoo doesn't know when each distro will make the change: it won't correspond to a specific upstream version. Does each distro have to recognize the issue and introduce symbol versions? Etc.

DSMan195276 · on Dec 29, 2021

Right, I get that point, I'm simply asking why musl introduced this change in 1.2.0 - why wasn't this 2.0.0? The fact that the 1.1.0 and 1.2.0 types are incompatible in a significant way seems counter-intuitive to me, and even after looking into it I don't see much in the way of describing how you're supposed to handle musl upgrades. Is the expectation that you recompile everything for every new feature release of musl? I couldn't find that spelled out anywhere but it seems like that is the case, or else stuff will be broken via changes like this one.

OtomotO · on Dec 29, 2021

Wonderful, so I already know what I'll be doing in 2038. If I still live, if civilization is still in place...

Keep the paychecks coming!

Sorry for my cynicism, it's one coping strategy. I'd also prefer to build reliable software, but alas...

speedgoose · on Dec 29, 2021

I don't know, building reliable software is a lot of boring work. Most software the industry develops is not going to space or safety critical, it's okay if they have bugs like this one.

OtomotO · on Dec 29, 2021

The engineer inside me cringes, while the Businessman cheers ;)

einpoklum · on Dec 29, 2021

If I were to write this post, I would take the time to: Check _why_ glibc maintainers have made this decision, and ask them why it is reasonable for this to be the case right now.

erk__ · on Dec 29, 2021

They have a design document for it here: https://sourceware.org/glibc/wiki/Y2038ProofnessDesign

edflsafoiewq · on Dec 29, 2021

What is the reason a new call like time64_t time64(); wasn't added?

masklinn · on Dec 29, 2021

That is what apple initially did.

And they dropped the approach because few to none were going to rewrite existing code to add support for non-standard extensions to fix 40 years out issues. Instead they moved to 64b time_t in 10.7 I think?

And it’s not a new call, dozens of functions touch time_t, plus a few syscalls.

temac · on Dec 29, 2021

Since 10.15 only 64 bits code is supported anyway.

masklinn · on Dec 31, 2021

64b code does not necessarily imply 64b time_t though. If your time_t is an alias for int, it's 32b on anything short of (S)ILP64, which is anything but common. I'm not sure there's been any other than Cray's UNICOS and HAL's Solaris port.

jabl · on Dec 29, 2021

This is exactly what glibc did. If you want you can use the 64-bit types and symbols directly. The _TIME_BITS=64 macro redefines the standard C & POSIX time things to point to the 64-bit variants, so you can just recompile code to use them.

rwaksmunski · on Dec 29, 2021

Are you going to rewrite 5 decades worth of legacy software to use it?

strenholme · on Dec 29, 2021

For open source software, it’s a simple recompile. Most OSS compiles are 64-bit these days, where time_t has always been 64-bit. In the case of compiling a new 32-bit application, -D_TIME_BITS=64 apparently needs to be a compile time flag.

For binary software, Windows has had a Y2038 compliant proprietary API since Windows NT in the 1990s; most Windows applications use this API so Y2038 is generally not an issue.

The issue only affects a subset of 32-bit binary-only applications where we don’t have the source code any more. Hopefully, any and all applications like that will be decommissioned within the next couple of years.

rwaksmunski · on Dec 29, 2021

I think you misread the root comment, it suggests a new function call that no one will use. Apple made that mistake, then they just switched the size and dealt with the fallout.

strenholme · on Dec 29, 2021

Looks like I lost the context. In terms of the context:

The issue is, besides having to rewrite code, it’s not just one function. It’s time_64(), but now we need gmtime_64(), strftime_64(), stat_64(), and so on for any and all functions which use timestamps.

The thinking in Linux land is that we won’t have 32-bit applications come 2038 where this matters, because everything will be 64-bit by then.

alkonaut · on Dec 29, 2021

What’s the alternative to rewriting? Just recompiling?

Assuming the code on the receiving end stores the value in 32 bits (and not a time _t which can magically change meaning but a 32 bit integer) then it’s still doomed without rewrite?

I mean even with time_t use you can’t just recompile and make it work because there could be subsequent assumptions that the size of a struct containing a time_t will be a specific size or allocations will be too small or misaligned and so on.

rwaksmunski · on Dec 29, 2021

Yes, the alternative to rewriting is changing the size and recompiling. OpenBSD somehow managed to pull it off.

alkonaut · on Dec 29, 2021

But the problem isn't really changing the OS to return a 64bit value where it used to return 32, the problem is all the applications that assumed it would be 32.

By "pulling it off" you mean, they changed it and planes didn't crash around them?

rwaksmunski · on Dec 29, 2021

Yes, stuff broke in the ports collection. Not sure about planes but at least one monitor exploded.

alkonaut · on Dec 29, 2021

That still seems like software under the “control” of the OS maintainers. But most apps running on an OS are never seen by the people maintaining the OS.

That’s why it’s such a tricky move to break compat with those apps X because you cant know what they are doing and how.

GoblinSlayer · on Dec 29, 2021

It was added.

ncmncm · on Dec 29, 2021

Just changing time_t to an unsigned int would take us all the way out to 2106.

This has seemed to be an unpopular observation, in the past.

mlyle · on Dec 29, 2021

Lots of code would break because they assume they can do signed math with time_t, though. It's a less invasive change to make it wider: largely, it's just a recompile, except for code that persists a time_t directly or sends it directly over the network (and both of these should be considered harmful for other reasons).

The other problem is that more unsafe code in libraries, etc, will happily cooperate with 2038-safe unsigned time_t code, but will start to do bad things shortly before 2038.

alkonaut · on Dec 29, 2021

> it's just a recompile, except for code that persists a time_t directly or sends it directly over the network

Wouldn’t it also fail for all code that

a) stores the return value in a plain int and not a time_t as it would truncate (but at least this is a warning)

b) has time_t inside struts and allocates their size or arrays of structs assuming time_t is 4 bytes

c) has time_t inside structs and assumes the byte alignment of fields following it

Etc etc..

mlyle · on Dec 29, 2021

A) will be fine until 2038. I assume we have a lot of things that mashes time into an int or long. But such code is no worse off by virtue of the C library types being fixed.

B) -- manual calculation of size of structs instead of sizeof() -- yah, maybe it'll happen. I don't see much code this bad. If it's ever compiled on a different word length it's already been fixed.

C) Perhaps. For the most part alignment improves when you have a wider time_t, but you could have people counting the number of 32 bit fields and then needing 16 byte alignment for SSE later. Again, for the most part this penalty has been paid by code compiling on amd64, etc.

xigoi · on Dec 29, 2021

All of these fall under “harmful for other reasons”.

masklinn · on Dec 29, 2021

> Lots of code would break because they assume they can do signed math with time_t, though.

And lots of code would break because they want to talk about pre-1970 dates.

I'm definitely guilty of it, as I routinely use the timeline of the Great Emu War (1932-10-02/1932-12-10) to test time-bound features.

anfilt · on Dec 29, 2021

Honestly, for network code it makes more sense when the data is received to add the time to the current era. A 64 bit timestamp is an extra 4 bytes of overhead. However, the biggest issue with a network protocol is you just can't force everyone to update everything.

Now it's some in house protocol sure you can just update everything instead making the timestamp a relative value to the current era.

For code running locally 64 bits is less of issue, just mainly a problem of ABI breaks.

Honestly, one thing I think people overlook is file-formats... A lot them have 32 bit time fields. Also unlike a network packet the file could actually be from a previous 32 bit era. So those timestamps are ambiguous after 2038.

jandrese · on Dec 29, 2021

Half measures just create even more headaches down the road. Migrating to 64 bit time_t basically solves the problem once and for all. If you're going to make a change, make it the last change you'll ever need.

I'm also in favour of adopting IPv6 ASAP, but so far that has been a much harder sell.

mjevans · on Dec 29, 2021

Optimistically assume we as a species manage to survive to the point where distance, or relativistic speed differences, cause sufficiently frequent change in the observation of time passing that a single number, of any size, is no longer sufficient.

time_t is sufficient within bounds. It is expedient and quite correct in many computer science use cases. It can be extended with small additions for many other use cases.

However those bounds are a set of assumptions and simplifications that shouldn't be forgotten. I agree that the problem would be solved until the next paradigm shift in our understanding of time and the universe, and maybe forever if it turns out that the rules are cruel or we're too stupid to reach a more complex situation. I just wouldn't say once and for all, there's far too much uncertainty there.

lpghatguy · on Dec 29, 2021

2^64 seconds is 584.9 billion years. I think it's pretty safe to kick the can nearly 600 billion years down the road.

masklinn · on Dec 29, 2021

It's signed so only 2^63-1s, or 292 billion years.

'bout the same as tomorrow really, better do nothing, for in the end all is dust.

jandrese · on Dec 29, 2021

In retrospect we should have made it unsigned, but set the epoch at the Big Bang.

drran · on Dec 29, 2021

2^63 is just 292 billion years. Beware.

mlyle · on Dec 29, 2021

~1/4th to 1/40th the length of time where star formation will still be possible. Woefully insufficient.

mjevans · on Dec 29, 2021

Ruler of great size, less useful when measured aspects are a pile of disconnected threads rather than a canvas that is mostly shared and mostly distorted the same way.

tzs · on Dec 29, 2021

Distance won't be a problem. 2^64 seconds takes us past the point where the expansion of the universe is such that anything you are not gravitationally bound to is outside your cosmological horizon.

You'll be in a much bigger universe, but it will be empty except for your local galaxy group.

ch_123 · on Dec 29, 2021

OpenVMS took this approach for their POSIX libs: https://www.zx.net.nz/mirror/h71000.www7.hp.com/2038.html

seanhunter · on Dec 29, 2021

The tradeoff there is that you would be unable to use time_t to express times before 1 Jan 1970 (iinm). That may or may not be important depending on use case.

mlyle · on Dec 29, 2021

Another tradeoff is that you can't just subtract two time_t's and get a negative value for an interval. I'd wager this is a far more common problem.

E.g. you can't do...

    time_t completion = time(NULL) + 30;
    
    do {
        time_t remaining = completion - time(NULL);
    
        /* ... */
    } while (remaining > 0);

Without risking a sporadic infinite loop.

strenholme · on Dec 29, 2021

We can convert that unsigned 32-bit number in to a 64-bit number:

    #include <stdint.h>

    int64_t completion = (int64_t)time(NULL) + 30;
    
    do {
        int64_t remaining = completion - (int64_t)time(NULL);
    
        /* ... */
    } while (remaining > 0);

There’s some slowdown doing 64-bit int math on a 32-bit system, but the above works.

mlyle · on Dec 29, 2021

Yes. We're talking about growing time_t from int32_t to int64_t, instead of uint32_t. If you change it to uint32_t behind the scenes, some code will silently fail while compiling OK, because it was not expecting unsigned math.

thewakalix · on Dec 29, 2021

That’s still a breaking change, although somewhat less of a break. Better to just do it right.

ck2 · on Dec 29, 2021

Reminder to self not to fly on a plane in January 2038, maybe a few months before and after just to be sure.

Whole lotta unknowing beta-testers going to be used to debug old system.

kazinator · on Dec 31, 2021

> This can be observed with the large file support extension, which is needed to handle files larger than 2GiB, you must always build your code with -D_FILE_OFFSET_BITS=64. And, similarly, if you’re on a 32-bit system, and you do not build your app with -D_TIME_BITS=64, it will not be built using an ABI that is Y2038-compliant.

_TIME_BITS=64 is not working for me on an Ubuntu 18 system based on glibc 2.27 (three plus years old), and I see nothing in the header files that switches time_t.

This must be something new?

_FILE_OFFSET_BITS is old, on the other hand.

Edit: I see in the git log that this has 2021 all over it:

https://sourceware.org/git/?p=glibc.git&a=search&h=HEAD&st=c...

Anyway, it's too early to see if this is "bad". The unknown quantity is the behavior of distro people. Distro people have the ability to override this default, such that all the packages have 64 bit time_t, and the toolchain is configured to build that. I have some faith in distro people.

roeles · on Dec 29, 2021

For work I'm currently building a test-rig that has a projected lifetime of 20 years. So for me, this is very relevant. Thank you for posting.

999900000999 · on Dec 29, 2021

Note to self, retire by then.

I sure don't want to deal with this. Although knowing corporate America, that's a problem for whoever is CEO in 2037

Aissen · on Dec 29, 2021

From my experience, the libc is not even the most scary step… It's the first, necessary step, but not sufficient. Converting a code base that has serialized data structures with 32 bits timestamps that are not even time_t (which is a good choice since you want serialization stability), now that makes for complex upgrade paths.

tyingq · on Dec 29, 2021

MySql fixed their 2038 issues with UNIX_TIMESTAMP() and FROM_UNIXTIME(), fairly recently: https://bugs.mysql.com/bug.php?id=12654

getcrunk · on Dec 29, 2021

They can at least throw some compiler warnings.

high_byte · on Dec 29, 2021

added a reminder January 18, 2038 :)

onion2k · on Dec 29, 2021

I know this is a joke but it highlights an issue with the 2038 bug - people think it'll be a problem in 2038. It's very badly named. It should be called something like "the future date problem", because it affects any date that gets converted to a timestamp that represents a time after 19 Jan 2038. I wrote code for calculating the payments on 40 year mortgages back in the early 2000s and I had to consider this problem then.

high_byte · on Dec 29, 2021

ah interesting I hadn't realized. but for sure lots of problems will start occurring starting 2038

Subsentient · on Dec 29, 2021

Thank god all my shit is 64-bit now.

masklinn · on Dec 29, 2021

If you're on a GNU/Linux system that is not actually relevant unless you've also opted into glibc's 64b time_t, and recompiled everything locally: by default, glibc uses 32b time_t even on 64b machines.

petters · on Dec 29, 2021

This is certainly not true:

  #include <time.h>                                                                                                             
  #include <stdio.h>                                                                                                      
  int main() {                                                                                                                    
      printf("size: %lu\n", sizeof(time_t));                                                                                  
      return 0;                                                                                                       
  }

prints 8 on my system when compiled with just "gcc test.c"

AnssiH · on Dec 29, 2021

time_t has always been 64-bit on 64-bit glibc (https://sourceware.org/glibc/wiki/Y2038ProofnessDesign , https://www.gnu.org/software/libc/manual/html_node/Feature-T...).

> Currently it defaults to 64 bits on most architectures. Although it defaults to 32 bits on some traditional architectures (i686, ARM), this is planned to change and applications should not rely on this.