Leap second bedevils Web systems over weekend
Reddit, LinkedIn and other sites were knocked offline by an extra second added to the official time
IDG News Service - Despite precautions by system providers, an extra second added to the official timekeeping record Saturday triggered several popular Internet services to crash over the weekend, including LinkedIn, Reddit and Quantas airline's reservation system.
Among systems that reportedly experienced difficulty with the extra second were unpatched Linux OS kernels, Hadoop instances, Cassandra databases and Java-based programs. Enough problems were documented around the globe that some even likened leap second change to another Y2K.
Leap seconds were introduced in 1971, as way to synchronize official digital time keeping, called Coordinated Universal Time (UTC) and mean solar time, namely by adding or removing a second from the official time. Overseen by the International Telecommunications Union, UTC is defined by the accumulation of seconds, which are defined with great precision by atomic clocks. Solar time, in contrast, measures the day by the time it takes the Earth to do one complete rotation, which can fluctuate slightly due to tidal effects, the slowing of the earth's rotation and other factors.
On Saturday, the ITU added a second onto the end of June 30 UTC, at midnight Greenwich Mean Time, or 8 PM eastern daylight time, the 25th adjustment to UTC since 1971.
Typically, UTC is coordinated across the Internet using the Network Time Protocol (NTP), in which the official time, supplied by the U.S. Navy and other sources, is passed among Internet servers. The client software for NTP can be configured to accommodate the leap seconds when they occur. Additional solutions have been offered to better arm servers against possible leap second failures, including those introduced by Opera and Google engineers.
For this leap second however, numerous problems were experienced. Those with servers running Debian Linux had reported several servers going offline after the leap second occurred. The servers were restored after NTP was temporarily disabled.
Reddit reported failures, stemming from its use of the Cassandra database and programs written in Java (although Reddit may have also been affected by storm related power outages that also took place Saturday in the Northern Virginia area). Mozilla also reported on leap second issues, stemming from Java. Mozilla's deployments of Hadoop, which heavily use Java, needed to be restarted. DataStax, which manages development of the open source Cassandra database, did not immediately respond to a request for comment.
The ITU has considered eliminating leap seconds because of the problems they can cause IT systems.
- Ten Factors Shaping the Future of Application Delivery Download this research report conducted by Enterprise Management Associates (EMA) to learn how those that are seeking to accelerate application delivery are leveraging...
- Using Continuous Delivery to Improve Software Delivery Learn more about the challenges impacting organizations and how continuous delivery processes can be a key success factor in accelerating software delivery.
- Global Bank Improves Quality of Application Development Read how this financial institution centralized build assets, cut development time in half and added additional security controls.
- HTTP Status Code Cheat Sheet Look at the Graph, Find the Code and Boom - You're Solving Problems. Identifying and understanding common HTTP status codes can go a...
- Cloud BI in Action: Recorded Webinar of Customer, Kony, Inc. See how Kony, Inc., a leading enterprise mobility company, is using TIBCO Jaspersoft for Amazon Web Services and Redshift to achieve embedded analytics...
- Cloud BI Overview: Jaspersoft for AWS Check out this overview of Jaspersoft for AWS, to easily and affordably build business intelligence solutions as well as embed visualizations and analytics... All App Development White Papers | Webcasts