Posterous theme by Cory Watilo

Filed under: status

More on today's Gmail issue

Gmail's web interface had a widespread outage earlier today, lasting about 100 minutes. We know how many people rely on Gmail for personal and professional communications, and we take it very seriously when there's a problem with the service. Thus, right up front, I'd like to apologize to all of you — today's outage was a Big Deal, and we're treating it as such. We've already thoroughly investigated what happened, and we're currently compiling a list of things we intend to fix or improve as a result of the investigation.

Here's what happened: This morning (Pacific Time) we took a small fraction of Gmail's servers offline to perform routine upgrades. This isn't in itself a problem — we do this all the time, and Gmail's web interface runs in many locations and just sends traffic to other locations when one is offline.

However, as we now know, we had slightly underestimated the load which some recent changes (ironically, some designed to improve service availability) placed on the request routers — servers which direct web queries to the appropriate Gmail server for response. At about 12:30 pm Pacific a few of the request routers became overloaded and in effect told the rest of the system "stop sending us traffic, we're too slow!". This transferred the load onto the remaining request routers, causing a few more of them to also become overloaded, and within minutes nearly all of the request routers were overloaded. As a result, people couldn't access Gmail via the web interface because their requests couldn't be routed to a Gmail server. IMAP/POP access and mail processing continued to work normally because these requests don't use the same routers.

The Gmail engineering team was alerted to the failures within seconds (we take monitoring very seriously). After establishing that the core problem was insufficient available capacity, the team brought a LOT of additional request routers online (flexible capacity is one of the advantages of Google's architecture), distributed the traffic across the request routers, and the Gmail web interface came back online.

What's next: We've turned our full attention to helping ensure this kind of event doesn't happen again. Some of the actions are straightforward and are already done — for example, increasing request router capacity well beyond peak demand to provide headroom. Some of the actions are more subtle — for example, we have concluded that request routers don't have sufficient failure isolation (i.e. if there's a problem in one datacenter, it shouldn't affect servers in another datacenter) and do not degrade gracefully (e.g. if many request routers are overloaded simultaneously, they all should just get slower instead of refusing to accept traffic and shifting their load). We'll be hard at work over the next few weeks implementing these and other Gmail reliability improvements — Gmail remains more than 99.9% available to all users, and we're committed to keeping events like today's notable for their rarity.

Posted by Ben Treynor, VP Engineering and Site Reliability Czar

I'm glad to see that this problem is not too difficult and some configuration changes should prevent it from happening again.

Carnival cruise

This was my first time on a boat in the ocean! It was so big I couldn’t really feel the motion and didn’t end up needing my motion sickness pills, which is good because they make me sleepy.

The ship was huge! It had about 10 different levels and took a long time to walk from bow to stern. It was a cross between a floating mall/casino/all-you-can-eat buffet!

The food was pretty good and available everywhere you go. There were three big dining halls plus many little places serving more food with lots of variety to suit everyone’s tastes.

We left from Long Beach, and the second day was spent at Catalina Island. It was nice and sunny there and the people get around town on golf carts instead of cars. We spent the day walking all over Avalon, visiting the tourist shops and walking through the non-tourist part of town too.

The rest of the trip was spent on the boat. I really enjoyed the Las Vegas style shows with dancers and a live band. The highlight in my opinion were the evening comedy shows. They put on a show for kids and then a late show for adults only. It was hilarious and very entertaining.

Watch out for alcohol and the casino while onboard or you’ll end up with a huge bill at the end of the cruise. I avoided them both like the plague and was probably the only person onboard they had to write a check to in the end. (They gave us a credit for not stopping in Mexico due to H1N1 and I never fully used it up.)

Photos: http://bit.ly/19Rodx

San Diego day 4

We had a lunchtime get together at Megan's house the morning after the wedding.

Toured Balboa Park again, this time with my parents. It’s an incredible place. I can’t believe the variety of flora and fauna. Some of the trees remind me of Dr. Seuss.

Also toured Old Town and had dinner at an authentic Mexican restaurant with a mariachi band! The weather was even colder and more wet today, however, there is so much to see and do we still had a blast.

It was a dream wedding!

Who could ask for a more picture perfect wedding! My cousin Megan, the professional dancer and Pilates instructor, was married today at a beautiful venue. Right on the water, view of downtown San Diego across the way, sail boats going by, lots of friends and family, warm weather (after the cool ocean breeze died down), great music, great food, and fun dancing…who could ask for more. They even had the Star Wars theme as the bride and groom walked away after being married! It was awesome.

Cold, wet San Diego

Drove from Sacramento to San Diego today. After two days of straight driving, I’ve got a major case of TB…tired butt!

Northern California was scorching hot and dry, but when we got to San Diego it was cold (in the low 60s) and wet. Apparently it hasn’t rained for months until the day we arrived! Go figure.

Hopefully the weather will clear up for the wedding tomorrow which will be mostly outdoors at Point Loma.


View Larger Map

We plan to tour Balboa Park tomorrow morning, walk around and get some exercise!


View Larger Map

Long day of driving finally over but...

Drove from Portland to Sacramento today! Got about 50 mpg in our Honda Civic hybrid, nice. It was a beautiful drive with lots of sunshine, scenic mountains, and also ran into a swarm of bugs that covered the whole windshield with bug guts near the rice fields North of Sacramento. Yuck!

One more day of driving in store for tomorrow. Driving straight through to San Diego. We hope to get through LA by 1 pm before the traffic hits.