Surf forecast model not updating (Jan 2021)

thermalben's picture
thermalben started the topic in Saturday, 23 Jan 2021 at 6:07am

Just a quick heads up - we seem to be experiencing some issues with our wave model, which hasn't updated for a few cycles (normally every 6 hours). We're investigating the issue this morning and hope to have everything back up and running ASAP. Apologies for the inconvenience. 

thermalben's picture
thermalben's picture
thermalben Saturday, 23 Jan 2021 at 7:40am

OK, turns out it's related to maintenance at NOAA. As issued 15th Jan:

"On January 21, 2021, the General Services Administration (GSA) will perform emergency maintenance to the power infrastructure at the David Skaggs Research Center in Boulder, CO. This will impact the Boulder IDP data center causing a total outage of the NOAA Operational Model Archive and Distribution System (NOMADS) website for at least nineteen (19) hours from 1200Z to 0700z the following day, January 22."

We received an email overnight (around 1am) saying that the "maintenance window needed to be extended which has prolonged the outage", though it should be restored in the coming hours.

So, hopefully our next model update should have the latest data.

Thanks for your patience. 

thermalben's picture
thermalben's picture
thermalben Saturday, 23 Jan 2021 at 9:10pm

Latest update (0800Z, i.e. 2 hours ago): The NOAA data source "remains degraded.  We continue to work on making it 100 percent viable again"

thermalben's picture
thermalben's picture
thermalben Sunday, 24 Jan 2021 at 6:30am

Another update (1752Z, i.e. 90 mins ago):

"NCO continues working to fully restore the Boulder data center so applications can be failed over to it.  Once that is done, the plan is to reboot the firewall in College Park to fix the Internet degradation issue.  Thank you for your patience."

thermalben's picture
thermalben's picture
thermalben Sunday, 24 Jan 2021 at 7:40am

Yet another update (2032Z, i.e. 8 mins ago): The NOAA data source "remains in degraded mode. NCO continues to work to restore it to an optimal state."

bigredcouch's picture
bigredcouch's picture
bigredcouch Sunday, 24 Jan 2021 at 9:41am

Thanks for keeping us updated Ben. It's an uncomfortable feeling not having an accurate reading on the week and weekend ahead. Makes it hard to plan. Thanks for the transparency and hopefully she's back up asap

thermalben's picture
thermalben's picture
thermalben Sunday, 24 Jan 2021 at 11:29am

Thanks mate. FWIW, the Forecaster Notes still stand as is.

thermalben's picture
thermalben's picture
thermalben Sunday, 24 Jan 2021 at 1:33pm

Latest advice - from 45 mins ago - is that NOAA have fixed the problem.

Our next model run is due around 4-5pm today so let's check back around 6pm to see how it's looking.

thermalben's picture
thermalben's picture
thermalben Sunday, 24 Jan 2021 at 1:43pm

Actually, our surf model has already picked up the latest data, so we are back up and running. Probably need to give it a day or two to get up to speed with things, as  there's no way to input the missing data (though because of the current synoptic setup, most locations should be Ok).

donweather's picture
donweather's picture
donweather Sunday, 24 Jan 2021 at 8:41pm

It’s not updating for me on the forecast page?

Craig's picture
Craig's picture
Craig Sunday, 24 Jan 2021 at 9:30pm

Yeah, it's not gone through yet, let's hope the next model run kicks off.

donweather's picture
donweather's picture
donweather Sunday, 24 Jan 2021 at 11:09pm

Weird as other websites have updated. How come not yours?

thermalben's picture
thermalben's picture
thermalben Monday, 25 Jan 2021 at 6:45am

Argh. Still down.

Don - it's a complex story. As it is, our wave model developers aren't on standby 24/7, because there's very little ongoing work required once the models are up and running (over the last twelve years, there's been less than a dozen incidents requiring immediate intervention at our end). 

When the data feed went down on Friday - it wasn't until the evening that we worked out what had happened - it was confirmed that the problem was a temporary NOAA outage (which does happen every now and then), and was therefore outside of our control. So, we took the view that it would probably come back sooner rather than later, and given the current synoptic setup, one day offline wouldn't greatly affect the data output.

Unfortunately, NOAA's expectations blew way out of proportion and it took another couple of days to fix.

Some (but not all) of the larger websites appear to have switched feeds over the weekend - for example, those with mission critical clients, such as energy, mining etc - though changing feeds is a not-insignificant amount of work, and could have only been possible for companies with full time model devs who were able to come in on a Saturday (unfortunately, this all kicked off on the cusp of the Australia Day Long Weekend.. terrible timing!).

Anyway, we've been unable to do anything until NOAA confirmed the feed was back - and even once that was done, we had to wait for another one of the six-hourly model cycles to pass before we can determine whether everything's OK at our end. 

I checked one of the forecast pages yesterday afternoon and erroneously concluded that our systems were back up and running (not sure how that happened, I think it's because I was viewing it on an iPad, which displays things a little differently). Unfortunately, it wasn't until the evening that we realised that wasn't the case. I then rebooted a few servers, and then had to wait another six-hour cycle to see if it came back. I've woken up this morning to find it's still not updating properly.

FWIW, NOAA have since been in contact with all end users (i.e. weather services) as it appears there are still some websites unable to get back up and running, as is the case with Swellnet.

Anyway, I've now asked our developers to jump in ASAP and try to diagnose the issue. We'll prioritise this today over everything else, though availability could be an issue as a lot of people are on holiday. 

Apologies again for the inconvenience - we're doing the best we can.

donweather's picture
donweather's picture
donweather Monday, 25 Jan 2021 at 7:44am

Thanks Ben

thermalben's picture
thermalben's picture
thermalben Monday, 25 Jan 2021 at 7:57am

OK, a little more progress.

Seems we managed to get a NOAA feed of yesterday's 12Z run overnight (that was the first complete download since the outage). But, our system isn't updating. I suspect one of the processing tasks has hung, and needs a kick in the balls (non-technical description).

Hopefully our devs can source that issue and get things up and running ASAP.

But, I'm confident we're over the NOAA hurdle.

thermalben's picture
thermalben's picture
thermalben Monday, 25 Jan 2021 at 8:36am

And we're back! Looks like it was a hung process on one of the servers.

bigredcouch's picture
bigredcouch's picture
bigredcouch Monday, 25 Jan 2021 at 4:52pm

Great news, thanks Ben. How long until the forecasts are up to speed/perfect?

indo-dreaming's picture
indo-dreaming's picture
indo-dreaming Monday, 25 Jan 2021 at 4:55pm

Weird day today, forecast was for 1ft low period waves, so didn't bother checking the beach this morning, checked the report about 11:00 and they said 3 to 5 ft, had a look at the beach and it was more like 2ft maybe 3 ft.

Didn't matter anyway i had to get some work done, but my lesson was learned still got to do a drive by.

thermalben's picture
thermalben's picture
thermalben Monday, 25 Jan 2021 at 5:01pm

BRC - end of the week to be safe. Depends on which coast you are though.

ID - the models are totally out of whack for Vicco for the next few days, no long period energy at all (as a result of the weekend's malarky).

bigredcouch's picture
bigredcouch's picture
bigredcouch Monday, 25 Jan 2021 at 5:34pm

Hey Ben, does that mean that the longer term forecast, say days 4-8 of the period should be pretty accurate but days 1-4 will be a bit off? Cheers. I'm in SA.

thermalben's picture
thermalben's picture
thermalben Friday, 29 Jan 2021 at 6:50am

Argh! Another alert from NOAA: "SYSTEM/SERVICE problems started around 1630Z. NCO support is investigating."

That was a couple of hours ago, and we've missed one model run as a result (though, that shouldn't affect the forecasts very much, if at all). We'll keep an eye on it.

thermalben's picture
thermalben's picture
thermalben Friday, 29 Jan 2021 at 9:40am

Latest update about 90 mins ago:

"As we continue to bring the Boulder data center back to full health, there will be brief interruptions to data feeds today. We will update users again when service has been fully restored. We apologize for the inconvenience."

Craig's picture
Craig's picture
Craig Friday, 29 Jan 2021 at 11:09am

With the intermittent outages, we've got 18z data through which should be seen across most forecasts in the coming half an hour if not already.

thermalben's picture
thermalben's picture
thermalben Friday, 12 Feb 2021 at 9:05am

More outages over the last 24 hours, due to the same problem as last time from NOAA ("Boulder Data Center system issues"). We've just restarted our servers, so hopefully it'll be back to normal for the next model run in a few hours time.

thermalben's picture
thermalben's picture
thermalben Friday, 12 Feb 2021 at 10:29am

Model run seems to be working OK now.

carpetman's picture
carpetman's picture
carpetman Friday, 12 Feb 2021 at 1:08pm

Surfcoast one is still broken :-/

jks

thermalben's picture
thermalben's picture
thermalben Thursday, 25 Mar 2021 at 5:59am

Looks like we've got another temporary glitch in the wave model, thanks to NOAA implementing a significant upgrade of the service a few days back. They've gone and changed their directory structure (first time I can recall in the last 10+ years), but we're having a few issues at our end for some reason. Will hopefully be sorted this AM.

lomah's picture
lomah's picture
lomah Saturday, 8 Jan 2022 at 1:08pm

Anyone having issues with the WAMS not updating? Seems to be stuck on wed.

Craig's picture
Craig's picture
Craig Saturday, 8 Jan 2022 at 2:37pm

Yep we've had issues getting the data from NCEP as they've had issues with their servers. We're trying to remedy it. Thanks for your patience.

lomah's picture
lomah's picture
lomah Saturday, 8 Jan 2022 at 5:18pm

No problem. Thanks for the heads up.