RSS Feed
News
May
4
System issues on legacy systems [Updated]
Posted by Helpdesk Admin on 04 May 2023 03:40 PM

We are experiencing an issue with our legacy systems since about 09:00

This will affect mailbox access and incoming messages and some websites.

Outgoing messages  may also be affected for IMAP accounts.

We apologize for the inconvenience, we are working on this currently and will update as we have more information.

Update 10:00

ETA for fix is currently 11:30, however this could change.

Update 12:30

Unfortunately this will take some hours longer to resolve, we sincerely apologize to customers, please bear with us while we work to bring legacy services back online.

Update 15:30

We are still working on the issue in conjunction with our storage system provider Dell/EMC. At this stage it looks like it could be several hours yet, we'll update again around 6pm.

Update: 18:35

We're experiencing a very unusual condition on our storage array. Redundant storage processors are performing rolling reboots when both are present and not starting all services. Individually,  shares are appearing online for short periods of time before the processor restarts. Dell engineers working to stabilize access, some time to go yet, unfortunately.

Update: 5/5 06:30

We've worked overnight with Dell Support and Engineering teams who have been able to stabilize the processors on our arrays in order to be able to work safely on the file systems. Cause of previous erratic behavior was a corruption in some internal databases that are shared between service processors and were causing the reboots of both processors.

Hardware is now stable and we are waiting on an analysis from Dell engineering to resolve a panic issue that has been happening on access to certain file stores, once that issue is resolved we should be able to bring everything back online.

Update: 08:30

Dell Engineering are running a recovery process on the main file system. In the past, this took about 90-120 mins after which stores were brought back online, however this can vary according to usage and depth of scan.

Update: 09:00

We'll bring web shares up shortly so legacy websites should be back online shortly.

Mail stores are still recovery processing. 

Update: 12:30

Mounting of recovered Mailstore is still causing panic on our storage systems (which in turn affects websites also).

We've dismounted the webstore to allow websites function again, and will be moving data off the array to different hardware.

Unfortunately this does mean that Mailstore data will be unavailable an extended period, we are looking at contingencies to allow customers send and receive mail in advance of data migration.

Update: 23:55 

We are working with Dell on this on an on going basis.

We will be performing some software updates and also some system configuration changes.

This will cause some intermittent outages on websites, mainly after 7am but we'll try to minimize the impact of this during normal hours.

Again, we sincerely apologize for the inconvenience caused to customers still on our legacy systems and will update again as soon as we have  more information.

Update 6/5 12:00

This is proving to be a very persistent issue, unfortunately software updates did not help with the situation.

In the event this drags on much longer, we will be implementing a contingency plan to re-establish email communications for customers on a different system.

Update 6/5 23:50

We have applied some additional software updates that will allow us perform a deeper recovery process on the mailstore.

The recovery process is running overnight and we are monitoring it's progress.

Update 7/5 06:00

The mailstore is recovered and storage system is stable - we are conducting tests on this currently. 

Once this is completed, we'll begin to enable services on an incremental basis, staring with incoming email delivery and processing, then POP3 access, and finally IMAP.

Update 7/5 08:30

SMTP/IMAP & POP3 services are enabled and running normally. We are monitoring the system to ensure stability. 

We sincerely apologize for the inconvenience caused to customers still on our legacy systems.