Categories
Outages

[Resolved] Large Inbound DDoS

We are currently facing a 5GB/s inbound DDoS attack, we are working to mitigate this and update will be provided here.

Update: We are still working on mitigating this attack, appologies for any inconvenience. Further updates will be provided as soon as possible.

Update 2: Connectivity has been restored to all but 1 server at the moment.

Update 3: Connectivity has now been restored to all servers. The remaining malicious traffic has terminated, and traffic through our network has returned to normal levels. We are very sorry for the amount of time it has taken to mitigate this, the largest timekiller was isolating multiple targets. Some customers will have had a longer amount of downtime/packet loss than others.

Following this outage we will continue to monitor the network vigilently for the next 48 hours, and we are also going to draw up our options for implementing additional filtering into our network to improve our resilienece to malicious traffic.

Categories
News

System Status

We are currently not experiencing any issues.

Categories
Outages

[Resolved] xn7 down

xn7 appears to be down although its responding to SSH. We are currently waiting for a KVM to be attached.

Update: The datacentre appears to be non-responsive at the moment. Appologies for the delay.

Update 2: This was due to the first disk in the RAID array failing, the server has now been rebooted and VPS are starting up. We will hot-swap the disk out shortly.

Update 3: We have hot-swapped out the bad disk and the array has now almost finished rebuilding.

Categories
Outages

[Resolved]Packet Loss

We have been alerted by our monitoring systems that there is currently a high amount of packet loss in our UK Data Centre, we are in contact with the Data Centre and will update here once we have any more information.

Update 2: This has now been resolved and all traffic is flowing normally.

Update: This is a LINX (London Internet Exchange) Issue, it has just dropped over 500GB/s of traffic:

Categories
Outages

[Resolved]Temporary Network Issue

During a maintenance window, one of the DWDM units lost power, this caused temporary packet loss for approx 2-3 Minutes however everything is now online and normal service has been resumed.

Categories
Outages

[Resolved] xn9 Issue

xn9 is currently having issues, although the system & VPS are responding to ping it looks like I/O or RAID related issues. Just waiting on the DC to attach KVM now

Update 1: We have confirmed this is a RAID failure and we are currently working to restore the array.

Update 2: Waiting on DC again at the moment, sorry for the delay.

Update 3: Unfortunatley it appears the RAID array has collapsed and is unusable instead of just being degraded due to a bad disk. We are working through our options to restore the array.

Update 4: We have now re-assembled both sides of the RAID set and the system is now booted off a degraded RAID-10 volume, VPS are currently starting up. We will now proceed to inspect the integrity of the array and hot-swap out any suspect disks.

Update 5: Disks p1 and p2 make up the first half of the RAID-10 array (Or together  a RAID-1 set), disk p2 is bad and a read error caused the system to hang. The system is currently running off the bad disk, p2 while p1 rebuilds itself. Once this has been completed we will immediatley take p2 offline and replace the disk. At this point the array is fragile however we have no reason to believe the current rebuild will not complete successfully. Many thanks for your patience.

Update 6: Full redundancy and performance has now been restored to the array.

Categories
Outages

[Resolved]DDoS Attack

We are currently facing a 1+GB/s DDoS Attack on our systems, we are working to mitigate this as quickly as possible and all update will be provided here.

Update 2: Traffic is now flowing normally again and now consider this resolved.

Update: This appears to be under control, we have nulled the IPs in question and are monitoring the situation very closely.

Categories
Planned Maintainance

CP1 MySQL Issue

We are currently aware of a MysQL issue on CP1 our shared/reseller server which occurs when the daily backups run at around 4AM in the morning. It is normal for MySQL databse tables to lock while a particular database is being backed up, however we have recieved reports of MySQL being inaccessible for extended amounts of time during the backup cycle.

We are working diligently to resolve this, and have a Senior Admin monitoring the situation again tonight around the time this has been occuring.

Thanks

Categories
Planned Maintainance

[Complete] Rolling Restarts 11/01/2011

Tonight at 8PM we are doing maintainence on our last server affected by the kernel driver bug, XN1.

Updates will be provided here as usual.

XN1 (Complete)
8.00PM – Going down for reboot now
8.04PM – System is back
8.16PM – All VPS have been restarted and are booting up now

The maintainence has now been completed. Many thanks for your patience.

Categories
Planned Maintainance

[Complete] Rolling Restarts 10/01/2011

Tonight we are continuing with maintainence on XN12 and XN3.

Updates will be provided here throughout.

XN12 (Complete)
8.00PM – Going down for reboot now
8.03PM – System is up and starting rolling restarts of VPS
8.10PM – All VPS have been restarted

XN3 (Complete)
8.24PM – Going down for reboot now
8.26PM – System has failed to boot into the new kernel, investigating now
8.37PM – This server appears to have a bootloader (grub) issue which we are working to resolve
8.53PM – Looks like this isn’t such a simple issue. We are still working on it and will restore service ASAP.
9.04PM – Server is now booting up on the old kernel and we are investigating why the new kernel failed to boot. It’s the same hardware as our other servers.
9.22PM – Issue resolved and the system is now up on the new kernel. Restarts of VPS to follow
9.38PM – All VPS have been restarted

Today’s maintainence is now complete. Appologies for the delay on XN3, however this was absolutly mandatory to ensure the machine is not vulnerable to a driver bug which has affected two other systems earlier this week.