[Fault report] /gs/hs0 - occurred on Jun. 15, 2018

2018.6.18

An fault occurred and now temporarily recovered.

1. Summary

Impossible to access a part of /gs/hs0. It has temporarily recovered, but there is the possibility of performance decline.

2. Period

From 21:42 to 21:57, on Jun. 15

3. Details

Around 21:42, panic occurred on ossa0 which manages OST of Lustre (/gs/hs0), thereby It happened not to be able to access to /gs/hs0. Around 21:57,  it was taken over to ossa1. /gs/hs0 is accessible at present.
It was probably caused by a temporal stall of file I/O to Lustre file system in the period above.
OST, which is supposed to be managed by ossa0, is mounted on ossa1 at present. For that reason, it is possible that I/O bandwidth to /gs/hs0 decline.

It is thought that of the same kind as the fault occurred on May 24, and Jun. 4