HP StorageWorks XP24000 Continuous Access Journal Software User and Reference Guide, v01 (T5278-96001, June 2007)

ManualsBrandsHP ManualsSoftwareHP StorageWorks Continuous Access XP Media

151

152

153

154

155

156

157

158

159

160

Table 28 Resolving Continuous Access Journal Pair Suspension

Classiﬁcation Causes of Suspension

SIM

Recovery P rocedure

Primary

storage system

hardware or

secondary

storage system

hardware

Hardware redundancy has been lost

due

to a blockade condition. As a

result, one of the following could

not

complete: primary-secondary

storage system communications,

journal creation, copy operation,

restore operation, staging process, or

de-staging process.

Journals cannot be retained because

some

portion of the cache memory or

the shared memory has been blocked

due

to hardware failure.

The primary storage system failed to

create and transfer journals due to

unrecoverable hardware failure.

The secondary storage system failed

to receiveand restorejournalsdue to

unrecoverable hardware failure.

The drive parity group was in

correction-access status while the

Continuous Access Journal pair was

in COPY status.

DC0x

DC1x

DC2x

According to the SIM, remove the

hardware blockade or failure.

Restore the failed volume pairs

(Pairresync).

If a

failure occurs during execution

the RAID Manager horctakeover

command, secondary volumes in

SSWS pair status may remain in the

master journal group. If these volumes

remain, execute the pairresync -swaps

command on the secondar y volumes

whose pairstatusisSSWS(pairresync

the R AID Manager command for

resynchronizing pair and -swaps is a

swap option). This operation changes

all

volumes in the master journal

group to primary volumes. After this

operation, restore the volume pairs

(Pairresync).

Communica-

tions between

the

primary and

secondary stor-

age

systems

Communications between the storage

systems failed because the secondary

storage system or network relay

devices were not running.

Journal volumes remained full even

after the timeout period elapsed.

DC0x

DC1x

Remove the failure from the primary

and

secondary storage systems or the

network relay devices.

necessary, increase resources as

needed (for example, the amount of

cache, the number of paths between

primary and secondary storage

systems, the parity groups for journal

volumes).

Restore the failed pairs (Pairresync).

RIO o

verload or

RIO

failure

An unrecoverable RIO (remote I/O)

timeout

occurred because the storage

system or network relay devices were

overloaded. Or, the RIO could not

ﬁnished duetoafailureinthe

storage system.

DC2x

Relea

se the

failed pairs (pairsplit-S).

nec

essary, increase resources as

need

(for example, the amount of

cach

e, the number of paths between

prim

ary and secondar y storage

systems, the parity groups for journal

volumes).

Re-establish failed pairs (Paircreate).

Planned power

outage to the

primary storage

system

The Continuous Access Journal pairs

were temporarily suspended due to a

planned power outage to the primary

storage system.

DC8x

No recovery procedure is required.

The primary storage system will

automatically remove the suspension

condition when the storage system is

powered on.

Checking Continuous Access Journal Err or Codes

Remote Web Console computers display an error message when an error occurs during Continuous

Access Journal operations. The error message describes the error and displays an error code consisting

four digits. The error message may also include a storage system SVP error code. If you need to call HP

technical support for assistance, report the error code(s). Please see HP StorageWorks XP24000 Remote

Web Console Error Codes for a list of error codes displayed on the Remote We b Console computers.

156

Troubleshooting