|
|
"By using the Freeze FlashCopy Consistency Group option, the disk subsystem will hold off I/O activity to a volume for a brief time period by putting the source volume in an extended long busy state. In this way, a window is created during which dependent write updates will not occur and FlashCopy will use that window to obtain a consistent point-in-time copy of the related volumes. I/O activity resumes when a FlashCopy consistency group is created"
Well, certain level of IO suspension is still expected, although it is not at the DB level, and this is different from Wangzhonnew's original question.
The quoted description of FlashCopy Consistency Group is correct. But the I/O activity holder is disk subsystem, NOT database subsystem and operating system. In other words, it is transparent to OS and DBMS. So-called, "certain level of IO suspension is still expected" is also absolutely correct, because I/O is always discrete from the time-slot point of view, this kind of IO suspension is always existed.
Can you confirm this is the only requirement to get a consistent DB image, but no other action required?
We have been using this method to get a full set of replicated data for years, from production OLTP, never with an I/O suspension outage. The recovery process is one part of DR solutions, what need to do is to restart DB2. This process has been standardized in our site, but different site leads to different process, the reason "why IBM will not just give an firm answer".
Another question about this backup method, can such image be used to "roll forward" the database? Or like in Wangzhonew's original question, can this be used as a base and some delta is applied to get the base to next stage of consistent db image?
My answer is YES. You can use RECOVER LOGONLY option to apply LOG after restore the copy. I have mentioned previously.
And, if this is a valid and working solution, I don't see why IBM will not just give an firm answer. We have been implementing Global Mirroring for DB2 for LUW, Global Mirroring for Oracle for DR site, and we have been cloning DB2 database using Global Mirroring all the time without affecting the primary database. And of course, we are also using "write suspend" and FlashCopy to take our daily DB backup.
FlashCopy, also known as IBM TotalStorage FlashCopy.
Asynchronous PPRC, also known as IBM TotalStorage Global Mirror.
XRC, also known as IBM TotalStorage z/OS Global Mirror.
So, I think what you are using is Asynchronous PPRC(PPRC-ASYN, or Global Mirror PPRC). It applies to both open platform and z/OS, based on PPRC-XD and FlashCopy and periodic creation of Consistency Groups across one or more ESS boxes. It is designed to provide a long-distance remote copy solution across two sites using asynchronous technology, and provide a consistent and restartable copy of the data at the remote site, created with minimal impact to applications at the local site. Asynchronous PPRC eliminates the requirement to do a manual and periodic suspend at the local site order to create a consistent and restartable copy at the remote site. The lags between the primary and secondary site is typically less than 10 seconds, minimizing the amount of data exposure in the event of an unplanned outage/disaster.
A typical Global Mirror PPRC DR solution is:
Your DBMS running in your local site, the disks (the first set) are PPRC Primary.
Thru PPRC-XD, the data in the local site (PPRC Primary) is asynchronously copied to PPRC Secondary disks(the second set), which residing in the remote site, and PPRC Secondary is also used as FlashCopy source.
And, in the remote site, you have FlashCopy taget disks(the third set).
The key point for DR structure is:
1.Automatically and periodically create Consistency Group of volumes in the local site.
2.Send incremental of consistent data to the remote site.
3.FlashCopy in the remote site.
4.Recycle from step 1 after a user-defined time period.
When unplanned outage/disaster happens:
1. TERMINATE SESSION to stop creating Consistency Groups.
2. FAILOVER to the PPRC Secondary disks at the remote site.
3. Establish a FlashCopy from the PPRC Secondary to the FlashCopy target disks. This is a safe copy.
4. Put the PPRC Secondary disks online at the remote site, start applications.
If "write suspend" dose not impact your SLA and you can tolerate a short time outage, write suspend+FlashCopy is OK for daily DB backup, and this copy should be offline (or clean) copy, assume you ensure the data in buffers is also copied. If not, this copy is still a fuzzy copy, and no difference from direct FlashCopy without write suspend. We still have methods to recover this fuzzy copy to a consisten point, i.e. the fuzzy copy made by FlashCopy is still recoverable.
[ 本帖最后由 Pythagoras 于 2008-12-2 14:54 编辑 ] |
|