WO2000052577A1 - System and method for relocation of compressed data tracks - Google Patents

System and method for relocation of compressed data tracks Download PDF

Info

Publication number
WO2000052577A1
WO2000052577A1 PCT/US2000/005529 US0005529W WO0052577A1 WO 2000052577 A1 WO2000052577 A1 WO 2000052577A1 US 0005529 W US0005529 W US 0005529W WO 0052577 A1 WO0052577 A1 WO 0052577A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
host
compressed
track image
compressed track
Prior art date
Application number
PCT/US2000/005529
Other languages
French (fr)
Inventor
Michael S. Milillo
Christopher J. West
Original Assignee
Storage Technology Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Storage Technology Corporation filed Critical Storage Technology Corporation
Publication of WO2000052577A1 publication Critical patent/WO2000052577A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1469Backup restoration techniques

Definitions

  • the invention relates to data compression, and particularly to relocating compressed data tracks to and from storage devices.
  • One object of the invention to provide methods for relocating compressed data tracks without the above-described problems.
  • One specific object of the invention is to provide methods and systems for relocating compressed data tracks to and from Direct Access Storage Devices ("DASD").
  • a further object of the invention is to provide methods and systems to relocate compressed tracks, selectively, to compatible storage volumes.
  • U.S. Patent Nos. 5,615,329, 5,072,378 and 5,193,184 relate to storage systems, remote data duplex and/ or virtual data storage, and provide useful background information for the invention.
  • U.S. Patent Nos. 5,615,329, 5,072,378 and 5,193,184 are thus herein incorporated by reference.
  • the systems and methods of the invention transfer compressed track images to the host with support information that provides for relocating the tracks from the source disk location to the new location, if desired, with a different track, a different cylinder, and/ or a different disk in the format of the source track.
  • data stored in compressed form is read by the host using a read protocol to read the compressed data; and that data is preceded by metadata that self-defines the track.
  • the metadata solves the relocation problem of the prior art by creating a description of a compressed track that is invariant with its currently assigned location on a virtual volume.
  • the metadata is used to restore the track in its newly assigned location on a different virtual volume, if desired.
  • a seed value in the metadata provides an identification tag for the data. This identification tag remains valid across relocation for data that is stored to a new location and subsequently decompressed.
  • the invention provides for relocating compressed tracks of different sizes so long as the new location has a compatible format and size with the original location at the time host-compressed track was read.
  • This aspect is particularly useful in supporting IBM DASD systems such as model 3380-J and 3390- 3, as these storage systems provide for different track sizes.
  • the systems and methods of the invention also permit compression and alternatively decompression of data to and from the host, similar to storage systems in existence in the prior art.
  • sending and receiving compressed data images from a storage volume with the host for purposes of backup and restore exemplifies one important advantage of the invention.
  • Backup and restore data can then be moved quickly between the backup drive, e.g., a tape drive, and the storage volume.
  • compressed data tracks from the backup device can be relocated to other locations in the virtual volume - also an advantage over the prior art.
  • the invention provides a method for relocating data to a storage volume, including the steps of: storing a compressed track image of the data at a first location in a first storage volume; tagging metadata to the compressed track image; transmitting the compressed track image to a host; and storing the compressed track image at a second location in a second storage volume according to the metadata.
  • the method can also include the step of backing up the compressed track image in a storage device after transmitting the compressed track image to a host.
  • the method can further include the step of compressing the data prior to storing the compressed track image at the first location.
  • the first and second locations can be identical; and the first and second storage volumes can be the same.
  • the method includes decompressing the compressed track image and assigning a seed value to the metadata, so that the data remains identifiable across relocation.
  • the method includes the step of automatically modifying a count field of the compressed track image to match the second location.
  • the step of transmitting is preceded by the step of reading the compressed track image by a Read Protocol.
  • the Read Protocol can include executing one or more of Define Extent, Locate Record and Read Track Commands.
  • the step of storing includes the step of writing the compressed track by a Write Protocol.
  • the Write Protocol can include executing one or more of Define Extent, Locate Record, Write, and Write Next Commands.
  • the step of writing by a Write Protocol includes the step of specifying the second location through a count field preceding the compressed track image.
  • the invention includes a system for backup and relocation of compressed track images.
  • a host connects to a control unit, which stores the data as a compressed track image in a storage volume.
  • the control unit responds to a read command from the host to transmit the compressed track image to the host; and the control unit assigns metadata to the image.
  • the control unit responds to a read command from the host to restore the compressed track image into a location corresponding to the metadata. In restore, therefore, the compressed track is relocated, if desired.
  • the system includes a backup storage device to backup data from the storage volume.
  • the host includes read protocol logic to read the compressed track image from the control unit; and write protocol logic to restore the compressed track image to the storage volume.
  • FIG. 1 schematically illustrates a prior art data storage and backup system
  • FIG. 2 schematically illustrates a prior art distributed memory arrangement
  • FIGs. 3, 3A and 3B schematically show a system, constructed according to the invention and in modes of operation, for storing and relocating compressed data tracks;
  • FIG.4 illustrates a track image restored to a storage volume and preceded by metadata and a seed value, in accord with the invention.
  • FIG. 1 schematically illustrates a prior art data storage and backup system 10, including a host 12, a control unit ("CU") 14, and a tape drive backup device 16.
  • CU 14 has a virtual volume 14a which is, for example, representative of data storage cylinders, known in the art, or complex mappings of distributed memory, such as illustratively shown in FIG. 2.
  • virtual volume 14a is commonly designed according to IBM standards.
  • Virtual volume 14a can for example be model 3380s and 3390s from IBM.
  • Tape storage device 16 is used to backup data from volume 14a.
  • host 12 writes data to CU 14 and into virtual volume 14a.
  • host 12 can write data in virtual volume 14a at disk X, cylinder X', track X", illustrated in FIG. 1 as data extent 18.
  • CU 14 compresses such data prior to storage in storage volume 14a to improve storage capacity and data transfer efficiency.
  • host 12 makes a backup of data extent 18, CU 14 decompresses the compressed track image and sends the uncompressed data to host 12; and host 12 stores the data in tape drive 16.
  • host 12 captures data from tape drive 16 and transfers the data to CU 14, which then compresses and restores data extent 18 to its original location within virtual volume 14a, i.e., at disk X, cylinder X', track X". Host 12 and/ or CU 14 do not relocate data extent 18 to a new location within virtual volume 14a.
  • FIG. 2 illustrates one prior art distributed memory arrangement 20.
  • a virtual volume e.g., virtual volume 14a
  • a control unit 22 e.g., control unit 14
  • distributed memory 26 can for example include disks or cylinders 26a with tracks for storage therein.
  • data Z entering control unit 22 is compressed and mapped to Z' within the distributed memory 26 via complex mapping detailed at the cache 24.
  • Data bus 25 provides a data routing conduit between the cache 24 and the disks 26a.
  • Data Z' can be decompressed, read and recovered to a connected host by known methods, as described in FIG. 1.
  • system 50 provides for relocating compressed data tracks to user-selected storage volumes, such as during data backup and restore.
  • FIG. 3 shows system 50 writing a compressed track to a storage volume in a first mode of operation.
  • FIG. 3A shows system 50 in a backup mode of operation.
  • system 50 includes a host 52 coupled to CU 54 via data bus 53.
  • Data bus 53 can include a plurality of data paths, as known in the art.
  • Host 52 stores data into storage volume 56 through CU 54.
  • CU 54 has a cache 55 coupled to storage volume 56 via data bus 57; and cache 55 maps data into volume 56 according to known methods.
  • Storage volume 56 illustratively shows one track 58 representative of compressed data assuming the extent within volume 56.
  • storage volume 56 can be, and preferably is, a distributed memory arrangement of cylinders and tracks.
  • Tape device 60 connects to host 52 through data bus 61 and is used by system 50 to backup data within storage volume 56. As illustrated in FIG. 3, system 50 is shown writing compressed track image 58 into storage volume 56.
  • CU 54 preferably includes an application specific integrated circuit (“ASIC") 54a, which compresses and alternatively decompresses data to and from CU 54.
  • ASIC application specific integrated circuit
  • CU 54 can decompress stored data, via ASIC 54a, so that host 52 receives uncompressed data.
  • CU 54 also sends compressed data to host 52, when commanded, for purposes of backup, restore and/ or relocation.
  • ASIC 54a is not used to decompress data sent from CU 54 to host 52; and data backed up in tape drive 60 is conveniently stored in compressed form.
  • system 50 is shown illustrating a backup of data, e.g., track 58, from storage volume 56 to tape device 60.
  • Data backup is typically performed to safeguard data track 58 in a redundant, second storage volume; though data backup also occurs to manipulate or use storage volume 56 for other purposes while maintaining a copy of the original data track 58.
  • host 52 reads data track 58 from storage volume 56 with read protocol 52a.
  • Read protocol 52a for example takes the form of software that provides host 52 with capability to read the compressed track image 58.
  • CU 54 assigns and tags metadata to image 58. Metadata is shown illustratively in FIG. 4 and is described in more detail below.
  • the backup version of data track 58 is shown as data 60a in tape device 60.
  • CU 54 does not decompress the data prior to transmission to host 52. Rather, compressed track image 58 is sent to host 52 in compressed form with metadata attached. Nevertheless, CU 54 has capability (i.e., at ASIC 54a) to decompress data for host 52, when commanded.
  • FIG. 3B shows system 50 restoring data 60a to storage volume 56; except that host commands change the location of the stored track image to location 62.
  • a user of system 50 has the choice to relocate compressed data 60a to track location 62 through the metadata assigned to the track image 58.
  • Host 52 writes data 60a into storage volume 56 with write protocol 52b.
  • Write protocol 52a for example takes the form of software that provides host 52 with capability to write - and hence "restore" - compressed track images into storage volume 56.
  • system 50 restores compressed data 60a to location 62 via the metadata assigned to data 60a.
  • Location 62 is specified at host 52 by user commands.
  • FIG. 3B also shows an alternative storage volume 70 connected to CU 54 via data bus 72. If for example original data track 58 was stored in cylinder y in volume 56, compressed data associated with track 58 can for example be restored to track 74, cylinder v, in volume 70. Cylinders y and v represent any one cylinder within the associated storage volume 56, 70. As above, restoring data 60a to volume 56 or 70, and at selected cylinders and tracks within these volumes, is preferably determined by commands at host 52.
  • FIG. 4 illustrates data extent 58 and metadata 82 transferred to host 52 in the above operational mode of FIG. 3A.
  • data extent 58 is preceded by metadata 82 which "self-defines" track image 58.
  • Metadata 82 provides for relocation of compressed tracks by creating a description of the compressed track that is invariant with its currently assigned location within volume 56.
  • write protocol 52a uses the metadata to "restore" the track to the newly assigned location (e.g., location 62 or 74), if desired,
  • CU 54 Whenever a compressed track is decompressed by ASIC 54a, CU 54 appends a seed value 84 to the metadata as an identification tag for data extent 58. After the compressed track is written to a new location - e.g., to location 62 or 74, FIG. 4 - the identification tag remains valid across the relocation even during the decompression process.
  • CU 54 understands when host 52 transmits compressed data with metadata 82. Even if host 52 originally transmits compressed images to CU 54, relocation such as described above can still occur though the metadata 82; and relocation can be accomplished through a single read and write operation by host 52. Seed value 84 establishes that the track is consistent such as through cyclic redundancy checking ("CRC”), known in the art.
  • CRC cyclic redundancy checking
  • the invention utilizes the count field (known in the art), which specifies information such as cylinder, head and record numbers.
  • the count field can be specified to identify the origination of each record.
  • the CU modifies the count field, as appropriate, to specify the data extent as cylinder and head numbers for the redirected compressed track in each count field. Accordingly, the CU translates the data extents and modifies the count field based upon what is transferred by the CU. For example, if the count field describes the cylinder and head numbers of the extent within the tape drive 60, the CU moves the data into that extent and modifies the cylinder and head numbers, if needed, based upon the appropriate mapping.
  • the count field can also be used in rerouting tracks to and from the CU.
  • host read protocol 52a sends a series of commands that are chained together during a read process.
  • the first command (Define Extent) specifies the extent (i.e., the range of tracks) over which the read pertains.
  • the second command (Locate record) specifies the start track and the mask of which tracks in a cylinder are read. The second command thus sets the orientation of which track is read first; and the mask dictates which subsequent tracks are read by skipping tracks that do not have the mask bit set.
  • the next read protocol commands are the Read Track Commands. There can, for example, be up to fifteen read track commands after the Locate Record Command, since there typically are fifteen tracks per cylinder.
  • the Write Next Track Command writes data to the track that is incremental to the prior oriented track (i.e., the 'next' track ).
  • This track and subsequent tracks are written to the orientation specified by their location in the chain relative to the Locate Record Command, without regard to where they were read from or to the count field specified within the write data.
  • An alternative is to use the count field to place the tracks, without regard to where they were read from or to the orientation specified by a location relative to the Locate Record domain.

Abstract

Systems and methods are provided to backup, restore and relocate compressed data images, e.g., DASD compressed tracks, associated with virtual storage volumes. A compressed track image of the data is first stored in a storage volume. The data derives from a connected host connected with a control unit that manages the storage volume. When the host data is uncompressed, the control unit compresses the data. In backup, the host reads the compressed data with a read protocol and the control unit assigns metadata to the compressed track. The compressed track is then stored in a backup device, e.g., a tape drive. In restore, the host writes the compressed track to the control unit with a write protocol. If desired, the track is relocated to a different location or to a different storage volume through the associated metadata. In decompression, a seed value is assigned to the data to ensure identification across relocation.

Description

SYSTEM AND METHOD FOR RELOCATION OF COMPRESSED DATA TRACKS
Related Applications
This application is a continuation-in-part of commonly-owned and co- pending U.S. Application No.09/243,085 (Attorney Docket No. 98-056-DSK), filed on February 2, 1999, entitled "Selective Remote Storage Copy System and Methods," and herein incorporated by reference.
Field of the Invention
The invention relates to data compression, and particularly to relocating compressed data tracks to and from storage devices.
Background of the Invention
Methods of sending compressed data from a disk subsystem to a host are well known in the prior art. Nevertheless, such methods are currently limited in that the compressed track is restored, i.e., written by the host, at the same storage location from which the track was read. While this may be acceptable for disk volume backup and restore, it is unacceptable for the backup and restore of application data sets that are backed up from one location and restored to another location.
More particularly, application data sets are seldom restored to the disk location from where they were originally backed up. By way of one example, data sets are sometimes deleted after backup and the corresponding disk space is used for another data set; and it can be undesirable to over-write the new data with restored data. By way of another example, utilities that compact volumes to make contiguous areas of free space move data sets around; and it can be undesirable to perform an additional backup simply because the data set was moved. It is, accordingly, one object of the invention to provide methods for relocating compressed data tracks without the above-described problems. One specific object of the invention is to provide methods and systems for relocating compressed data tracks to and from Direct Access Storage Devices ("DASD"). A further object of the invention is to provide methods and systems to relocate compressed tracks, selectively, to compatible storage volumes. These and other objects will become apparent in the description that follows.
Summary of the Invention
U.S. Patent Nos. 5,615,329, 5,072,378 and 5,193,184 relate to storage systems, remote data duplex and/ or virtual data storage, and provide useful background information for the invention. U.S. Patent Nos. 5,615,329, 5,072,378 and 5,193,184 are thus herein incorporated by reference.
In one aspect, the systems and methods of the invention transfer compressed track images to the host with support information that provides for relocating the tracks from the source disk location to the new location, if desired, with a different track, a different cylinder, and/ or a different disk in the format of the source track. In one specific aspect, data stored in compressed form is read by the host using a read protocol to read the compressed data; and that data is preceded by metadata that self-defines the track. The metadata solves the relocation problem of the prior art by creating a description of a compressed track that is invariant with its currently assigned location on a virtual volume. When the compressed track is written using a write protocol to write a compressed track, the metadata is used to restore the track in its newly assigned location on a different virtual volume, if desired.
In another aspect, when a compressed track is decompressed, a seed value in the metadata provides an identification tag for the data. This identification tag remains valid across relocation for data that is stored to a new location and subsequently decompressed.
In one aspect, the invention provides for relocating compressed tracks of different sizes so long as the new location has a compatible format and size with the original location at the time host-compressed track was read. This aspect is particularly useful in supporting IBM DASD systems such as model 3380-J and 3390- 3, as these storage systems provide for different track sizes.
In certain aspects, the systems and methods of the invention also permit compression and alternatively decompression of data to and from the host, similar to storage systems in existence in the prior art. However, sending and receiving compressed data images from a storage volume with the host for purposes of backup and restore exemplifies one important advantage of the invention. Backup and restore data can then be moved quickly between the backup drive, e.g., a tape drive, and the storage volume. Further, with the invention, compressed data tracks from the backup device can be relocated to other locations in the virtual volume - also an advantage over the prior art.
In still another aspect, the invention provides a method for relocating data to a storage volume, including the steps of: storing a compressed track image of the data at a first location in a first storage volume; tagging metadata to the compressed track image; transmitting the compressed track image to a host; and storing the compressed track image at a second location in a second storage volume according to the metadata.
The method can also include the step of backing up the compressed track image in a storage device after transmitting the compressed track image to a host.
The method can further include the step of compressing the data prior to storing the compressed track image at the first location. In other aspects, the first and second locations can be identical; and the first and second storage volumes can be the same.
In yet another aspect, the method includes decompressing the compressed track image and assigning a seed value to the metadata, so that the data remains identifiable across relocation.
In one aspect, the method includes the step of automatically modifying a count field of the compressed track image to match the second location.
In another aspect, the step of transmitting is preceded by the step of reading the compressed track image by a Read Protocol. The Read Protocol can include executing one or more of Define Extent, Locate Record and Read Track Commands.
In still another aspect, the step of storing includes the step of writing the compressed track by a Write Protocol. The Write Protocol can include executing one or more of Define Extent, Locate Record, Write, and Write Next Commands. In a preferred aspect, the step of writing by a Write Protocol includes the step of specifying the second location through a count field preceding the compressed track image.
In another aspect, the invention includes a system for backup and relocation of compressed track images. A host connects to a control unit, which stores the data as a compressed track image in a storage volume. The control unit responds to a read command from the host to transmit the compressed track image to the host; and the control unit assigns metadata to the image. The control unit responds to a read command from the host to restore the compressed track image into a location corresponding to the metadata. In restore, therefore, the compressed track is relocated, if desired. In another aspect, the system includes a backup storage device to backup data from the storage volume.
In other aspects, the host includes read protocol logic to read the compressed track image from the control unit; and write protocol logic to restore the compressed track image to the storage volume.
The invention is next described further in connection with preferred embodiments, and it will become apparent that various additions, subtractions, and modifications can be made by those skilled in the art without departing from the scope of the invention.
Brief Description of the Drawings
A more complete understanding of the invention may be obtained by reference to the drawings, in which:
FIG. 1 schematically illustrates a prior art data storage and backup system;
FIG. 2 schematically illustrates a prior art distributed memory arrangement;
FIGs. 3, 3A and 3B schematically show a system, constructed according to the invention and in modes of operation, for storing and relocating compressed data tracks; and
FIG.4 illustrates a track image restored to a storage volume and preceded by metadata and a seed value, in accord with the invention.
Detailed Description of the Drawings FIG. 1 schematically illustrates a prior art data storage and backup system 10, including a host 12, a control unit ("CU") 14, and a tape drive backup device 16. CU 14 has a virtual volume 14a which is, for example, representative of data storage cylinders, known in the art, or complex mappings of distributed memory, such as illustratively shown in FIG. 2. Those skilled in the art understand that virtual volume 14a is commonly designed according to IBM standards. Virtual volume 14a can for example be model 3380s and 3390s from IBM. Tape storage device 16 is used to backup data from volume 14a.
In operation, host 12 writes data to CU 14 and into virtual volume 14a. By way of example, host 12 can write data in virtual volume 14a at disk X, cylinder X', track X", illustrated in FIG. 1 as data extent 18. CU 14 compresses such data prior to storage in storage volume 14a to improve storage capacity and data transfer efficiency. When host 12 makes a backup of data extent 18, CU 14 decompresses the compressed track image and sends the uncompressed data to host 12; and host 12 stores the data in tape drive 16. To restore data to virtual volume 14a, host 12 captures data from tape drive 16 and transfers the data to CU 14, which then compresses and restores data extent 18 to its original location within virtual volume 14a, i.e., at disk X, cylinder X', track X". Host 12 and/ or CU 14 do not relocate data extent 18 to a new location within virtual volume 14a.
FIG. 2 illustrates one prior art distributed memory arrangement 20. Specifically, FIG. 2 shows that a virtual volume (e.g., virtual volume 14a) within a control unit 22 (e.g., control unit 14) can be represented as a cache 24 coupled to distributed memory 26. Distributed memory 26 can for example include disks or cylinders 26a with tracks for storage therein. As known in the art, data Z entering control unit 22 is compressed and mapped to Z' within the distributed memory 26 via complex mapping detailed at the cache 24. Data bus 25 provides a data routing conduit between the cache 24 and the disks 26a. Data Z' can be decompressed, read and recovered to a connected host by known methods, as described in FIG. 1.
FIGs. 3-3B illustrate a system 50 constructed according to the invention.
Among other functions, system 50 provides for relocating compressed data tracks to user-selected storage volumes, such as during data backup and restore.
FIG. 3 shows system 50 writing a compressed track to a storage volume in a first mode of operation. FIG. 3A shows system 50 in a backup mode of operation.
FIG. 3B shows system 50 in a restore mode of operation.
More particularly, system 50 includes a host 52 coupled to CU 54 via data bus 53. Data bus 53 can include a plurality of data paths, as known in the art. Host 52 stores data into storage volume 56 through CU 54. CU 54 has a cache 55 coupled to storage volume 56 via data bus 57; and cache 55 maps data into volume 56 according to known methods. Storage volume 56 illustratively shows one track 58 representative of compressed data assuming the extent within volume 56. Those skilled in the art should appreciate that storage volume 56 can be, and preferably is, a distributed memory arrangement of cylinders and tracks. Tape device 60 connects to host 52 through data bus 61 and is used by system 50 to backup data within storage volume 56. As illustrated in FIG. 3, system 50 is shown writing compressed track image 58 into storage volume 56.
As known in the art of virtual volume systems such as DASD, storage volume 56 can have many disks, cylinders, records and tracks. Each disk can have several cylinders; and each cylinder can have several tracks, each track having storage capacity. Records can fill entire tracks or portions of tracks. Data is spread among tracks and records through the associated cache and according to distributed memory methods known in the art.
CU 54 preferably includes an application specific integrated circuit ("ASIC") 54a, which compresses and alternatively decompresses data to and from CU 54. Specifically, if host 52 sends uncompressed data to CU 54 along bus 53, then ASIC 54a can compress the data for storage within volume 56, saving space. Likewise, if commanded by the host, CU 54 can decompress stored data, via ASIC 54a, so that host 52 receives uncompressed data. As described herein, CU 54 also sends compressed data to host 52, when commanded, for purposes of backup, restore and/ or relocation. In this case, ASIC 54a is not used to decompress data sent from CU 54 to host 52; and data backed up in tape drive 60 is conveniently stored in compressed form.
Note that ASIC 54a is also not used to compress data when host 52 sends compressed data to CU 54. Rather, compressed data from host 52 to CU 54 is transmitted along bus 53 for mapping within storage volume 56 without further compression.
With further reference to FIG. 3A, system 50 is shown illustrating a backup of data, e.g., track 58, from storage volume 56 to tape device 60. Data backup is typically performed to safeguard data track 58 in a redundant, second storage volume; though data backup also occurs to manipulate or use storage volume 56 for other purposes while maintaining a copy of the original data track 58. In the data backup operational mode of FIG. 3A, host 52 reads data track 58 from storage volume 56 with read protocol 52a. Read protocol 52a for example takes the form of software that provides host 52 with capability to read the compressed track image 58. When transferring track image data 58 to host 52, CU 54 assigns and tags metadata to image 58. Metadata is shown illustratively in FIG. 4 and is described in more detail below. For purpose of illustration, the backup version of data track 58 is shown as data 60a in tape device 60.
Note that in the mode of operation shown in FIG. 3A, CU 54 does not decompress the data prior to transmission to host 52. Rather, compressed track image 58 is sent to host 52 in compressed form with metadata attached. Nevertheless, CU 54 has capability (i.e., at ASIC 54a) to decompress data for host 52, when commanded.
FIG. 3B shows system 50 restoring data 60a to storage volume 56; except that host commands change the location of the stored track image to location 62. Specifically, in achieving one object of the invention, a user of system 50 has the choice to relocate compressed data 60a to track location 62 through the metadata assigned to the track image 58. Host 52 writes data 60a into storage volume 56 with write protocol 52b. Write protocol 52a for example takes the form of software that provides host 52 with capability to write - and hence "restore" - compressed track images into storage volume 56. Specifically, system 50 restores compressed data 60a to location 62 via the metadata assigned to data 60a. Location 62 is specified at host 52 by user commands.
Those skilled in the art should appreciate that the restore operation need not occur within the original storage volume 56. For purpose of illustration, FIG. 3B also shows an alternative storage volume 70 connected to CU 54 via data bus 72. If for example original data track 58 was stored in cylinder y in volume 56, compressed data associated with track 58 can for example be restored to track 74, cylinder v, in volume 70. Cylinders y and v represent any one cylinder within the associated storage volume 56, 70. As above, restoring data 60a to volume 56 or 70, and at selected cylinders and tracks within these volumes, is preferably determined by commands at host 52.
FIG. 4 illustrates data extent 58 and metadata 82 transferred to host 52 in the above operational mode of FIG. 3A. In that process, data extent 58 is preceded by metadata 82 which "self-defines" track image 58. Metadata 82 provides for relocation of compressed tracks by creating a description of the compressed track that is invariant with its currently assigned location within volume 56. When host 52 writes the compressed track back into storage volume 54, write protocol 52a uses the metadata to "restore" the track to the newly assigned location (e.g., location 62 or 74), if desired,
Whenever a compressed track is decompressed by ASIC 54a, CU 54 appends a seed value 84 to the metadata as an identification tag for data extent 58. After the compressed track is written to a new location - e.g., to location 62 or 74, FIG. 4 - the identification tag remains valid across the relocation even during the decompression process.
CU 54 understands when host 52 transmits compressed data with metadata 82. Even if host 52 originally transmits compressed images to CU 54, relocation such as described above can still occur though the metadata 82; and relocation can be accomplished through a single read and write operation by host 52. Seed value 84 establishes that the track is consistent such as through cyclic redundancy checking ("CRC"), known in the art.
In one embodiment, the invention utilizes the count field (known in the art), which specifies information such as cylinder, head and record numbers. In multi-virtual storage ("MVS") environments, the count field can be specified to identify the origination of each record. In accord with this embodiment, the CU modifies the count field, as appropriate, to specify the data extent as cylinder and head numbers for the redirected compressed track in each count field. Accordingly, the CU translates the data extents and modifies the count field based upon what is transferred by the CU. For example, if the count field describes the cylinder and head numbers of the extent within the tape drive 60, the CU moves the data into that extent and modifies the cylinder and head numbers, if needed, based upon the appropriate mapping. The count field can also be used in rerouting tracks to and from the CU.
This is significantly different from prior art systems which simply make a one-to-one copy of data extents, and count fields, into memory. Those skilled in the art should appreciate that certain systems do not utilize the count field and that other identifiers can be used to locate the origination of the data extent. In this case, when the count field is not used, the count field is not changed.
Read and Write Host Protocols
In the preferred embodiment of the invention, and with further reference to FIGs. 3-3B, host read protocol 52a sends a series of commands that are chained together during a read process. The first command (Define Extent) specifies the extent (i.e., the range of tracks) over which the read pertains. The second command (Locate record) specifies the start track and the mask of which tracks in a cylinder are read. The second command thus sets the orientation of which track is read first; and the mask dictates which subsequent tracks are read by skipping tracks that do not have the mask bit set.
The next read protocol commands are the Read Track Commands. There can, for example, be up to fifteen read track commands after the Locate Record Command, since there typically are fifteen tracks per cylinder.
Once these tracks are read into host 52, they can be written back to CU 54 (or into tape device 60) in a selected location by write protocol 52b. The write chain of commands starts with a Define Extent command, specifying the extent (i.e., the range of tracks) over which the write pertains. The second command (Locate Record Command) specifies the start track. A mask is not used in the write protocol process; and the chain of write commands may be as long as the extent. The next commands are a Write Command followed by one or more Write Next Track Commands. The first track that is sent by the Write Command is written to the location specified by the Locate Record Command. It does not matter where the track was read from, nor what its count field is, as tne track is written to the location according to the Locate Record Command.
The Write Next Track Command writes data to the track that is incremental to the prior oriented track (i.e., the 'next' track ). This track and subsequent tracks are written to the orientation specified by their location in the chain relative to the Locate Record Command, without regard to where they were read from or to the count field specified within the write data. An alternative is to use the count field to place the tracks, without regard to where they were read from or to the orientation specified by a location relative to the Locate Record domain.
The invention thus attains the objects set forth above, among those apparent from preceding description. Since certain changes may be made in the above systems and methods without departing from the scope of the invention, it is intended that all matter contained in the above description or shown in the accompanying drawing be interpreted as illustrative and not in a limiting sense.
In view of the foregoing, what is claimed is:

Claims

1. A method for relocating data to a storage volume, comprising the steps of:
storing a compressed track image of the data at a first location in a first storage volume;
tagging metadata to the compressed track image;
transmitting the compressed track image to a host; and
storing the compressed track image at a second location in a second storage volume according to the metadata.
2. A method of claim 1, further comprising the step of backing up the compressed track image in a storage device after transmitting the compressed track image to a host.
3. A method of claim 1, further comprising compressing the data prior to storing the compressed track image at the first location.
4. A method of claim 1, wherein the step of storing the compressed track image comprises storing the compressed track image to a location corresponding to the first location.
5. A method of claim 1, wherein the second storage volume corresponds to the first storage volume.
6. A method of claim 1, further comprising the steps of decompressing the compressed track image and assigning a seed value to the metadata, wherein the data remains identifiable across relocation.
7. A method of claim 1, wherein the step of storing comprises storing the compressed track image at a second location with a compatible format size as the compressed track image at the first location.
8. A method of claim 1, further comprising decompressing the compressed track image.
9. A method of claim 1, further comprising the step of automatically modifying a count field of the compressed track image to match the second location.
10. A method of claim 1, wherein the step of transmitting is preceded by the step of reading the compressed track image by a Read Protocol.
11. A method of claim 10, wherein the Read Protocol comprises executing one or more of Define Extent, Locate Record and Read Track Commands.
12. A method of claim 1, wherein the step of storing comprises the step of writing the compressed track by a Write Protocol.
13. A method of claim 12, wherein the Write Protocol comprises executing one or more of Define Extent, Locate Record, Write, and Write Next Commands.
14. A method of claim 12, wherein the step of writing by a Write Protocol comprises the step of specifying the second location through a count field preceding the compressed track image.
15. A system for backup and relocation of compressed track images, comprising:
a host; and
a control unit for storing data, as a compressed track image, from the host and into a first location of a first storage volume, the control unit responding to a read command from the host to transmit the compressed track image to the host and assigning metadata to the image, the control unit responding to a read command from the host to store the compressed track image into a second location of a second storage volume.
16. A system according to claim 15, further comprising a compression and decompression circuit for compressing and alternatively decompressing the data, the control unit compressing uncompressed data from the host and responding to host commands to decompress data selectively.
17. A system according to claim 15, further comprising a compression and decompression circuit for compressing and alternatively decompressing the data.
18. A system according to claim 15, further comprising a backup storage device, the host transmitting the compressed track image to the device for backup of the data.
19. A system according to claim 15, wherein the host comprises read protocol for reading the compressed track image from the control unit.
20. A system according to claim 15, wherein the host comprises write protocol for writing the compressed track image to the control unit.
21. A system according to claim 15, wherein the control unit comprises means for assigning a seed value to the metadata, wherein the data remains identifiable across relocation.
PCT/US2000/005529 1999-03-03 2000-03-02 System and method for relocation of compressed data tracks WO2000052577A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US26182699A 1999-03-03 1999-03-03
US09/261,826 1999-03-03

Publications (1)

Publication Number Publication Date
WO2000052577A1 true WO2000052577A1 (en) 2000-09-08

Family

ID=22995052

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/005529 WO2000052577A1 (en) 1999-03-03 2000-03-02 System and method for relocation of compressed data tracks

Country Status (1)

Country Link
WO (1) WO2000052577A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110347535A (en) * 2019-07-16 2019-10-18 北京谷数科技有限公司 A method of promoting dual controller write performance

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5555371A (en) * 1992-12-17 1996-09-10 International Business Machines Corporation Data backup copying with delayed directory updating and reduced numbers of DASD accesses at a back up site using a log structured array data storage

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5555371A (en) * 1992-12-17 1996-09-10 International Business Machines Corporation Data backup copying with delayed directory updating and reduced numbers of DASD accesses at a back up site using a log structured array data storage

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110347535A (en) * 2019-07-16 2019-10-18 北京谷数科技有限公司 A method of promoting dual controller write performance

Similar Documents

Publication Publication Date Title
US11199978B2 (en) Data protection with multiple site replication
US8285898B2 (en) Method and system for compression of files for storage and operation on compressed files
US7970965B2 (en) Method and system for compression of data for block mode access storage
US5630092A (en) System and method for transferring compressed and uncompressed data between storage systems
KR101804034B1 (en) Apparatus, system, and method for converting a storage request into an append data storage command
US5394534A (en) Data compression/decompression and storage of compressed and uncompressed data on a same removable data storage medium
US7716438B2 (en) Storage system and back-up method for storage system
US6751714B2 (en) Systems and methods for relocation of compressed data tracks
KR100216146B1 (en) Data compression method and structure for a direct access storage device
US7877567B2 (en) Transporting image files
US20060190643A1 (en) Method and system for compression of data for block mode access storage
US6351792B1 (en) Selective remote storage copy system and methods
US6460123B1 (en) Mirroring computer data
WO2007099636A1 (en) File system migration method, program and apparatus
US20050289309A1 (en) Storage control system and storage control method
US7062603B2 (en) External storage device for selectively storing data between first and second recording media
JP2001125815A (en) Back-up data management system
WO2000052577A1 (en) System and method for relocation of compressed data tracks
US6910214B1 (en) Method, system, and program for converting an input parameter list into an output parameter list
JP2000339855A (en) Data storage system
JP2004199711A (en) Semiconductor disk device
JP2004158047A (en) Semiconductor disk device
JPH10257577A (en) Controller with function memory

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase