WO2012071719A1 - Method, apparatus and system for accessing data in cloud storage - Google Patents

Method, apparatus and system for accessing data in cloud storage Download PDF

Info

Publication number
WO2012071719A1
WO2012071719A1 PCT/CN2010/079327 CN2010079327W WO2012071719A1 WO 2012071719 A1 WO2012071719 A1 WO 2012071719A1 CN 2010079327 W CN2010079327 W CN 2010079327W WO 2012071719 A1 WO2012071719 A1 WO 2012071719A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
cloud storage
original
check value
original data
Prior art date
Application number
PCT/CN2010/079327
Other languages
French (fr)
Chinese (zh)
Inventor
刘慧�
Original Assignee
北京卓微天成科技咨询有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京卓微天成科技咨询有限公司 filed Critical 北京卓微天成科技咨询有限公司
Priority to US13/000,575 priority Critical patent/US8347184B2/en
Publication of WO2012071719A1 publication Critical patent/WO2012071719A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/20Network architectures or network communication protocols for network security for managing network security; network security policies in general

Definitions

  • the present invention relates to the field of cloud storage technologies, and in particular, to a method, device and system for data access of cloud storage.
  • DAS direct attached storage, direct attached storage
  • SAN Storage Area Network
  • NAS network acces s server
  • the other is a multi-party shared architecture, the cloud storage architecture. According to the scope of its services, it is divided into private cloud (public cloud) and public cloud (public cloud).
  • the cloud storage architecture is based on network technologies (internet and intranet) that provide storage space for on-demand purchases, leases, and on-demand provisioning services.
  • This service is typically provided by third parties (or third-party departments within the enterprise) including storage devices and specialized Maintenance personnel.
  • third parties or third-party departments within the enterprise
  • enterprises or departments within the enterprise
  • the users of the above mentioned storage may be individuals, enterprises, or even departments or branches within the enterprise.
  • Public cloud storage services can reduce the cost of enterprise and personal storage, increase the availability of data, and even the data in the cloud storage service provider's data center is lost or tombed. This risk allows businesses or individuals to use public cloud storage services. In particular, key data related to its business data is prohibitive. For example, the cloud storage service provider is bankrupt or other reasons, so that the cloud storage service provided by it is no longer available, which leads to the risk of data being stored on it; or the enterprise chooses some smaller cloud storage service due to the price of the service.
  • cloud storage service providers' data center data is lost; or for some reason, such as power outages, cloud storage service providers Data is not available (generally cloud storage service providers promise 99.99% or 99.999% availability of their services); or data center data loss due to human error due to corporate mergers or other reasons; and viruses or hackers The resulting cloud storage data center data is lost or altered by the tomb.
  • the RAI D mechanism is used for data protection of the data receiving end for network storage, and data redundancy is created by the data check value at the data receiving end to protect data loss caused by storage hardware damage; the RA ID mechanism is limited to the interconnected, usually in one The storage medium in the data center creates data redundancy; the RAI D protection scope has limitations.
  • the traditional RAI D technology is usually deployed on the storage medium of the cloud storage data center, and can only protect the data loss caused by the physical damage of the storage medium. It can be seen that the RA ID mechanism is mainly used to prevent data loss caused by damage of storage hardware levels such as hard disks, tapes, and the like.
  • the object of the present invention is to provide a method, device and system for data access of cloud storage, which are to solve the problem that the existing cloud storage method cannot effectively resist the data caused by the short-term unavailability of the cloud storage service caused by the disaster of the non-storage hardware level. Temporarily unable to retrieve, data loss and the risk of being altered by the tomb.
  • the invention provides a method for data access of cloud storage, the method comprising data storage and data retrieval,
  • the storing step of the data includes:
  • the step of retrieving the data includes:
  • the original data is restored based on the data check value of the original data and the data used to create the data check value of the original data.
  • the present invention provides an apparatus for data access of a cloud storage, the apparatus comprising a storage module of data and a retrieval module of data, the storage module of the data comprising:
  • a data check value creating unit configured to create a data check value of the original data to be stored at the sending end of the data
  • a data transmission unit configured to separately transmit the data verification values created by the original data and the data verification value creation unit to different cloud storage data center storage according to a preset redundancy storage policy
  • the data retrieval module includes:
  • the access judging unit is configured to access the cloud storage data center storing the original data according to the preset redundancy storage policy according to the access request, and determine whether the original data stored in the cloud storage data center is available, and send the determination result;
  • a data retrieval unit configured to retrieve, according to the determination result that the original data stored in the cloud storage data center is not available, by the access determination unit, to retrieve the data verification value of the original data from other cloud storage data centers and create The data used by the data check value of the original data; the check value and the data used to create the data check value of the original data restore the original data.
  • the present invention also provides a system for data access of a cloud storage, the system comprising a data storage module, a data retrieval module, and a cloud storage data center, the data storage module comprising:
  • a data check value creating unit configured to create a data check value of the original data to be stored at the sending end of the data
  • a data transmission unit configured to convert the original data and data according to a preset redundant storage policy
  • the data check values created by the check value creation unit are respectively transmitted to different cloud storage data center storages
  • the data retrieval module includes:
  • the access judging unit is configured to access the cloud storage data center storing the original data according to the preset redundancy storage policy according to the access request, and determine whether the original data stored in the cloud storage data center is available, and send the determination result;
  • a data retrieval unit configured to retrieve, according to the determination result that the original data stored in the cloud storage data center is not available, by the access determination unit, to retrieve the data verification value of the original data from other cloud storage data centers and create The data used by the data check value of the original data; the check value and the data used to create the data check value of the original data restore the original data.
  • the invention creates a data check value of the original data to be stored by using a logical operation on the data sending end, and stores the original data to be stored and the data check value according to a preset redundant storage policy in different cloud storage data centers, in the cloud.
  • Storage data center level creates redundancy of data, greatly increasing the availability and fault tolerance of cloud storage data.
  • FIG. 1 is a flowchart of a method for data access of a cloud storage according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a data redundancy storage method according to an embodiment of the present invention
  • FIG. 3 is another data provided by an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of a device for data access of a cloud storage according to an embodiment of the present invention
  • FIG. 5 is a schematic structural diagram of a system for data access of a cloud storage according to an embodiment of the present invention.
  • the data check value of the original data to be stored is created on the data sending end, and the original data and the data check value to be stored are respectively stored in different cloud storage data centers according to the preset redundant storage policies. Store data center level to create redundancy of data.
  • an embodiment of the present invention provides a method for data access of a cloud storage, where the method includes a data storage step S1 00 and a data retrieval step S200;
  • the data storage step S 1 00 specifically includes:
  • Step S 1 01 creating a data check value of the original data to be stored at the data sending end; in actual applications, the original data may include part of data of the file or the file; the number of the number of data can be stored in the cloud by the number of data centers Divisible, if it cannot be divisible, it needs to be complemented by other data; the data check value of the original data can be, but is not limited to, created by XOR logical operation, and correspondingly, the original data is restored by XOR logical operation;
  • the data used to store the data check value of the original data includes the original data to be stored and at least one fixed-length binary number, for example: X-OR logical operation can be created using the original data and a binary number 1 001 having 4 bits Data check value, or use the original data to perform XOR logical operations with two binary digits of 1 001 00, 001 000 with 6 bits to create a data check value, etc.
  • the user can create one or more according to actual needs. Data check value of raw data; input
  • the original data and the data check value are respectively stored in different cloud storage data centers.
  • the user can formulate a corresponding redundancy storage policy according to his actual needs, and the original data and the data check value are determined according to the redundancy.
  • Store the policy to store; the created data can be stored in the same cloud storage data center or stored in different cloud storage data centers; usually, the user will store the created data check values in different Cloud storage data center to prevent two or more cloud storage services from being unavailable at the same time;
  • the data retrieval step S200 specifically includes:
  • Step S201 accessing the cloud storage data center storing the original data according to the preset redundant storage policy according to the access request;
  • Step S202 determining whether the original data stored in the cloud storage data center is available, if not, executing step S203, if yes, executing step S205;
  • Step S203 Retrieving data check values of the original data and data used to create data check values of the original data from other cloud storage data centers; Step S204: Restore the original data according to the data check value and the data used to create the data check value of the original data;
  • Step S205 Retrieving the original data from the cloud storage data center
  • the exclusive OR logic is also used.
  • the operation method restores the original data
  • an embodiment of the present invention provides an apparatus for data access of a cloud storage, where the apparatus includes: a data storage module 20 and a data retrieval module 30;
  • the data storage module 20 includes a data check value creation unit 21 and a data transfer unit 22;
  • the data retrieval module 30 includes an access determination unit 33, a data retrieval unit 31, and a data restoration unit 32;
  • the data check value creating unit 21 creates the data check value of the original data to be stored at the transmitting end of the data, and the data transfer unit 22 sets the original data and the data check value according to the preset redundant storage policy.
  • the data check values created by the creating unit 21 are respectively transmitted to different cloud storage data centers for storage;
  • the access judging unit 33 accesses the cloud storage data center storing the original data according to the preset redundant storage policy according to the access request, and determines whether the original data stored in the cloud storage data center is available; when the cloud storage When the original data of the data center is unavailable, the data retrieval unit 31 retrieves the data of the original data from other cloud storage data centers. The value used and the data used to create the data check value of the original data; the data restore unit 32 uses the data check value of the original data retrieved by the data retrieval unit 31 and the data check value used to create the original data. The data restores the raw data.
  • an embodiment of the present invention further provides a system for data access of a cloud storage, where the system includes a data storage module 20, a data retrieval module 30, and a cloud storage data center 40.
  • the data storage module 20 includes The data check value creation unit 21 and the data transfer unit 22;
  • the data retrieval module 30 includes an access determination unit 33, a data retrieval unit 31, and a data restoration unit 32;
  • the data check value creating unit 21 creates the data check value of the original data to be stored at the transmitting end of the data, and the data transfer unit 22 sets the original data and the data check value according to the preset redundant storage policy.
  • the data check values created by the creating unit 21 are respectively transmitted to different cloud storage data centers 40 for storage;
  • the access judging unit 33 accesses the cloud storage data center 40 storing the original data according to the preset redundant storage policy according to the access request, and determines whether the original data stored in the cloud storage data center 40 is available;
  • the data retrieval unit 31 retrieves the data check value of the original data and the data used to create the data check value of the original data from the other cloud storage data center 40;
  • the restoration unit 32 restores the original data based on the data check value of the original data retrieved by the data retrieval unit 31 and the data used to create the data check value of the original data.
  • This example mainly generates data check values by data bit XOR (X0R) logic operations.
  • D3 if Dl, D2, and D3 are not equal in length, they need to be equal to 0 to make them equal.
  • cloud storage data centers There are four different cloud storage data centers available.
  • an exclusive OR logical operation is performed between the original data according to the number of available cloud storage data centers, thereby generating a data check value P1 of the original data to be stored.
  • the above method is recycled until all data check values of the original data to be stored are generated. If the number of the original data to be stored and the corresponding data check value cannot be divisible by the number of the specified cloud storage data center, other data may be used or a certain length of 0, 1 string may be used to complement the generated data to be stored. The data check value of the original data.
  • the method for generating the data check value is mature.
  • the above is only a typical data check value generation method.
  • the user can also generate two or more data check values as needed and store them in different cloud storage data centers. Prevent raw data from two or more cloud storage data centers from being unavailable at the same time.
  • the original data to be stored and the data check value thereof are redundantly stored in different cloud storage data centers according to the number of available cloud storage data centers, thereby achieving resistance.
  • the raw data of any cloud storage data center is not available or the risk of data loss or tomb modification.
  • the user can use the data check value of the original data to assist in restoring the original data.
  • the user can ensure the storage of two or more cloud storage data centers by saving two or more data check values.
  • the initial data is not available, the original data can still be restored.
  • Just saving multiple copies of the data check value will increase the need for storage space, which will increase the cost. Because the data restoration method is similar, it will not be described again.
  • the embodiment of the present invention creates the check value of the original data to be stored on the data sending end, and stores the original data to be stored and the check value thereof in different cloud storage data centers according to preset policies, in different cloud storages. Redundancy of data is created between data centers, thereby realizing risks and improving the availability of public cloud storage service data and the fault tolerance of data.

Abstract

A method, apparatus and system for accessing data in cloud storage are disclosed. Steps of storing data include: creating data check values of original data to be stored in a data transmitting terminal (S101); transmitting the original data and the data check values to different cloud storage data centres for storing in accordance with pre-configured redundancy storing policy (S102). Steps of retrieving data include: based on an access request, accessing the cloud storage data centre storing the original data in accordance with the pre-configured redundancy storing policy (S201); determining whether the original data stored in the cloud storage data centre is available (S202); if no, retrieving the data check value of the original data and data used for creating the data check value from the other cloud storage data centre (S203); recovering the original data according to the data check value and the data used for creating the data check value (S204). This technical solution improves availability and fault tolerance of data in cloud storage.

Description

一种云存储的数据存取的方法、 装置及系统  Method, device and system for data access of cloud storage
技术领域 Technical field
本发明属于云存储技术领域, 特别涉及一种云存储的数据存取的方 法、 装置及系统。  The present invention relates to the field of cloud storage technologies, and in particular, to a method, device and system for data access of cloud storage.
背景技术 Background technique
数据已经被证明是企业重要资产之一, 数据的高速增长使企业面临 前所未有的挑战。 同时, 瞬息万变的世界经济形势和激烈竟争带来的成 本压力, 使企业不得不考虑如何以降低 IT成本, 应对企业不断增长的存 储需求。  Data has proven to be one of the company's key assets, and the rapid growth of data has made companies face unprecedented challenges. At the same time, the ever-changing world economic situation and the cost pressures of fierce competition have forced companies to consider how to reduce IT costs and respond to the growing storage needs of enterprises.
现有的存储架构可以划分为两种: 一种是由一方专有的架构, 比如 Existing storage architectures can be divided into two types: one is a proprietary architecture, such as
DAS (direct attached storage, 直接附力口存储), SAN (Storage Area Network, 存者区网给) 和网给接入存者月良务器( Network acces s server, NAS)等。 这类存储体系由一方独占使用, 可以提供给用户很好的控制权, 更好的可靠性和性能, 但是其扩展性较差, 不适用于大规模部署; 用户 在这种模式下也很难灵活使用存储预算 (需要一次性地投入购买存储设 备) ; 随着存储容量的增加, 成本控制也将面临挑战。 DAS (direct attached storage, direct attached storage), SAN (Storage Area Network) and network access to the network acces s server (NAS). This kind of storage system is used exclusively by one party, which can provide users with good control, better reliability and performance, but its scalability is poor, and it is not suitable for large-scale deployment; users are also difficult in this mode. Flexible use of storage budgets (requires one-time purchase of storage devices); as storage capacity increases, cost control will also face challenges.
另一种是多方共享架构, 即云存储架构。 按照其服务范畴的不同, 分 为私有云 ( rivate cloud ) 和公共云 (public cloud ) 。 云存储的体系 结构基于网络技术( internet和 intranet ) , 为用户提供存储空间按需 购买、 租赁和按需配置服务, 该服务通常由第三方 (或企业内第三方部 门)提供包括存储设备和专门维护人员。 通过该存储服务, 企业 (或企 业内各部门) 可以大幅降低其内部存储器的需求和相应的管理成本, 以 平衡急剧上升的存储需求和企业成本压力。 以上所提存储的使用方可以 是个人、 企业, 甚至企业内的部门或者分支机构等。 除了服务对象和服务范畴不同, 公共云存储系统和私有云存储, 及 企业内传统的网络备份系统不同之处在于, 云存储服务商 (数据中心) 之间相对独立, 限于安全性考虑及数据兼容性问题, 它们之间很少有信 息交换和通讯的需求。 The other is a multi-party shared architecture, the cloud storage architecture. According to the scope of its services, it is divided into private cloud (public cloud) and public cloud (public cloud). The cloud storage architecture is based on network technologies (internet and intranet) that provide storage space for on-demand purchases, leases, and on-demand provisioning services. This service is typically provided by third parties (or third-party departments within the enterprise) including storage devices and specialized Maintenance personnel. Through this storage service, enterprises (or departments within the enterprise) can significantly reduce their internal storage requirements and corresponding management costs to balance the sharply rising storage requirements and enterprise cost pressures. The users of the above mentioned storage may be individuals, enterprises, or even departments or branches within the enterprise. In addition to different service objects and service categories, public cloud storage systems and private cloud storage, and traditional network backup systems in the enterprise, are different in that cloud storage service providers (data centers) are relatively independent, limited to security considerations and data compatibility. Sexual problems, there is little need for information exchange and communication between them.
公共云存储服务虽然可以降低企业及个人存储的成本, 提高了数据 可用, 甚至云存储服务商的数据中心中数据丟失或者被墓改, 这种风险 使得企业或个人对使用公共云存储服务时, 特别是涉及其业务数据等关 键数据, 望而却步。 比如云存储服务提供商破产或者其他原因导致其提 供的云存储服务不再可用, 进而导致存放在其上的数据存在丟失的风险; 或企业因服务价格的原因选择某些较小的云存储服务提供商, 但其因经 营不善而破产; 或由于不可抗拒的原因, 如地震或者重大灾害, 云存储 服务商的数据中心的数据丟失; 或由于某种原因如电力中断等, 云存储 服务商中的数据不可用 (一般云存储服务商承诺其服务有 99. 99%或者 99. 999%的可用性) ; 或由于企业并购或其他原因, 人为操作失误导致的 数据中心数据丟失; 以及病毒或者黑客入侵, 导致的云存储数据中心数 据丟失或者被墓改。  Public cloud storage services can reduce the cost of enterprise and personal storage, increase the availability of data, and even the data in the cloud storage service provider's data center is lost or tombed. This risk allows businesses or individuals to use public cloud storage services. In particular, key data related to its business data is prohibitive. For example, the cloud storage service provider is bankrupt or other reasons, so that the cloud storage service provided by it is no longer available, which leads to the risk of data being stored on it; or the enterprise chooses some smaller cloud storage service due to the price of the service. Provider, but it is bankrupt due to poor management; or due to irresistible causes, such as earthquakes or major disasters, cloud storage service providers' data center data is lost; or for some reason, such as power outages, cloud storage service providers Data is not available (generally cloud storage service providers promise 99.99% or 99.999% availability of their services); or data center data loss due to human error due to corporate mergers or other reasons; and viruses or hackers The resulting cloud storage data center data is lost or altered by the tomb.
鉴于数据对企业的重要性或由于法律的要求, 增加公共云存储系统 数据的容错性是很必要的, 特别是企业或者个人的关键数据。  Given the importance of data to the enterprise or due to legal requirements, it is necessary to increase the fault tolerance of public cloud storage system data, especially for companies or individuals.
传统用于提高公共云存储服务的可用性和数据的容错性, 通常依赖 于企业与云存储服务提供商签订服务协议(SLA )及在云存储数据中心端 的 RAID机制(主要是 RAI D 2 - RAID6)。 参考文献见 Dav id A. Pa t t er son, Gar th G i bson, and Randy H. Ka t z: A Ca se for Redundant Arrays of Inexpens ive Di sks (RAID) . ACM 1988。 RAID 2 - RAID6通过存放(传输 到存储介质)数据的校验值, 使数据保存有充分冗余, 进而保证了在存 储介质硬件损害时, 不会导致数据丟失。 Traditionally used to improve the availability of public cloud storage services and the fault tolerance of data, usually rely on the enterprise to sign a service agreement (SLA) with the cloud storage service provider and the RAID mechanism on the cloud storage data center (mainly RAI D 2 - RAID6) . For references, see Dav id A. Pa tt er son, Gar th G i bson, and Randy H. Ka tz: A Ca se for Redundant Arrays of Inexpensive Di sks (RAID) . ACM 1988. RAID 2 - RAID6 ensures sufficient storage by storing the checksum value of the data transferred to the storage medium. When the storage medium hardware is damaged, it will not cause data loss.
RAI D机制对于网络存储多用于数据接收端的数据保护, 在数据接收 端通过数据校验值创建数据冗余, 保护因存储硬件损害导致的数据丟失; RA I D机制仅限于对互联的、 通常在一个数据中心内的存储介质创建数据 冗余; RAI D保护范围有局限性, 传统的 RAI D技术因通常部署在云存储数 据中心的存储介质上, 只能保护因存储介质的物理损坏导致的数据丟失, 可见 RA I D机制主要是用来防范存储硬件级别如硬盘、 磁带等的损害导致 的数据丟失。  The RAI D mechanism is used for data protection of the data receiving end for network storage, and data redundancy is created by the data check value at the data receiving end to protect data loss caused by storage hardware damage; the RA ID mechanism is limited to the interconnected, usually in one The storage medium in the data center creates data redundancy; the RAI D protection scope has limitations. The traditional RAI D technology is usually deployed on the storage medium of the cloud storage data center, and can only protect the data loss caused by the physical damage of the storage medium. It can be seen that the RA ID mechanism is mainly used to prevent data loss caused by damage of storage hardware levels such as hard disks, tapes, and the like.
为了抵御非存储硬件级别的灾害引起的云存储服务短暂不可用时造 成的数据短暂无法取回, 用户数据丟失及被墓改的风险, 有必要创立一 种新的提高公共云存储服务的可用性和数据的容错性的方法。  In order to protect against the short-term inaccessibility of data storage services caused by disasters caused by non-storage hardware-level disasters, user data loss and the risk of being modified, it is necessary to create a new public cloud storage service availability and data. The method of fault tolerance.
发明内容 Summary of the invention
本发明的目的在于提供一种云存储的数据存取的方法、 装置及系统, 旨在解决现有云存储方法无法有效抵御因非存储硬件级别的灾害引起的 云存储服务短暂不可用时造成的数据暂时无法取回, 数据丟失及被墓改 的风险的问题。  The object of the present invention is to provide a method, device and system for data access of cloud storage, which are to solve the problem that the existing cloud storage method cannot effectively resist the data caused by the short-term unavailability of the cloud storage service caused by the disaster of the non-storage hardware level. Temporarily unable to retrieve, data loss and the risk of being altered by the tomb.
本发明提供了一种云存储的数据存取的方法, 所述方法包括数据的 存储与数据的取回,  The invention provides a method for data access of cloud storage, the method comprising data storage and data retrieval,
所述数据的存储步骤包括:  The storing step of the data includes:
在数据的发送端创建待存储原始数据的数据校验值; 同的云存储数据中心存储;  Creating a data check value of the original data to be stored at the sending end of the data; the same cloud storage data center storage;
所述数据的取回步骤包括:  The step of retrieving the data includes:
根据访问请求按预置的冗余存放策略访问存储原始数据的云存储数 据中心;  Accessing the cloud storage data center storing the original data according to the preset redundant storage policy according to the access request;
判断所述存储原始数据的云存储数据中心是否可用, 如果不可用, 则从其它云存储数据中心取回所述原始数据的数据校验值和创建所述原 始数据的数据校验值所使用的数据; Determining whether the cloud storage data center storing the original data is available, if not available, Retrieving data check values of the original data and data used to create data check values of the original data from other cloud storage data centers;
根据所述原始数据的数据校验值及创建所述原始数据的数据校验值 所使用的数据还原所述原始数据。  The original data is restored based on the data check value of the original data and the data used to create the data check value of the original data.
本发明提供了一种云存储的数据存取的装置, 所述装置包括数据的 存储模块及数据的取回模块, 所述数据的存储模块包括:  The present invention provides an apparatus for data access of a cloud storage, the apparatus comprising a storage module of data and a retrieval module of data, the storage module of the data comprising:
数据校验值创建单元, 用于在数据的发送端创建待存储原始数据的 数据校验值;  a data check value creating unit, configured to create a data check value of the original data to be stored at the sending end of the data;
数据传输单元, 用于按预置的冗余存放策略将所述原始数据及数据 校验值创建单元创建的数据校验值分别传输至不同的云存储数据中心存 储;  a data transmission unit, configured to separately transmit the data verification values created by the original data and the data verification value creation unit to different cloud storage data center storage according to a preset redundancy storage policy;
所述数据的取回模块包括:  The data retrieval module includes:
访问判断单元, 用于根据访问请求按预置的冗余存放策略访问存储 原始数据的云存储数据中心, 以及判断云存储数据中心存储的原始数据 是否可用, 发送判断结果;  The access judging unit is configured to access the cloud storage data center storing the original data according to the preset redundancy storage policy according to the access request, and determine whether the original data stored in the cloud storage data center is available, and send the determination result;
数据取回单元, 用于根据所述访问判断单元发送的所述云存储数据 中心存储的原始数据不可用的判断结果, 从其它云存储数据中心取回所 述原始数据的数据校验值及创建所述原始数据的数据校验值所使用的数 据; 校验值及创建所述原始数据的数据校验值所使用的数据还原所述原始数 据。  a data retrieval unit, configured to retrieve, according to the determination result that the original data stored in the cloud storage data center is not available, by the access determination unit, to retrieve the data verification value of the original data from other cloud storage data centers and create The data used by the data check value of the original data; the check value and the data used to create the data check value of the original data restore the original data.
本发明还提供了一种云存储的数据存取的系统, 所述系统包括数据 的存储模块、 数据的取回模块和云存储数据中心, 所述数据的存储模块 包括:  The present invention also provides a system for data access of a cloud storage, the system comprising a data storage module, a data retrieval module, and a cloud storage data center, the data storage module comprising:
数据校验值创建单元, 用于在数据的发送端创建待存储原始数据的 数据校验值;  a data check value creating unit, configured to create a data check value of the original data to be stored at the sending end of the data;
数据传输单元, 用于按预置的冗余存放策略将所述原始数据及数据 校验值创建单元创建的数据校验值分别传输至不同的云存储数据中心存 储; a data transmission unit, configured to convert the original data and data according to a preset redundant storage policy The data check values created by the check value creation unit are respectively transmitted to different cloud storage data center storages;
所述数据的取回模块包括:  The data retrieval module includes:
访问判断单元, 用于根据访问请求按预置的冗余存放策略访问存储 原始数据的云存储数据中心, 以及判断云存储数据中心存储的原始数据 是否可用, 发送判断结果;  The access judging unit is configured to access the cloud storage data center storing the original data according to the preset redundancy storage policy according to the access request, and determine whether the original data stored in the cloud storage data center is available, and send the determination result;
数据取回单元, 用于根据所述访问判断单元发送的所述云存储数据 中心存储的原始数据不可用的判断结果, 从其它云存储数据中心取回所 述原始数据的数据校验值及创建所述原始数据的数据校验值所使用的数 据; 校验值及创建所述原始数据的数据校验值所使用的数据还原所述原始数 据。  a data retrieval unit, configured to retrieve, according to the determination result that the original data stored in the cloud storage data center is not available, by the access determination unit, to retrieve the data verification value of the original data from other cloud storage data centers and create The data used by the data check value of the original data; the check value and the data used to create the data check value of the original data restore the original data.
本发明通过在数据发送端运用逻辑运算创建待存储原始数据的数据 校验值, 将待存储原始数据及数据校验值按预置的冗余存放策略存储于 不同的云存储数据中心, 在云存储数据中心级别创建数据的冗余, 从而 大大地提高了云存储数据的可用性和容错性。  The invention creates a data check value of the original data to be stored by using a logical operation on the data sending end, and stores the original data to be stored and the data check value according to a preset redundant storage policy in different cloud storage data centers, in the cloud. Storage data center level creates redundancy of data, greatly increasing the availability and fault tolerance of cloud storage data.
附图说明 DRAWINGS
图 1是本发明实施例提供的云存储的数据存取的方法流程图; 图 2是本发明实施例提供的数据冗余存放方法的原理示意图; 图 3是本发明实施例提供的另一数据冗余存放方法的原理示意图; 图 4是本发明实施例提供的云存储的数据存取的装置结构示意图; 图 5是本发明实施例提供的云存储的数据存取的系统结构示意图。 具体实施方式  1 is a flowchart of a method for data access of a cloud storage according to an embodiment of the present invention; FIG. 2 is a schematic diagram of a data redundancy storage method according to an embodiment of the present invention; FIG. 3 is another data provided by an embodiment of the present invention. FIG. 4 is a schematic structural diagram of a device for data access of a cloud storage according to an embodiment of the present invention; and FIG. 5 is a schematic structural diagram of a system for data access of a cloud storage according to an embodiment of the present invention. detailed description
为了深入了解本发明, 下面结合附图及具体实施例对本发明进行详细说 明。 In order to further understand the present invention, the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments. Bright.
本发明实施例通过在数据发送端创建待存储原始数据的数据校验 值, 将待存储原始数据及数据校验值按预置的冗余存放策略分别存储于 不同的云存储数据中心, 在云存储数据中心级别创建数据的冗余。  In the embodiment of the present invention, the data check value of the original data to be stored is created on the data sending end, and the original data and the data check value to be stored are respectively stored in different cloud storage data centers according to the preset redundant storage policies. Store data center level to create redundancy of data.
参见图 1 , 本发明实施例提供了一种云存储的数据存取的方法, 该方 法包括数据的存储步骤 S 1 00及数据的取回步骤 S 200 ;  Referring to FIG. 1, an embodiment of the present invention provides a method for data access of a cloud storage, where the method includes a data storage step S1 00 and a data retrieval step S200;
其中, 数据的存储步骤 S 1 00具体包括:  The data storage step S 1 00 specifically includes:
步骤 S 1 01 : 在数据发送端创建待存储原始数据的数据校验值; 在实际应用中, 原始数据可以包括文件或者文件的部分数据; 在数 的个数能被云存储数据中心的个数整除, 如果不能被整除, 则需要用其 它数据补足; 原始数据的数据校验值可以但不限于通过异或逻辑运算创 建, 相应地, 通过异或逻辑运算还原原始数据; 通常情况下, 创建待存 储原始数据的数据校验值所使用的数据包括待存储的原始数据以及至少 一个固定长度的二进制数, 例如: 可以使用原始数据与一个具有 4 个比 特的二进制数 1 001进行异或逻辑运算创建数据校验值, 或者使用原始数 据与二个具有 6个比特位的二进制数 1 001 00、 001 000进行异或逻辑运算 创建数据校验值等; 另外, 用户可以根据实际需要创建一个或多个原始 数据的数据校验值; 输至不同的云存储数据中心存储;  Step S 1 01: creating a data check value of the original data to be stored at the data sending end; in actual applications, the original data may include part of data of the file or the file; the number of the number of data can be stored in the cloud by the number of data centers Divisible, if it cannot be divisible, it needs to be complemented by other data; the data check value of the original data can be, but is not limited to, created by XOR logical operation, and correspondingly, the original data is restored by XOR logical operation; The data used to store the data check value of the original data includes the original data to be stored and at least one fixed-length binary number, for example: X-OR logical operation can be created using the original data and a binary number 1 001 having 4 bits Data check value, or use the original data to perform XOR logical operations with two binary digits of 1 001 00, 001 000 with 6 bits to create a data check value, etc. In addition, the user can create one or more according to actual needs. Data check value of raw data; input to different cloud storage data centers Storage;
原始数据及数据校验值分别存储于不同的多个云存储数据中心, 具 体地, 用户可以依据自己的实际需要制定相应的冗余存放策略, 将原始 数据及数据校验值按制定的冗余存放策略来存放; 创建的数据经校验值 可以存储于同一个云存储数据中心, 也可以存储于不同的云存储数据中 心; 通常情况下, 用户会将创建的数据校验值分别存储于不同的云存储 数据中心, 以防止两个或者多个云存储服务同时不可用;  The original data and the data check value are respectively stored in different cloud storage data centers. Specifically, the user can formulate a corresponding redundancy storage policy according to his actual needs, and the original data and the data check value are determined according to the redundancy. Store the policy to store; the created data can be stored in the same cloud storage data center or stored in different cloud storage data centers; usually, the user will store the created data check values in different Cloud storage data center to prevent two or more cloud storage services from being unavailable at the same time;
原始数据及其校验值的冗余存放策略有很多种, 以下举两个实例: 1、 将所有待存储原始数据的校验值统一存放在一个云存储数据中 心, 如图 2 所示, 假如有四个不同的云存储数据中心可用时, 将第一部 分原始数据 D1存放在云存储服务提供商 1的云存储数据中心中, 依次类 推, 将第二部分原始数据 D2、 第三部分原始数据 D3 分别存放在云存储 服务提供商 2、 3 的云存储数据中心, 最后将所有原始数据的校验值 P1 存放在云存储服务提供商 4 的云存储数据中心; 同理将其它的原始数据 及对应的数据校验值按上述方法依次存储于上述四个云存储数据中心 中; There are many redundant storage strategies for raw data and its checksums. Here are two examples: 1. Store the checksum values of all the original data to be stored in a cloud storage data center. As shown in Figure 2, if there are four different cloud storage data centers available, store the first part of the original data D1 in the cloud storage. In the cloud storage data center of service provider 1, and so on, the second part of the original data D2 and the third part of the original data D3 are respectively stored in the cloud storage data center of the cloud storage service provider 2, 3, and finally all the original data The check value P1 is stored in the cloud storage data center of the cloud storage service provider 4; similarly, other original data and corresponding data check values are sequentially stored in the above four cloud storage data centers according to the above method;
2、 将所有原始数据和对应的数据校验值均勾地混合地存放在各个云 存储数据中心, 如图 3所示, 假如有四个不同的云存储数据中心可用时, 将第一部分原始数据 D1 存放在云存储服务提供商 1 的云存储数据中心 中, 将第二部分原始数据 D2、 第三部分原始数据 D3 分别存放在云存储 服务提供商 2、 3的云存储数据中心中, 最后将所有原始数据的校验值 P1 存放在云存储服务提供商 4的云存储数据中心中; 将原始数据 D4、 D5、 D6及它们的校验值 P2 , 分别存放在云存储数据中心 1、 2、 4和 3; 类似 地, 将原始数据 D7、 D8、 D9及它们的校验值 P3 , 分别存放在云存储数据 中心 1、 3、 4和 2 ; 同理, 其它的原始数据及对应的数据校验值按上述方 法依次存储于上述四个云存储数据中心中;  2. Store all the original data and the corresponding data check values in each cloud storage data center. As shown in Figure 3, if there are four different cloud storage data centers available, the first part of the original data will be The D1 is stored in the cloud storage data center of the cloud storage service provider 1, and the second part of the original data D2 and the third part of the original data D3 are respectively stored in the cloud storage data center of the cloud storage service provider 2, 3, and finally The check value P1 of all the original data is stored in the cloud storage data center of the cloud storage service provider 4; the original data D4, D5, D6 and their check values P2 are respectively stored in the cloud storage data center 1, 2. 4 and 3; similarly, the original data D7, D8, D9 and their check value P3 are respectively stored in the cloud storage data centers 1, 3, 4 and 2; similarly, other original data and corresponding data are corrected. The verification values are sequentially stored in the above four cloud storage data centers according to the above method;
以上仅举出了两种原始数据和数据校验值冗余存放策略, 真实实现 中用户可以根据自己的实际需要采用其他冗余数据存放策略, 例如: 原 始数据和数据校验值交替存放。  The above only cites two original data and data check value redundancy storage strategies. In the real implementation, users can adopt other redundant data storage strategies according to their actual needs. For example, the original data and data check values are stored alternately.
其中, 数据的取回步骤 S200具体包括:  The data retrieval step S200 specifically includes:
步骤 S201 : 根据访问请求按预置的冗余存放策略访问存储原始数据 的云存储数据中心;  Step S201: accessing the cloud storage data center storing the original data according to the preset redundant storage policy according to the access request;
步骤 S202 : 判断云存储数据中心存储的原始数据是否可用, 如果不 可用, 则执行步骤 S203 , 如果可用, 则执行步骤 S205 ;  Step S202: determining whether the original data stored in the cloud storage data center is available, if not, executing step S203, if yes, executing step S205;
步骤 S203 : 从其它云存储数据中心取回原始数据的数据校验值及创 建原始数据的数据校验值所使用的数据; 步骤 S 204 : 根据数据校验值及创建原始数据的数据校验值所使用的 数据还原原始数据; Step S203: Retrieving data check values of the original data and data used to create data check values of the original data from other cloud storage data centers; Step S204: Restore the original data according to the data check value and the data used to create the data check value of the original data;
步骤 S 205 : 从云存储数据中心取回原始数据;  Step S205: Retrieving the original data from the cloud storage data center;
同创建原始数据的数据校验值所使用的计算方法相对应, 本实施例 在根据数据校验值还原因云存储数据中心的原始数据不可用而不能取回 原始数据时, 同样采用异或逻辑运算方法进行原始数据的还原;  Corresponding to the calculation method used to create the data check value of the original data, in this embodiment, when the original data of the cloud storage data center is unavailable according to the data check value, and the original data cannot be retrieved, the exclusive OR logic is also used. The operation method restores the original data;
下面举例说明, 如果数据冗余存放策略采用上述第二种方法, 参见 图 3 , 假定第二个云存储服务商的云存储数据中心的原始数据不可用, 现 在需要根据从另外三个云存储数据中心取回的其它原始数据及数据校验 值来还原存储于第二个云存储数据中心的原始数据, 比如需要恢复原始 数据 D2 , 那么根据从其它云存储数据中心取回的原始数据 Dl、 D3和该原 始数据 D2的校验值 P1 , 按照异或运算法则得到该原始数据 D2 , 即: D2 = D1 θ D 3 θ P 1  The following example shows that if the data redundancy storage strategy adopts the second method described above, refer to Figure 3, assuming that the original data of the cloud storage data center of the second cloud storage service provider is unavailable, it is now necessary to store data from the other three clouds. The other raw data and data check values retrieved by the center are used to restore the original data stored in the second cloud storage data center, for example, the original data D2 needs to be restored, then the original data D1, D3 retrieved from other cloud storage data centers. And the check value P1 of the original data D2, the original data D2 is obtained according to an exclusive OR algorithm, namely: D2 = D1 θ D 3 θ P 1
同理, 也可以还原其他原存放在第二个云存储数据中心的其他原始 数据。  Similarly, other raw data originally stored in the second cloud storage data center can be restored.
参见图 4 , 本发明实施例提供了一种云存储的数据存取的装置, 该装 置包括: 数据的存储模块 20及数据的取回模块 30 ;  Referring to FIG. 4, an embodiment of the present invention provides an apparatus for data access of a cloud storage, where the apparatus includes: a data storage module 20 and a data retrieval module 30;
数据的存储模块 20包括数据校验值创建单元 21及数据传输单元 22 ; 数据的取回模块 30包括访问判断单元 33、 数据取回单元 31及数据还原 单元 32 ;  The data storage module 20 includes a data check value creation unit 21 and a data transfer unit 22; the data retrieval module 30 includes an access determination unit 33, a data retrieval unit 31, and a data restoration unit 32;
在存储数据时, 数据校验值创建单元 21在数据的发送端创建待存储 的原始数据的数据校验值后, 数据传输单元 22按预置的冗余存放策略将 原始数据及数据校验值创建单元 21创建的数据校验值分别传输至不同的 云存储数据中心进行存储;  When the data is stored, the data check value creating unit 21 creates the data check value of the original data to be stored at the transmitting end of the data, and the data transfer unit 22 sets the original data and the data check value according to the preset redundant storage policy. The data check values created by the creating unit 21 are respectively transmitted to different cloud storage data centers for storage;
当有原始数据访问请求时, 访问判断单元 33根据访问请求按预置的 冗余存放策略访问存储原始数据的云存储数据中心, 并判断云存储数据 中心存储的原始数据是否可用; 当该云存储数据中心的该原始数据不可 用时, 数据取回单元 31从其它云存储数据中心取回该原始数据的数据校 验值及创建该原始数据的数据校验值所使用的数据; 数据还原单元 32根 据数据取回单元 31取回的原始数据的数据校验值及创建该原始数据的数 据校验值所使用的数据还原该原始数据。 When there is a raw data access request, the access judging unit 33 accesses the cloud storage data center storing the original data according to the preset redundant storage policy according to the access request, and determines whether the original data stored in the cloud storage data center is available; when the cloud storage When the original data of the data center is unavailable, the data retrieval unit 31 retrieves the data of the original data from other cloud storage data centers. The value used and the data used to create the data check value of the original data; the data restore unit 32 uses the data check value of the original data retrieved by the data retrieval unit 31 and the data check value used to create the original data. The data restores the raw data.
参见图 5 , 本发明实施例还提供了一种云存储的数据存取的系统, 该 系统包括数据的存储模块 20、 数据的取回模块 30和云存储数据中心 40 ; 数据的存储模块 20包括数据校验值创建单元 21及数据传输单元 22 ; 数据的取回模块 30包括访问判断单元 33、 数据取回单元 31及数据还原 单元 32 ;  Referring to FIG. 5, an embodiment of the present invention further provides a system for data access of a cloud storage, where the system includes a data storage module 20, a data retrieval module 30, and a cloud storage data center 40. The data storage module 20 includes The data check value creation unit 21 and the data transfer unit 22; the data retrieval module 30 includes an access determination unit 33, a data retrieval unit 31, and a data restoration unit 32;
在存储数据时, 数据校验值创建单元 21在数据的发送端创建待存储 的原始数据的数据校验值后, 数据传输单元 22按预置的冗余存放策略将 原始数据及数据校验值创建单元 21创建的数据校验值分别传输至不同的 云存储数据中心 40进行存储;  When the data is stored, the data check value creating unit 21 creates the data check value of the original data to be stored at the transmitting end of the data, and the data transfer unit 22 sets the original data and the data check value according to the preset redundant storage policy. The data check values created by the creating unit 21 are respectively transmitted to different cloud storage data centers 40 for storage;
当有原始数据访问请求时, 访问判断单元 33根据访问请求按预置的 冗余存放策略访问存储原始数据的云存储数据中心 40 , 并判断云存储数 据中心 40存储的原始数据是否可用; 当该云存储数据中心 40的该原始 数据不可用时, 数据取回单元 31从其它云存储数据中心 40取回该原始 数据的数据校验值及创建该原始数据的数据校验值所使用的数据; 数据 还原单元 32根据数据取回单元 31取回的原始数据的数据校验值及创建 该原始数据的数据校验值所使用的数据还原该原始数据。  When there is a raw data access request, the access judging unit 33 accesses the cloud storage data center 40 storing the original data according to the preset redundant storage policy according to the access request, and determines whether the original data stored in the cloud storage data center 40 is available; When the original data of the cloud storage data center 40 is unavailable, the data retrieval unit 31 retrieves the data check value of the original data and the data used to create the data check value of the original data from the other cloud storage data center 40; The restoration unit 32 restores the original data based on the data check value of the original data retrieved by the data retrieval unit 31 and the data used to create the data check value of the original data.
为了阐述本发明可行性, 下面举一典型的数据校验值生成实例, 该 例主要通过数据位异或 (X0R ) 逻辑运算生成数据校验值。  In order to illustrate the feasibility of the present invention, a typical example of data check value generation is shown below. This example mainly generates data check values by data bit XOR (X0R) logic operations.
记 a ㊉ b为 a与 b的异或运算, a、 = 0或者 1。 异或运算遵循以 下规则:  Note a ten b is the exclusive OR of a and b, a, = 0 or 1. The XOR operation follows the following rules:
1、 如果 a、 b 相同 (即同为 0 , 或者同为 1 ) , 则 a e b=0; 反之 a θ b=l ;  1. If a and b are the same (that is, both are 0 or the same is 1), then a e b=0; otherwise a θ b=l ;
2、 a θ b θ c = a θ (b θ c) = (a θ b) θ c;  2. a θ b θ c = a θ (b θ c) = (a θ b) θ c;
3、 d=a θ b θ c 可以推出 a=d @ b @ c 在使用公共云存储服务时, 用户存放在第三方云存储数据中心的数 据无论是文件或数据块, 它们在物理上均表现为一串 0、 1值。 3, d = a θ b θ c can be launched a = d @ b @ c When using a public cloud storage service, the data that the user stores in the third-party cloud storage data center, whether it is a file or a data block, physically appears as a string of 0, 1 values.
假定现有三段待备份存储到指定云存储数据中心的原始数据 Dl、 D2、 Assume that there are three existing segments of data to be backed up to the specified cloud storage data center, Dl, D2.
D3 , 如果 Dl、 D2、 D3不等长, 需要通过补足 0位使之等长, 有四个不同 的云存储数据中心可用。 D3, if Dl, D2, and D3 are not equal in length, they need to be equal to 0 to make them equal. There are four different cloud storage data centers available.
在传输到指定的云存储数据中心前, 根据可用的云存储数据中心的 数量在上述原始数据之间做异或逻辑运算, 进而产生该待存储原始数据 的数据校验值 Pl。  Before being transmitted to the designated cloud storage data center, an exclusive OR logical operation is performed between the original data according to the number of available cloud storage data centers, thereby generating a data check value P1 of the original data to be stored.
数据校验值 PI = D1 θ D2 θ D3。  Data check value PI = D1 θ D2 θ D3.
循环使用上述方法直到生成所有待存储原始数据的数据校验值。 如果待存储原始数据及其对应的数据校验值的个数不能被所指定的 云存储数据中心的个数整除时, 可以使用其它的数据或指定一定长度的 0、 1 串补足, 产生待存储原始数据的数据校验值。  The above method is recycled until all data check values of the original data to be stored are generated. If the number of the original data to be stored and the corresponding data check value cannot be divisible by the number of the specified cloud storage data center, other data may be used or a certain length of 0, 1 string may be used to complement the generated data to be stored. The data check value of the original data.
数据校验值的生成方法已经成熟,以上仅为一种典型数据校验值生 成方法, 用户还可以根据需要生成两份甚至多份数据校验值并分别存放 于不同的云存储数据中心, 以防止两个或者多个云存储数据中心的原始 数据同时不可用。  The method for generating the data check value is mature. The above is only a typical data check value generation method. The user can also generate two or more data check values as needed and store them in different cloud storage data centers. Prevent raw data from two or more cloud storage data centers from being unavailable at the same time.
待所有待存储原始数据的校验值生成后, 根据可用的云存储数据中 心的数量, 将待存储原始数据及其数据校验值冗余地存放在不同的云存 储数据中心, 从而可以达到抵御任何一个云存储数据中心的原始数据不 可用或数据丟失或被墓改的风险。  After the checksum values of all the original data to be stored are generated, the original data to be stored and the data check value thereof are redundantly stored in different cloud storage data centers according to the number of available cloud storage data centers, thereby achieving resistance. The raw data of any cloud storage data center is not available or the risk of data loss or tomb modification.
当云存储数据中心的原始数据短暂不可用或者长期不可用时, 用户 就可以通过原始数据的数据校验值辅助以恢复原始数据。  When the raw data of the cloud storage data center is temporarily unavailable or unavailable for a long time, the user can use the data check value of the original data to assist in restoring the original data.
以上所述仅是保存一份数据校验值, 具体实施中用户可以通过保存 两份或者多份数据校验值确保存储在两个或者多个云存储数据中心的原 始数据不可用的时候, 原始数据仍然可以还原。 只是保存多份数据校验 值将增加对存储空间的需求, 从而增加成本。 因数据还原方法类似, 不 再赘述。 The above description only saves a data check value. In the specific implementation, the user can ensure the storage of two or more cloud storage data centers by saving two or more data check values. When the initial data is not available, the original data can still be restored. Just saving multiple copies of the data check value will increase the need for storage space, which will increase the cost. Because the data restoration method is similar, it will not be described again.
本发明实施例通过在数据发送端创建待存储原始数据的校验值, 将 待存储原始数据及其校验值, 按照预置的策略分别存放在不同的云存储 数据中心, 在不同的云存储数据中心之间创建数据的冗余, 从而实现了 风险, 提高了公共云存储服务数据的可用性和数据的容错性。  The embodiment of the present invention creates the check value of the original data to be stored on the data sending end, and stores the original data to be stored and the check value thereof in different cloud storage data centers according to preset policies, in different cloud storages. Redundancy of data is created between data centers, thereby realizing risks and improving the availability of public cloud storage service data and the fault tolerance of data.
以上所述仅为本发明的较佳实施例而已, 并不用以限制本发明, 凡 在本发明的精神和原则之内所作的任何修改、 等同替换和改进等, 均应 包含在本发明的保护范围之内。  The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims

权 利 要 求 书 Claim
1、 一种云存储的数据存取的方法, 所述方法包括数据的存储与数据 的取回, 其特征在于, A method for data access of a cloud storage, the method comprising storing data and retrieving data, wherein:
所述数据的存储步骤包括:  The storing step of the data includes:
在数据的发送端创建待存储原始数据的数据校验值; 同的云存储数据中心存储;  Creating a data check value of the original data to be stored at the sending end of the data; the same cloud storage data center storage;
所述数据的取回步骤包括:  The step of retrieving the data includes:
根据访问请求按预置的冗余存放策略访问存储原始数据的云存储数 据中心;  Accessing the cloud storage data center storing the original data according to the preset redundant storage policy according to the access request;
判断所述云存储数据中心存储的原始数据是否可用, 如果不可用, 则从其它云存储数据中心取回所述原始数据的数据校验值和创建所述原 始数据的数据校验值所使用的数据;  Determining whether the original data stored in the cloud storage data center is available, and if not available, retrieving the data check value of the original data from other cloud storage data centers and the data check value used to create the original data Data
根据所述原始数据的数据校验值及创建所述原始数据的数据校验值 所使用的数据还原所述原始数据。  The original data is restored based on the data check value of the original data and the data used to create the data check value of the original data.
2、 如权利要求 1所述的云存储的数据存取的方法, 其特征在于, 所 述创建待存储原始数据的数据校验值是通过异或逻辑运算创建原始数据 的数据校验值; 相应地, 所述还原所述原始数据是通过异或逻辑运算还 原原始数据。  The method for data access of the cloud storage according to claim 1, wherein the data check value for creating the original data to be stored is a data check value for creating original data by an exclusive OR logical operation; The restoring the original data is to restore the original data by an exclusive OR logical operation.
3、 如权利要求 1所述的云存储的数据存取的方法, 其特征在于, 所 个数能被所述云存储数据中心的个数整除。  3. The method of data access for cloud storage according to claim 1, wherein the number is divisible by the number of the cloud storage data centers.
4、 如权利要求 1所述的云存储的数据存取的方法, 其特征在于, 所 述创建所述原始数据的数据校验值所使用的数据包括所述原始数据和至 少一个固定长度的二进制数。  4. The method for data access of a cloud storage according to claim 1, wherein the data used to create the data check value of the original data comprises the original data and at least one fixed length binary. number.
5、 如权利要求 1所述的云存储的数据存取的方法, 其特征在于, 所 述数据校验值存储于同一个云存储数据中心, 或存储于不同的云存储数 据中心。 The method for data access of the cloud storage according to claim 1, wherein the data check value is stored in the same cloud storage data center or in a different cloud storage data center.
6、 一种云存储的数据存取的装置, 所述装置包括数据的存储模块及 数据的取回模块, 其特征在于, 6. A device for data access of a cloud storage, the device comprising a data storage module and a data retrieval module, wherein
所述数据的存储模块包括:  The storage module of the data includes:
数据校验值创建单元, 用于在数据的发送端创建待存储原始数据的 数据校验值;  a data check value creating unit, configured to create a data check value of the original data to be stored at the sending end of the data;
数据传输单元, 用于按预置的冗余存放策略将所述原始数据及数据 校验值创建单元创建的数据校验值分别传输至不同的云存储数据中心存 储;  a data transmission unit, configured to separately transmit the data verification values created by the original data and the data verification value creation unit to different cloud storage data center storage according to a preset redundancy storage policy;
所述数据的取回模块包括:  The data retrieval module includes:
访问判断单元, 用于根据访问请求按预置的冗余存放策略访问存储 原始数据的云存储数据中心, 以及判断云存储数据中心存储的原始数据 是否可用, 发送判断结果;  The access judging unit is configured to access the cloud storage data center storing the original data according to the preset redundancy storage policy according to the access request, and determine whether the original data stored in the cloud storage data center is available, and send the determination result;
数据取回单元, 用于根据所述访问判断单元发送的所述云存储数据 中心存储的原始数据不可用的判断结果, 从其它云存储数据中心取回所 述原始数据的数据校验值及创建所述原始数据的数据校验值所使用的数 据; 校验值及创建所述原始数据的数据校验值所使用的数据还原所述原始数 据。  a data retrieval unit, configured to retrieve, according to the determination result that the original data stored in the cloud storage data center is not available, by the access determination unit, to retrieve the data verification value of the original data from other cloud storage data centers and create The data used by the data check value of the original data; the check value and the data used to create the data check value of the original data restore the original data.
7、 一种云存储的数据存取的系统, 所述系统包括数据的存储模块、 数据的取回模块和云存储数据中心, 其特征在于,  7. A system for data access of a cloud storage, the system comprising a data storage module, a data retrieval module, and a cloud storage data center, wherein
所述数据的存储模块包括:  The storage module of the data includes:
数据校验值创建单元, 用于在数据的发送端创建待存储原始数据的 数据校验值;  a data check value creating unit, configured to create a data check value of the original data to be stored at the sending end of the data;
数据传输单元, 用于按预置的冗余存放策略将所述原始数据及数据 校验值创建单元创建的数据校验值分别传输至不同的云存储数据中心存 储;  a data transmission unit, configured to separately transmit the data verification values created by the original data and the data verification value creation unit to different cloud storage data center storage according to a preset redundancy storage policy;
所述数据的取回模块包括: 访问判断单元, 用于根据访问请求按预置的冗余存放策略访问存储 原始数据的云存储数据中心, 以及判断云存储数据中心存储的原始数据 是否可用, 发送判断结果; The data retrieval module includes: The access judging unit is configured to access the cloud storage data center storing the original data according to the preset redundancy storage policy according to the access request, and determine whether the original data stored in the cloud storage data center is available, and send the determination result;
数据取回单元, 用于根据所述访问判断单元发送的所述云存储数据 中心存储的原始数据不可用的判断结果, 从其它云存储数据中心取回所 述原始数据的数据校验值及创建所述原始数据的数据校验值所使用的数 据; 校验值及创建所述原始数据的数据校验值所使用的数据还原所述原始数 据。  a data retrieval unit, configured to retrieve, according to the determination result that the original data stored in the cloud storage data center is not available, by the access determination unit, to retrieve the data verification value of the original data from other cloud storage data centers and create The data used by the data check value of the original data; the check value and the data used to create the data check value of the original data restore the original data.
PCT/CN2010/079327 2010-11-29 2010-12-01 Method, apparatus and system for accessing data in cloud storage WO2012071719A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/000,575 US8347184B2 (en) 2010-11-29 2010-12-01 Cloud storage data access method, apparatus and system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010566265.X 2010-11-29
CN201010566265XA CN102055797A (en) 2010-11-29 2010-11-29 Method, device and system for accessing cloud storage data

Publications (1)

Publication Number Publication Date
WO2012071719A1 true WO2012071719A1 (en) 2012-06-07

Family

ID=43959716

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/079327 WO2012071719A1 (en) 2010-11-29 2010-12-01 Method, apparatus and system for accessing data in cloud storage

Country Status (2)

Country Link
CN (1) CN102055797A (en)
WO (1) WO2012071719A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9514145B2 (en) 2011-12-02 2016-12-06 International Business Machines Corporation Managing redundant immutable files using deduplication in storage clouds

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102752402A (en) * 2012-07-20 2012-10-24 广东威创视讯科技股份有限公司 Cloud storage method and cloud storage system
CN102882885B (en) * 2012-10-17 2015-07-01 北京卓微天成科技咨询有限公司 Method and system for improving cloud computing data security
CN106027653B (en) * 2016-05-23 2019-04-12 华中科技大学 A kind of cloudy storage system extended method based on RAID4
CN106060094B (en) * 2016-07-29 2019-04-16 成都轻车快马网络科技有限公司 Cloud computing method for mobile client
CN110276213B (en) * 2019-06-24 2021-03-16 恒宝股份有限公司 Smart card and storage and verification method for sensitive result thereof
CN111782623A (en) * 2020-05-21 2020-10-16 北京交通大学 File checking and repairing method in HDFS storage platform

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101605148A (en) * 2009-05-21 2009-12-16 何吴迪 The framework method of the parallel system of cloud storage
CN101888405A (en) * 2010-06-07 2010-11-17 北京高森明晨信息科技有限公司 Cloud computing file system and data processing method
US20100299313A1 (en) * 2009-05-19 2010-11-25 Security First Corp. Systems and methods for securing data in the cloud

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1258921C (en) * 2002-07-30 2006-06-07 中兴通讯股份有限公司 Distributive video interactive system and its data recording and accessing method
JP4754852B2 (en) * 2005-03-15 2011-08-24 富士通株式会社 Storage control apparatus and method
CN101571820B (en) * 2009-05-27 2012-07-04 成都市华为赛门铁克科技有限公司 Redundancy protected method, device and system
CN101834898B (en) * 2010-04-29 2013-01-30 中科院成都信息技术有限公司 Method for storing network distributed codes

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100299313A1 (en) * 2009-05-19 2010-11-25 Security First Corp. Systems and methods for securing data in the cloud
CN101605148A (en) * 2009-05-21 2009-12-16 何吴迪 The framework method of the parallel system of cloud storage
CN101888405A (en) * 2010-06-07 2010-11-17 北京高森明晨信息科技有限公司 Cloud computing file system and data processing method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9514145B2 (en) 2011-12-02 2016-12-06 International Business Machines Corporation Managing redundant immutable files using deduplication in storage clouds

Also Published As

Publication number Publication date
CN102055797A (en) 2011-05-11

Similar Documents

Publication Publication Date Title
US10613776B2 (en) Appyling multiple hash functions to generate multiple masked keys in a secure slice implementation
US8347184B2 (en) Cloud storage data access method, apparatus and system
US10387249B2 (en) Migrating data slices within a dispersed storage network
WO2012071719A1 (en) Method, apparatus and system for accessing data in cloud storage
US9009575B2 (en) Rebuilding a data revision in a dispersed storage network
US9323603B2 (en) Storage of sensitive data in a dispersed storage network
US9128870B2 (en) Sending a zero information gain formatted encoded data slice
US10693640B2 (en) Use of key metadata during write and read operations in a dispersed storage network memory
US10142115B2 (en) Distributed storage network data revision control
CN102667727B (en) For realizing the N road parity technique from nearly N number of memory device failure recovery
US20130297947A1 (en) Storing data integrity information utilizing dispersed storage
US9632722B2 (en) Balancing storage unit utilization within a dispersed storage network
US10255135B2 (en) Method and apparatus for non-interactive information dispersal
WO2012071712A1 (en) Method, device and system for accessing cloud storage data
US20190004727A1 (en) Using a namespace to augment de-duplication
US20120134494A1 (en) Data Control Method of Cloud Storage
US10642992B2 (en) Password augmented all-or-nothin transform
US10324657B2 (en) Accounting for data whose rebuilding is deferred
US20190026025A1 (en) Multi-phase dispersed storage write process
Sengupta et al. Data vaporizer-towards a configurable enterprise data storage framework in public cloud
US10360391B2 (en) Verifiable keyed all-or-nothing transform
US20190007380A1 (en) De-duplication of data streams
Kiani et al. A non-mds erasure code scheme for storage applications
CN117093397A (en) Data output fault-tolerant device and fault-tolerant method in high-performance computing environment
Jaikar et al. Securing Cloud Data Storage

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 13000575

Country of ref document: US

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10860290

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10860290

Country of ref document: EP

Kind code of ref document: A1