Professional Documents
Culture Documents
Contents
1. Defining Virtualization 2. Characteristics for an Ideal Storage Virtualization Solution 3. A Stroll Down Memory Lane 4. Storage Virtualization in 3 layers 5. Key Concepts 6. Reasons Why to Virtualize
Defining Virtualization
A general definition of virtualization is: The act of integrating one or more (back-end) services or functions with additional (front-end) functionality for the purpose of providing useful abstractions. Storage virtualization is the aggregation of multiple physical storage devices (such as RAID arrays, ATA/SATA disks, high speed disks (HSDs), or tape drives) of various interface protocols (such as SCSI, iSCSI, or Fibre Channel) into a single virtual storage pool from which the creation and provisioning of virtual storage volumes, appearing as locally attached logical devices to their host server, can be performed as needed, when needed from a centralized console. An advanced virtualization solution provides IT administrators the freedom of choice to provision the available storage in the virtual pool as SAN and/or NAS volumes.
Enhance the storage resources it is virtualizing through the aggregation of services to increase the return of existing assets. Not add another level of complexity in configuration and management. Improve performance rather than act as a bottleneck in order for it to be scalable. Scalability is the capability of a system to maintain performance linearly as new resources (typically hardware) are added. Provide secure multi-tenancy so that users and data can share virtual resources without exposure to other users bad behavior or mistakes.
What is a SAN?
A storage area network (SAN) is an architecture to connect detached computer storage devices, such as disk arrays, tape libraries, and optical jukeboxes, to servers in a way that the devices appear as local resources. SANs deliver storage to servers at a block level, and feature mapping and security capabilities to ensure only one server can access the allocated storage at any particular time. The protocol, or language, used to communicate between storage devices and servers is SCSI (small computer system interface). Traditionally, SANs have used optical connectivity called fibre channel (FC) due to its gigabit transfer speeds and ability to work over long distances. However, more recently ethernet based storage networks have become popular due to the ubiquity (and therefore lower cost) of the technology in the TCP/IP world (transmission control protocol/Internet protocol) and the adoption of the SCSI protocol, called iSCSI (Internet small computer system interface) to the medium.
What is a NAS?
Network attached storage (NAS) has traditionally been a single device that contains disk storage and computer components (CPU, memory, network ports) and whose sole purpose is to deliver storage to servers. NAS is sometimes referred to as a file server due to its ability to share a common pool of files with multiple servers. By managing the disks, file systems, and volumes, it optimizes and simplifies the delivery of storage to servers in a network environment, and can provide services such as redundancy and replication, relieving servers of these tasks.
Out-of-band, also known as asymmetric, virtualization devices are sometimes called meta-data servers. These devices only perform the metadata mapping functions. This requires additional software in the host which knows to first request the location of the actual data. Therefore an I/O request from the host is intercepted before it leaves the host, a meta-data lookup is requested from the meta-data server (this may be through an interface other than the SAN) which returns the physical location of the data to the host. The information is then retrieved through an actual I/O request to the storage. Caching is not possible as the data never passes through the device. Both in-band and out-of-band approaches provide storage virtualization with the ability to: . Pool heterogeneous vendor storage products in a seamless accessible pool. . Perform replication between non-like devices. . Provide a single management interface. However, only the in-band approach can cache data for increased performance.
A primary storage controller provides the virtualization services and allows the direct attachment of other storage controllers. Depending on the implementation these may be from the same or different vendors.
The primary controller will provide the pooling and meta-data management services. It may also provide replication and migration services across those controllers which it is virtualizing.
Supports remote data access, backup, mirroring, replication: A value-adding storage virtualization solution also enables IT administrators to create backups and mirror space on disks configured virtually anywhere in a system, or across IP networks. It enables disaster recovery services such as replication of data across IP to a remote site and rapid data and system recovery services. It also ensures that a business can continue operating in the event of damage or loss of a primary data set.