Scale Up Your Performance Density with HP IO Accelerators and ioTurbine Software

August 8, 2012

As an update to a previous posting, HP has just published a whitepaper highlighting dramatic TPS performance and VM density using HP IO Accelerators and Fusion’s ioTurbine caching software with virtualized HP DL980 servers.

Benchmark testing demonstrated a 2.8x increase in performance achieved by adding just two 1.28TB IO Accelerators and ioTurbine software to a pair of HP ProLiant DL980 servers running Microsoft Windows 2008 and VMware ESX 4.1, connecting with HP P4800 storage. The configuration supports an aggregated 7120 transactions-per-second (TPS) across 16 VMs running SQL workloads, in parallel. This result compares to 2521 TPS for the same baseline but uncached configuration.

The testing illustrates that this solution not only adds scale to the HP DL980 server but also dramatically increases VM densities to support almost any application, regardless of workload, in a virtual server environment. Having more and larger size VMs makes more efficient use of infrastructure, lowers costs and provides the opportunity to run databases in a virtualized environment, something that was previously not possible or cost prohibitive, at least until now. The solution is an ideal VMware consolidation platform and preserves VMware vMotion and high availability capabilities, important customer requirements.

Of course, this same solution approach can also be attractive for customers interested in using HP IO Accelerators and ioTurbine with other HP servers such as the HP DL580, HP DL380 and the BL460 for bladed environments.

Note also that testing was accomplished with HP’s Gen1 IO Accelerators, so performance results would be expected to be even better using HP’s recently announced ioDrive2 IO Accelerators.


Ten Things to Consider When Building a Storage System for Virtualization

July 5, 2012

Virtualization is quickly taking over data centers. Gone are the days when IT admins worried about managing operating systems running directly on physical server hardware. The manageability and cumulative performance advantages of virtualization has led to a growing trend where consumer operating systems like Microsoft Windows are run within virtual machines. These virtual machines are managed by a hypervisor (such as VMware’s vSphere) that mediates access to the physical hardware in the server node. Clusters of such server nodes are being put together to host several hundred to even thousands of virtual machines. Such clusters afford high availability and load balance by permitting migration of virtual machines between server nodes.

Just like the rest of the physical hardware, the hypervisor also virtualizes the underlying storage for the virtual machines. Thus, each virtual machine may present a virtualized SCSI disk to the guest operating system running inside it. The data written to this virtualized disk cannot be simplistically mapped to an underlying physical disk. This is because this data needs to remain accessible once the virtual machine is migrated to another server node (e.g., upon a hardware failure). A sophisticated storage subsystem is therefore needed, one that can keep the data accessible despite the movement of virtual machines across server nodes.

The Nutanix Complete Cluster offers such a sophisticated storage subsystem that was designed specifically for virtualization workloads. This storage subsystem can be accessed by the hypervisor through industry standard iSCSI/NFS protocols. This blog talks about 10 key considerations that went into the design of this storage subsystem.

Elegance of Nutanix Design

1. Converged and distributed: Hardware trends in the past ten years indicate that disk capacities and speeds are growing at a much faster pace than network speeds. A cost-effective solution needs to be converged to leverage these trends – i.e., the storage needs to be placed close to the computation that accesses that storage and not across an expensive network fabric. The Nutanix offering epitomizes this by building a distributed storage subsystem using the local disks in the server nodes themselves. This is in sharp contrast to the single-headed SAN/NAS solutions that require expensive networking to deliver the high performance required by server clusters running virtual machines.

30 year old legacy design, the linchpin of our datacenters

2. Incremental scalability: As compute/storage needs grow, it should be possible to grow the system incrementally rather than requiring a complete hardware refresh as is typical with centralized SAN/NAS solutions. The Nutanix Complete Cluster is designed to be incrementally scalable, with no single point of bottleneck. While near linear scalability has been demonstrated in a cluster of 50 nodes, the design affords limitless scalability.

3. Performance: A storage system that considers performance as an after-thought opens itself up for one or more expensive architectural redesigns. The Nutanix Complete Cluster was designed for delivering high performance from the very outset. It combines traditional wisdom in distributed system design with new techniques to deliver high performance. These include a pipelined architecture, asynchronous request handling, extensive caching, and judicious use of Fusion ioMemory to keep frequently accessed data as well as metadata. The design specifically caters to virtualization workloads. For example, the NFS server implementation in the Nutanix Complete Cluster was designed to deliver high data IOPS (both random and sequential) rather than high namespace IOPS (which is what outdated benchmarks like SpecFS primarily measure). This is specially suited for virtualization as the bulk of the IO requests from guest VMs are converted into NFS read/write requests by the hypervisor when accessing the underlying storage subsystem through the NFS protocol.

Nutanix Direct Data Path

4. Random IO: With potentially hundreds of virtual machines simultaneously issuing IO requests, the data access patterns appear random by the time they are incident on the underlying storage system. In contrast to traditional storage subsystem designs, Nutanix was designed with the intent of delivering high random IO performance from the very start. It uses techniques such as a distributed operation log to absorb random writes, careful placement of metadata indexes in high performance SSDs for quick lookups, and extensive use of caching and deduplication to absorb boot/login storms. Recently, a 40-node Nutanix cluster successfully ran VMware’s RAWC benchmark with a record-breaking 3000 virtual machines. More details on this VDI reference architecture can be found at http://bit.ly/yN9S01.

5. Fine-grained tiering: Gone are the days when the predominant form of persistent storage were magnetic disks with similar performance characteristics. Today the data can be stored on a wide variety of media e.g., SSDs, SAS/SATA drives etc, each affording different capacities and performance at a given price point. The storage subsystem in Nutanix recognizes these as separate tiers of storage and places data on them based on its temperature. Thus, hot data is placed on the faster SSDs while colder data might be placed on the slower SATA drives. As the temperature of data changes, the Nutanix complete cluster supports water-falling of data between tiers. To avoid polluting the SSDs with cold data, data is divided up into fine-grained units of a few megabytes that form the basis of data placement and migration. Such fine-grained management of data across tiers also enables Nutanix to quickly adapt to changing workloads.

Information Lifecycle Management

6. Consistency model: The Nutanix Complete Cluster can manage petabytes of data written by guest VMs. Just like other storage subsytems, metadata is maintained to enable the quick location of any data. Since losing data or returning stale data is not an acceptable option, a strict consistency model is supported. While relational database abstractions such as transactions can be used to implement strict consistency, this approach is known to be unscalable and slow. On the other hand, typical noSQL approaches that maintain structured information as a set of key/value pairs are know to be highly performant, but typically only afford eventual consistency. The Nutanix Complete Cluster adopts a novel two-fold approach for delivering high performance despite supporting strict consistency. First, the metadata is kept in a noSQL key/value store that was enhanced with the Paxos algorithm to provide strict consistency for updates of any given key’s value. Second, all metadata operations involving multiple keys are carefully sequenced in way so as to always keep the overall metadata tree completely consistent at all times. This approach provides the best of both worlds – delivering high performance while supporting strict consistency.

7. Congestion management: Every major function in the Nutanix Complete Cluster is handled by a different component. A key aspect of the design is that flow/congestion control is built into each of these components. Without proper congestion management, a distributed system can come to a grinding halt by entering situations where useful work can no longer be done. As an example, the component that manages writes to a disk might become clogged with requests. As a result, a remote sender may timeout its outstanding requests to the congested component and re-send them – causing further continued congestion. To avoid such situations, every component in the Nutanix Complete Cluster exerts appropriate flow control to ensure it accepts only as many requests as it can reasonably execute. In addition, stale or low priority requests are quickly dropped when congestion is detected.

8. Designed for high-availability: A highly available storage subsystem does not have the luxury of going offline when a few of its components fail. These components might be either software components, or hardware ones. The storage subsystem in the Nutanix complete cluster was designed for fault-tolerance. There is no single-point of failure and any component can fail and stay down for extended periods of time. Thus, any disk, node, network card etc may fail without affecting availability. All data is both replicated as well as checksummed to protect against faults. The number of replicas kept for the data is configurable – thus permitting simultaneous failure of one or more components without sacrificing availability.

Anatomy of A Write IO; 10,000 ft. view

9. Replication fan-out: Distributed storage subsystems are often designed by mirroring one disk onto another. With disk capacities running into terabytes, this implies that failure of one disk would require reading all the data from the other healthy disk in order to restore replication. Not only does this create a hot-spot in the system by making one disk the bottleneck while others might be idle, it also increases the chances of data loss because the intense workload on the healthy disk might also cause it to fail. The Nutanix Complete Cluster avoids this by replicating each unit of data (comprising a few megabytes) on a disk to a random disk in the rest of the cluster. On a disk failure, the corresponding replicas can be read to restore replication – the restored second copy can also be placed on any disk in the cluster. Thus, recovering from a failed disk utilizes all of the cluster’s resources and avoids the formation of any hot-spots.

10. Continuous healing: Nutanix’s highly available storage subsystem cannot freeze to run a data consistency check (akin to the fsck found in Unix filesystems). The distributed nature of the system coupled with the petabytes of data it can potentially manage implies that faults will happen sooner or later – for example due to failed components. To discover and recover from such problems, the Nutanix Complete Cluster continuously heals itself by running a MapReduce over its metadata and taking appropriate corrective measures based on the issues found. For example, if a data unit is found to be under-replicated due to a failed component, a replication will be kicked for that component. The MapReduce computation runs as a low-priority background job so as to not affect the performance of higher-priority IO requests emanating from the guest VMs. The use of MapReduce, whose use is predominant in Big Data analytics today, lends the Nutanix Complete Cluster the scalability to manage large amounts of data, while affording high availability at the same time.

To summarize, the Nutanix Complete Cluster bridges the gap between computation and storage by converging these in a compact rackable unit, one or more of which can be stacked together to build a powerful virtualization appliance. The new demands imposed by virtualization workloads required an architecture that was built ground-up to specifically meet these requirements. The yardsticks of availability, performance, and scalability indicate that the Nutanix Complete Cluster is delivering on its promise, and is stretching the horizons of what was earlier possible in the realm of virtualization. Despite everything that has been delivered so far, there are lot of more exciting things that are in the pipeline. So stay tuned.


Backup Restore and Recovery Considerations in Virtual Environments

May 23, 2011

Great article below reference backing up virtual machines. C24 have always been known for our expertise in the delivery of business applications at speed across the globe. However, recently we have again invested significantly in our hosting infrastructure which has enabled us to now offer the solution suite for one of the worlds best back-up and recovery company’s Asigra.

The signing with Asigra again signals that C24 intends to offer only best of breed solutions. Please enjoy the article below.

It is no secret that large and small businesses alike, are rapidly adopting server virtualization in their data centers and most indications are that this trend will continue.  When architecting virtual infrastructures, one of the first issues that business face is “What should I do for backup and recovery in a virtual environment?”

The most common approach, at least when starting out, is to ignore the fact that servers are now running on Virtual Machines (VMs) and backup the servers through the guest Operating Systems (Oss) just like you do when the OS is running on a physical server.  While this approach will work, it does have some drawbacks.  This approach typically requires you to load a backup agent on the guest OS in order to backup that server.  If the server is running an application such as Exchange, SQL or SharePoint, then you need to load a separate agent for each application.  Some backup applications also require separate agents to backup the Windows System State or Services Data Base.

When you load backup agents on a physical server, these agents are processes running on the OS, which require CPU resources.  Depending upon the agents, each agent might use less than 1% of the CPU resources or more than 15%.   Regardless of the resources required by the agents, this CPU overhead usually goes unnoticed on a physical server.

However, in a virtual environment, you could easily have 10 VMs running on a single physical host.  Each VM might have several agents on the server to accommodate backing up the file system, services database and applications.   Assuming a very conservative average of two agents per VM (each using 1% of the host’s CPU cycles), in a virtual environment, you would be wasting 20% of your available CPU resources on backup agents that don’t do anything during normal business hours. 

Once businesses realize the overhead in terms of wasted CPU resources, as well as man-hours required to manage all those agents, they typically look for a solution that will allow them to back up their VMs from the physical host side.  VMWare has the largest server virtualization market share so most of the major backup applications now support backing up VMs from the VMWare host side. 

Backing up VMs from the host side has advantages over backing up servers from the guest OS side.  First, there is no need to load or manage agents on each of the guest OSs.  This saves on both CPU resources and management overhead.

The next advantage is that it is typically much faster to backup and restore VMs from the host side, since you are backing up and restoring a single large VMDK file rather than backing up and restoring thousands of small individual OS, application and data files.   In a Disaster Recovery (DR) situation, where a VM’s OS becomes corrupted and you need to restore from a backup, it is very easy to point and click, and restore that system to another VM.  The disadvantage with many backup applications is they don’t support individual file restores.  If an end user deletes a single file, you need to restore the entire VM, find the file and give it to the end user, then delete the VM. 

When moving to a virtual infrastructure, it is a good time to evaluate your current backup application and to see if it meets all your needs.  If you determine that you need to invest in a new backup solution, you will want to choose one that will meet all your needs, now and in the future.  You should look for a solution that will allow you to restore the entire VM in a DR situation or to restore applications and databases like Exchange and SQL without having to restore the entire VM.  You should also consider a solution that allows you to restore individual Exchange messages or individual SharePoint items, without having to restore the entire database.

Finally, you should seriously consider a backup recovery solution that supports both physical servers and virtual environments. And the BUR solution should support more than just VMware.   While VMware may have the lion’s share of the virtualization market share today, they are starting to face significant competition from other sources such as MS Hyper-V, XenServer and Parallels to name a few.  Whenever a technology vendor thinks that a customer has no alternatives and is locked into their solution, they have very little incentive to reduce the cost of their solution.  Bringing in an alternative virtualization solution may provide VMware an incentive to reduce their price.  But you shouldn’t have to invest time and money in a new backup solution just because you want to try an alternative to VMware.

Blog original from Scott Lakso @Asigra


Follow

Get every new post delivered to your Inbox.

Join 752 other followers