How to Double VM Density with ioTurbine

May 23, 2013

Slow disk I/O limits virtual machine (VM) density in servers, and that restricts customers from realizing one of the greatest benefits of virtualization: consolidation. When disk I/O is too slow, you have to use more server hardware and buy more software licenses for virtualization to perform well enough.

ioTurbine is an innovative caching software approach that allows users to double the number of VMs on industry standard servers and virtualize mission critical applications, while integrating seamlessly into existing SAN or NAS storage.

Watch this video to see the benefits of ioTurbine in action.

A new white paper illustrates these benefits, comparing the performance of virtual machines running off a storage array to performance of virtual machines using ioTurbine caching software on ioMemory.

A single virtual machine’s performance on ioMemory is five times that of a VM on disk. This makes ioTurbine an ideal solution for virtualizing I/O-intensive applications like Microsoft SQL Server or Oracle databases, while supporting VM mobility features such as vMotion and HA. See Virtualizing Critical Applications with ioTurbine.

As more VMs are added, the ioTurbine performance scales, enabling 20 high-performance VMs to run on a server that could support just 10 VMs without caching.

Users looking to increase VM density—either by virtualizing performance-sensitive applications currently running on bare metal or by increasing the number of VMs on servers— should consider ioTurbine intelligent caching with ioMemory.


Defensible Disposal with Automation

September 13, 2012

It’s no secret that the data on corporate servers is growing exponentially. Documents, presentations, media, spreadsheets, and other files are constantly being created and moved onto servers, and after a while, most of it is rarely used, if at all. However, much of this stale data also must be retained in order to comply with regulatory compliance, or to maintain business continuity.

Many IT departments are faced with the reality of having to either continually expand their storage infrastructure or try to accurately determine which data can be safely disposed. The first option is costly and results in basically paying for information you’ll never use, while the latter can be costly in terms of man-hours and brainpower, especially without an automated process in place.

Let’s examine the options a bit closer.

Do Nothing

While it seems like a simpler solution to keep expanding your hardware and try to hold onto every bit just in case it is needed some time in the future, this sort of inaction with regards to defensible disposal is simply not a viable option. Allowing vast amounts of data to accumulate will make it increasingly difficult for users to find relevant data, slow down e-discovery, cause servers to perform poorly, and possibly even crash them, costing your business precious time and money.

Do Anything

Taking the wrong action can be just as damaging. Deleting your CEO’s old email archive might result in a very uncomfortable conversation; disposing of files that you are legally obligated to retain (for HIPAA, HITECH, SOX, etc.) can cost people their jobs, and possibly result in legal action. That’s something no IT professional ever wants to have to deal with.

Do the Right Thing

It should be clear by now exactly why proper defensible disposal techniques are integral to the survival of any business, especially those with sensitive data. Proper disposal techniques can save money and time by streamlining the process of deleting useless data and allowing for admins to focus on other more pressing needs.

If you’re finding the process itself takes quite a bit of planning and/or some sophisticated technology to do most of the heavy lifting, consider automating with technology like the Varonis Data Transport Engine. Varonis DTE simplifies the process of defensible disposal by leveraging our Metadata Framework, allowing admins to automatically and continually delete or migrate data based on a wide array of criteria, such as the content of the file or the date it was last accessed by a human user. This ensures that information that needs to be retained isn’t disposed of by accident and the data that can be safely deleted proceeds safely to bit-heaven, or bit bucket, or /dev/null.


Scale Up Your Performance Density with HP IO Accelerators and ioTurbine Software

August 8, 2012

As an update to a previous posting, HP has just published a whitepaper highlighting dramatic TPS performance and VM density using HP IO Accelerators and Fusion’s ioTurbine caching software with virtualized HP DL980 servers.

Benchmark testing demonstrated a 2.8x increase in performance achieved by adding just two 1.28TB IO Accelerators and ioTurbine software to a pair of HP ProLiant DL980 servers running Microsoft Windows 2008 and VMware ESX 4.1, connecting with HP P4800 storage. The configuration supports an aggregated 7120 transactions-per-second (TPS) across 16 VMs running SQL workloads, in parallel. This result compares to 2521 TPS for the same baseline but uncached configuration.

The testing illustrates that this solution not only adds scale to the HP DL980 server but also dramatically increases VM densities to support almost any application, regardless of workload, in a virtual server environment. Having more and larger size VMs makes more efficient use of infrastructure, lowers costs and provides the opportunity to run databases in a virtualized environment, something that was previously not possible or cost prohibitive, at least until now. The solution is an ideal VMware consolidation platform and preserves VMware vMotion and high availability capabilities, important customer requirements.

Of course, this same solution approach can also be attractive for customers interested in using HP IO Accelerators and ioTurbine with other HP servers such as the HP DL580, HP DL380 and the BL460 for bladed environments.

Note also that testing was accomplished with HP’s Gen1 IO Accelerators, so performance results would be expected to be even better using HP’s recently announced ioDrive2 IO Accelerators.


Ten Things to Consider When Building a Storage System for Virtualization

July 5, 2012

Virtualization is quickly taking over data centers. Gone are the days when IT admins worried about managing operating systems running directly on physical server hardware. The manageability and cumulative performance advantages of virtualization has led to a growing trend where consumer operating systems like Microsoft Windows are run within virtual machines. These virtual machines are managed by a hypervisor (such as VMware’s vSphere) that mediates access to the physical hardware in the server node. Clusters of such server nodes are being put together to host several hundred to even thousands of virtual machines. Such clusters afford high availability and load balance by permitting migration of virtual machines between server nodes.

Just like the rest of the physical hardware, the hypervisor also virtualizes the underlying storage for the virtual machines. Thus, each virtual machine may present a virtualized SCSI disk to the guest operating system running inside it. The data written to this virtualized disk cannot be simplistically mapped to an underlying physical disk. This is because this data needs to remain accessible once the virtual machine is migrated to another server node (e.g., upon a hardware failure). A sophisticated storage subsystem is therefore needed, one that can keep the data accessible despite the movement of virtual machines across server nodes.

The Nutanix Complete Cluster offers such a sophisticated storage subsystem that was designed specifically for virtualization workloads. This storage subsystem can be accessed by the hypervisor through industry standard iSCSI/NFS protocols. This blog talks about 10 key considerations that went into the design of this storage subsystem.

Elegance of Nutanix Design

1. Converged and distributed: Hardware trends in the past ten years indicate that disk capacities and speeds are growing at a much faster pace than network speeds. A cost-effective solution needs to be converged to leverage these trends – i.e., the storage needs to be placed close to the computation that accesses that storage and not across an expensive network fabric. The Nutanix offering epitomizes this by building a distributed storage subsystem using the local disks in the server nodes themselves. This is in sharp contrast to the single-headed SAN/NAS solutions that require expensive networking to deliver the high performance required by server clusters running virtual machines.

30 year old legacy design, the linchpin of our datacenters

2. Incremental scalability: As compute/storage needs grow, it should be possible to grow the system incrementally rather than requiring a complete hardware refresh as is typical with centralized SAN/NAS solutions. The Nutanix Complete Cluster is designed to be incrementally scalable, with no single point of bottleneck. While near linear scalability has been demonstrated in a cluster of 50 nodes, the design affords limitless scalability.

3. Performance: A storage system that considers performance as an after-thought opens itself up for one or more expensive architectural redesigns. The Nutanix Complete Cluster was designed for delivering high performance from the very outset. It combines traditional wisdom in distributed system design with new techniques to deliver high performance. These include a pipelined architecture, asynchronous request handling, extensive caching, and judicious use of Fusion ioMemory to keep frequently accessed data as well as metadata. The design specifically caters to virtualization workloads. For example, the NFS server implementation in the Nutanix Complete Cluster was designed to deliver high data IOPS (both random and sequential) rather than high namespace IOPS (which is what outdated benchmarks like SpecFS primarily measure). This is specially suited for virtualization as the bulk of the IO requests from guest VMs are converted into NFS read/write requests by the hypervisor when accessing the underlying storage subsystem through the NFS protocol.

Nutanix Direct Data Path

4. Random IO: With potentially hundreds of virtual machines simultaneously issuing IO requests, the data access patterns appear random by the time they are incident on the underlying storage system. In contrast to traditional storage subsystem designs, Nutanix was designed with the intent of delivering high random IO performance from the very start. It uses techniques such as a distributed operation log to absorb random writes, careful placement of metadata indexes in high performance SSDs for quick lookups, and extensive use of caching and deduplication to absorb boot/login storms. Recently, a 40-node Nutanix cluster successfully ran VMware’s RAWC benchmark with a record-breaking 3000 virtual machines. More details on this VDI reference architecture can be found at http://bit.ly/yN9S01.

5. Fine-grained tiering: Gone are the days when the predominant form of persistent storage were magnetic disks with similar performance characteristics. Today the data can be stored on a wide variety of media e.g., SSDs, SAS/SATA drives etc, each affording different capacities and performance at a given price point. The storage subsystem in Nutanix recognizes these as separate tiers of storage and places data on them based on its temperature. Thus, hot data is placed on the faster SSDs while colder data might be placed on the slower SATA drives. As the temperature of data changes, the Nutanix complete cluster supports water-falling of data between tiers. To avoid polluting the SSDs with cold data, data is divided up into fine-grained units of a few megabytes that form the basis of data placement and migration. Such fine-grained management of data across tiers also enables Nutanix to quickly adapt to changing workloads.

Information Lifecycle Management

6. Consistency model: The Nutanix Complete Cluster can manage petabytes of data written by guest VMs. Just like other storage subsytems, metadata is maintained to enable the quick location of any data. Since losing data or returning stale data is not an acceptable option, a strict consistency model is supported. While relational database abstractions such as transactions can be used to implement strict consistency, this approach is known to be unscalable and slow. On the other hand, typical noSQL approaches that maintain structured information as a set of key/value pairs are know to be highly performant, but typically only afford eventual consistency. The Nutanix Complete Cluster adopts a novel two-fold approach for delivering high performance despite supporting strict consistency. First, the metadata is kept in a noSQL key/value store that was enhanced with the Paxos algorithm to provide strict consistency for updates of any given key’s value. Second, all metadata operations involving multiple keys are carefully sequenced in way so as to always keep the overall metadata tree completely consistent at all times. This approach provides the best of both worlds – delivering high performance while supporting strict consistency.

7. Congestion management: Every major function in the Nutanix Complete Cluster is handled by a different component. A key aspect of the design is that flow/congestion control is built into each of these components. Without proper congestion management, a distributed system can come to a grinding halt by entering situations where useful work can no longer be done. As an example, the component that manages writes to a disk might become clogged with requests. As a result, a remote sender may timeout its outstanding requests to the congested component and re-send them – causing further continued congestion. To avoid such situations, every component in the Nutanix Complete Cluster exerts appropriate flow control to ensure it accepts only as many requests as it can reasonably execute. In addition, stale or low priority requests are quickly dropped when congestion is detected.

8. Designed for high-availability: A highly available storage subsystem does not have the luxury of going offline when a few of its components fail. These components might be either software components, or hardware ones. The storage subsystem in the Nutanix complete cluster was designed for fault-tolerance. There is no single-point of failure and any component can fail and stay down for extended periods of time. Thus, any disk, node, network card etc may fail without affecting availability. All data is both replicated as well as checksummed to protect against faults. The number of replicas kept for the data is configurable – thus permitting simultaneous failure of one or more components without sacrificing availability.

Anatomy of A Write IO; 10,000 ft. view

9. Replication fan-out: Distributed storage subsystems are often designed by mirroring one disk onto another. With disk capacities running into terabytes, this implies that failure of one disk would require reading all the data from the other healthy disk in order to restore replication. Not only does this create a hot-spot in the system by making one disk the bottleneck while others might be idle, it also increases the chances of data loss because the intense workload on the healthy disk might also cause it to fail. The Nutanix Complete Cluster avoids this by replicating each unit of data (comprising a few megabytes) on a disk to a random disk in the rest of the cluster. On a disk failure, the corresponding replicas can be read to restore replication – the restored second copy can also be placed on any disk in the cluster. Thus, recovering from a failed disk utilizes all of the cluster’s resources and avoids the formation of any hot-spots.

10. Continuous healing: Nutanix’s highly available storage subsystem cannot freeze to run a data consistency check (akin to the fsck found in Unix filesystems). The distributed nature of the system coupled with the petabytes of data it can potentially manage implies that faults will happen sooner or later – for example due to failed components. To discover and recover from such problems, the Nutanix Complete Cluster continuously heals itself by running a MapReduce over its metadata and taking appropriate corrective measures based on the issues found. For example, if a data unit is found to be under-replicated due to a failed component, a replication will be kicked for that component. The MapReduce computation runs as a low-priority background job so as to not affect the performance of higher-priority IO requests emanating from the guest VMs. The use of MapReduce, whose use is predominant in Big Data analytics today, lends the Nutanix Complete Cluster the scalability to manage large amounts of data, while affording high availability at the same time.

To summarize, the Nutanix Complete Cluster bridges the gap between computation and storage by converging these in a compact rackable unit, one or more of which can be stacked together to build a powerful virtualization appliance. The new demands imposed by virtualization workloads required an architecture that was built ground-up to specifically meet these requirements. The yardsticks of availability, performance, and scalability indicate that the Nutanix Complete Cluster is delivering on its promise, and is stretching the horizons of what was earlier possible in the realm of virtualization. Despite everything that has been delivered so far, there are lot of more exciting things that are in the pipeline. So stay tuned.


Death of the Desktop

January 11, 2011

Image representing iPad as depicted in CrunchBase

Image via CrunchBase

Yesterday we met with a potential business partner yesterday, and I must say that their business intelligence software was fantastic. Below is a blog that they wrote reference death of a PC which we felt was interesting. Please enjoy and thanks to Richard Lewis

When I thought about writing this blog it was going to be called “death of a PC”. This was going to be about what happened to me on a recent sales trip to Dubai when a demo machine we had set up remotely from the UK died before my eyes a few hours before an important presentation.

I ended up doing the demo on my laptop and averted what could have been a total disaster; however it made me think a bit about how we can avoid this kind of thing in the future… don’t use standalone desktops for demos and make sure you have a backup plan for starters…

On my way home I read an article in “Computer Weekly” titled “Keep taking the tablets” about the projected 198% increase in tablet computers in 2011. This made me wonder about the future of the humble desktop and whether or not this was the beginning of the end…

When my son and daughter last complained about being left behind by their mates on the hardware front because their machines were now 12 months old, they didn’t want new desktops, they wanted to replace their PC’s with laptops. Things are changing…

Over the past couple of weeks my son has started to use an iPad to operate his laptop or his old creaking PC via WIFI using some really impressive remote control software that cost just 57 pence!

The next time I’m dragged down to PC world or trawling the internet for hardware, am I really going to be interested in buying a desktop?

What seems an age ago now, Bill Gates and Larry Ellison took opposite sides on where hardware was going (remember the thin client debate?).

Larry Ellison believed that the personal computer was about to be eclipsed by the internet. Bill Gates believing that every man and his dog would own a PC (with windows on it).

Larry Ellison’s vision was around the use of light weight network computers which would use the capacity and grunt of large servers hosted on the internet to process and run the application software, leaving the clients doing little more than sending and receiving messages and rendering output to a screen. With less demand to host heavy weight applications on the client there would be less need for client licences – a scary thought for Mr Gates at the time.

Away from the big hitting business visionaries, what does the average man on the street mainly use their desktops for today?

…Games, word processing, social networking and music?

These days, most gamers get their kicks from their XBOX, Play Station or Nintendo so the desktop is not quite as important as it used to be.

Mobility and portability are becoming increasingly important too. The weight and size of the machine are becoming as big a factor as a machines power, memory and storage capacity.

We see Google writing more and more free applications that run over the Internet. When we all start to do our word processing over the internet too, where does this leave the humble desktop?

Looking from a business perspective there is an even bigger shift going on as the demand for making things available over the net and 100% reliable makes more and more people look towards hosting and outsourcing.

Server hardware is getting better and better. Whilst power is increasing, price is decreasing, while the internet is getting bigger, our connection to it is getting ever faster.

We now have virtualised servers, we have the cloud. The barriers to usage are being knocked down one by one.

As I sat in despair watching the desktop blue screen and crash, I knew that the software was not going to be moved to a new desktop.

Two weeks later and it was moved to a blade. In fact it’s on a VM on a blade, and soon to be moved to the cloud. When it’s it in the cloud it could get moved from machine to machine at the flick of a switch or a click of the mouse. To access the application you need a browser, so an iPad, Smart phone or dusty old desktop will do.

Things are definitely changing. It’s all pretty exciting as long as you’re not a desktop manufacturer! I’ll be run my demos on hosted servers from now on – with my laptop as a backup just in case access to the Internet’s down!

Mon, 08/11/2010 – 18:27 — Richard Lewis


Follow

Get every new post delivered to your Inbox.

Join 752 other followers