EMC ScaleIO

ScaleIO
Original author(s) ScaleIO (israeli start-up)
Developer(s) EMC Corporation
Development status active
Operating system Microsoft Windows (2008 R2, 2012 or 2012 R2), Red Hat Enterprise Linux or CentOS (6 or 7), SUSE Linux 12, Ubuntu 14.04[1]
Platform IA-32, AMD64
Type System_software
License Freeware
Website www.emc.com/storage/scaleio/

EMC ScaleIO is a software-defined storage product from EMC Corporation that creates a server-based storage area network (SAN) from local application server storage, converting direct-attached storage into shared block storage.[2] [3] It uses existing host-based internal storage to create a scalable, high-performance, low-cost server SAN. EMC promotes its ScaleIO server storage-area network software as a way to converge computing resources and commodity storage into a “single-layer architecture.”[4]

ScaleIO can scale from three compute/storage nodes to over 1,000 nodes that can drive up to 240 million IOPS of performance. Developers can deploy the ScaleIO software on-prem commodity infrastructure or in the cloud and then port their applications back into a production ScaleIO instance.[5] As of September, 2015, ScaleIO is also available from the company bundled on EMC commodity computing servers (officially called EMC ScaleIO Node).[6][7]

ScaleIO can be deployed as storage only or as a converged infrastructure combining storage, computational and networking resources into a single block. Capacity and performance of all available resources are aggregated and made available to every participating ScaleIO server and application. Storage tiers can be created with media types and drive types that match the ideal performance or capacity characteristics to best suit the application needs. It is available for free for testing (with community support[8]) or as a paid-for EMC-supported option.

History

ScaleIO was founded in 2011 by Boaz Palgi, Erez Webman, Lior Bahat, Eran Borovik, and Erez Ungar in Israel.[9] The software was designed for high performance and large systems. [10] The company was backed by venture capital firms including Greylock Partners and Norwest Venture Partners.[11] A product was announced in November 2012.[12]

EMC Corporation bought ScaleIO in June 2013 for about $200 million, only about six months after the company emerged from stealth mode.[13][9] EMC began promoting ScaleIO in 2014 and 2015, marketing it in competition with EMC’s own data storage arrays. Also in 2015, EMC introduced a model of its VCE (company) hyper-converged infrastructure hardware that supported ScaleIO storage.[14]

At its 2015 trade show, EMC announced that ScaleIO would be made freely available to developers for testing. By May, 2015, developers could download the ScaleIO software.[5] In September 2015, EMC announced the availability of the previously software-only ScaleIO pre-bundled on EMC commodity hardware, called EMC ScaleIO Node.[15]

Architecture

EMC ScaleIO uses storage and compute resources of commodity hardware. It combines HDDs, SSDs, and PCIe flash cards to create a virtual pool of block storage with varying performance tiers. It features on-demand performance and storage scalability, as well as enterprise-grade data protection, multi-tenant capabilities, and add-on enterprise features such as QoS, thin provisioning and snapshots. ScaleIO operates on multiple hardware platforms and supports physical and/or virtual application servers.[3]

ScaleIO works by installing software components on application hosts. Application hosts contribute internal disks and any other direct attached storage resources to the ScaleIO cluster by installing the SDS software. Hosts can then be presented volumes from the ScaleIO cluster by leveraging the SDC software. These components can run alongside other applications on any server (physical, virtual, or cloud) using any type of storage media (disk drives, flash drives, PCIe flash cards, or cloud storage).[16]

The ScaleIO architecture is built on two components: a data client and a data server. The ScaleIO Data Client (SDC) is a lightweight device driver situated in each host whose application or file system requires access to the ScaleIO virtual SAN block devices. The SDC exposes block devices representing the ScaleIO volumes that are currently mapped to that host. The SDCs maintain a small in-memory map, being able to maintain mapping of petabytes of data with just megabytes of RAM. The inter-node protocol used by SDCs is simpler than iSCSI and uses fewer network resources.[10]

The ScaleIO Data Server (SDS) is situated in each host and contributes local storage to the central ScaleIO virtual SAN. Each node is part of a loosely coupled cluster.[10]

Performance is expected to increase as servers and storage devices are added to the cluster.[17] [16] Additional storage and compute resources (i.e., additional servers and drives) can be added modularly. Every server in the ScaleIO cluster is used in the processing of I/O operations, making all I/O and throughput accessible to any application within the cluster. Any needed rebuilds and rebalances are processed in the background. Workloads are evenly shared with a parallel I/O architecture.[18]

ScaleIO can be deployed in either a “two-layer” multi-server cluster in which the application and storage are installed in separate servers, or as “hyper-converged” option where the application and storage are installed on the same servers in the ScaleIO cluster, creating a low-footprint, low-cost scalable single-layer architecture. Capacity and performance of all available resources are aggregated and made available to every participating ScaleIO server and application. Storage tiers can be created with media types and drive types that match the ideal performance or capacity characteristics to best suit the application needs.[16]

Storage and compute resources can be added to or removed from the ScaleIO cluster as needed, with no downtime and minimal impact to application performance. The self-healing, auto-balancing capability of the ScaleIO cluster ensures that data is automatically rebuilt and rebalanced across resources when components are added, removed, or failed. Because every server and local storage device in the cluster is used in parallel to process I/O operations and protect data, system performance scales linearly as additional servers and storage devices are added to the configuration.[16]

ScaleIO software takes each data chunk to be written and spreads it across many nodes, mirroring it as well. This makes data rebuilds from disk loss very fast as several nodes contribute their own smaller, faster and parallel rebuild efforts to the whole. ScaleIO supports VMware, Hyper-V, Xen and KVM hypervisors. It also supports OpenStack, Windows, Red Hat, SLES, CentOS, and CoreOS (docker). Any app needing block storage can use it, including Oracle and other databases. While it is not as closely integrated with VMware as Virtual SAN, the SDC functionality has moved into the VMware kernel.[10]

Promotional reports were commissioned from the Enterprise Strategy Group in 2014 and 2015 including performance measurements.[18][16]

References

  1. "System requirements". EMC Corporation.
  2. "Introducing ScaleIO Node". EMC Corporation.
  3. 1 2 "EMC ScaleIO: Software-Defined, Scale-Out SAN" (PDF). EMC Corporation.
  4. "EMC Bundles ScaleIO Software, Servers". EnterpriseTech. 2015-09-16.
  5. 1 2 "ESG Blog: EMC Embracing Freemium and Open Source with ScaleIo and ViPR". Enterprise Strategy Group, Inc. 2015-05-05.
  6. "Introducing EMC ScaleIO Node: Loving what's INSIDE the Box". EMC Corporation. 2015-09-16.
  7. "ScaleIO Node - what's the scoop, and what's up?". Virtual Geek. October 29, 2015. Archived from the original on March 30, 2016. Retrieved August 30, 2016.
  8. "ScaleIO Community web site". EMC Corporation. Retrieved August 30, 2016.
  9. 1 2 Shmulik Shelach (July 11, 2013). "EMC Buys Israeli Start-Up ScaleIO For About $200 Million In Cash". Jewish Business News. Retrieved August 30, 2016.
  10. 1 2 3 4 "How to storm the virtual heights of SAN". The Register. 2015-06-03.
  11. "ScaleIO". TechCrunch. 2015-11-04.
  12. Chris Mellor (December 6, 2012). "ScaleIO uncloaks, offers glimpses of mighty SANitisation plans: Virtual storage will be a pocket miracle - if it works". The Register. Retrieved August 30, 2016.
  13. "ScaleIO joins the pack of pooled storage startups with $12M". GigaOM. 2012-12-04.
  14. "Server SAN 2012-2026" (PDF). Wikibon. 2015-07-15.
  15. "EMC Introduces ScaleIO Nodes". StorageReview.com, Inc. 2015-09-15.
  16. 1 2 3 4 5 "EMC ScaleIO: Proven Performance and Scalability". Enterprise Strategy Group, Inc. 2015-09-01.
  17. "EMC Announces "No Restrictions" Download For ScaleIO And Future Enhancements". 2015-05-05.
  18. 1 2 "EMC ScaleIO: Transforming Commodity Hardware into Simple, Scalable, High-performance, Shared Storage". Enterprise Strategy Group. April 1, 2014. Retrieved August 30, 2016.
This article is issued from Wikipedia - version of the 9/22/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.