Amazon Web Services now accommodates big data storage
Amazon's new High Storage EC2 package is customized for jobs with large volumes of data and high throughput
IDG News Service - Eyeing the growing market for big data analysis, Amazon Web Services (AWS) has introduced a storage package, called High Storage, that can offer fast access to large amounts of data.
High Storage, an Amazon Elastic Compute Cloud (EC2) package, is designed to run data intensive analysis jobs, such as seismic analysis, log processing and data warehousing, according to the company. It is built on a parallel file system architecture that allows data to be moved on and off multiple disks at once, speeding throughput times.
"Instances of this family provide proportionally higher storage density per instance, and are ideally suited for applications that benefit from high sequential I/O performance across very large data sets," AWS states in the online marketing literature for this service. The company is pitching the service as a complement to its Elastic MapReduce service, which provides a platform for Hadoop big data analysis. AWS itself is using the High Storage instances to power its Redshift data warehouse service.
An AWS instance is a bundle of compute units, memory, storage and other services configured to the characteristics of a particular type of workload. High Storage is the ninth type of compute instance that AWS has introduced. It joins other instant types customized for particular workloads, such as instances optimized for using GPUs (graphics processing units) or for HPC (high performance computing) jobs.
The High Storage instance offers 35 EC2 compute units (ECUs) of compute capacity and 117GB of working memory. Up to 48TB of storage is spread across 24 direct attached storage (DAS) hard disk drives. Spreading data across multiple disks can speed data transfers because the read-and-write speed of a single disk is no longer a bottleneck. The system can offer more than 2.4GB per second of sequential I/O performance.
Customers can evoke High Storage instances from the AWS Management Console, from the EC2 or Elastic MapReduce command lines, or from the AWS SDK (software development kit) or third-party libraries. The High Storage instance is currently available on the U.S. east coast and will be available in other parts of the world in the next few months. High Storage instances can be purchased ether on-demand or be reserved ahead of time at reduced cost.
Further helping potential big data-minded customers, Amazon has also turned on its data pipeline for general use, which the company announced last month.
Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com
- The 20 Best iPhone/iPad Games of 2013 So Far
- 9 Steps to Build Your Personal Brand (and Your Career)
- 7 Consumer Technologies Coming to an Enterprise Near You
- 11 Signs Your IT Project is Doomed
- A walking tour: 33 questions to ask about your company's security
- 15 social media scams
- The 7 elements of a successful security awareness program
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- The Total Cost of Email In this white paper, we'll explore the true costs of fragmented email management and uncover how to reduce those costs with a cloud-based...
- Best Practices for Cloud-based Information Governance This paper explores the latest ideas on evaluating cloud deployment: public or private clouds, data location and privacy, data ownership and access, and...
- Manage Virtualized and Cloud Environments and the New Software-defined Data Center Analyst report by Enterprise Management Associates on the newly announced EMC Service Assurance Suite, and how well it addresses operational challenges and market...
- Reduction in deployment time of a service development environment at GMO Media using a private cloud Read this case study to learn how GMO Media achieved a significant reduction in the implementation period of a service development environment using...
- B2B Integration on Cloud: Real World Solutions and Technology Advances Watch the webcast with IBM experts to learn about the advancing capabilities and strategic direction for B2B Integration on Cloud.
- How The Cloud Threatens Midsize Enterprises...And What To Do About It A recent study showed 92% of IT pros recognize that moving to the cloud provides a competitive edge, but only 20% plan to... All Cloud Computing White Papers | Webcasts
Rising salaries boost IT optimism, though not everyone is feeling upbeat. Our survey of 4,000+ IT workers shows who's riding the wave and why. Use our interactive tool and compare your own paycheck. Read more...