As 'big data' grows, IT job roles, technology must change
IT managers say 'big data' movement continues to be a problem for many companies
Computerworld - LAS VEGAS -- As corporate data stores continue to grow, in some cases by more than 50% annually, the expanding task of managing and mining them for information is forcing a change in how IT workers are trained.
That was the consensus of IT managers from 24 companies who gathered at the EMC World user conference here this week to discuss so-called big data issues.
The discussion was sponsored by EMC's Isilon division, which manufactures clustered network-attached storage (NAS) systems used to house massive data warehouses under single domain-name spaces.
Chris McNally, a storage architect at IT hosting company Sungard, said he is helping to cross-train employees to learn how various systems fit into a larger IT ecosystem.
For example, McNally said, AIX and backup administrators have volunteered to undergo storage area network (SAN) and cloud storage training at Sungard.
"So instead of me being the storage guy who has to argue with these other guys about how this works and what we do, they're now able make intelligent requests with regard to storage," he said. "It creates a better product in the end."
James Lowey, director of network and computer systems at genome sequencer company Translational Genomics Institute (TGen), said workers in his company's traditional IT shop are required to learn how networks, operating systems and storage interact.
Lowey said mapping human genomes currently creates 2TB of new data every week, and he expects that to grow to 10TB per week by year's end. The genomic data is used to tailor drug compounds to treat diseases based on a person's specific genetic profile.
He noted that coming up with the best way to mine that data for important information continues to be a sticking point.
Looking to come up with a solution to that problem, EMC last year spent more than $3 billion to acquire companies like Isilon and data warehousing and analytics company Greenplum.
" 'Do I keep [data] or do I not keep it?' has been an age-old question," said Paul Rutherford, CTO of Isilon.
For Lowey, keeping all the data produced is an issue his company is struggling with. There's nothing more personal than genetic data, so keeping everything means keeping everything secure for as long as the company has it. But the data store also continues to be a valuable source of information that can be mined for creating custom drug treatments.
"The reason we keep everything forever is that we're not sure about what it is we have," Lowey said. "In life sciences, there's so much to learn and so much unknown."
EMC announced here that it has ramped up programs to train and certify "data scientists." A data scientist spends his or her time determining the value of a corporation's data.
Nick Mehta, CEO of cloud storage provider LiveOffice, said data persists whether it's properly stored or not.
"For us, the issue is, how do you enable a world where you can keep everything cost effectively? We want a way to keep everything and then make it valuable. Having all that data helps us do our jobs better," he said.
LiveOffice currently stores some 4 petabytes of data on disk and adds another 5TB to that pool each day. LiveOffice encrypts all of the data for customer safety.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- The Total Economic Impact of the HP 3PAR Storage
- Forrester Consulting provides an analysis of four HP 3PAR storage customer implementations to quantify the efficiency and cost savings achieved over legacy storage...
- Using HP's Converged Storage to Develop/Enhance Business Resiliency in VMware Environments
- In this report, Enterprise Strategy Group reviews how HP's portfolio of hardware, software, and services can provide the foundational support for VMware environments....
- Converged Storage: Utility Storage - The Ideal Platform for Virtual and Cloud Computing
- Server virtualization has transformed corporate IT -- companies have enjoyed major cost savings and have gained flexibility and efficiency. But this has also...
- Defining Tier One Storage in the Modern Data Center
- This report defines "tier-1" storage in the modern IT world and in the data centers and services that support it. What was a...
- The Best Way to Build a Cloud -- HP CloudSystem Matrix and HP 3PAR Utility Storage provide solid, flexible foundation
- Learn how HP CloudSystem Matrix and HP 3PAR Utility Storage provide a solid, flexible foundation for your cloud environment.
Intel and the Intel logo...
All Storage White Papers
- Live Webcast
Today's NAS: A Solution Beyond Old Limits - Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - Today's NAS: A Solution Beyond Old Limits
- Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific...
All Storage Webcasts