"The people who do the best are those that have an intense curiosity," says Patil, whom Forbes magazine credited, along with Cloudera founder Jeff Hammerbacher, with inventing the term data scientist. Previously Patil worked at LinkedIn -- his titles included head of data products, chief scientist and chief security officer -- helping develop that company's data science team and strategy.
Patil has a Ph.D. in applied mathematics. Sacheti has a Ph.D. in agricultural and resource economics. And yet, the qualities of curiosity and creativity matter more than the level and type of academic credential, Patil says. "These are people who fit at the intersection of multiple domains," he says. "They have to take ideas from one field and apply them to another field, and they have to be comfortable with ambiguity."
Cloudera's Wills, for example, took a circuitous path to become a data scientist. After graduating from Duke University with a bachelor's degree in math, he pursued a graduate degree in operations research at the University of Texas on and off, while working for a series of companies, dropping out to take a job at Google in 2007. (He did eventually complete that master's degree, he points out.) Wills worked at Google as a statistician and then as a software engineer before moving to Cloudera and assuming his data science title.
In short, big data folks seem to be jacks of all trades and masters of none, Wills says. "You can take someone who maybe is not the world's greatest software engineer, [nor] the world's greatest statistician -- but they have the communications skills to talk to people on both sides" as well as to the marketing team and the C-level executives. Their biggest skill is in serving as the "glue" in an organization, and most organizations have them, he says.
"These are people who cut across IT, software development, app development and analytics." Wills thinks such people are rising in prominence at companies. "I'm seeing a shift in value that companies are assigning to these people."
Sacheti, too, keeps his eye out for such people internally. "We are finding there are a lot more who are flexible in learning new skills, willing to do iterative design and agile thinking," he says.
In an attempt to hone in on the career paths of big data professionals, IIA and Talent Analytics recently completed an online poll that aims to quantify not only the skills and academic degrees of current data professionals, but also their emotional and personal characteristics. Results are expected by year's end and will be available to HR professionals for a fee.
"In some cases the innate characteristics of people, like a predisposition to curiosity, can be more predictive of someone's performance in a role than them having a degree in, say, IT or IS or CS," says Talent Analytics' Roberts.
Wanted: A relentless, scientific temperament
Until the recent past, creativity, curiosity and communications skills have not typically been emphasized in IT departments, which may be why most sources said they weren't looking to their operations IT staff to spearhead big data projects.
IIA sees data science as resting on three legs: technological (IT, systems, hardware and software), quantitative (statistics, math, modeling, algorithms) and business (domain knowledge), according to Phillips. "The professionals we see that are successful come from the quantitative side," he says. "They know enough about the technology but they aren't running the technology. They rely on IT to give them the tools."
- Best iPhone, iPad Business Apps for 2014
- 14 Tech Conventions You Should Attend in 2014
- 10 Desktop Apps to Power Your Windows PC
- How to Add New Job Skills Without Going Back to School
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- Is Your Big Data Solution Production-Ready? Read "Is Your Big Data Solution Production-Ready?" now, and discover best practices and actionable steps to implementing a production-ready big data solution.
- Pay-as-you-Grow Data Protection: IBM Tivoli's Full-featured Data Protection Suite for Small to Medium Businesses IBM Tivoli Storage Manager Suite for Unified Recovery gives small and medium businesses the opportunity to start out with only the individual solutions...
- Simplify and Consolidate Data Protection for Better Business Results Learn about IBM® Tivoli® Storage Manager Operations Center, which provides advanced visualization, built-in analytics and integrated workflow automation features that leapfrog traditional backup...
- Smarter Environmental Analytics Solutions: Offshore Oil and Gas Installations Example This IBM Redbooks® Solution Guide describes a solution for implementing smarter environmental monitoring and analytics for oil and gas industries. The solution implements...
- Webinar: Building a Big Data solution that's production-ready Big data solutions are no longer just a nice-to-have.
- Meg Whitman presents Unlocking IT with Big Data During this Web Event you will hear Meg Whitman, President and CEO, HP discuss HAVEn - the #1 Big Data platform, as well... All Big Data White Papers | Webcasts