Strong knowledge of data mining and statistics with analysis on large scale datasets,
Education: Masters or PhD in Computer Science or Quantitative Techniques.
Strong Knowledge of Hadoop and apache family of tools,
Strong problem solving, data structures (trees, graphs etc) and algorithms knowledge,
Experience to Big Data Handling,
Exposure to modeling software SAS, R, KNIME, Weka, Matlab, SPSS, etc,
Understanding of Database like SQL, Teradata, Oracle, etc and have experience in query language,
Should have at least basic programming skills in any of programming language viz. visual basic, Java, C, C++ (which will be helpful in exploring new areas and Big data architecture development),
Experience with high volume Data Warehousing systems, related appreciation of data,
Preferred skills: Mahout, RHIPE, R, Python, UNIX/Linux shell programming experience, knowledge of Microsoft Excel for data manipulation, Experience with high volume Data Warehousing systems.