Ablility to benchmark systems, analyse system bottlenecks and propose solutions to eliminate them;
Clearly articulate pros and cons of various technologies and platforms;
Guide the full lifecycle of a big data solution, including requirements analysis, platform selection, technical architecture design, application design and development, testing, and deployment,
Provide technical and managerial leadership in a team that designs and develops path breaking large-scale cluster data processing systems"
Needs to have experience with the major big data solutions like Hadoop, MapReduce, Hive, HBase, MongoDB, Cassandra; Experience in big data solutions like Impala, Oozie, Mahout, Flume, ZooKeeper and/or Sqoop is desirable;
Needs to have a firm understanding of major programming/scripting languages like Java, Linux, PHP, Ruby, Phyton and/or R