Hands on Experience in Hadoop Ecosystem i.e. Yarn, Map Reduce, Hive, Pig, Sqoop, Oozie----- A MUST
Good knowledge of Big Data querying tools, such as Pig, Hive.
Ability to process large sets of structured, semi-structured dataset and supporting systems application architecture.
Experience in integration of multiple data sources like RDBMS with HDFS.
Good working experience in any of Cloudera & Hortonworks distributions.
Good Knowledge of Unix Commands
Knowledge of core Java/ Scala
Good to have knowledge on spark, Kafka streaming (optional)
Experience with Cloudera/MapR/Hortonworks