Job description
5 years of experience in software design and development, preferably in Pyspark or Scala. Hands-on experience building data pipelines using Hadoop ecosystem components (Hive, Spark, Spark SQL). Experience with scheduling tools such as Airflow. Strong knowledge of Unix/Linux platforms. Experience with big data frameworks: Apache Hadoop, Apache Spark, YARN, Hive, Python, ETL frameworks, MapReduce, SQL, RESTful services. Familiarity with version control (Git/GitHub), automated deployment tools (An…