1375 E Buena Vista Lake Buena Vista, FL 32830
- Define the model input requirements to produce a series of historical aggregated datasets utilizing Python, PySpark and SparkSQL on data provided by technology residing on Hadoop/HDFS/Hive.
- Validate the aggregation results and develop an incremental daily update process to the aggregated datasets.
- Skilled in SQL, Python, Spark, Hive.
- Skilled in PySpark, SparkSQL, AWS, Hadoop.
- Experience in the development and support of batch processing.