I have experience in setting up Hadoop cluster in distribution like Cloudera, Horton works, Google Proc in environments like Amazon AWS, Google Cloud platform, VM images, Stand alone servers, and Dockers.
Created data pipelines using Apache Spark, Kafka, HIVE, Mongo DB.
Have experience in Data cleaning, enriching, validation using apache Apark.
created machine learning model to analyse the data.
I will be available in Frankfurt in February 2018.