Opening for PySpark Developer (GCP) - RemoteContractWork Authorization – USC ,GC OR EAD
Why Open: Need to refactor current code so data is ingested near real time
Team/Role: Within network and provider group under the Healthcare business / technology umbrella. They have a Tableau dashboard that currently does data processing and batching every night. They are looking to run Hadoop on GCP, update GCP throughout, and make the dashboard more ‘real time’ to dynamically feed to Tableau. The main aspect of the job is refactoring code to run PySpark in GCP. There will be specific deliverables due end of year.
Must haves:• 3+ years with PySpark development• GCP – BigQuery, Dataproc, Cloud Composer• Running PySpark in GCP• Fortune 500 experience
Nice to haves:• Tableau dashboard understanding