Project Description:Our Customer is among the world's leading creators and distributors of award-winning still imagery, video, and multimedia products, as well as other forms of premium digital content, available through its trusted house of brands, including Image.net©, iStock©, and others. The project targets to enhance and further develop an advanced media management platform, to help our Customer to serve business customers in more than 100 countries and continue to be the first place media professionals to discover, purchase and manage digital content.
Responsibilities:- Develop, maintain and support high performing ETL/ELT processes, including data quality and testing- Own the data infrastructure including provisioning, monitoring and automation of infrastructure and application deployments- Design and build data models for data warehouse based on enterprise data lake- Create and maintain infrastructure and application documentation- Develop dashboards, reports, and visualization
Mandatory Skills Description:- Strong SQL, noSQL, Python or Java or Scala.- Experienced in Spark (at least 2 years).- Experienced in data orchestration technologies like Airflow or similar technologies.- Experienced in AWS like EMR, ECS, S3.
Nice-to-Have Skills Description:- Experienced in Databricks (experience in performing migrations from AWS EMR to Databricks is nice to have).- Experienced in data warehousing tools like Snowflake, Redshift and / or Databricks SQL warehouse.- Has experience in implementing modern architecture solutions like Lakehouse, event streaming, microservices and domain driven design architecture patterns.- Understanding of Hadoop, Hive, datalake and similar big data technology stack.- Is passionate about DataOps on AWS and Databricks, creating CI/CD pipelines, building automated test and deployment suites in Gitlab, experienced in infra as code tech like Terraform. Sound understanding of containerization technologies like Docker and Kubernetes.- Team player, believes in collaboration, documentation, presenting solutions, representing the team, and does not shy away from taking an unpopular approach if it's the correct one, and has the ability to disagree and commit. Has a "get things done" and an "empathy no ego" mindset.- Willing to participate in team's on-call rotation. Has operational experience with tools like PagerDuty, slack, and can participate in Scrum / Kanban style SDLC.
Languages:English: C1 Advanced