Years of Exp 5+Max notice 30 days
Role DescriptionThis is a full-time remote role for a Pyspark Developer. As a Pyspark Developer, you will be responsible for developing and maintaining efficient and scalable data processing solutions using Pyspark. You will collaborate with cross-functional teams to gather requirements, design data pipelines, and optimize data processing workflows. Additionally, you will perform data analysis and troubleshooting to ensure data quality and reliability. This role requires strong programming skills, knowledge of distributed computing principles, and experience working with large datasets.
QualificationsProficient in Pyspark and Python programmingExperience with big data technologies, such as Hadoop and SparkStrong understanding of distributed computing principlesExperience with data ingestion, data processing, and data transformationKnowledge of SQL and database systemsFamiliarity with cloud platforms, such as AWS glueExperience with data visualization tools, such as Tableau or Power BIExcellent problem-solving and debugging skillsStrong communication and collaboration abilitiesBachelor's degree in Computer Science or a related fieldRelevant certifications in big data or data engineering are a plus