You’ll work with project managers, quantitative scientists, clinical data analysts, andresearch oncologists on CSS (Customer Solutions and Support Team) to curate the delivery of datasets that meet contractuallyspecified requirements in support of our life science partners’ research. Your role as anengineer is to design, build, and manage our data pipelines using an in-house ETL toolcoded primarily in SQL and Python. In this role, you’ll leverage internal webapps,systems, and services, with support from the teams who own these tools. You’ll workclosely with other engineers who lead similar data deliveries, while being accountable tocross-functional CSS teams for technical support, guidance, and on-time delivery.
● Create ETL pipelines in code and link them up to internal tooling● Monitor CI/CD processes to ensure ETL pipelines continue to run smoothly● Coordinate with clinical data analysts to queue patient-based data extraction● Collaborate with clinical data analysts to ensure the data meets quality standards● Handle ad-hoc requests from CSS folks about the patient-level data● Package and deliver the outputs of ETL pipelines to AWS S3 for client access● Context switch between multiple ETL pipelines and CSS stakeholders● Make recommendations on code and process efficiency improvementsWith your help, CSS will deliver dozens of datasets every quarter to our life sciencepartners who use our data to:● Make decisions about which drugs/diseases to target● Submit evidence as part of FDA filings for approvals● Publish research papers (on which you may appear as co-author)
Who You AreYou're a kind, passionate and collaborative problem-solver who is laser-focused on ontime delivery and calm under pressure. In addition, you’re an empathetic communicatorwith 4+ years of experience managing data pipelines or working on data intensivesoftware applications.
● You have experience with Python 3.9+● You have experience with SQL (any variety / Postgres preferred)● You collaborate effectively with other engineers through the use of Git and mergerequests (or similar version control tools)● You have experience with at least one ETL tool● You are comfortable introspecting data to debug ETL pipelines● You communicate effectively with non-technical stakeholders● You enter into discussions with curiosity and an urge to Do the Right ThingExtra credit:● You are familiar with data engineering best practices● You have a proven record of identifying inefficiencies in code or processes andproposing or working with others toward solutions