Job Description:
• Maintain and enhance Spotify’s reporting pipelines, ensuring they remain business-critical
• Build large-scale batch and real-time data pipelines to efficiently collect, process, and store reporting-related data
• Make enhancements to data models and ETL code to improve pipeline efficiency and data quality
• Design, develop, and maintain high-performance backend services and APIs to power Spotify’s advertising platform
• Work closely with stakeholders to understand and translate business requirements into technical solutions
• Write scalable, testable, maintainable, and high-quality code
• Collaborate with engineers, product managers, and other stakeholders, embracing learning and leadership opportunities daily
• Monitor daily execution, diagnose issues, and ensure SLAs are met by maintaining and fixing business-critical pipelines
Requirements:
• Experience in data engineering with high-volume, heterogeneous data, ideally within distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS, or Azure
• Familiarity with data processing frameworks like Scio, GCP Dataflow, AWS EMR, and Apache Beam
• Familiarity with API design, microservice architecture, and frameworks like Spring Boot or gRPC.
• Programming skills in Java or Scala, and for analytics and data extraction (SQL, Python, Pandas, R)
• Commitment to agile software processes, data-driven development, reliability, and responsible experimentation
• Hands-on experience with orchestration tools like Airflow or similar
• Experience with database systems, both relational (e.g., PostgreSQL, MySQL) and non-relational (e.g., Cassandra, DynamoDB).
• Excellent problem-solving and communication skills
Benefits:
• Health insurance
• Six month paid parental leave
• 401(k) retirement plan
• Monthly meal allowance
• 23 paid days off
• 13 paid flexible holidays
Apply Now
Apply Now