Skip navigation EPAM

Lead Data Software Engineer (Spark/Scala/Databricks) Malaga, Spain

  • hot

Lead Data Software Engineer (Spark/Scala/Databricks) Description

Job #: 87040
EPAM is the foremost global digital transformation services provider with over 59,000 EPAMers in more than 50 countries. Since 1993, our multidisciplinary teams have been helping make the future real for our clients and communities around the world. In 2018, we opened an office in Spain that quickly grew to over 1,000 EPAMers distributed between the Málaga office and remotely across the country. Here you will collaborate with multinational teams, contribute to numerous innovative projects, and have an opportunity to learn and grow continuously.


Are you an open-minded professional fluent with Scala programming language? If it sounds like you, this could be the perfect opportunity to join EPAM as a Lead Data Engineer. We are looking for a team player with excellent communication and organizational skills, mastery in engineering and B2+/C1 level of English to communicate fluently with English-speaking stakeholders, share ideas and provide reasoning.

As part of this project, our Data teams are working on migration of Data Products pipelines from Oracle workloads to Databricks. The team is composed of 3 engineers, 2 analysts and a Product Manager working in a highly agile environment, following XP practices and best CI/CD practices.

What You’ll Do

  • Develop, monitor, and operate the most used and most critical curated data pipeline at client’s - Sales Order Data (incl. post-order information, e.g. shipment, return, payment). This pipeline is processing hundreds of millions of records to provide high-quality datasets for analytical and machine learning use-cases
  • Consulting with analysts, data scientists, and product managers to build and continuously improve "Single Source of Truth" KPI for business steering such as the central Profit Contribution measurement (PC II)
  • Redevelop old legacy pipelines to new, advanced, and standard versions that are easy to maintain and scalable for future demands
  • Leverage and improve a cloud-based tech stack that includes AWS, Databricks, Kubernetes, Spark, Airflow, Python, and Scala

What You Have

  • Expertise in Apache Spark along with Spark streaming
  • Good hands-on experience with Databricks and delta-lake
  • Fluency in Scala programming language
  • Expertise in SQL
  • Good understanding & hands-on experience with CI/CD
  • Rich working experience with GitHub
  • Fluency working with AWS landscape
  • Ability to build Apache Airflow pipelines

Nice to have

  • Presto
  • Superset
  • Starburst
  • Oracle & Exasol

We offer

  • WORK & LIFE BALANCE. Enjoy more of your personal time with flexible & remote work options, 24 working days of annual leave and paid time off for numerous public holidays
  • CONTINUOUS LEARNING CULTURE. Develop your hard & soft skills with internal training and mentorship opportunities, sponsored professional certification, and access to 18,000+ LinkedIn courses
  • CLEAR & DIFFERENT CAREER PATHS. Grow in engineering or managerial direction to become a People Manager, in-depth technical specialist, Solution Architect, or Project/Delivery Manager
  • GLOBAL RELOCATION OPPORTUNITIES. EPAM has presence in more than 50 countries globally. Explore opportunities to relocate to a new country, and EPAM will provide relocation support for you and your family
  • COMPETITIVE BENEFITS. Benefit from a competitive salary, private health insurance, employee stock purchase plan, special discount programs, plus, internal wellbeing programs to take your career to the next level
  • STRONG PROFESSIONAL COMMUNITY. Join a global EPAM community of highly skilled experts and connect with them to solve challenges, exchange ideas, share expertise and make friends