Skip navigation EPAM

Network Reliability Engineer Cluj, Romania or Remote

  • hot

Network Reliability Engineer Description

Job #: 92007
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.


We are looking for a highly skilled Network Reliability Engineer to join us on a journey to transform Global Network Operations. We are a global team across US, UK, Bucharest, India and Sri Lanka made up of a diverse range of people from varied backgrounds who each bring unique skillsets and perspectives. The team is responsible for building a suite of observability tools and developing our self-healing capabilities while working closely with other members of the Global Network Services team to ensure our many network services remain highly available, resilient, and secure.

What You’ll Do

  • Maintaining and develop network monitoring, orchestration and automation solutions including inventory reconciliation and remediate, workflow automation, network configuration validation, network health monitoring, alert handling and incident remediation, all with a focus on using automated tools
  • Design and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for networks
  • You will perform audits on Network Infrastructure to ensure best practice and standards
  • You will collaborate with teams to troubleshoot and resolve network issues
  • You will build API-driven services for seamless integration with other services
  • You will automate the mundane and design and implement tools for streamlining internal processes and automate end-to-end workflows within the network infrastructure
  • You will develop automated test frameworks and maintain excellent documentation
  • You will be personally accountable for implementing build and release pipelines for deployment scheduling and management of issues, risk and impediments
  • You will collaborate with stakeholders to prioritise and deliver solutions and ensure project success
  • You will plan and execute releases, for multiple SD-Net products, including non-functional testing
  • You will collaborate with the team to innovate and improve processes
  • You will develop reports to identify and remediate network inventory gaps and ensure network devices comply with standards and best practices
  • You will identify vulnerabilities and ensure a secure environment
  • Participate in on-call service to provide out of hours support when required (via rota)

What You Have

  • Programming skills with hands-on Python experience
  • Hands-on experience with automation and orchestration tools such as Ansible or similar tools
  • Hands-on experience with network monitoring and observability tools (Entuity, HPNA, Datadog & BigPanda)
  • Ability to build API based services
  • Good understanding of Network Domain fundamentals, good knowledge in Network Asset and Configuration management processes
  • Good understanding of the Software Development Life Cycle (SDLC) and experienced in using Agile methodologies and tools such as, JIRA, Ansible and GitLab
  • Good knowledge and experience with Software-Defined Networking (SDN)
  • Analytical skills and problem-solving skills needed to manage multiple factors on a project simultaneously
  • Education: Bachelor’s Degree in Information Technology, Engineering or Computer Science is preferred

We Offer

  • We believe that the greatest strength of the company is its people. EPAM is fully committed to help its employees to reach their full potential and achieve their professional goals through continues learning. With this in mind, we would like to introduce to you few of the many opportunities and services which we believe will help you expand your current knowledge:
  • Full access to cutting-edge tools and technologies
  • Competitive compensation depending on experience and skills
  • All-around Social package: professional & soft skills training, medical & family care programs, sports
  • Relocation opportunities
  • Free English classes
  • Unlimited access to LinkedIn learning solutions
  • Continuous experience exchange with experts and professionals worldwide
  • Friendly team and comfortable working environment
  • Engineering, corporate, and social events within and outside the Company
  • Flexible working schedule
  • Opportunities for self-realization