Network Engineer

jobsnear.org

Overview

We are looking for a Network Engineer seeking to apply their technical skills in a fast-paced and complex HPC environment. Working knowledge of server and network hardware and software and the desire to participate in projects at a large-scale data center is central to this role. This position will work with remote engineering teams to resolve and diagnose network issues at scale. Adaptability and flexibility within the environment will be key to the candidate’s success. Penguin supports HPC environments where network performance is critical.

Responsibilities

  • Monitor and perform on-going maintenance and upgrades to network equipment.
  • Provide support to staff, as well as respond to server and network issues.
  • Manage and maintain both InfiniBand and Ethernet networks.
  • Run hardware diagnostics and replace failing parts in a timely manner.
  • Monitor all network processes to ensure the smooth flow of data across the network.
  • Collaborate with software and network engineering teams on cybersecurity and network efficiency.
  • Support a Linux-based, high-performance computing (HPC) and artificial intelligence (AI) environment, featuring a wide range of technologies.
  • Maintain meticulous documentation for internal and external build instructions – such as configuration examples, guidance on technical details, and best practices.
  • Develop automation and other tools to improve operations.
  • Supervise on-site staff in updating cards and other components in the environment.

Qualifications

  • 5+ years of hands-on experience with enterprise scale networks.
  • In-depth knowledge of data center environments, servers, and network equipment.
  • Exceptional ability to work as part of a remote team, provide IT support, and resolve errors.
  • Ability to keep up with advancements in data center infrastructure and technologies.
  • Experience administering both InfiniBand (Mellanox/NVIDIA) and Ethernet (Cumulus) networks.
  • Proficiency in documenting network processes and diagrams.
  • Thorough understanding of L2 and L3 network protocols.
  • Exceptional written and verbal communication skills.
  • Willingness to respond to network and server errors after hours.
  • Participate in an on-call rotation to provide critical support for AI and HPC operations.
  • US Citizenship is required for this role.

Preferred Qualifications

  • Extensive experience in installing, monitoring, and maintaining data center networks.
  • Hands-on experience configuring and supporting large scale Ethernet and InfiniBand networks.
  • Demonstrated practice with low-latency/high-bandwidth networking performance optimization.
  • Knowledge of communication libraries such as NCCL, UCX and MPI.
  • Familiarity with UFM or OpenSM network management software.

Location

This is a remote position in the United States.

Travel

Minimal travel may be required.

Compensation & Benefits

The base pay range that the Company reasonably expects to pay for this position in the United States is $97,000 – $114,000; the pay ultimately offered may vary based on business considerations, including job-related knowledge, skills, experience, and education. The position is bonus-eligible, and there are medical, dental, and vision benefits available. There is a 401k saving plan and other benefits, such as Paid Time Off, Life Insurance, and an Employee Assistance Plan.

Inclusion & Belonging Statement

We are committed to creating an inclusive environment that embraces differences and fosters belonging for all.

Equal Opportunity Statement

We are an Affirmative Action/Equal Opportunity Employer and strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to age, national origin, race, ethnicity, creed, gender, disability, veteran status, or any other characteristic protected by law.

Read Full Description

Apply
To help us track our recruitment effort, please indicate in your cover//motivation letter where (jobsnear.org) you saw this job posting.

Share

Junior Consultant

Job title: Junior Consultant Company Roland Berger Job description Company DescriptionRoland Berger, founded in 1967,…

29 minutes ago

Assistant Manager – SOX and Risk Business Partner

Job title: Assistant Manager - SOX and Risk Business Partner Company Lloyds Banking Group Job…

41 minutes ago

Project Administrator/Receptionist – CE Scheme – Athleague/Castlecoote Community Dev. Co. Ltd

Job title: Project Administrator/Receptionist - CE Scheme - Athleague/Castlecoote Community Dev. Co. Ltd Company Athleague/Castlecoote…

58 minutes ago

Senior Director, Operations IT – Business Partner, Sourcing

Job title: Senior Director, Operations IT - Business Partner, Sourcing Company AstraZeneca Job description SCRUM…

2 hours ago

Clinical and Translational Medicine Lead

Job title: Clinical and Translational Medicine Lead Company Sanofi Job description Clinical and Translational Medicine…

2 hours ago

Laboratory Systems Lead

Job title: Laboratory Systems Lead Company PSC Biotech Job description About PSC BiotechWho are we?PSC…

2 hours ago
For Apply Button. Please use Non-Amp Version