Network Operations Engineer (InfiniBand) – Austin, TX –

- Advertisement -professional ultrasonic cavitation machineprofessional ultrasonic cavitation machine

Job title: Network Operations Engineer (InfiniBand)


Job description: WHO WE ARE: EOS IT Management Solutions Inc is a leading Global IT and Video Collaboration company. We specialize in innovative IT and video conferencing solutions, which empower businesses and organizations throughout the world. We have an immediate opening for a highly motivated, collaborative and committed individual to fill our Network Operations Engineer position with InfiniBand experience.THE POSITION: The successful candidate will be enthusiastic and passionate about IT and Networking. The role specifically involves working as a Network Operations Engineer as a first/second level of contact, supporting InfiniBand fabrics for high performance compute clusters for a R group.Hours: 5 days a week Mon-Fri 9:00am to 6:00pm. On-call will be required as needed.WHAT YOU’LL DO: Maintain and support InfiniBand fabrics infrastructure that will include routers, switches, servers, network operating system software, network management software and other related hardware Will provide day-to-day operations of the Linux HPC clusters and network support in the areas of I/O connectivity, IP over InfiniBand Proactively monitor, analyze and correct system issues. Develop scripts to automate repetitive tasks or tools to enhance support of HPC systems System performance analysis and tuning Build, install and support user requested software that includes upgrades, patch fixes Support HPC technology evaluations and assessments Communicate with end users via tasking tools, group chats and phone Multi-tasking various user issue’s effectively and efficiently, while documenting trouble shooting and triage steps Escalating tasks to vendor while documenting summary of problem and troubleshooting steps that have been taken. Perform queue management for user tasks, alarms, alert tasks, incidents and trouble shoot/and or triage as necessary Document runbooks and procedures to assist in trouble shooting and completion of tasks Must have excellent customer service skills and the ability to deal with end users / management during times of pressure WHAT YOU NEED TO SUCCEED: The ideal candidate should have minimum 3+ years’ experience with InfiniBand Should have strong background in maintaining (and building) InfiniBand fabrics for high performance compute clusters Experience with HDR (200Gbps) and SHARPv2 (Scalable Hierarchical Aggregation and Reduction Protocol). Knowledge of key I/O technologies such as Smart NIC’s, 200GigE, RNIC’s, Infiniband, Fibre Channel, SAS. Experience with routing engines like OpenSM is mandatory. Understanding of the internals of a Router/Switch hardware, NPU/data planes and Optics Understanding of the design principles and troubleshooting of distributed systems Solid understanding of high-performance computing, IB fabrics and operational best practices Demonstrated knowledge of different routing algorithms including UPDN, LASH, DOR etc. Working experience with Mellanox vendor to troubleshoot issues Demonstrated ability to analyze complex situations and utilize troubleshooting skills, systems and tools, and creative problem-solving abilities under pressure Proficiency in Linux operating system, scripting experience a plus. Ability to work with system configuration management tools (e.g., Puppet, Ansible) and revision control software such as Git Experience with scripting and programming languages such as Bash Shell, Python, etc. Ability to work in fast-paced and dynamic environments with limited supervision Strong attention to detail with excellent time management and organization skills Team player, excellent written and verbal communication Self-motivated, strong analytical thinker who enjoys problem solving Capable of working/using own initiative with minimal supervision. EDUCATION: Associates or Bachelor’s degree in computer science or related discipline Certification such as InfiniBand Professional a plus. Experience with High Performance Computing or Linux #indeedhp

Job Information

  • Job ID: ba5122c0-15100297354
  • Location:

Austin, Texas, United States

Jobs You May Like


Network Security Engineer


Austin, TX, United States


Senior Principal Engineer, Lab Support (Lab…


Austin, TX, United States

Senior Cybersecurity Engineer- Vulnerability…

Home Depot

Austin, TX, United States

Senior Cybersecurity Engineer – DLP – Data…

Home Depot

Austin, TX, United States

No content

No content

Austin Jobs! Career Opportunities in the Greater Austin, Texas area.

Copyright 2021. All Rights Reserved

{Error Message Title}

Insert additional messaging here.

We use cookies on this site to enhance your experience. By using our website you accept our use of cookies. Yes, I agree More Information


YourMembership uses cookies for your convenience and security. Cookies are text files stored on the browser of your computer and are used to make your experience on web sites more personal and less cumbersome. You may choose to decline cookies if your browser permits, but doing so may affect your ability to access or use certain features of this site. Please refer to your web browser’s help function for assistance on how to change your preferences.

Expected salary:

Location: Austin, TX

Job date: Thu, 25 Feb 2021 04:45:35 GMT

Apply for the job now!

- Advertisement -microcurrent machines for sale

Latest news

- Advertisement -oxygen facial machine

Related Videos

- Advertisement -facials near mefacials near me