Company OverviewKLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world's leading technology providers to accelerate the delivery of tomorrow's electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.
Job Description/Preferred Qualifications
This role provides senior technical leadership for the architecture, deployment, and longterm scalability of largescale HPC storage and compute platforms! It owns systems endtoend-from early architectural definition through full production-partnering across engineering, manufacturing, and strategic vendors to deliver highly available, highperformance infrastructure at scale.
The scope emphasizes deep technical ownership, architectural decisionmaking, and solving sophisticated infrastructure challenges in live production environments! This work directly develops critically important HPC platforms built for adaptability, scale, and operational excellence, driving realworld impact across core products and technologies.Job Duties, but not limited to:
Lead the design, implementation, and ongoing support of highperformance compute (HPC) clusters, taking accountability for system performance, reliability, and scalability
Serve as a technical authority for HPC storage, with deep handson expertise in parallel file systems such as Lustre, GPFS, and BeeGFS
Apply sophisticated systems knowledge across CPU and GPU architectures, highbandwidth interconnects, and robust storage subsystems to deliver balanced, highperformance solutions
Lead the creation of hardware BOMs for HPC clusters, working directly with vendors and coordinating hardware release activities
Design, configure, and optimize Linux operating systems for HPC environments.
Translate project specifications and performance requirements into subsystem and systemlevel designs, driving execution while meeting technical and schedule commitments
Support the design, release, and transition of new systems to manufacturing and customers, providing highquality golden images, procedures, scripts, and documentation
Lead EOL part requalification activities to ensure longterm system viability and supportabilityQualifications, but not limited to:
Proven experience with HPC systems and Linux platform.
Strong, distroagnostic Linux experience (Rocky, RHEL, SuSE, Ubuntu)
Strong scripting skills in Shell and Python
Strong understanding of HPC hardware platforms (servers, GPUs, networking, storage, BIOS/BMC)
Advanced Linux systems knowledge (PXE/netboot, systemd, HA concepts)
Solid networking fundamentals (TCP/IP, DNS, DHCP, LDAP, HTTP)
Experience with configuration management and automation (Salt, Ansible, Puppet, Chef, etc.)
Interest in HPC storagePreferred Qualifications:
Strong DevOps and automation mentality (CI/CD pipelines, Git, infrastructure as code)
Experience with containers for HPC (Singularity, Docker)
Monitoring and observability experience (Prometheus, Grafana)
Familiarity with Apache/Nginx and supporting infrastructure services
Minimum Qualifications
Requires minimum of 8 years of related experience with a Bachelor's degree; or 6 years and a Master's degree; or a PhD with 3 years experience; or equivalent experience.
Base Pay Range: $129,600.00 - $220,300.00 Annually
Primary Location: USA-MI-Ann Arbor-KLA
KLA's total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits including but not limited to: medical, dental, vision, life, and other voluntary benefits, 401(K) including company matching, employee stock purchase program (ESPP), student debt assistance, tuition reimbursement program, development and career growth opportunities and programs, financial planning benefits, wellness benefits including an employee assistance program (EAP), paid time off and paid company holidays, and family care and bonding leave.
Interns are eligible for some of the benefits listed. Our pay ranges are determined by role, level, and location. The range displayed reflects the pay for this position in the primary location identified in this posting. Actual pay depends on several factors, including state minimum pay wage rates, location, job-related skills, experience, and relevant education level or training. We are committed to complying with all applicable federal and state minimum wage requirements where applicable. If applicable, your recruiter can share more about the specific pay range for your preferred location during the hiring process.
KLA is proud to be an Equal Opportunity Employer. We will ensure that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us... For full info follow application link.
KLA-Tencor is an Equal Opportunity Employer. Applicants will be considered for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, or any other characteristics protected by applicable law.