Company OverviewKLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world's leading technology providers to accelerate the delivery of tomorrow's electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.
Job Description/Preferred Qualifications
HPC Hardware & Rack Engineering
Design, assemble, and integrate racklevel HPC systems, including compute, storage, networking, power distribution, and cooling.
Perform handson server bringup, including component installation, cable routing, labeling, and rack documentation.
Evaluate and select HPC components (CPUs, GPUs, memory, NICs, storage, PCIe cards) based on performance, reliability, and cost.
Partner with vendors and internal teams on BOM definition, hardware qualification, and lifecycle management (NPI, sustaining, and EOL transitions).
System BringUp & Management
Configure and manage systems using IPMI/BMC tools for remote access, monitoring, firmware updates, and diagnostics.
Configure, validate, and document BIOS, firmware, and BMC settings.
Debug hardware issues across power, thermals, PCIe, memory, storage, and interconnects.
HPC Software & OS Integration
Install, configure, and maintain Linux OS environments (e.g., SUSE, Rocky) on HPC systems.
Tune OS settings for HPC workloads (CPU pinning, memory, I/O, networking).
Collaborate with software teams to ensure hardware platforms meet application and algorithm requirements.
Networking, Storage & Interconnects
Work with highspeed networking technologies (Ethernet, InfiniBand, RoCE).
Integrate and validate highbandwidth NICs, switches, and cabling at rack scale.
Support local and shared storage solutions (NVMe, DAS, RAID, and NFS/SMB where applicable).
Reliability, Validation & Sustaining
Participate in system validation, stress testing, and reliability characterization.
Perform rootcause analysis for field and lab issues involving hardware or HW/SW interactions.
Develop and maintain system documentation, rack layouts, and bringup procedures.
Required Qualifications
Bachelor's or master's degree in Electrical Engineering, Computer Engineering, Computer Science, or equivalent practical experience.
5+ years of experience with HPC hardware systems or largescale compute platforms.
Strong handson experience with:
Server assembly and rack integration
CPUs, GPUs, memory, PCIe, NICs, and storage devices
IPMI/BMC and outofband management
Linux system administration in HPC or server environments
Solid understanding of systemlevel characteristics (performance, thermals, power, reliability).
Ability to debug issues spanning hardware, firmware, OS, and system configuration.
Preferred Qualifications
Experience with rackscale system design in production or lab environments.
Familiarity with HPC networking (100G/200G/400G Ethernet, InfiniBand).
Experience with GPUaccelerated HPC systems.
Exposure to HPC software stacks, workload characteristics, and performance tuning.
Experience in semiconductor, advanced manufacturing, or inspection systems is a plus.
Key Skills
HPC hardware architecture
Racklevel system integration
IPMI/BMC management
Linux system administration
Hardware bringup and debugging
Networking and storage integration
Crossfunctional collaboration (hardware, software, algorithms, manufacturing)
Minimum Qualifications
Doctorate (Academic) Degree and 0 years related work experience; Master's Level Degree and related work experience of 3 years; Bachelor's Level Degree and related work experience of 5 years
Base Pay Range: $105,900.00 - $180,000.00 Annually
Primary Location: USA-MI-Ann Arbor-KLA
KLA's total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits including but not limited to: medical, dental, vision, life, and other voluntary benefits, 401(K) including company matching, employee stock purchase program (ESPP), student debt assistance, tuition reimbursement program, development and career growth opportunities and programs, financial planning benefits, wellness benefits including an employee assistance program (EAP), paid time off and paid company holidays, and family care and bonding leave.
Interns are eligible for some of the benefits listed. Our pay ranges are determined by role, level, and location. The range displayed reflects the pay for this position in the primary location identified in this posting. Actual pay depends on several factors, including state minimum pay wage rates, location, job-related skills, experience, and relevant education level or training. We are committed to complying with all applicable federal and state minimum wage requirements where applicable. If applicable, your recruiter can share more about the specific pay range for your preferred location during the hiring... For full info follow application link.
KLA-Tencor is an Equal Opportunity Employer. Applicants will be considered for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, or any other characteristics protected by applicable law.