Company OverviewKLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world's leading technology providers to accelerate the delivery of tomorrow's electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.
Group/DivisionWith over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. Enabling the movement towards advanced chip design, KLA's Global Products Group (GPG), which is responsible for creating all of KLA's metrology and inspection products, is looking for the best and the brightest research scientist, software engineers, application development engineers, and senior product technology process engineers. Central Engineering is KLA's largest engineering organization comprised of 9 Centers-of-Excellence (CoE) in various disciplines applied across all product groups in the company. These CoE include Handling & Automation, Precision Motion Control, Sensors & Image Acquisition, Platform Design, and Packaging Engineering, among others. Talent includes over 500 engineers across global centers in Israel, China, India, and the US. Each CoE contributes not just talent and deliverables per discipline toward product programs, but also subject matter expertise, best practices, roadmaps, specialized facilities, apparatus, models, and analytics. These differentiate KLA not only in WHAT we do, but also in HOW we do it.
Job Description/Preferred Qualifications
Crafting, deploying, and supporting an HPC cluster from infancy to enterprise is exciting because it involves crafting robust, scalable systems that push the boundaries of computational power! This process offers the satisfaction of overcoming sophisticated challenges and seeing your work enable groundbreaking research and innovation.Responsibilities for this exciting role will include:
Design, implementation & support of high-performance compute clusters
Solid understanding on HPC systems, including CPU/GPU architecture, scalable/robust storage, high-bandwidth inter-connects, and a knowledge of cloud based computing architectures
Apply their attention to detail to generate HW BOMs for HPC Clusters, provide vendor management and coordinate HW release activities.
Use their strong skills with the Linux OS to configure appropriate operating systems for the HPC system
Understand and assemble the project specifications and performance requirements at the subsystem and system levels. Adhere and strive to project timelines to ensure program achievements complete on time.
Support design and release of new products to manufacturing and ultimately the customer, providing quality golden images, procedures, scripts and documentation to the manufacturing team and customer support team.
Lead EOL Parts Re-Qualification for long term system deployments
Support in-house as well as in-field critical issuesRequired Qualifications:
Validated in-depth and flavor agnostic knowledge of Linux systems (SuSE, RedHat, Rocky, Ubuntu)
Experience of crafting and maintaining robust storage
Strong HPC HW knowledge especially in the Server, GPU, Networking, Storage, Scheduler, BIOS & BMC arenas.
Experience in System-D, Net boot/PXE, Linux HA.
Strong understanding of TCP/IP fundamentals and knowledge of protocols, DNS, DHCP, HTTP, LDAP, SMTP.
Strong with Storage File Shares: NFS/CIFS
Ability to code and develop Shell and Python scripts.
Experience with one or more of the listed Configuration Mgmt utilities. (Ansible, Salt, Chef, Puppet etc).Preferred Qualifications:
Possess a strong DevOps focus: Knowledge of setting up a continuous development pipelines, Repository software (Git-based).
Hypervisor Knowledge: VMWare, Proxmox, or XCP-ng
Knowledge of Apache/Nginx, Setting up proxy/reverse proxy, application server routing, load balancing (HA Proxy)
HPC Schedulers: SGE/SLURM
Monitoring tools: Prometheus, Grafana, Nagios
Database Technologies: MySQL
BS or MS degree 5+ years validated experience
Computer Engineering or Electrical Engineer related fieldsSkills and Abilities:
Team Orientation & Interpersonal - Highly motivated teammate with ability to develop and maintain collaborative relationships with all levels within and external to the organization.
Organization & Time Management - Able to plan, schedule, prioritize, and follow up on tasks related to the job to achieve goals within or ahead of established time frames.
Multi-task - Ability to expeditiously organize, coordinate, manage, prioritize, and perform multiple tasks simultaneously to swiftly assess a situation, determine a logical course of action, and apply the appropriate response.
Adaptability to Change - Able to be flexible and encouraging, and able to assimilate change positively and proactively in... For full info follow application link.
KLA-Tencor is an Equal Opportunity Employer. Applicants will be considered for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, or any other characteristics protected by applicable law.