The role is responsible for the design, integration, and management of high performance computing (HPC) systems that encompass both hardware and software components into the organization’s network infrastructure. This individual will be responsible for all activities related to handling and supporting the Business and platforms including system administration, as well as incorporating new technologies under the challenge of a sophisticated and constantly evolving technology landscape. This role involves ensuring that all parts of a system work together seamlessly to meet the organization’s requirements.
Roles & Responsibilities:
Implement, and manage cloud-based infrastructure that supports HPC environments that support data science (e.g. AI/ML workflows, Image Analysis).
Collaborate with data scientists and ML engineers to deploy scalable machine learning models into production.
Ensure the security, scalability, and reliability of HPC systems in the cloud.
Optimize cloud resources for cost-effective and efficient use.
Keep abreast of the latest in cloud services and industry standard processes.
Provide technical leadership and guidance in cloud and HPC systems management.
Develop and maintain CI/CD pipelines for deploying resources to multi-cloud environments.
Monitor and fix cluster operations/applications and cloud environments.
Document system design and operational procedures.
Basic Qualifications and Experience:
Masters or Bachelor’s degree with 8 - 12 years of experience in Computer Science, IT or related field with hands-on HPC administration OR
Functional Skills:
Must-Have Skills:
Demonstrable experience in cloud computing (preferably AWS) and cloud architecture.
Experience with containerization technologies (Singularity, Docker) and cloud-based HPC solutions.
Experience with infrastructure-as-code (IaC) tools such as Terraform, CloudFormation, Packer, Ansible and Git.
Expert with scripting (Python or Bash) and Linux/Unix system administration (preferably Red Hat or Ubuntu).
Proficiency with job scheduling and resource management tools (SLURM, PBS, LSF, etc.).
Knowledge of storage architectures and distributed file systems (Lustre, GPFS, Ceph).
Understanding of networking architecture and security best practices.
Good-to-Have Skills:
Experience supporting research in healthcare life sciences.
Experience with Kubernetes (EKS) and service mesh architectures.
Knowledge of AWS Lambda and event-driven architectures.
Exposure to multi-cloud environments (Azure, GCP).
Familiarity with machine learning frameworks (TensorFlow, PyTorch) and data pipelines.
Certifications in cloud architecture (AWS Certified Solutions Architect, Google Cloud Professional Cloud Architect, etc.).
Experience in an Agile development environment.
Prior work with distributed computing and big data technologies (Hadoop, Spark).
Professional Certifications (please mention if the certification is preferred or mandatory for the role):
Red Hat Certified Engineer (RHCE) or Linux Professional Institute Certification (LPIC)
AWS Certified Solutions Architect – Associate or Professional
Soft Skills:
Strong analytical and problem-solving skills.
Ability to work effectively with global, virtual teams
Effective communication and collaboration with cross-functional teams.
Ability to work in a fast-paced, cloud-first environment.
Shift Information:
This position requires you to work a Standard Shift with Flexibility to Work Early or Late for Overlap
EQUAL OPPORTUNITY STATEMENT
Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.
We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.
Biotechnology Research and Pharmaceutical Manufacturing
What We Offer
About the Company
Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants
are
advised to research the bonafides of the prospective employer independently. We do NOT
endorse any
requests for money payments and strictly advice against sharing personal or bank related
information. We
also recommend you visit Security Advice for more information. If you suspect any fraud
or
malpractice,
email us at abuse@talentmate.com.
You have successfully saved for this job. Please check
saved
jobs
list
Applied
You have successfully applied for this job. Please check
applied
jobs list
Do you want to share the
link?
Please click any of the below options to share the job
details.
Report this job
Success
Successfully updated
Success
Successfully updated
Thank you
Reported Successfully.
Copied
This job link has been copied to clipboard!
Apply Job
Upload your Profile Picture
Accepted Formats: jpg, png
Upto 2MB in size
Your application for Senior High Performance Computing Engineer
has been successfully submitted!
To increase your chances of getting shortlisted, we recommend completing your profile.
Employers prioritize candidates with full profiles, and a completed profile could set you apart in the
selection process.
Why complete your profile?
Higher Visibility: Complete profiles are more likely to be viewed by employers.
Better Match: Showcase your skills and experience to improve your fit.
Stand Out: Highlight your full potential to make a stronger impression.
Complete your profile now to give your application the best chance!