Sort by
Refine Your Search
-
Listed
-
Category
-
Program
-
Employer
- Oak Ridge National Laboratory
- University of Texas at Austin
- Harvard University
- Princeton University
- The University of Chicago
- University of California
- University of California Davis
- University of Vermont
- California Institute of Technology
- Carnegie Mellon University
- Cold Spring Harbor Laboratory
- Georgia State University
- Johns Hopkins University
- Nature Careers
- Northeastern University
- Temple University
- University of Colorado
- University of Maryland, Baltimore
- University of North Carolina at Chapel Hill
- University of North Texas at Dallas
- University of Texas at Dallas
- Virginia Tech
- 12 more »
- « less
-
Field
-
preferred). Essential Functions of Position: Manage and maintain multiple GPU clusters and networked storage systems. Monitor system performance, troubleshoot hardware issues, and coordinate repairs
-
to manage support tickets and prioritize, considering varied scope, scale, and technical requirements. Ability to define, deliver, and optimize HPC or scientific support services. Know multiple programming
-
that address real-world challenges and deliver positive business outcomes. The Institute for Insight is equipped with a computer cluster that includes multiple GPUs, designed for big data analytics for both
-
environment. This includes a HPC cluster with NVIDIA H100 nodes, parallel storage and fast interconnects, dedicated workstations with multiple A6000 GPUs, and web-based services via Microsoft Azure
-
. Experience with multiple deployment mechanisms like Diskless, Warewulf, and traditional deployment (cobbler, PXEboot, and/or Bright). Experience managing systems utilizing GPU (NVIDIA and AMD) clusters for AI
-
and GPUs) or cloud. Proficiency with high performance computing and system architecture. Advanced skills and experience associated multiple of the following: artificial intelligence; method and machine
-
, e.g. HPC cluster (CPUs and GPUs) or cloud. Proficiency with high performance computing and system architecture. Advanced skills and experience associated multiple of the following: artificial
-
, and tooling support across multiple clustered infrastructures, we facilitate Lab-wide R&D projects. Our HPC clusters range in scope from just a handful of nodes to over fifty-thousand cores. We partner
-
for all system configurations, processes, and procedures. Prioritize and manage multiple tasks and projects efficiently. Keep all required documentation updated Other duties as assigned to complete
-
well as an integral member of the AI Lab and the research community it supports. You will work with a diverse group of faculty, postdocs, and students from multiple disciplines. If you are passionate about advancing AI