3 parallel-and-distributed-computing-"UNIS" positions at Oak Ridge National Laboratory
-
for Science @ Scale: Pretraining, instruction tuning, continued pretraining, Mixture-of-Experts; distributed training/inference (FSDP, DeepSpeed, Megatron-LM, tensor/sequence parallelism); scalable evaluation
-
for Science @ Scale: Pretraining, instruction tuning, continued pretraining, Mixture-of-Experts; distributed training/inference (FSDP, DeepSpeed, Megatron-LM, tensor/sequence parallelism); scalable evaluation
-
of relevant experience in Linux systems administration or HPC systems engineering. Preferred Qualifications Demonstrated experience leading the design and deployment of HPC or large-scale distributed computing
Enter an email to receive alerts for parallel-and-distributed-computing-"UNIS" positions