Sort by
Refine Your Search
-
Country
-
Program
-
Employer
- Forschungszentrum Jülich
- NEW YORK UNIVERSITY ABU DHABI
- Northeastern University
- Oak Ridge National Laboratory
- University of California
- Lawrence Berkeley National Laboratory
- New York University
- The University of North Carolina at Chapel Hill
- University of Innsbruck, Institute of Computer Science
- University of Utah
- University of Washington
- Brookhaven Lab
- Brookhaven National Laboratory
- CNRS
- Ecole Centrale de Lyon
- FCiências.ID
- INSTITUTO DE ASTROFISICA DE CANARIAS (IAC) RESEARCH DIVISION
- Japan Agency for Marine-Earth Science and Technology
- Luleå University of Technology
- Monash University
- National Renewable Energy Laboratory NREL
- The University of Chicago
- UNIVERSITY OF VIENNA
- University of A Coruña
- University of California, Merced
- University of Dayton
- Washington University in St. Louis
- 17 more »
- « less
-
Field
-
the Computer Science program at New York University Abu Dhabi, seeks to recruit a research assistant to work on the intersection of compilers and deep learning. Many companies, such as Google, Facebook, and Amazon
-
. Demonstrated experience developing and running computational tools for high-performance computing environment, including distributed parallelism for GPUs. Demonstrated experience in common scientific programming
-
programming; Experience programming distributed systems; Experience with parallel and distributed File Systems (e.g., Lustre, GPFS, Ceph) development. Advanced experience with high-performance computing and/or
-
The University of North Carolina at Chapel Hill | Chapel Hill, North Carolina | United States | about 1 month ago
and Experience: Distributed parallel training and parameter-efficient tuning. Familiarity with multi-modal foundation models, HITL techniques, and prompt engineering. Experience with LLM fine-tuning
-
) for a given Tiramisu program, many code optimizations should be applied. Optimizations include vectorization (using hardware vector instructions), parallelization (running loop iterations in parallel
-
programming; Experience programming distributed systems; Experience with parallel and distributed File Systems (e.g., Lustre, GPFS, Ceph) development. Advanced experience with high-performance computing and/or
-
, AMD uProf, or Omniperf. Debugging experience with distributed-memory parallel applications. Experience with containers (Docker, Podman, Shifter or similar) and modern software practices such as Git
-
willingness to learn: High-performance computing (distributed systems, profiling, performance optimization), Training large AI models (PyTorch/JAX/TensorFlow, parallelization, mixed precision), Data analysis
-
Deadline 7 Nov 2025 - 23:59 (UTC) Type of Contract Temporary Job Status Full-time Hours Per Week 35 Offer Starting Date 1 Jan 2026 Is the job funded through the EU Research Framework Programme? Not funded by
-
for an accurate simulation of time-dependent flows, enabling sensitive applications such as aeroacoustics. Furthermore, the high scalability on massively parallel computers can lead to advantageous turn-around