Top Tech Jobs & Startup Jobs in Los Angeles, CA

12 Hours AgoSaved
In-Office or Remote
United States
Mid level
Mid level
Artificial Intelligence • Information Technology • Software
Design and optimize high-performance kernels and custom operators for attention, MoE, GEMM, quantization and collective communication across NVIDIA, AMD and AWS Trainium. Improve LLM inference runtimes, develop distributed training/inference solutions at scale, use compilers and SDKs (Neuron, Torch Dynamo, PyTorch/XLA), contribute to open-source, and publish technical findings.
Top Skills: Amd GpusAws NeuronAws TrainiumC++CudaFlashattentionLmcacheMoeNeuron CompilerNeuron ProfilerNeuron SdkNsightNvidia GpusPagedattentionPythonPytorch/XlaRlhfRocm ProfilerRocm/HipSglangTensorrt-LlmTorch DynamoTritonVllm
Reposted 7 Days AgoSaved
In-Office or Remote
United States
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
The GPU Cloud Platform Engineer designs and operates multi-cluster GPU infrastructures for AI workloads, ensuring performance and efficiency across cloud environments.
Top Skills: AWSAzureCudaDockerGCPGoGrafanaKubernetesPrometheusPython
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account