AI Inference Engineer Job at Signify Technology, Santa Clara, CA

bVhtVjVTcFhKL3pQZk04MnlaVmVDWXBvMnc9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

RISE Pediatric Therapies

Physical Therapist Job at RISE Pediatric Therapies

 ...growth, development and learning skills in their natural environment. We are hiring an employee OR contractor PEDIATRIC PHYSICAL THERAPIST! The Pediatric Physical Therapist is responsible for providing evaluations, assessments, and ongoing therapy for children birth... 

Onward Recruiting

Real Estate Associate/Commercial Real Estate Attorney Job at Onward Recruiting

 ...Real Estate Associate Transactions & Leasing / Commercial Real Estate Attorney Shape skylines, lead complex deals, and make your mark. My client is seeking a Real Estate Associate who thrives in fast-paced, high-stakes environments to work on diverse commercial real... 

Tandym Group

Mail and Production Clerk Job at Tandym Group

A recognized services organization in Illinois is actively seeking a new Distribution Services Specialist to support production planning and fulfillment operations. In this role, the Distribution Services Specialist will be responsible for processing printed materials,...

Luxoft

Java/Python Developer Job at Luxoft

 ...Project description Luxoft is looking for a Senior Java/Python Developer who would be working with our Customer - one of the world's largest investment management companies. Based in Southern California, our client manages close to $2 trillion in assets and is working... 

TA Monroe

PPC Senior Specialist Paid Search (Google Ads) Job at TA Monroe

 ...team members to meet client goals. Qualifications: #6+ years of experience in Search Engine Marketing (SEM) and Paid Search (PPC) mostly about daily & weekly management of paid ads # Proficiency in managing moderate to large scale B2B accounts in different business...