SDK Engineer
Company details
Company: Nexa AI
Job type: Remote
Country: United States
City: Los Angeles
Region: California
Experience: 2 years or more
Description of the offer
NEXA AI
Nexa AI is an on-device AI research and deployment company. We specialize in tiny, multimodal models (e.g. Octopus v2, OmniVLM, OmniAudio), local on-device inference framework (e.g. nexa-sdk), and model optimization techniques (e.g. NexaQuant). Our work has been recognized by industry leaders like Google, Hugging Face, AMD, and more. And we partner with enterprises and SMBs to bring local intelligence to every device.
Responsibilities:
- Specialize in Google Cloud / AWS tech stacks
- Familiarity with LLM technologies, particularly with the Transformers library
- Experience with model compression is a plus
- Knowledge of model deployment on edge devices is a plus
- Contribute to the development of our SDKs across multiple platforms, including Android, iOS, and Linux
You may be a good fit if you:
- 2+ years of experience
- Minimum BS/MS in Computer Science
- Excellent CS fundamentals (data structures, algorithms, coding)
- Knowledge of OS internals, compilers, low-power/mobile optimization
- Experience with low-level code C and frameworks like CUDA, OpenCL
- Proficiency in multithreading and performance optimization
Logistics:
- Full Time: Cupertino, California
Location of employment
How to apply?
Click on the button to get the company email or employment application form.
Apply with External LinkSponsored ads
