Performance Library Engineer

Efficient is developing the world’s most energy-efficient general-purpose computer processor. Efficient’s patented technology uses 100x less energy than state of the art commercially available ultra-low-power processors and is programmable using standard high-level programming languages and AI/ML frameworks. This level of efficiency makes perpetual, pervasive intelligence possible: run AI/ML continuously on a AA battery for 5-10 years. Our platform’s unprecedented level of efficiency enables IoT devices to intelligently capture and curate first-party data to drive the next major computing revolution

About the Role

We are seeking Performance Library Engineers to join our growing team. Efficient’s Performance Library Engineers create and optimize libraries and frameworks that are used by customers of the world’s most energy-efficient programmable processor (the “Fabric”). In this role, you will take ownership of complex, performance-critical, library development while working closely with Efficient’s compiler and embedded teams providing direct feedback on compiler enhancements and performance related tools. In other words, you will push the performance limit of compute on our platform.

This position is a unique opportunity to work on cutting-edge hardware/software co-design, while making an immediate impact on building the next generation of performance libraries and applications used by our customers.

Required Qualifications & Experience

  • Hands-on software development experience working closely with hardware, including exposure to at least two RISC, DSP or GPU platforms.
  • A passion for analyzing and maximizing software performance on an unique dataflow hardware architecture.
  • Experience with framework and library design, particularly within resource constrained and realtime environments.
  • Experience with CUDA, HIP and/or other parallel programming models.
  • A collaborative spirit, with the ability to work with and influence multiple engineering teams.
  • Demonstrated ability to write, debug, and maintain low-level, C/C++, systems-level code as well as design clean interfaces and modular code.
  • Actively uses AI tools to generate, optimize, and debug code.
  • Background in performance profiling, benchmark design, or comparative hardware analysis.
  • Excellent written, verbal, analytical and technical communication skills, with the ability to clearly document complex systems, and lead discussions across teams.
  • Minimum Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field. 5+ years of experience.

Desired Qualifications & Experience

  • Some exposure to low-level programming interfaces, e.g. PTX, LLVM IR and/or MLIR.
  • Some domain expertise in two or more of the following areas: Linear Algebra, ML, Image Processing, Signal Processing, Audio Processing, SDR, realtime programming, or Robotics.

We offer a competitive salary for this role, generally ranging from $130,000 to $230,000, along with meaningful equity and comprehensive benefits. The final compensation package will be based on your experience and location, with some flexibility to ensure we align with the right candidate.

Why Join Efficient?

Efficient offers a competitive compensation and benefits package, including 401K match, company-paid benefits, equity program, paid parental leave, and flexibility. We are committed to personal and professional development and strive to grow together as people and as a company.