{"jobs":[{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4115953009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4078802009,"location":{"name":"Santa Clara, CA - Toronto, Canada"},"metadata":null,"id":4115953009,"updated_at":"2026-03-25T17:43:16-04:00","requisition_id":"1","title":"Compiler Code Gen Engineer","company_name":"Lemurian Labs","first_published":"2026-02-12T21:46:51-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;h3\u0026gt;\u0026amp;nbsp;\u0026lt;/h3\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re looking for a Compiler Code Generation Engineer to design and build the core code generation capabilities inside our high-performance, portable AI compiler. This is a foundational engineering role where you\u0026#39;ll work directly on the subsystems that translate high-level ML computations into optimized machine code across heterogeneous hardware targets.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;If you love digging into instruction selection, register allocation, and low-level target-specific optimization and you want that work to matter at the scale of the next era of AI infrastructure, this role is for you.\u0026lt;/p\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Design, develop, maintain, and improve our heterogeneous AI compiler.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Design and implement new code generation capabilities based on our novel compiler architecture.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Propose improvements and extensions to our compiler architecture in response to advances in ML model design and hardware.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Apply the latest techniques in parallelization and partitioning to automate kernel generation and exploit highly optimized execution paths.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Use performance data to identify optimization opportunities and drive measurable improvements.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate with our product team to understand the evolving needs of ML engineers and translate those needs into architectural improvements.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;Requirements\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;BS degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;4+ years of experience working with compilers.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep knowledge of compiler algorithms and data structures.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with and genuine interest in low-level code generation, object file manipulation, and target-specific optimizations.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;4+ years of experience with C/C++.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong written and verbal communication skills; ability to write clear and concise technical documentation.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Master\u0026#39;s or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Knowledge of traditional compiler techniques: instruction selection, register allocation, dominance analysis, def-use chains.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with calling conventions, APIs, linking, and relocations.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Working knowledge of LLVM.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with loop optimizations, vectorization, unrolling, fusion, and parallelization.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with machine learning workloads and their hardware demands.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Work on compiler infrastructure that runs AI at scale across every major hardware platform.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Tackle deep, unsolved technical problems alongside a team that prizes craft and rigor.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Be part of building sustainable AI infrastructure that genuinely moves the needle.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;div\u0026gt;\n\u0026lt;p\u0026gt;\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;/div\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023102009,"name":"Back End (Proteus)","child_ids":[],"parent_id":4023099009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4118628009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4080545009,"location":{"name":"Santa Clara, CA - Toronto, Canada"},"metadata":null,"id":4118628009,"updated_at":"2026-03-30T12:35:52-04:00","requisition_id":"7","title":"Compiler Optimization Engineer","company_name":"Lemurian Labs","first_published":"2026-02-12T21:46:56-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;div\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re looking for a Graph Optimization Compiler Engineer to own the middle tier of our AI compiler stack — the layer where high-level model graphs are transformed, simplified, and made ready for efficient code generation. You\u0026#39;ll design and implement the optimization passes that make the difference between a model that runs and a model that flies.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;This role sits between our compiler front end and code generation backend. You\u0026#39;ll work on graph-level transformations — fusion, layout optimization, dead code elimination, constant folding, and more — with a direct line of sight to the performance outcomes your work produces. If you think in data flow graphs and optimization passes, and you want that thinking to power the next generation of AI infrastructure, we\u0026#39;d love to talk.\u0026lt;/p\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Design, develop, and maintain the graph optimization layer of our heterogeneous AI compiler\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Implement and extend graph-level transformation passes including operator fusion, layout propagation, dead code elimination, constant folding, and algebraic simplification\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Define and evolve our intermediate representation (IR) to support new optimization opportunities as ML model architectures advance\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Analyze performance data to identify optimization gaps and drive measurable improvements in throughput and latency\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate with front end and code generation teams to ensure clean IR interfaces and well-structured optimization pipelines\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Propose and prototype new optimization strategies in response to advances in model design and hardware capabilities\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Contribute to testing and validation infrastructure to ensure optimization correctness across model types and hardware targets\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Requirements\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;BS degree in Computer Science, Computer Engineering, or equivalent practical experience\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;4+ years of experience working with compilers, with a focus on intermediate representation design or optimization passes\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep knowledge of graph-level compiler optimization techniques — fusion, tiling, layout transformations, and related methods\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;4+ years of experience with C/C++\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong written and verbal communication skills; ability to write clear and concise technical documentation\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Master\u0026#39;s or PhD in Computer Science, Computer Engineering, or equivalent\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with polyhedral models or affine analysis for loop and tensor optimization\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with hardware memory hierarchies and how layout decisions impact performance on GPUs or accelerators\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience working with MLIR, XLA, or similar graph-level IR frameworks\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with ML framework internals — PyTorch eager/compile mode, JAX/XLA, or TensorRT\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong understanding of ML model architectures and their computational patterns (attention, convolution, normalization, etc.)\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Knowledge of quantization, sparsity, or other model-level optimization techniques\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Contributions to open-source compiler or ML infrastructure projects\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Own a critical layer of our compiler stack where optimization decisions have direct, measurable impact on model performance\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Work on the hardest graph-level problems in AI infrastructure — across diverse hardware targets and model architectures\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate with a team that treats infrastructure as a canvas and optimization as a craft\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;br\u0026gt;\u0026lt;br\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;/div\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023104009,"name":"Graph Optimizer (Nemo)","child_ids":[],"parent_id":4023099009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4118625009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4080543009,"location":{"name":"Santa Clara, CA - Toronto, Canada"},"metadata":null,"id":4118625009,"updated_at":"2026-03-25T17:43:50-04:00","requisition_id":"6","title":"Front End Compiler","company_name":"Lemurian Labs","first_published":"2026-02-12T21:46:54-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;We\u0026#39;re looking for a Front End Compiler Engineer to own the ingestion layer of our high-performance, portable AI compiler. This is where the journey begins — you\u0026#39;ll build the systems that parse, validate, and lower representations from frameworks like PyTorch, StableHLO, ONNX, and MLIR dialects into our internal compiler IR, setting the stage for everything that follows.\u0026lt;/span\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;If you\u0026#39;re drawn to the elegance of well-designed language frontends, care deeply about correctness and coverage, and want your work to directly enable next-generation AI models to run anywhere — this is the role for you.\u0026lt;/span\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Design, develop, and maintain the front end of our heterogeneous AI compiler, including parsing, validation, and IR lowering stages\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Build and extend ingestion pipelines for ML frameworks and representations including PyTorch, StableHLO, ONNX, and MLIR-based dialects\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Define and evolve the interface between external model representations and our internal compiler IR\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Ensure correctness and completeness of operator coverage across supported frameworks and hardware targets\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Collaborate with graph optimization and code generation teams to ensure clean, well-structured IR that enables downstream transformations\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Use performance and correctness data to identify gaps in coverage and drive improvements\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Contribute to documentation and tooling that helps ML engineers understand and debug the ingestion process\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Requirements\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;BS degree in Computer Science or equivalent practical experience\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;2+ years of experienceWorking with ML optimization tools/libraries\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Experience working with or ingesting from ML frameworks such as PyTorch, TensorFlow/JAX, or ONNX\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;2+ years of experience with C/C++ and 2+ years working with Python\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Strong written and verbal communication skills; ability to write clear and concise technical documentation\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Knowledge of quantization, sparsity, or other model-level optimization techniques\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Master\u0026#39;s or PhD in Computer Science or equivalent\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Experience with StableHLO, XLA, or other ML-specific IRs\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Knowledge of operator semantics across ML frameworks and the challenges of cross-framework compatibility\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Familiarity with Python bindings and tooling for compiler front ends\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Experience with testing infrastructure for compiler correctness — fuzzing, differential testing, or model-level validation\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Contributions to open-source compiler or ML framework projects\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Familiarity with MLIR, including defining and working with custom dialects\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Build the entry point for an AI compiler that targets every major hardware platform\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Work at the boundary of ML frameworks and compiler infrastructure — a rare and high-leverage intersection\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Tackle deep correctness and coverage problems that directly impact what models can run and where\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;li style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;\u0026lt;span style=\u0026quot;font-family: helvetica, arial, sans-serif;\u0026quot;\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits\u0026lt;/span\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026amp;nbsp;\u0026lt;/p\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023103009,"name":"Front End (Iron)","child_ids":[],"parent_id":4023099009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4130990009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4086287009,"location":{"name":"Santa Clara, CA"},"metadata":null,"id":4130990009,"updated_at":"2026-03-25T17:26:18-04:00","requisition_id":"10","title":"Product Growth Lead","company_name":"Lemurian Labs","first_published":"2026-02-12T21:46:57-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re looking for a Product Growth Lead who lives at the intersection of deep tech, community strategy, and product adoption. You\u0026#39;ll define how the market experiences our product — serving as the primary bridge between our AI stack and the external developer and partner ecosystem, translating technical value into developer adoption at scale.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;This is a rare hybrid role that demands both technical fluency and go-to-market instincts. You\u0026#39;ll architect the developer journey, own the feedback loop between the market and our product backlog, and build the relationships that matter in the open-source and cloud ecosystem.\u0026lt;/p\u0026gt;\n\u0026lt;div\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Drive Developer Adoption: \u0026lt;/strong\u0026gt;Lead the zero-to-one experience for our AI stack. Architect the developer journey from discovery to deployment, ensuring our SDKs and libraries are not just powerful, but accessible.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Own the Feedback Loop: \u0026lt;/strong\u0026gt;Act as Customer Zero. Bring insights from hackathons, partner integrations, neoclouds, and open-source communities directly into the product backlog.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Technical Storytelling: \u0026lt;/strong\u0026gt;Partner closely with Communications Team to translate complexity into compelling narratives — whitepapers, blog posts, and documentation that resonate with ML engineers and external audiences, where applicable\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Ecosystem Presence: \u0026lt;/strong\u0026gt;Build high-trust relationships with developers and maintainers of key open-source and cloud-native projects (e.g., vLLM, LangChain, Triton).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Growth Engineering: \u0026lt;/strong\u0026gt;Define and track key product adoption metrics. Work with technical teams to build the telemetry that informs growth strategy.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Requirements\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;8+ years of experience blending Technical Product Management with Developer Relations, Solutions Engineering, or Growth roles in the AI/HPC space.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Technical fluency: you can read and write Python, and understand lower-level concepts (GPU memory hierarchy, kernel fusion, latency vs. throughput) well enough to debate trade-offs with architects.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;A portfolio of technical writing (blogs, documentation, whitepapers) or public speaking that simplifies complex systems without dumbing them down.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Proven track record of launching developer-facing products (APIs, SDKs, CLI tools) and measuring success through adoption metrics.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;BS/MS in Computer Science, Engineering, or equivalent practical experience.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Ways to Stand Out\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Open source credibility: you have managed or contributed to high-growth repositories (1k+ stars) or active Discord communities in the GenAI space.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Builder DNA: you have personally built and deployed RAG pipelines or custom model serving endpoints in the last year.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Network: you bring existing relationships within the PyTorch, Hugging Face, Neocloud, or CNCF communities.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Personal Attributes\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Community-native: \u0026lt;/strong\u0026gt;you understand how developer trust is built and have done it before.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Self-directed: \u0026lt;/strong\u0026gt;you take ownership and don\u0026#39;t wait for permission to move fast.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Technically credible: \u0026lt;/strong\u0026gt;engineers take you seriously because you\u0026#39;ve done the work.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Storyteller: \u0026lt;/strong\u0026gt;you can make infrastructure feel exciting — and back it up with data.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;/div\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023108009,"name":"Product","child_ids":[],"parent_id":4023106009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4116502009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4079213009,"location":{"name":"Santa Clara, CA - Toronto, Canada"},"metadata":null,"id":4116502009,"updated_at":"2026-03-25T17:44:53-04:00","requisition_id":"3","title":"Runtime Engineer","company_name":"Lemurian Labs","first_published":"2026-02-12T21:46:59-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re looking for a Runtime Engineer to design and build the multi-target runtime that sits at the heart of our AI compiler stack. This is a systems-level role where you\u0026#39;ll take the output of our optimizing compiler and make it execute — efficiently, correctly, and at scale — across a diverse landscape of hardware targets.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;You\u0026#39;ll work on low-level parallelization, kernel scheduling, and performance analysis, and collaborate closely with our compiler and product teams to push the boundaries of what\u0026#39;s possible on modern AI hardware.\u0026lt;/p\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Design, develop, maintain, and improve our multi-target runtime.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Apply the latest techniques in parallelization and partitioning to automate kernel generation and exploit highly optimized execution paths.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Rapidly prototype and data-drive exploration of new runtime ideas.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Benchmark and analyze the outputs produced by our optimizing compiler on target hardware.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Build tools to collect and analyze performance bottlenecks.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Work closely with our product team to understand the evolving needs of ML engineers and drive improvements in runtime architecture.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Requirements\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;BS degree in Computer Science, Computer Engineering, or equivalent practical experience.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;4+ years of experience working with compilers or runtime systems.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep understanding of asynchronous and concurrent programming.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;4+ years of experience with C/C++ (C++14 or newer).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Understanding of hardware architecture: vector vs. scalar registers and instructions, memory hierarchies.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Knowledge of operating system kernel development or hypervisor development.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Master\u0026#39;s or PhD in Computer Science, Computer Engineering, or equivalent.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience developing or maintaining GPU compute libraries such as CUDA or ROCm.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with GPU programming and optimization.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Background in high-performance computing (HPC).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Knowledge of deep learning frameworks such as PyTorch, JAX, or Triton.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience programming large compute clusters.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Build the runtime that makes next-generation AI infrastructure actually go fast.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Work across the full stack — from hardware intrinsics to compiler output to distributed execution.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Join a team that approaches infrastructure as a canvas, not a constraint.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023105009,"name":"Runtime (Hydra)","child_ids":[],"parent_id":4023099009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4118706009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4080619009,"location":{"name":"Santa Clara, CA - Toronto, Canada"},"metadata":null,"id":4118706009,"updated_at":"2026-03-25T17:45:17-04:00","requisition_id":"8","title":"Senior Developer Tools Engineer","company_name":"Lemurian Labs","first_published":"2026-02-12T21:47:00-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;div\u0026gt;\u0026amp;nbsp;\u0026lt;/div\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;div\u0026gt;As the founding member of our Developer Experience (DevX) team, you will be instrumental in shaping how engineers interact with our compiler infrastructure. You\u0026#39;ll build the tools that give developers deep visibility into system performance—from profiling and debugging capabilities to hardware introspection interfaces. Your work will bridge the gap between our core compiler technology and the engineers who use it, transforming complex system data into actionable insights.\u0026lt;/div\u0026gt;\n\u0026lt;div\u0026gt;\u0026lt;br\u0026gt;This role sits at the intersection of systems programming and developer tooling. You\u0026#39;ll work closely with our compiler engineers to surface server-side telemetry through intuitive client-side interfaces, ultimately creating a best-in-class development experience for our users.\u0026lt;strong\u0026gt;\u0026lt;br\u0026gt;\u0026lt;/strong\u0026gt;\u0026lt;/div\u0026gt;\n\u0026lt;div\u0026gt;\u0026amp;nbsp;\u0026lt;/div\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;div\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Design and build developer tools for profiling, debugging, and performance introspection across our compiler stack.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Create client-side tooling that transforms server-side compiler telemetry into clear, actionable information for engineers.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Develop interfaces that expose hardware performance metrics, and interrupt data in meaningful ways.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Build GPU debugging capabilities and visualization tools to help engineers understand execution on heterogeneous hardware.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Define formats and protocols for debug information exchange, working with standard debugger formats (DWARF, JTAG) and object file formats (ELF, COFF).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate with internal engineering teams to understand their needs and iterate on tooling, with a path toward external customer-facing tools.\u0026lt;strong\u0026gt;\u0026lt;br\u0026gt;\u0026lt;/strong\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;/div\u0026gt;\n\u0026lt;div\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience:\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;3+ years of professional experience in systems-level software development.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong proficiency in C++ with experience writing performance-critical code.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Working knowledge of assembly language and low-level debugging techniques.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with debugger formats (DWARF, JTAG) and object file formats (ELF, COFF).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Understanding of profiling methodologies and performance analysis tools.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Ability to work on-site at our Toronto or Santa Clara office.\u0026lt;strong\u0026gt;\u0026lt;br\u0026gt;\u0026lt;/strong\u0026gt;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience:\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Experience with GPU programming and debugging (CUDA, ROCm, or similar).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with OS-level interfaces including I/O subsystems and interrupt handling.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Background in compiler development or toolchain infrastructure.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience building developer-facing tools or IDEs.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Contributions to open-source debugging or profiling tools.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Work at the intersection of compilers, hardware, and developer experience — a genuinely rare combination.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Build tools that developers will rely on every day, with real ownership from day one.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;/div\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023100009,"name":"AI Infrastructure","child_ids":[4028882009,4028883009,4028884009,4028885009,4028886009,4028887009,4028888009],"parent_id":4023099009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4200595009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4117655009,"location":{"name":"Santa Clara, California, United States, Toronto, Ontario, Canada, United States - Remote"},"metadata":null,"id":4200595009,"updated_at":"2026-04-14T15:20:29-04:00","requisition_id":"12","title":"Senior DSL Engineer","company_name":"Lemurian Labs","first_published":"2026-04-14T15:20:29-04:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;We are building a domain-specific language and compiler toolchain for programming machine learning models.\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;As a Senior DSL Compiler Engineer, you will focus on the compiler frontend: scanning, parsing, AST design and construction, compiler passes, type and shape inference, and error and warning reporting. You should be deeply comfortable reasoning about object ownership and lifecycle management in C++, and be prepared to work within a custom ARC system with semantics similar to the standard smart pointer types.\u0026lt;/p\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Design and implement compiler frontend components including the lexer, parser, abstract syntax tree, and compiler passes.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Design and implement type inference and shape inference systems for the DSL.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Design clear, actionable error and warning diagnostics that help users understand and resolve problems in their programs.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Work within and extend a proprietary automatic reference counting system that governs memory management across the frontend.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Participate in code reviews to maintain code quality and ensure sound design decisions.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate through pair programming sessions.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Contribute to the full software engineering lifecycle: product specification, requirements gathering, high-level design, low-level design, implementation, and testing.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Help inform the design of future DSLs as the platform expands to other scientific computing domains.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Requirements\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;BS degree in Computer Science, Computer Engineering, or equivalent practical experience\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Extensive experience designing and implementing domain-specific languages.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep expertise in compiler frontend engineering: lexical analysis, parsing, AST design, semantic analysis, and compiler passes.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong experience with type inference and shape inference systems.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong professional C++ background with modern C++ standards.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep understanding of automatic reference counting concepts and object lifetime management in C++, including thorough familiarity with the semantics of shared_ptr, weak_ptr, and unique_ptr. The frontend uses a proprietary ARC implementation with similar semantics, and you must be comfortable reasoning about ownership, reference cycles, and clean teardown in this kind of system.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience designing compiler diagnostics (errors and warnings) that are clear and useful to end users.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience across the full software engineering lifecycle: product specification, requirements gathering, high-level design, low-level design, implementation, and testing.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;General familiarity with GPUs or other accelerator devices and their role in high-performance computing and machine learning workloads.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Master\u0026#39;s or PhD in Computer Science, Computer Engineering, or equivalent\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with or willingness to use AI-assisted code generation tools in day-to-day development.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with PyTorch or similar machine learning frameworks.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with Python language internals or strategies for subsetting Python-like languages.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Own a critical layer of our compiler stack where optimization decisions have direct, measurable impact on model performance\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Work on the hardest graph-level problems in AI infrastructure — across diverse hardware targets and model architectures\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate with a team that treats infrastructure as a canvas and optimization as a craft\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023099009,"name":"Engineering","child_ids":[4023100009,4028396009,4028889009,4023102009,4023103009,4023104009,4023105009,4028882009,4028883009,4028884009,4028885009,4028886009,4028887009,4028888009],"parent_id":null}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/lemurianlabs/jobs/4116510009","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":4079218009,"location":{"name":"Santa Clara, CA - Toronto, Canada"},"metadata":null,"id":4116510009,"updated_at":"2026-03-25T17:45:58-04:00","requisition_id":"4","title":"Senior ML Performance Engineer","company_name":"Lemurian Labs","first_published":"2026-02-12T21:47:02-05:00","language":"en","application_deadline":null,"content":"\u0026lt;div class=\u0026quot;content-intro\u0026quot;\u0026gt;\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;About Us\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;At Lemurian Labs, we\u0026#39;re reimagining the foundations of computing to make AI accessible to everyone. Our mission is to remove the limits of scale, hardware, and cost that hold back innovation, so the people solving humanity\u0026#39;s hardest problems can move faster.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re building a new kind of software stack: a hardware-agnostic platform that makes every system — from a laptop to a supercomputer — feel like one seamless engine. Developers can write once, run anywhere, and get state-of-the-art performance across any chip, any cloud, at any scale. It\u0026#39;s a complete rethink of how software and hardware interact — designed for the era beyond Moore\u0026#39;s Law.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re not looking for the comfortable or the conventional; we\u0026#39;re looking for the bold. The engineers who crave frontier problems, who want to bend the limits of what\u0026#39;s possible, who see infrastructure not as a constraint but as a canvas. If you want to build the foundation for the next era of AI and change what humanity can achieve in the process, join us.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;\u0026lt;div\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;About the Role\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;p\u0026gt;We\u0026#39;re looking for a Senior ML Performance Engineer to architect and lead our Performance Testing Platform from the ground up. You\u0026#39;ll be the technical authority on how we measure, validate, and optimize the performance of large language models — including Llama 3.2 70B, DeepSeek, and others — before and after compiler optimization on modern GPU architectures.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;This is a high-impact role at the intersection of ML systems, GPU architecture, and performance engineering. You\u0026#39;ll build the infrastructure that proves our compiler delivers real, measurable value — and you\u0026#39;ll work directly with compiler and ML engineers to drive the optimizations that get us there.\u0026lt;/p\u0026gt;\n\u0026lt;/div\u0026gt;\n\u0026lt;div\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;What You\u0026#39;ll Do\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;/div\u0026gt;\n\u0026lt;div\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Design and build a comprehensive performance testing platform for evaluating LLM inference workloads across GPU clusters\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Define and implement the benchmarking methodology, metrics, and test suites that measure latency, throughput, memory utilization, power consumption, and model accuracy\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Establish baseline performance for unoptimized models (Llama 3.2 70B, DeepSeek, etc.) and validate post-optimization improvements\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Develop automated testing pipelines for continuous performance validation across compiler releases and model updates\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Investigate performance bottlenecks using profiling tools (ROCm profilers, GPU traces, system-level monitoring) and work with the compiler team to drive optimizations\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Create dashboards and reporting that provide clear visibility into performance trends, regressions, and wins\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate cross-functionally with compiler engineers, ML engineers, and DevOps to ensure performance testing is integrated into our development workflow\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Document best practices for performance testing and optimization of ML workloads on GPU hardware\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Essential Skills and Experience:\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;BS degree in computer science, computer engineering, electrical engineering, or equivalent practical experience\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;7+ years of experience in performance engineering, benchmarking, or systems engineering roles\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep understanding of ML inference workloads, particularly transformer-based models and LLMs\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Hands-on experience with GPU programming and optimization (CUDA, ROCm, or similar)\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong programming skills in Python and C/C++\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Proven track record of building performance testing infrastructure or benchmarking platforms from scratch\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with ML frameworks (PyTorch, TensorFlow, ONNX Runtime, vLLM, TensorRT-LLM, etc.)\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Proficiency with profiling and debugging tools for GPU workloads\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong analytical skills with the ability to design experiments, analyze results, and communicate findings clearly\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with CI/CD systems and test automation frameworks\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h3\u0026gt;\u0026lt;strong\u0026gt;Preferred Skills and Experience:\u0026lt;/strong\u0026gt;\u0026lt;/h3\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Masters or PhD degree in computer science, computer engineering, electrical engineering, or equivalent practical experience.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with AMD GPUs (Mi200/Mi300 series) and ROCm ecosystem\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Knowledge of compiler optimization techniques and their impact on performance\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with distributed inference and multi-GPU workloads\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with ML model quantization, pruning, and other optimization techniques\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Background in high-performance computing or systems-level optimization\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with infrastructure-as-code (Kubernetes, Docker, Terraform)\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Contributions to open-source ML or systems projects\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;strong\u0026gt;Personal Attributes\u0026lt;/strong\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Precision-driven: \u0026lt;/strong\u0026gt;you catch the 2% regression that others miss.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Self-directed: \u0026lt;/strong\u0026gt;you take ownership and don\u0026#39;t wait for permission to solve problems.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Collaborative: \u0026lt;/strong\u0026gt;you work well across teams and actively help others succeed.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;\u0026lt;strong\u0026gt;Clear communicator: \u0026lt;/strong\u0026gt;you can explain complex technical concepts to engineers and stakeholders alike.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;h2\u0026gt;\u0026lt;span style=\u0026quot;text-decoration: underline;\u0026quot;\u0026gt;\u0026lt;strong\u0026gt;Why Join Lemurian Labs\u0026lt;/strong\u0026gt;\u0026lt;/span\u0026gt;\u0026lt;/h2\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Build the performance testing infrastructure that validates the future of efficient AI.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Own a high-visibility platform that directly influences product quality and customer success.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Work with cutting-edge GPU hardware and next-generation LLMs.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;/div\u0026gt;\u0026lt;div class=\u0026quot;content-conclusion\u0026quot;\u0026gt;\u0026lt;p\u0026gt;Lemurian Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of gender identity, race, ethnicity, sexual orientation, disability status, age, or background.\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Compensation depends on experience and geographic location and will be narrowed during the interview process. Additional benefits include equity, company bonus opportunities, medical, dental, and vision coverage, a retirement savings plan, and supplemental wellness benefits.\u0026lt;/p\u0026gt;\u0026lt;/div\u0026gt;","departments":[{"id":4023100009,"name":"AI Infrastructure","child_ids":[4028882009,4028883009,4028884009,4028885009,4028886009,4028887009,4028888009],"parent_id":4023099009}],"offices":[{"id":4022544009,"name":"Santa Clara","location":"Santa Clara, California, United States","child_ids":[],"parent_id":null},{"id":4022545009,"name":"Toronto","location":"Toronto, Ontario, Canada","child_ids":[],"parent_id":null},{"id":4022574009,"name":"United States - Remote","location":"United States - Remote","child_ids":[],"parent_id":null}]}],"meta":{"total":8}}