Software Engineer - Search

Engineering, London Office

Samaya AI is building knowledge discovery tools for domain experts using AI and Large Language Models. Our platform enables users to answer complex questions and uncover hidden insights from large sets of knowledge-intensive documents.

Founded by Maithra Raghu (formerly at Google Brain) and Fabio Petroni (formerly at Meta AI), Samaya AI has incredible firepower in terms of researchers, engineers, and developers with a breadth of AI experience, including Stanford, Meta AI, Google, Microsoft, and more. 

We are seeking an experienced Software Engineer to develop scalable backend systems and infrastructure, optimizing data access and system performance. Your role involves designing and maintaining frameworks for diverse data handling, enhancing AI-native data processing and search pipelines, and integrating advanced indexing and search optimization techniques.

Your Opportunity

  • Design and build a distributed system capable of handling fast data ingestion and near-instant retrieval, ensuring data security and integrity.
  • Integrate search infrastructure that meets the needs of large language models and retrieval systems.
  • Monitor, debug, and optimize applications in production to ensure peak performance and reliability.
  • Drive the engineering process improvements, collaborating closely with team members to deliver impactful product features.

Your first 3 months

  • Gain a comprehensive understanding of our infrastructure, architecture, performance concerns, and software development lifecycle.
  • Enhance our data ingestion framework, demonstrating improvements in data collection efficiency and system scalability.
  • Contribute to improving the scalability and performance of our data infrastructure, setting new benchmarks for data quality and accessibility.
  • Actively engage in learning and mentoring within the team, becoming a go-to technical advisor.

What you’ll bring

  • 5+ years of experience working on infrastructure for distributed systems or Search applications.
  • Expertise in building and optimizing search infrastructure, with strong programming skills in Python.
  • Experience with cloud infrastructure (AWS, Google Cloud, Azure) and familiarity with various data storage solutions (SQL, NoSQL, etc.).
  • Proven experience in managing complex search pipelines and search infrastructure, including familiarity with indexing techniques such as BM25, understanding of search optimization metrics, and expertise in embeddings and vector databases.
  • A commitment to implementing secure data practices, with a proactive stance on adopting the latest technologies and methodologies.
  • A track record of scaling data pipelines to manage large volumes of data seamlessly.
  • An independent, problem-solving approach, paired with a passion for data and a proactive attitude towards tackling challenges.
  • BA/BS in computer science, or related degree

What we provide

  • Flexible and hybrid work environment (typically spend a minimum of 2 days in the office per week)
  • Comprehensive health insurance coverage, including medical, dental, vision, and short-term disability
  • Opportunity to rest and recharge with unlimited PTO
  • Support your long-term financial well-being with 401K (US) and Pension (UK)
  • Create your work setup with your office equipment allowance
  • Travel to other locations is encouraged to foster team cohesion and maintain Samaya’s unique culture

If you're motivated to make a difference and possess the drive to succeed but are not sure you meet all the criteria, why not give us a try? 
We are committed to ensuring an equitable selection process for everyone and welcome applicants from varied backgrounds to enrich our team. If you require accommodations or adjustments during our recruitment process, please inform us.

*Samaya AI is an Equal Opportunity Employer. We do not discriminate on the basis of race, color, religion, sex (including pregnancy and gender identity), national origin, political affiliation, sexual orientation, marital status, disability, genetic information, age, membership in an employee organization, retaliation, parental status, military service, or other non-merit factor. 

We pride ourselves on working collaboratively in an inclusive, truth-seeking environment, dedicated to changing the way professionals access and use knowledge through generative AI. Join us in shaping the future of knowledge and AI technology! 

Apply now