Perform

AI Engineer - RAG Platform

Job Location

Buenos Aires, Brazil

Job Description

Join to apply for the AI Engineer - RAG Platform role at Perform 3 days ago Be among the first 25 applicants Join to apply for the AI Engineer - RAG Platform role at Perform We’re hiring an AI Engineer to design and build a production-grade RAG platform that powers our test autoscripting agent. This platform ingests our QA codebase and documentation, transforms them into embeddings, and serves relevant context (page objects, fixtures, helpers, examples) via a retrieval API—enabling high-quality LLM-generated tests. You’ll own everything from ingestion to evaluation, including keeping the index fresh via Jenkins and optimizing for token cost and latency. This role is ideal for someone who thrives in the intersection of LLM tooling, backend engineering, and developer productivity. What You’ll Do Build and maintain a local RAG platform , including: Loaders for Git, Confluence, Drive. Code-aware chunking (AST/semantic) and embedding pipelines. Vector indexing in ChromaDB with metadata and reranking. FastAPI (or similar) retrieval service for the autoscripting agent. Implement metadata filters (e.g., layer=page-object|fixture|helper|test, Git SHA, feature tags) and import-based neighbor expansion to optimize context. Optimize for cost and performance : tune k values, context lengths, reranker thresholds, and cache frequent retrievals. Build retrieval evaluation and telemetry : track recall, faithfulness, token usage, compile success of generated code, and wire alerts into Jenkins CI. Manage access to Claude 4 Sonnet and other model APIs; help deploy self-hosted endpoints if needed (keys, quotas, audit logs). Write runbooks and train the SDET team on how to use and troubleshoot the RAG system. Tech Stack (Initial Plan) Embeddings: mxbai-embed-large-v1 (text), bge-code-base (code) Reranker: mxbai-rerank-base-v2 Vector store: ChromaDB (local) Pipeline orchestration: LangChain (router by MIME/type) Retrieval API: FastAPI Evaluation: Telemetry basic metrics (compile/run, cost, retrieval quality) What You Bring 4 years in ML/AI or platform-oriented backend engineering , including 2 years building LLM Development within RAG applications . Strong experience with LangChain , vector DBs (ChromaDB, Qdrant, pgvector), and code-aware embeddings (BGE-code or similar). Solid Python skills (FastAPI or Flask) and comfort reading Java to inform chunking and context design. Experience with Jenkins , secrets management, and basic observability tooling (Grafana, Prometheus, LangSmith, or RAGAS). Comfortable working with OpenAI/Anthropic APIs or deploying self-hosted endpoints, including handling keys, rate limits, and safety controls. It is an asset if you have: Experience with Claude-specific practices , structured prompting, and cost control techniques. Familiarity with retrieval evaluation tools like RAGAS or LangChain Evaluators, plus A/B testing for prompt or routing strategies. Understanding of security and compliance for developer-facing AI tools (PII handling, audit logging). The SDET team focuses on test quality and final review of autoscripted code. The Automation Agent Engineer tunes prompts and retrieval logic. You own the RAG platform : indexing, retrieval quality, LLM orchestration, and CI integration. Seniority level Seniority level Not Applicable Employment type Employment type Other Job function Job function Engineering and Information Technology Referrals increase your chances of interviewing at Perform by 2x Get notified about new Artificial Intelligence Engineer jobs in Greater Buenos Aires . Backend Python Developer (100% Remote from Argentina)ONLY) Greater Buenos Aires $18,000.00-$48,000.00 1 month ago Full Stack Developer/Engineer (Vue.js/Node.js) Dev Ops Full Stack Developers - Part Time Intermediate Software Engineer (React.js, Node.js) - OP01587-OS Full Stack Software Developer (Contract, Buenos Aires) Software Engineer (Python) Career Opportunities at Dev.Pro - 01 BE2 Software Engineer (Python) (USD Salary & Full-time) We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. J-18808-Ljbffr

Location: Buenos Aires, Espírito Santo, BR

Posted Date: 9/17/2025
View More Perform Jobs

Contact Information

Contact Human Resources
Perform

Posted

September 17, 2025
UID: 5348937954

AboutJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.