Job Description

Our client is a leading software solutions organisation, who specialise in empowering businesses with innovative technology solutions by providing SaaS solutions tailored to business needs.


They are currently hiring for an AI Engineer to own the end-to-end lifecycle of AI features, from data ingestion and RAG setup to fine-tuning, evaluation, deployment, and continuous improvement, so they can ship reliable, cost-effective AI products.


Location: On-site (Dubai, UAE)

Type: Full-time


What Youll Do

  • Design and implement RAG pipelines (chunking, embeddings, vector stores, retrieval strategies) using tools like Ollama, LangChain, LlamaIndex, or equivalent.
  • Stand up local and cloud LLM orchestration (prompt routing, tool use, function calling, guards) with strong observability.
  • Run fine-tuning / LoRA / adapters; build data re-entry loops to capture outputs and feedback for secondary retraining.
  • Create robust prompt engineering patterns (templates, guards, evals, versioning) and latency/cost controls.
  • Build evaluation suites (RAGAS, custom golden sets, offline + online A/B tests) and quality dashboards.
  • Productionize models with MLOps best practices (CI/CD, model registries, feature stores, experiment tracking).
  • Ensure privacy, safety, and compliance (PII handling, red-teaming, prompt injection defenses, content filters).
  • Collaborate with PM/Design/Eng to scope features and deliver increments quickly.


Must-Have Qualifications

  • 3+ years shipping ML/AI systems, including at least one production RAG deployment.
  • Hands-on with LangChain/Ollama (or similar), vector DBs (Pinecone, Weaviate, Milvus, pgvector), and embedding models.
  • Experience with fine-tuning (HF Transformers, PEFT/LoRA) and dataset curation/cleaning.
  • Strong Python skills; solid grasp of APIs, microservices, and async patterns.
  • Familiar with LLM evals, metrics (precision@k, faithfulness, groundedness), and cost/perf tuning.
  • Cloud/containerization: Docker, any of AWS/GCP/Azure, basic GPU/accelerator know-how.


Nice to Have

  • Llama 3/4, Mistral, OpenAI/Anthropic APIs; Guardrails/Gandalf; Weights & Biases or MLflow.
  • Feature stores, Kafka, Airflow; security hardening and secret management.
  • Basic front-end to prototype admin/eval tools (React/Next.js).


Success Metrics

  • RAG answer quality (e.g., >85% groundedness on eval set) and unit cost over time.
  • P50 latency within target; model incidence rate (hallucinations, jailbreaks) MoM.
  • Time-to-ship for new datasets/features.



Job Details

Role Level: Associate Work Type: Full-Time
Country: United Arab Emirates City: Dubai
Company Website: http://www.snk.ae Job Function: Information Technology (IT)
Company Industry/
Sector:
Software Development

What We Offer


About the Company

Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.

Report

Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.


Recent Jobs
View More Jobs
Talentmate Instagram Talentmate Facebook Talentmate YouTube Talentmate LinkedIn