AI Benchmarking Lead Performance Benchmarking Evaluation
Talentmate
India
7th April 2026
2604-1832-7255
Job Description
Description
Join our mission-critical team supporting Seller Assistant, Amazons Gen-AI powered copilot that helps sellers navigate Amazons complex ecosystem and grow their businesses. As a Quality Assurance Specialist, youll play a pivotal role in ensuring the reliability and accuracy of AI model evaluations as we scale from 61% to 90%+ active seller coverage worldwide.
About Seller Assistant
Seller Assistant is a conversational AI copilot that understands the full context of a sellers business. It intelligently orchestrates backend tools to deliver actionable, drilled-down responses and can independently complete complex tasks on behalf of sellers with their permission.
Our Scale And Impact
Expanded to 2.44MM sellers (45x growth vs. Dec 2024)
Currently serving 61% of active sellers worldwide across 9 international stores (CN2XX, IN, UK, DE, JP, BR, MX, AE, SA)
Supporting four languages: English, Chinese, German, and Japanese
2026 Goal: Scale to 90%+ active sellers WW with 5 new store launches (France, Italy, Spain, Canada, Australia)
As a Quality Assurance Specialist/AI Benchmarking Lead, you will benchmark Seller Assistant AI models for relevancy, correctness, and completeness. Your primary responsibilities include: 1) Evaluate audits performed by the core auditing team to increase confidence in evaluation metrics, 2) Improve audit reliability and consistency through systematic measurement of auditor accuracy,3) Conduct targeted calibration to ensure quality standards across the auditing function, 4) Enforce quality standards by quality-checking audits and providing actionable feedback to team members, 5) Drive continuous improvement in audit processes and methodologies.
You conduct quality checks on audits performed by the core auditing team.
You identify rubric gaps and evaluation ambiguities that lead to inconsistent audit outcomes.
You surface high-confidence product issues earlier by validating and categorizing model failures.
You serve as point of contact for annotation tasks across ML data process areas, ensuring quality execution and delivery
You understand dependencies across ML data workflows and articulate customer impact effectively
You modify existing annotation methods and update SOPs.
You document SOP changes, secure approval, share knowledge with the team, and audit adoption and execution
You test new SOPs and tools, providing feedback on quality and improvement recommendations to support onboarding
Key job responsibilities
You structure data collection, analyse results and share inputs for SOP changes.
You collate, track, and report progress on key metrics agreed to with respective stakeholders (e.g., Program managers, Applied Scientist) specific to your functional area.
You identify operational issues related to process and tooling and recommend suggestions to improve key project metrics such as productivity and quality.
Basic Qualifications
Bachelors degree or equivalent
Bachelors degree or equivalent in a related field
Experience in natural language data labeling, data annotation, linguistic annotation or other forms of data markup
Technical Skills: Proficiency in MS Excel; basic understanding of SQL and Python
Experience with Microsoft Office products and applications
Communication Skills: Strong verbal and written communication skills in English
Knowledge about SOA and process that deal with sellers.
Preferred Qualifications
1 to 3 years of equivalent experience
Performed annotation related tasks across ML data process areas.
Strong knowledge of process documentation, analysis knowledge
Technical proficiency in SQL querying and Python programming for data analysis
Strong analytical and problem-solving skills
Ability to work independently and as part of a team
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants
are
advised to research the bonafides of the prospective employer independently. We do NOT
endorse any
requests for money payments and strictly advice against sharing personal or bank related
information. We
also recommend you visit Security Advice for more information. If you suspect any fraud
or
malpractice,
email us at abuse@talentmate.com.
You have successfully saved for this job. Please check
saved
jobs
list
Applied
You have successfully applied for this job. Please check
applied
jobs list
Do you want to share the
link?
Please click any of the below options to share the job
details.
Report this job
Success
Successfully updated
Success
Successfully updated
Thank you
Reported Successfully.
Copied
This job link has been copied to clipboard!
Apply Job
Upload your Profile Picture
Accepted Formats: jpg, png
Upto 2MB in size
Your application for AI Benchmarking Lead Performance Benchmarking Evaluation
has been successfully submitted!
To increase your chances of getting shortlisted, we recommend completing your profile.
Employers prioritize candidates with full profiles, and a completed profile could set you apart in the
selection process.
Why complete your profile?
Higher Visibility: Complete profiles are more likely to be viewed by employers.
Better Match: Showcase your skills and experience to improve your fit.
Stand Out: Highlight your full potential to make a stronger impression.
Complete your profile now to give your application the best chance!