About The Job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark
,
General Catalyst
,
Peter Thiel
,
Adam D'Angelo
,
Larry Summers
, and
Jack Dorsey
.
Position:
AI Model Evaluator
Type:
Contract
Compensation:
$50–$75/hour
Commitment:
20 hours/week
Role Responsibilities
• Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance.
• Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
• Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
• Score and rank multiple model responses using structured rubrics across dimensions.
• Provide written justifications with specific evidence for each evaluation.
Qualifications
Must-Have
• Master’s degree or higher in Finance, Accounting, or a relevant professional field.
• Professional experience applying domain expertise in a practitioner or advisory capacity.
• Familiarity with industry-specific standards, regulations, or clinical guidelines.
• Strong written communication and critical reasoning skills.
Application Process (Takes 20–30 mins to complete)
• Submit your resume to begin.
• Complete the Model Response Evaluation assessment.
Resources & Support
• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
• For any help or support, reach out to:
[email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
#hiringmercor
Apply tot his job
Apply To this Job