Mercor is seeking native Italian speakers from Switzerland or Italy with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Italian / English prompt–golden answer pairs that train and evaluate advanced language models.
Job Details
• Multilingual Prompt Design & Optimization: Create detailed prompts in Italian and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Italian-speaking users in Switzerland and Italy contexts.
• Define and Document Evaluation Standards: Establish high-level expectations for correct responses in Switzerland and Italy consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions.
• Model Testing and Grading (Bilingual): Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Italian, comparing results against English where needed.
• Benchmarking & Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor—maintaining consistency and reliability across Italian-language benchmarks before integration into official evaluations.
Minimum Qualifications
• Native-level fluency in Italian (written), specific to Switzerland or Italy usage, with strong reading/writing ability in English.
• Must be native to Switzerland or Italy and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity.
• BS or BA from a reputable institution (completed or in progress).
• Strong writing and critical thinking skills.
• Ability to work independently and meet deadlines.
• Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests.
• Based in Switzerland or Italy (or able to reliably produce Switzerland- or Italy-specific, culturally accurate Italian).
Preferred Qualifications
• Experience in teaching, research, editing, or academic writing.
• Experience creating evaluation criteria, rubrics, or grading guidelines.
• Familiarity with LLMs, prompting, or model evaluation (helpful but not required).
Application & Onboarding Process
• Complete an AI-led interview (about 15 minutes).
• If approved, complete a paid assessment focused on writing and rubric creation.
• Then, if selected, you will be invited to work on the project.
More Details About This Role
• Expect to contribute at least 20 hours per week.
• Expect a commitment of approximately 2–4 months.
• You’ll be working in a structured project environment with clear goals and tools.
• We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Apply tot his job
Apply To this Job