Join Outlier AI as an AI Training Specialist focused on Reinforcement Learning from Human Feedback (RLHF). You'll evaluate and compare AI-generated responses, provide detailed feedback to improve model outputs, and help train next-generation language models. Ideal for writers, educators, and domain experts who want to shape the future of AI.
Responsibilities:
- Compare pairs of AI-generated responses and select the better one
- Write detailed explanations for your rankings
- Provide original creative writing samples for model training
- Identify factual errors, logical inconsistencies, and tone issues
Requirements:
- Strong English writing skills
- Attention to detail and analytical thinking
- Available for 15+ hours per week
- No prior AI experience needed