AwesomeAIJobs
Back to all jobs

AI Training Specialist — RLHF

Outlier AI

$15 – $40 / hourlyRemote — WorldwideFreelancePosted Feb 12, 2026
Apply on Outlier AI

Job Description

Join Outlier AI as an AI Training Specialist focused on Reinforcement Learning from Human Feedback (RLHF). You'll evaluate and compare AI-generated responses, provide detailed feedback to improve model outputs, and help train next-generation language models. Ideal for writers, educators, and domain experts who want to shape the future of AI. Responsibilities: - Compare pairs of AI-generated responses and select the better one - Write detailed explanations for your rankings - Provide original creative writing samples for model training - Identify factual errors, logical inconsistencies, and tone issues Requirements: - Strong English writing skills - Attention to detail and analytical thinking - Available for 15+ hours per week - No prior AI experience needed

More AI Jobs