Note: The job is a remote job and is open to candidates in USA. Rex.zone is hiring full-time remote Online Generalists to support AI/ML data operations for large language model training pipelines. The role involves completing online tasks such as data labeling, RLHF preference ranking, prompt evaluation, and QA evaluation across various datasets.
Responsibilities
- Perform data labeling for NLP tasks (classification, summarization, named entity recognition) and LLM evaluation workflows
- Create preference data for RLHF by ranking/scoring model outputs and writing clear rationales
- Conduct prompt evaluation and response scoring using web-based evaluation interfaces
- Execute QA evaluation, adjudication, and consistency checks to improve training data quality
- Handle content safety labeling using policy-based decisions and detailed guidelines
- Complete computer vision annotation including image tagging, bounding boxes, and segmentation as needed
- Document edge cases, follow versioned guidelines, and escalate ambiguity appropriately
Skills
- Based in Canada and able to work full-time remotely
- Strong reading comprehension, attention to detail, and consistent decision-making
- Comfortable following detailed annotation guidelines and meeting quality standards
- Experience in data labeling, QA evaluation, RLHF, or LLM evaluation
Company Overview
- RemoExperts is a global platform where skilled professionals and tutors contribute to AI training across video, image, audio, text, code, math, and more. It was founded in 2025, and is headquartered in Palo Alto, California, US, with a workforce of 201-500 employees. Its website is https://www.remoexperts.com.