Data that requires judgment.
If a generic crowd label would be wrong, low-trust, or impossible, this is the work we do. We label the items where domain understanding actually changes the answer.
- Model outputs and responses
- Technical answers and derivations
- Safety cases and policy-sensitive content
- Domain-specific text
- Code and reasoning tasks
- Images and video clips where needed
- Benchmark items