Description:
Role Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
- Review AI outputs on sensitive topics like bias and misinformation, with optional participation in higher-sensitivity projects.
Qualifications
Must-Have
- Native fluency in English and Urdu.
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Ability to explain risks clearly to both technical and non-technical stakeholders.
Preferred
- Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis.
- Skills in creative probing such as psychology, acting, or writing for unconventional adversarial thinking.
Application Process (Takes 20–30 mins to complete)
- Upload resume
- AI interview based on your resume
- Submit form