Applied Methods
~The MetaSecurityTrust & Safety

Trust & Safety

Specialists in this role develop detection systems and enforcement strategies to identify and mitigate emerging abuse patterns across AI products, working at the intersection of data science, policy, and operations. They balance competing priorities—detecting sophisticated threat actors while maintaining platform usability—by building scalable detection pipelines, conducting rapid investigations, and collaborating with policy and engineering teams to implement mitigations. Unlike policy-focused roles, these positions emphasize technical implementation and quantitative analysis; unlike pure engineering roles, they require deep domain expertise in specific abuse vectors and threat actor behavior. These analysts typically sit within dedicated Trust & Safety or Safeguards teams that operate cross-functionally with research, product, and legal to stay ahead of evolving misuse techniques.

$ titles --canonical
Trust & Safety Operations AnalystContent Integrity AnalystAbuse InvestigatorAI Safety ManagerProduct Safety ManagerAI Safety & Responsibility ManagerResponsible AI Manager
Open Jobs37
Companies Hiring5
$02

Skills

What companies are looking for in this role.

$ skills --core

Monitoring and investigating content and behavior that violates terms of service

100%

Detecting, investigating, and disrupting malicious use of AI platforms

95%

Developing abuse signals and tracking strategies to proactively detect harmful activities

90%

Providing data labeling, annotations, and inputs for safety protocols

85%

Designing and implementing enforcement workflows and review processes

80%

Analyzing large datasets to identify patterns and coordinated networks

80%

Building and scaling detection systems for fraud and abuse

80%

Processing appeals and auditing automated systems

80%

Responding to urgent escalations and participating in on-call rotations

80%

Conducting threat intelligence analysis and threat actor investigations

75%

Conducting root cause analyses and deep-dive investigations

70%

Creating monitoring dashboards, alerts, and internal administrative interfaces

65%

Leading safety assessments and threat modeling for new products

60%

Managing vendor relationships and third-party content moderation services

60%
$ skills --emerging

Training and refining large language models for safety and policy enforcement

85%

Building multi-layered defenses and real-time safety mechanisms for AI systems

75%

Conducting safety evaluations and assessments of AI models

70%

Developing AI-specific detection capabilities and behavioral clustering techniques

70%

Creating automated enforcement systems that scale with AI platform growth

65%
$ skills --soft

Collaborating across cross-functional teams including engineering, policy, and legal

95%

Communicating complex technical concepts to non-technical stakeholders

85%

Leading and mentoring teams of safety operations analysts

50%
$03

Technology

The tools and technologies that define this role.

$ tech --language
Pythonhigh
SQLmoderate
$ tech --platform
Claudelow
GPTlow
Groklow
$ tech --tool
Dark web monitoringlow
$ tech --concept
LLMsvery high
Machine Learningmoderate
A/B testinglow
Embeddingslow
Fine-tuninglow
Graph-based data infrastructurelow
$04

Open Jobs

37 open Trust & Safety jobs across 5 companies.

OpenAI5d
Technical Intelligence Analyst
San Francisco·Security
OpenAI2w
AI Emerging Risks Analyst
San Francisco·Security
OpenAI3w
Abuse Investigator (AI Self-Improvement Risk)
San Francisco·Security
OpenAI3w
Strategic Risk Analyst, Behavioral & Psychological Risk
San Francisco·Security
Replit3w
Trust & Safety Specialist
Foster City, CA (Hybrid) In office M,W,F·Security
Anthropic4w
Safeguards Policy Analyst, Fraud & Scams
Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY·Security
xAI1mo
Senior Analyst - Safety Operations (Child Safety)
Palo Alto, CA·Security
xAI1mo
Senior Analyst - Safety Operations (Child Safety)
Bastrop, TX·Security
xAI1mo
Senior Analyst, Safety Operations
Bastrop, TX·Security
Nscale1mo
Staff Engineer, Customer Trust
AMER·Security
Anthropic1mo
Technical Policy Manager, Cyber Harms
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic1mo
Technical Cyber Threat Investigator
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic1mo
Technical CBRN-E Threat Investigator
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC·Security
Anthropic1mo
Software Engineer, Safeguards Infrastructure
London, UK·Security
Anthropic1mo
Software Engineer, Account Abuse
San Francisco, CA | New York City, NY·Security
xAI1mo
Manager, Safety Operations
Bastrop, TX·Security
Anthropic1mo
Biological Safety Research Scientist
San Francisco, CA·Security
Anthropic1mo
Safeguards Analyst, Human Exploitation & Abuse
Remote-Friendly, United States·Security
Anthropic1mo
Safeguards Enforcement Analyst, Safety Evaluations
Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San Francisco, CA | New York City, NY·Security
Anthropic1mo
Safeguards Analyst, Account Abuse
San Francisco, CA | New York City, NY·Security