Applied Methods
~The MetaSecurityTrust & Safety

Trust & Safety

Specialists in this role develop detection systems and enforcement strategies to identify and mitigate emerging abuse patterns across AI products, working at the intersection of data science, policy, and operations. They balance competing priorities—detecting sophisticated threat actors while maintaining platform usability—by building scalable detection pipelines, conducting rapid investigations, and collaborating with policy and engineering teams to implement mitigations. Unlike policy-focused roles, these positions emphasize technical implementation and quantitative analysis; unlike pure engineering roles, they require deep domain expertise in specific abuse vectors and threat actor behavior. These analysts typically sit within dedicated Trust & Safety or Safeguards teams that operate cross-functionally with research, product, and legal to stay ahead of evolving misuse techniques.

$ titles --canonical
Trust & Safety Operations AnalystContent Integrity AnalystAbuse InvestigatorAI Safety ManagerProduct Safety ManagerAI Safety & Responsibility ManagerResponsible AI Manager
Open Jobs39
Companies Hiring9
$02

Skills

What companies are looking for in this role.

$ skills --core

Designing and implementing machine learning-based detection systems for abuse, fraud, and policy violations

95%

Analyzing attack patterns and emerging abuse trends to identify novel threat vectors and behavioral anomalies

92%

Developing and maintaining enforceable policies, rules systems, and classification frameworks for platform safety

88%

Conducting investigations into complex misuse cases involving suspicious user behavior and coordinated harm

87%

Building automated response mechanisms and enforcement workflows that operate without manual intervention

85%

Querying, transforming, and analyzing large datasets using SQL and data manipulation techniques

84%

Writing and maintaining Python code for data analysis, system testing, and detection logic implementation

83%

Translating ambiguous safety risks and complex threat landscapes into measurable, evidence-based problems

81%

Building and operating systems to detect phishing, cryptomining, account takeovers, and financial fraud at scale

80%

Scoping and implementing abuse monitoring systems for new product launches and existing platforms

78%

Building dashboards, monitoring systems, and prevalence estimators for safety metrics and trends

76%

Designing experiments and conducting causal inference analyses to understand safety intervention impacts

75%

Building zero-to-one analytical systems and transforming prototypes into scalable, reusable tools

74%

Designing threat taxonomies and harm classification frameworks for emerging and frontier risks

70%

Performing high-volume content review and data labeling tasks with accuracy and attention to detail

65%

Integrating and tuning security scanning tools in continuous integration pipelines

58%
$ skills --emerging

Designing large-language-model guardrails to detect abuse scenarios in AI-generated content and interactions

82%

Detecting and analyzing agentic and autonomous behavior patterns in AI systems for safety risks

68%

Using large language models as defensive tools to identify malicious patterns and automate threat classification

65%

Conducting horizon scanning, competitive benchmarking, and external narrative analysis for risk sense-making

62%

Applying prompt injection attack detection and mitigation techniques at production scale

58%

Developing compliance programs aligned with global online safety and content moderation regulations

55%

Conducting regulatory risk assessments and translating legal obligations into enforceable safeguards

52%

Conducting behavioral and psychological analysis of user interactions with AI systems in high-risk contexts

48%

Developing domain-specific expertise in chemical, biological, radiological, nuclear, and explosives threat detection

42%
$ skills --soft

Communicating technical findings and risk assessments clearly to both technical and non-technical stakeholders

85%

Coordinating cross-functional teams across Policy, Legal, Engineering, and Communications during high-stakes situations

80%

Operating independently with high ownership in ambiguous, rapidly evolving problem domains

79%

Identifying gaps in existing safety systems and proposing improvements based on investigation findings

77%

Managing escalation procedures and on-call incident response operations for sensitive enforcement decisions

72%
$03

Technology

The tools and technologies that define this role.

$ tech --language
Pythonvery high
SQLvery high
$ tech --platform
BigQueryhigh
Google Suitelow
$ tech --tool
Hexmoderate
SASTmoderate
SCAmoderate
Netwatchlow
Slurperlow
Zoomlow
$ tech --concept
LLMvery high
Machine Learningvery high
Anomaly Detectionhigh
Data Sciencehigh
Statistical Analysishigh
CI/CDmoderate
$04

Open Jobs

39 open Trust & Safety jobs across 9 companies.

Snorkel AI1w
Trust & Safety Associate
Redwood City, CA (Hybrid); San Francisco, CA (Hybrid); United States (Remote)·Security
Abnormal Security1w
Security Analyst
Remote - USA·Security
OpenAI1w
Agentic Risk Analyst
San Francisco·Security
Hark2w
AI Safety Engineer
San Jose·Security
Runway2w
Member of Technical Staff, Trust & Safety Engineer
Remote·Security
OpenAI2w
Abuse Investigator - Child Safety
San Francisco·Security
xAI3w
Member of Technical Staff - Imagine Safety
Palo Alto, CA·Security
Anthropic3w
Data Scientist, Safeguards
New York City, NY; San Francisco, CA; Seattle, WA·Security
OpenAI3w
Product Policy, Biosecurity Policy Manager
San Francisco·Security
Lovable3w
Trust and Safety Support Specialist
Stockholm·Security
Replit4w
Staff Software Engineer, Trust & Safety
Foster City, CA·Security
Replit4w
Senior Software Engineer, Trust & Safety
Foster City, CA·Security
OpenAI1mo
Data Scientist, Safety
London, UK·Security
OpenAI1mo
Data Scientist, Safety
San Francisco·Security
Anthropic1mo
Incident Response Manager, Enforcement
San Francisco, CA | New York City, NY | Washington, DC·Security
OpenAI1mo
Model Policy, Frontier Cyber Risk
San Francisco·Security
OpenAI1mo
Protection Scientist Engineer, Integrity
San Francisco·Security
OpenAI1mo
Technical Intelligence Analyst
San Francisco·Security
OpenAI2mo
AI Emerging Risks Analyst
San Francisco·Security
OpenAI2mo
Abuse Investigator (AI Self-Improvement Risk)
San Francisco·Security