Applied Methods
~The MetaInfrastructure & ITInfrastructure Engineer

Infrastructure Engineer

Infrastructure Engineers at AI companies operate the physical and systems-level infrastructure the business depends on—servers, storage arrays, networking equipment, and the Unix/Linux environments hosted on them. The day-to-day is hands-on: diagnosing hardware and firmware faults, managing warranty replacements through vendors, performing root-cause analysis on systemic issues, and maintaining the operational health of data-center and corporate-IT hardware. Cloud and infrastructure-as-code work appears in many of these jobs, but the centre of gravity is closer to traditional systems administration and data-center operations than to cloud platform engineering. These engineers typically sit within IT, infrastructure operations, or data-center teams, partnering with networking, security, and application teams to keep infrastructure running as the business scales.

$ titles --canonical
IT Infrastructure EngineerSenior Storage EngineerCloud Operations EngineerTech Ops EngineerInfrastructure Ops Engineer
Open Jobs31
Companies Hiring13
$02

Skills

What companies are looking for in this role.

$ skills --core

Diagnosing and resolving complex hardware and firmware issues in server and datacenter environments

95%

Administering and troubleshooting large-scale Linux operating systems and command-line interfaces

92%

Troubleshooting complex multi-component system issues across hardware, software, and networking

90%

Monitoring and analyzing system health, performance metrics, and equipment status

88%

Designing and implementing infrastructure automation and scripting solutions

85%

Implementing observability and monitoring solutions for complex distributed systems

83%

Operating and maintaining high-performance computing clusters and distributed systems

82%

Implementing continuous integration and continuous deployment pipelines

80%

Managing virtual machine and containerized workload orchestration platforms

78%

Designing configuration-as-code environments and infrastructure automation frameworks

76%

Designing and optimizing enterprise storage systems for performance and reliability

75%

Conducting performance benchmarking and capacity planning for infrastructure resources

74%

Integrating multiple systems and platforms through APIs and middleware solutions

72%

Managing asset lifecycle and infrastructure inventory tracking systems

70%

Implementing security hardening and zero-trust principles across infrastructure

68%
$ skills --emerging

Managing GPU cluster operations and troubleshooting GPU-specific hardware issues

72%

Optimizing infrastructure for AI and machine learning workload performance

70%

Configuring and operating high-speed interconnect fabrics for AI workloads

68%
$ skills --soft

Creating and maintaining technical documentation and operational runbooks

88%

Responding to and managing critical infrastructure incidents under pressure

82%

Mentoring and escalating technical issues to junior technicians and support teams

80%

Leading cross-functional collaboration with infrastructure, platform, and development teams

78%

Managing vendor relationships and processing warranty and replacement requests

75%
$03

Technology

The tools and technologies that define this role.

$ tech --language
Bashvery high
Pythonvery high
Gomoderate
PowerShellmoderate
$ tech --framework
Temporallow
$ tech --platform
Linuxvery high
Kuberneteshigh
VMwarehigh
Active Directorymoderate
Dockermoderate
Microsoft 365moderate
Microsoft Entra IDmoderate
VMware Tanzumoderate
vSANmoderate
Cephlow
ECSlow
Google Workspacelow
Lustrelow
NSX-Tlow
Omnissa Horizonlow
Spectrum-Xlow
$ tech --tool
Ansiblehigh
Githigh
Terraformhigh
Grafanamoderate
NCCLmoderate
PowerCLImoderate
Prometheusmoderate
Slurmmoderate
Grafana Lokilow
VMware Aria Operationslow
$ tech --concept
CI/CDhigh
DCIMmoderate
ECCmoderate
InfiniBandmoderate
NVLinkmoderate
REST APIsmoderate
CMMSlow
Ethernet RoCElow
SCADAlow
VDIlow
$04

Open Jobs

31 open Infrastructure Engineer jobs across 13 companies.

Lambda1w
HPC Operations Engineer
San Francisco Office (Fremont St)·Infrastructure & IT
Nscale2w
Infrastructure Support Engineer
Houston; New York; San Francisco; Seattle·Infrastructure & IT
MongoDB2w
Cloud Operations Engineer (3rd Shift, Weekend)
United States·Infrastructure & IT
Nebius2w
Data Center IT Technician
Oklahoma, United States·Infrastructure & IT
OpenAI2w
Audiovisual Design Engineer
San Francisco·Infrastructure & IT
Helsing2w
HPC Systems Administrator
Munich·Infrastructure & IT
Graphcore2w
Lead Engineering Support Linux Engineer - Bengaluru
Bengaluru, India·Infrastructure & IT
Nebius3w
IT Infrastructure Engineer (RMA & Diag)
London, United Kingdom·Infrastructure & IT
Graphcore3w
Observability, Staff Infrastructure Engineer
Gdańsk, Pomeranian Voivodeship, Poland·Infrastructure & IT
Nebius1mo
IT infrastructure engineer (RMA & Diag)
Lappeenranta, Finland·Infrastructure & IT
fal1mo
Operations Engineer, HPC Networking
Remote·Infrastructure & IT
fal1mo
Operations Engineer, Fleet Reliability
Remote·Infrastructure & IT
Graphcore1mo
Lead Engineer Support Linux Engineer
Bristol, UK·Infrastructure & IT
Graphcore1mo
Automation Engineer
Bristol, UK·Infrastructure & IT
Crusoe1mo
Senior Engineering Manager, Cloud Storage
San Francisco, CA - US·Infrastructure & IT
Nebius1mo
Automations Engineer
Amsterdam, Netherlands·Infrastructure & IT
Nebius1mo
Data Center IT Infrastructure Engineer
Paris, France·Infrastructure & IT
CoreWeave1mo
Senior Business Systems Engineer- Data Center Systems II
Livingston, NJ /Bellevue, WA / Sunnyvale, CA·Infrastructure & IT
Nebius1mo
Data Center IT Infrastructure Engineer
Israel·Infrastructure & IT
Nscale2mo
Infrastructure Engineer (Norway)
Norway·Infrastructure & IT