Jobs / Insure Technology

Observability Engineer

Insure Technology · United States
Visa: unknownSalary: unknownWork mode: unknown
Skills
awsazurebashopentelemetrypython

Description

IN Tech has been engaged to lead the search for a mission-critical Observability Engineer role for a global, highly regulated enterprise operating across cloud, data center, and production environments.


This is a rare opportunity to own and shape an enterprise observability strategy from the ground up - including leading a strategic migration from SolarWinds to Dynatrace. In this role you'll build a modern, proactive monitoring capability that directly impacts reliability, performance, and customer experience worldwide.


Why This Role Matters

Our client is making a significant investment in observability as a core capability. This role sits at the center of that transformation and will directly influence how engineering and operations teams detect issues, respond faster, and continuously improve system performance across a complex hybrid infra environment.


Role Overview

As the Observability Engineer, you will design, implement, and operate the enterprise monitoring and observability platform, leading the migration from SolarWinds to Dynatrace. You’ll own the observability roadmap end-to-end: architecture, instrumentation, dashboards, alerting, automation, and operational readiness across a global footprint spanning on-prem, cloud, and production environments with IT/OT convergence.


Key Responsibilities

  • Drive the enterprise transformation from SolarWinds to Dynatrace, including architecture design, OneAgent/ActiveGate deployment, and dashboard development
  • Build comprehensive monitoring across networks, servers, applications, databases, and cloud platforms across a global footprint
  • Develop dashboards, alerts, and automated remediation workflows aligned to operational KPIs
  • Establish baseline metrics and anomaly detection rules for proactive incident identification
  • Integrate Dynatrace with ServiceNow for automated incident creation and enrichment
  • Configure monitoring for IT/OT environments including manufacturing and operational systems
  • Implement synthetic monitoring for critical business applications and digital experience tracking
  • Design log aggregation and correlation strategies in partnership with the security team
  • Create runbooks and standard operating procedures for alert response and escalation
  • Define and support a 24x7 global monitoring strategy (follow-the-sun)
  • Optimize observability costs through data retention strategy and license management
  • Train operations teams on platform usage and alert response best practices


About You

  • 5+ years of experience in infrastructure monitoring and observability
  • Hands-on experience with Dynatrace (OneAgent, Davis AI, dashboards)
  • Strong experience with SolarWinds to support migration planning
  • Experience monitoring enterprise network and infrastructure environments
  • VMware and cloud monitoring experience (Azure and AWS)
  • Strong scripting skills (PowerShell, Python, Bash)
  • Experience with log management and SIEM integration

Preferred Qualifications

  • Dynatrace certification(s)
  • Experience in regulated, industrial, or highly available environments
  • Knowledge of IT/OT monitoring and industrial protocols
  • Experience with AIOps and machine-learning-driven anomaly detection
  • ServiceNow Event Management experience
  • Familiarity with OpenTelemetry and distributed tracing


Tools & Technologies

Dynatrace • SolarWinds • VMware • Azure & AWS • Enterprise Networking • PowerShell / Python • ServiceNow • Log & SIEM Platforms

Success Metrics

  • 100% monitoring coverage of critical assets
  • < 5 minute Mean Time to Detect (P1 incidents)
  • < 5% false positive rate
  • 100% Dynatrace migration completion
  • 80% dashboard adoption across operations teams

Get new job alerts Weekly digest to your inbox.