Andrei Turcu

Site Reliability Engineer

Summary

Site Reliability Engineer with 5+ years of experience at a leading proprietary trading firm. Strong operational background spanning bare-metal provisioning, trading environment management, and exchange connectivity. Transitioned from heavy day-to-day operations into a development-focused SRE role, driving automation, observability, and infrastructure modernization initiatives. Proven mentor and cross-team collaborator with deep institutional knowledge of trading infrastructure end-to-end.

Skills

Infrastructure Kubernetes, Docker, bare-metal provisioning, systemd, host lifecycle
Automation & Config Ansible, config management, deployment orchestration
Observability Grafana, OpenTelemetry, monitoring & introspection tooling
CI/CD GitHub Actions, build & deploy pipelines, validation workflows
Languages Python, Bash, Go, JavaScript
Trading Ops Exchange connectivity, session management, clearing firm coordination, multicast monitoring
Practices Incident response, on-call, runbooks, mentoring, cross-team collaboration
Networking Multicast, L2 taps/mux, switch configuration, network troubleshooting

Experience

Site Reliability Engineer (Dev Focus) 2024 – Present
Jump Trading
  • Key member of a firm-wide initiative to document, automate, and improve host provisioning workflows and pipelines across hundreds of hosts, reducing end-to-end provisioning time by 30–50%.
  • Built observability into the provisioning process using OpenTelemetry metrics and Grafana dashboards, enabling a data-driven approach to identifying and eliminating bottlenecks.
  • Early contributor in adopting GitHub Actions workflows for org-wide projects, reducing time spent building, deploying, and validating releases.
  • Leading a project to migrate configurations and deployments into a modernized system, contributing to Ansible roles and orchestration to reduce manual workflow overhead.
  • Throughout tenure at Jump Trading, mentored ~10 interns on production-facing projects and helped onboard numerous full-time hires; recognized as a key knowledge resource across teams.
Technical Operations Engineer Jul 2020 – 2024
Jump Trading
  • Initially joined as intern (Jul–Sep 2020), delivering three production-facing projects; converted to full-time in November 2020.
  • Performed 4–6 hours of daily operations across thousands of services, including application setup, on-prem bare-metal provisioning, internal deployments, upgrades, and configuration rollouts.
  • Managed the full trading environment lifecycle — from bare-metal server setup through exchange connectivity and trading session establishment.
  • Served as a key liaison between internal teams and external counterparties including clearing firms and exchanges, ensuring smooth trading operations.
  • Reduced on-call toil by automating recurring tasks — developed Ansible roles for application deployment and built monitoring and introspection tooling for internal systems.
  • Troubleshot multicast and network issues using L2 taps and mux devices; contributed to switch configuration and lightweight automation of network operations.
  • Unified disparate config systems and built troubleshooting interfaces that presented information concisely for rapid incident diagnosis.

Education

B.Sc. Computer Science · GPA 9.41 / 10
University POLITEHNICA of Bucharest
2016 – 2020
  • Diploma thesis: novel image-to-3D-mesh reconstruction using UV maps (research internship at Xperi). Improved inference time by ~40% through vectorization of a key operation.

Achievements & Certifications