Physician / New York / Permanent / DevOps / SRE Job
Tandym Health
New York, NY, 10001, USA
New York, NY, 10001, USA
- IT
- Full-time
- DevOps
- Site Reliability Engineer
- CI/CD
This role involves building and maintaining infrastructure to support trading systems for an asset manager in New York City. The DevOps/SRE professional will collaborate with development teams to automate build and deployment pipelines, manage CI/CD tools, and support high-scalability, resilient distributed systems. Candidates should have strong software engineering, scripting, cloud, and infrastructure automation skills with experience in trading environments and microservices architectures.
An asset manager in New York City is actively seeking a self-motivated and hardworking professional to join their staff as their newDevOps / SRE.
Responsibilities
The DevOps / SRE will:
Responsibilities
The DevOps / SRE will:
- Build and maintain the infrastructure that supports the firm's trading systems
- Collaborate with development teams to design and implement automated build and deployment pipelines
- Drive the rapid adoption of new processes/systems
- Provide hands-on support to the trading team
- BS/MS in Computer Science, Engineering, or related discipline
- 5+ years experience in the Platform, SRE, Production, or Systems Engineering fields
- Excellent knowledge of all aspects of the software engineering process, including Coding, Testing, Deployment, Scalability, Security, and Maintainability
- Ability to set-up andmanage CI/CD activities and tools (e.g. Gitlab, Bitbucket), as well as build you own solutions (e.g. Java/Gradle)
- Track record of working with distributed systems in a trading environment e.g. Aeron, Kafka, and RabbitMQ
- Deep understanding of best practices, design patterns, and principles for highly decoupled and scalable systems
- Good knowledge of Unix systems / Bash / networks
- Experience with infrastructure and application observability tooling e.g. Datadog, Prometheus, and Grafana
- Strong knowledge in coding/scripting (Java, Python, Go, or Bash)
- Experience with automation/configuration frameworks using Terraform, Kustomize, Ansible, Helm, or an equivalent
- Experience with cloud platforms (ideally AWS)
- Experience in API Management (routing, gateways, versioning) with profound understanding of API Development aspects
- Ability to apply strategies for efficient communication, data consistency, and resilience across micro services, including experience with API design, message-based communication, and event-driven architectures
- Experience in defining and enforcing architectural patterns (SOA, CQRS, Event Sourcing etc.)
- Experience in performance/stress test and system tuning




