Junior Cloud/DevOps Engineer at Diameter Health

3 years ago DevOps & System Administration Junior Full-Time

As a Junior Cloud/DevOps Engineer, you will be work to configure, streamline, and improve our overall cloud infrastructure within multiple independent clouds.


Qualifications:

  • Experience as a software engineer implementing, testing and debugging
  • Experience with Azure and/or AWS infrastructure in an engineer capacity.
  • Experience in containerized deployment using Kubernetes.
  • Experience with MongoDB.
  • Experience running Cloud at scale.
  • Experience with Terraform.
  • Experience with Jenkins, Ansible, or other similar technologies.
  • Deep scripting knowledge using Python (preferred), Perl, PowerShell, JavaScript, or similar scripting languages.
  • Bachelor’s Degree in Computer Science, Information Technology, or equivalent experience.

Bonus Points:

  • AWS DevOps Engineer (professional level) certification.
  • ITIL Certification.
  • Experience with hosting government solutions (eg. AWS GovCloud).
  • Experience with DynamoDB and other similar technologies.
  • Experience with healthcare information technology.


Description

Summary: As a Junior Cloud/DevOps Engineer, you will be work to configure, streamline, and improve our overall cloud infrastructure within multiple independent clouds. The ideal candidate is:

  • An expert in 24/7 operations with high performing and scaling systems that meet a high degree of uptime.
  • An expert in all facets of Cloud hosting operations with a focus on lean architecture and cost savings.
  • An expert in Python, or another programming language, with a deep passion for automation.

Essential Functions:

  • Work on infrastructure architecture and planning as it relates to scalability and cost savings.
  • Facilitate meetings with key stakeholders on product improvements.
  • Facilitate meetings with the DevOps team and Engineering teams on infrastructure changes.
  • Create/finalize automation in an effort to enable our infrastructure and IT systems to be more scalable and lower our manual labor overhead.
  • Provide root cause analysis of high visibility production incidents.
  • Create action plans to mitigate total infrastructure growth/footprint where possible.
  • Act as the primary contact for changes to procedures or infrastructure/architecture.
  • Keep up to date documentation of core responsibilities and current projects.
  • Create and maintain SOPs relating to infrastructure and systems responsibilities.
  • Generate and provide recommendations on optimizing usage of Cloud services.
  • Ensure compliance with security best practices including continuous monitoring.
  • Participate in the Cloud Well Architected Framework to support overall operations initiatives.
  • Develop automated reporting that enables teams to leverage best practices for running efficient Cloud solutions.
  • Partner with Engineering, DevOps, and Client Services on CI/CD and automated deployment and event management.
  • Identify key procedures that can be automated and automate them.

Knowledge or Skills:

  • Serverless computing experience with containers (AKS/EKS) and VM based workloads along with a solid understanding of the trade-offs of different serverless implementations emerging in public Cloud.
  • Heavy background in software engineering/scripting with a focus on python development for use in automating deployment procedures.
  • Deep understanding of the key concepts and practices of Cloud observability, coupled with experience implementing robust systems that leverage metrics, logs, and traces to provide holistic state of the Cloud operations.
  • Deep understanding of how to apply best practices around monitoring, alerting, logging and have implementation experience with one or more (Azure Monitor, CloudWatch, AppInsight, Log Analytics, Splunk, Dynatrace, SolarWinds, etc…).
  • Knowledge of monitoring systems for infrastructure monitoring as well as application performance monitoring including SLAs/KPIs and reporting approaches for the multi-cloud platforms.
  • Partner with Engineering team to design key concepts and practices of observability, coupled with experience implementing robust systems that leverage metrics, logs, and traces to provide understanding of system state.
  • Experience with and enthusiasm for operating in an agile DevSecOps oriented organization and culture.
  • Plan and execute Disaster Recovery (DR) and Failover simulations to demonstrate adherence to SLAs.
  • Skill and knowledge in ITIL processes related to Incident Management, Service Requests, Event Management, Access Management, Change Management, Knowledge Management and Escalated Incident Management.
  • Knowledge of Cloud Monitoring Platforms that simplify financial management, help streamline operations, and strengthen security & compliance (eg. CloudHealth, Apptio, CloudCheckr, etc).


🎉 Let Employers Find You!

Employers will see your profile when they are sending a job in your skill.


Create Your Profile   (simple)