Observability Engineer


As an Observability Engineer, you will be instrumental in maintaining and enhancing the stability, scalability, and performance of our rapidly evolving applications and services. You will collaborate with cross-functional teams to build and maintain resilient systems that can handle our growing user base and workload. Your work directly impacts our mission to provide exceptional member experiences

Responsibilities

  • Develop and maintain systems for effective monitoring, logging, and tracing of software applications.
  • Choose appropriate tools and technologies, set up dashboards, and ensure the scalability and reliability of the observability platform.
  • Develop and integrate tools for logging, monitoring, and alerting to enhance system performance visibility.
  • Ensure compatibility and efficiency across various platforms and services.
  • Work closely with engineering teams to integrate observability practices into their workflows.
  • Provide operational log/event analysis and make recommendations for remediation of system errors/faults and/or performance issues.
  • Analyse system performance regularly and identify areas for improvement.
  • Stay up-to-date with the latest trends in observability, logging, monitoring, and cloud technologies.
  • Introduce innovative solutions and best practices to improve system observability and reliability.
  • Participate in strategic planning for the technology roadmap, including considerations related to scalability, cost-effectiveness, and risk management of observability infrastructure.
  • Create comprehensive documentation for observability systems and processes.
  • Perform work in a manner that complies with relevant regulatory standards including Work Health & Safety (WHS) legislation.
  • Actively participate in all regulatory compliance activities associated with this role including required training, meetings and information sessions.



Essential Skills

  • Must have solid experience in AppDynamics (Application Performance Management)
  • Must have Proficiency in instrumenting systems using tools like OpenTelemetry, Prometheus, Grafana, and understanding the trade-offs in data ingestion.
  • Must have experience in at least one of the products - Splunk, Splunk Observability Cloud, ThousandEyes, New Relic, LogicMonitor, Dynatrace, Datadog, Solarwinds, Grafana
  • Must have experience in integrating with third party APIs.
  • Must have experience with AWS/Azure cloud technologies and parsing languages such as Sed & Regex.
  • Must have experience with any one of the scripting languages- PowerShell, Bash, Python, JavaScript
  • Must possess knowledge of monitoring middleware including WebSphere/Java/.NET application servers.
  • Self-driven and proactive nature and can work independently or as part of a team.
  • Critical thinking, curiosity, and the ability to communicate effectively across IT and business stakeholders.
  • Keeping up-to-date with the latest trends, tools, and methodologies in observability to continuously enhance systems and processes.


Desirable

  • Infrastructure Knowledge: Skills in infrastructure as code using tools such as Ansible, Terraform, and Kubernetes declarative specifications.
  • System Understanding: Ability to understand the concept of data producers and consumers, and familiarity with monitoring tools, developing ingestion pipelines, and sometimes investigating incidents.

About HCF


At HCF, our purpose is to bring our human touch to healthcare. Since 1932 we’ve been putting our members and their health first. As Australia’s largest not-for-profit health fund, we cover 2 million members with health, life, travel and pet insurance and our vision is to make healthcare understandable, affordable, high quality and member centric.

We want to be true health partners to our members, easily guiding the healthcare choices that are right for them. At HCF, our values are the way we do things and create the necessary culture to help us realise our purpose and deliver ourStrategy. Living our values in action we step forward, walk in their shoes, stay human, make it better and get there together.

Culture & Benefits

Purpose-driven passion
We’re united by a common purpose: to make healthcare affordable, understandable, high quality and member-focused.

Wellness and work-life balance
We’ll empower you with the necessary skills and tools to support your personal wellbeing journey, ensuring you perform at your best. Our offerings include:

  • 50% subsidy on HCF hospital and/or extras cover
  • 18 weeks of parental leave for all new parents
  • Mental health and wellbeing programs, including workshops, fitness classes, flu vaccinations, skin checks and more
  • Discounts on HCF’s products, including life, pet and travel Insurance, as well as discounts at Fitness First gyms and on our eyecare products.


Collaboration and inclusivity
We embrace diversity as our strength and are committed to maintaining an inclusive and collaborative work environment. Our workplace is welcoming and safe for all our employees, irrespective of their unique characteristics including age, ethnicity, cultural or spiritual background, gender identity, disability, education and socio-economic status.

Continuous learning and growth
We believe in lifelong learning. HCF provides opportunities for personal and professional development. From workshops to mentorship programs, we encourage your growth and curiosity.

Next steps

If you require any adjustments to assist you in making your application or during the recruitment or onboarding process, please reach out to Talent Acquisition – [email protected]  to discuss.

We encourage applicants to submit their applications at their earliest convenience, as at HCF, we review applications as they are submitted, and may have filled the role prior to the job closing date.


As an Observability Engineer, you will be instrumental in maintaining and enhancing the stability, scalability, and performance of our rapidly evolving applications and services. You will collaborate with cross-functional teams to build and maintain resilient systems that can handle our growing user base and workload. Your work directly impacts our mission to provide exceptional member experiences

Responsibilities

  • Develop and maintain systems for effective monitoring, logging, and tracing of software applications.
  • Choose appropriate tools and technologies, set up dashboards, and ensure the scalability and reliability of the observability platform.
  • Develop and integrate tools for logging, monitoring, and alerting to enhance system performance visibility.
  • Ensure compatibility and efficiency across various platforms and services.
  • Work closely with engineering teams to integrate observability practices into their workflows.
  • Provide operational log/event analysis and make recommendations for remediation of system errors/faults and/or performance issues.
  • Analyse system performance regularly and identify areas for improvement.
  • Stay up-to-date with the latest trends in observability, logging, monitoring, and cloud technologies.
  • Introduce innovative solutions and best practices to improve system observability and reliability.
  • Participate in strategic planning for the technology roadmap, including considerations related to scalability, cost-effectiveness, and risk management of observability infrastructure.
  • Create comprehensive documentation for observability systems and processes.
  • Perform work in a manner that complies with relevant regulatory standards including Work Health & Safety (WHS) legislation.
  • Actively participate in all regulatory compliance activities associated with this role including required training, meetings and information sessions.



Essential Skills

  • Must have solid experience in AppDynamics (Application Performance Management)
  • Must have Proficiency in instrumenting systems using tools like OpenTelemetry, Prometheus, Grafana, and understanding the trade-offs in data ingestion.
  • Must have experience in at least one of the products - Splunk, Splunk Observability Cloud, ThousandEyes, New Relic, LogicMonitor, Dynatrace, Datadog, Solarwinds, Grafana
  • Must have experience in integrating with third party APIs.
  • Must have experience with AWS/Azure cloud technologies and parsing languages such as Sed & Regex.
  • Must have experience with any one of the scripting languages- PowerShell, Bash, Python, JavaScript
  • Must possess knowledge of monitoring middleware including WebSphere/Java/.NET application servers.
  • Self-driven and proactive nature and can work independently or as part of a team.
  • Critical thinking, curiosity, and the ability to communicate effectively across IT and business stakeholders.
  • Keeping up-to-date with the latest trends, tools, and methodologies in observability to continuously enhance systems and processes.


Desirable

  • Infrastructure Knowledge: Skills in infrastructure as code using tools such as Ansible, Terraform, and Kubernetes declarative specifications.
  • System Understanding: Ability to understand the concept of data producers and consumers, and familiarity with monitoring tools, developing ingestion pipelines, and sometimes investigating incidents.

About HCF


At HCF, our purpose is to bring our human touch to healthcare. Since 1932 we’ve been putting our members and their health first. As Australia’s largest not-for-profit health fund, we cover 2 million members with health, life, travel and pet insurance and our vision is to make healthcare understandable, affordable, high quality and member centric.

We want to be true health partners to our members, easily guiding the healthcare choices that are right for them. At HCF, our values are the way we do things and create the necessary culture to help us realise our purpose and deliver ourStrategy. Living our values in action we step forward, walk in their shoes, stay human, make it better and get there together.

Culture & Benefits

Purpose-driven passion
We’re united by a common purpose: to make healthcare affordable, understandable, high quality and member-focused.

Wellness and work-life balance
We’ll empower you with the necessary skills and tools to support your personal wellbeing journey, ensuring you perform at your best. Our offerings include:

  • 50% subsidy on HCF hospital and/or extras cover
  • 18 weeks of parental leave for all new parents
  • Mental health and wellbeing programs, including workshops, fitness classes, flu vaccinations, skin checks and more
  • Discounts on HCF’s products, including life, pet and travel Insurance, as well as discounts at Fitness First gyms and on our eyecare products.


Collaboration and inclusivity
We embrace diversity as our strength and are committed to maintaining an inclusive and collaborative work environment. Our workplace is welcoming and safe for all our employees, irrespective of their unique characteristics including age, ethnicity, cultural or spiritual background, gender identity, disability, education and socio-economic status.

Continuous learning and growth
We believe in lifelong learning. HCF provides opportunities for personal and professional development. From workshops to mentorship programs, we encourage your growth and curiosity.

Next steps

If you require any adjustments to assist you in making your application or during the recruitment or onboarding process, please reach out to Talent Acquisition – [email protected]  to discuss.

We encourage applicants to submit their applications at their earliest convenience, as at HCF, we review applications as they are submitted, and may have filled the role prior to the job closing date.

Are you viewing this job on LinkedIn? Click here to apply