• Senior ELK Engineer

    Job Locations US-MA-North Reading
    Posted Date 2 months ago(10/23/2018 2:26 PM)
    Job ID
    2018-1350
    Organization
    Engineering - Cloud Ops
  • ­

    Senior Cloud Operations Engineer

     

    TraceLink is seeking an experienced Senior Cloud Operations Engineer with strong background in Elasticsearch, Logstash, Kibana (ELK), Telegraf, InfluxDB, Chronograf, Kapacitor (TICK) and Grafana to join the Cloud Operations team supporting Tracelink’s Life Sciences Cloud products. 

     

    The Senior Cloud Operations Engineer is a key member of the team responsible for the operations, maintenance and monitoring of TraceLink’s cloud-based software services providing track and track services to the pharmaceutical and other industries.  TraceLink is replacing its legacy rsyslog implementation with Fluentd, AWS Elasticsearch and Kibana for the logging of application and AWS ECS container data.  Additionally, we are moving from pnp4nagios to Telegraf, InfluxDB, Chronograf, Kapacitor and Grafana for monitoring and possibly alerting.  We are looking for an experienced ELK engineer to manage and maintain those systems.  This engineer will integrate data sources such as AWS CloudWatch and CloudTrail into the logging and monitoring systems.  They will develop visualizations of the data using both Kibana and Grafana.  Additionally, this engineer will assist software engineers and support technicians in using ELK and Grafana to mine data from the logs.

    ­

    Key Responsibilities:

     

    • Support TraceLink’s J2EE SaaS web applications and AWS Elastic Container Service (docker) containers providing track and trace services to the pharmaceutical and other industries
    • Responsible for operating and maintaining Tracelink’s Elasticsearch, Fluentd, and Kibana (EFK) stacks across all environments from QA to Production
    • Develop, operate and maintain TraceLink’s Telegraf, InfluxDB, Chronograf, Kapacitor (TICK) and Grafana stacks across all environments from QA to Production
    • Develop and document the processes and procedures for using the EFK and TICK stacks to replace TraceLink’s legacy rsyslog implementation
    • Create and document the processes and procedures for using the TICK plus Grafana to replace TraceLink’s legacy pnp4nagios data visualization environments
    • Support Tracelink’s software engineers and support technicians in making the transition to EFK and Grafana
    • Evaluate and implement new data sources for both the EFK and TICK stacks
    • Create Kibana and Grafana dashboards to visualize data across all data sources
    • Participate as a level 2 escalation point in an on-call rotation to respond to and address any system, application, or network issues, including those that occur outside normal business hours.
    • Communicate with all parts of the TraceLink organization, in both verbal and written capacity.

    ­

    Required Skills:

     

    • 5-7 years of direct, hands-on operations experience supporting J2EE web applications in a SaaS environment. Candidates with experience supporting other web application server technologies and 2-3 years of J2EE will be considered.
    • 3 years of experience designing, building and supporting modern logging systems such as ELK or Splunk including integrating data sources and developing dashboard visualizations
    • Possess a solid background in Linux/Unix systems administration, including OS deployment, configuration, updates, and troubleshooting, with particular focus on diagnosing security and performance issues
    • 3 years of experience as a Red Hat / CentOS Linux system administrator
    • 3 years of experience with AWS services
    • Excellent written and oral communications skill in English

    Options

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed