Oct 23, 2019

Staff Site Reliability Engineer US-FL-Orlando | US-Remote - US | US-IL-Schaumburg Job ID 2019-3473

  • TravelClick, Inc.
  • US-FL-Orlando | US-Remote - US | US-IL-Schaumburg
Full-Time Engineering Hospitality - Hotel Information Technology (IT) Other

Job Description

  • Staff Site Reliability Engineer

    Job Locations US-FL-Orlando | US-Remote - US | US-IL-Schaumburg Job ID 2019-3473 Category IT

Job Overview

The Staff Site Reliability Engineer is responsible to provide support for revenue generation production systems. The engineer assists with monitoring, maintenance and problem resolution of TravelClick production applications. The candidate must be able to provide prompt technology operations support in a high energy, fast paced environment.

The successful candidate will be bright, motivated, detailed orientated and willing to go the extra mile to ensure exceptional results for our customers.  This is a great opportunity in technology operations at a growing company with opportunities for advancement for the right candidate.

Functional Description

  • Provide support related to production systems availability incidents and problems
  • Provide support related to production systems latency incidents and problems
  • Provide support related to production systems performance incidents and problems
  • Provide support related to production operations efficiency issues
  • Support monitoring tools currently in production
  • Provide emergency response to production systems incidents
  • Maintain production ticketing system
  • Maintain the knowledgebase solution platform
  • Create, Delete and maintain production automation solutions using tools
  • Automate of day to day tasks
  • Resolve/remove false-positives alerts
  • Configure and update alert dashboards
  • Maintain tasks using task scheduler
  • Become SME of production applications and operations tools
  • Participate during application releases implementation
  • Analyze and interpret application logs to determine problem areas
  • Enhance current application and device monitoring systems
  • Help to evaluate application performance statistics including application and system response times

What we are looking for

Basic Qualifications

  • High School Diploma/GED required 
  • Computer Science or a related field certification required 
  • Working knowledge of the Linux and Windows operating systems
  • Ability to technically troubleshoot web server technologies such as Apache, IIS or NginX by connecting to those servers and analyzing technical problems within the application, server and operating systems logs to identify the root cause and resolving the issue creating an impact to system’s availability in production
  • Experience technically supporting middleware such as Tomcat, Jboss or other application server by evaluating the middleware state while analyzing the logs and identifying a solution to be executed
  • Experience supporting monitoring, alerting, or pipeline analysis tool such as AppDynamics, Splunk or Nagios while optimizing the current configuration of those monitoring tools and technically maintaining their availability
  • Ability to technically troubleshoot networks using Cisco switches, routers, firewalls and F5 load balancers technologies by connecting and identify potential root cause while analyzing the network traffic and the performance/state of those network devices
  • Ability to write basic Linux shell script incorporating Grep, SED or AWK
  • Ability to troubleshoot Java application servers while using the appropriate commands and JVM arguments

Additional Characteristics

  • Bachelor's or Master Degree in Computer Science preferred 
  • Fluency in Python, Ruby or other common scripting language
  • Experience in problem solving and troubleshooting network latency and connectivity issues
  • Experience developing operational automation in a distributed environment
  • Ability to perform database queries across database platforms
  • Knowledge of automated and centralized job scheduling
  • Experience in a mixed on-premises and cloud environment
  • Experience with a CDN such as Akamai, Cloudflare or other
  • Experience with VMware
  • Experience with Docker and Kubernetes or other containerized solution
  • Strong collaboration skills and team player
  • Good written and verbal communication ability

Please click on link below to be directed to our website for your application process:

https://careers-travelclick.icims.com/jobs/3473/staff-site-reliability-engineer/job?mode=job&iis=Job+board&iisn=HIRE+VETERANS

#LI-KG1

EEO Statement

“All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.”

States

FL,   IL  

Security Clearance

NO Security Clearance

Apply Now