A daily email of jobs matching your skills and preferences.Sign Up 👋
Staff Site Reliability Engineer
Job LocationsUS-FL-Orlando | US-Remote - US | US-IL-Schaumburg
Job ID 2019-3473 Category IT
The Staff Site Reliability Engineer is responsible to provide support for revenue generation production systems. The engineer assists with monitoring, maintenance and problem resolution of TravelClick production applications. The candidate must be able to provide prompt technology operations support in a high energy, fast paced environment.
The successful candidate will be bright, motivated, detailed orientated and willing to go the extra mile to ensure exceptional results for our customers. This is a great opportunity in technology operations at a growing company with opportunities for advancement for the right candidate.
Provide support related to production systems availability incidents and problems
Provide support related to production systems latency incidents and problems
Provide support related to production systems performance incidents and problems
Provide support related to production operations efficiency issues
Support monitoring tools currently in production
Provide emergency response to production systems incidents
Maintain production ticketing system
Maintain the knowledgebase solution platform
Create, Delete and maintain production automation solutions using tools
Automate of day to day tasks
Resolve/remove false-positives alerts
Configure and update alert dashboards
Maintain tasks using task scheduler
Become SME of production applications and operations tools
Participate during application releases implementation
Analyze and interpret application logs to determine problem areas
Enhance current application and device monitoring systems
Help to evaluate application performance statistics including application and system response times
What we are looking for
High School Diploma/GED required
Computer Science or a related field certification required
Working knowledge of the Linux and Windows operating systems
Ability to technically troubleshoot web server technologies such as Apache, IIS or NginX by connecting to those servers and analyzing technical problems within the application, server and operating systems logs to identify the root cause and resolving the issue creating an impact to system's availability in production
Experience technically supporting middleware such as Tomcat, Jboss or other application server by evaluating the middleware state while analyzing the logs and identifying a solution to be executed
Experience supporting monitoring, alerting, or pipeline analysis tool such as AppDynamics, Splunk or Nagios while optimizing the current configuration of those monitoring tools and technically maintaining their availability
Ability to technically troubleshoot networks using Cisco switches, routers, firewalls and F5 load balancers technologies by connecting and identify potential root cause while analyzing the network traffic and the performance/state of those network devices
Ability to write basic Linux shell script incorporating Grep, SED or AWK
Ability to troubleshoot Java application servers while using the appropriate commands and JVM arguments
Bachelor's or Master Degree in Computer Science preferred
Fluency in Python, Ruby or other common scripting language
Experience in problem solving and troubleshooting network latency and connectivity issues
Experience developing operational automation in a distributed environment
Ability to perform database queries across database platforms
Knowledge of automated and centralized job scheduling
Experience in a mixed on-premises and cloud environment
Experience with a CDN such as Akamai, Cloudflare or other
Experience with VMware
Experience with Docker and Kubernetes or other containerized solution
Strong collaboration skills and team player
Good written and verbal communication ability
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.
Note to Applicants
IMPORTANT: We contact all applicants via email throughout the hiring process. It is recommended that you add iCIMS (@agents.icims.com) to your Approved/Safe Sender list to ensure that our emails are properly delivered to your inbox and not marked as spam. Please click here for instructions on whitelisting iCIMS.
A new window will open to the job source site.
Growing a career that's right for you is a life-changer, but it's undeniable that the job search gets tougher every year. With automated hiring processes, resume filters and questionable interview practices, finding a job that a tech skillset has become seriously challenging.
That's where we step in. Careeriscope can help lighten the stress load by making your search a bit easier. We help you find matches based on the job search criteria you set, then send a summary of the results in a daily email sent every morning for review.