Site Reliability Engineer (SRE) Job at Hirekeyz Inc, Omaha, NE

b201TDUxWXhlT3ZNUS9SVElDOGNPNXV2b3c9PQ==
  • Hirekeyz Inc
  • Omaha, NE

Job Description

Title: Site Reliability Engineer (SRE)

Location: Omaha, NE / Dallas, TX

Job Type: Full Time

Job Summary :

Seasoned Site Reliability Engineer (SRE) with 5+ years of experience in supporting complex, large-scale distributed systems. Highly skilled in managing production failures, conducting root cause analysis, and driving effective remediation. Strong communicator with expertise in ing, monitoring, and release management, complemented by automation proficiency and a keen ability to learn quickly.

This role involves providing 24/7 support as part of the SRE team, ensuring the reliability and performance of mission-critical Java, .NET, and Batch applications deployed across GCP, PCF, and on-premise environments.

Years of experience needed

Candidate experience 5+ Years

Technical Skills:

Expertise in understanding large scale production systems and technologies, for example load balancing, monitoring, distributed systems, microservices, and configuration management.

Should have solid hands-on experience in troubleshooting and fixing application failures, application Performance degradation, Code issues, cloud platform issues, Batch Failures, Infra failures, DB failures, Network failures.

Hands-on experience in performing Production deployments using CI/CD and exposure to deployment strategies.

Experience in troubleshooting of Linux/Unix.

Monitor the application/Services/batch availability.

Act quickly on the application s(Performance, Availability) and Batch Job failures

Perform the required analysis (Code/Log) and escalate to the Engineering team as required.

Initiate and drive the Techlines in case of outages/major incidents/Batch abends and ensure Service Restoration in the least time possible.

Effectively handle the Incident, Problem, Release and Change management.

Own and deliver the user stories assigned as part of the sprint.

o The user stories range from application code Debugging, Issue analysis, Code fix, Knowledge base creation, documentation of SOP's, Production Deployments, Pre & Post Patching/Maintenance activities, Service Requests.

o Build monitoring solutions using APM tools like Splunk, Appdynamics, Thousand Eyes, ITRS, AppMetrics, MoogSoft, Kafka etc.

o Automate of day-day operational tasks.

o Be part of the Exit reviews to ensure the best practices are followed to have the right code deployed to Production systems

o Provide feedback/recommend improvements to the system which would enable highly stable systems.

Strong understanding of Networking Concepts (TCP/IP, SSL/TLS, IPSec, VPN etc), Firewall and Load Balancers.

Experience in Scripting Shell/Powershell/Python

Strong Experience in working with any Cloud-based infrastructure (PCF, GCP, AWS, Azure Cloud or others)

Certifications Needed:

As per industry standards

Skills

PRIMARY COMPETENCY : Production Support PRIMARY SKILL : Production Support PRIMARY SKILL PERCENTAGE : 51 SECONDARY COMPETENCY : Unix SECONDARY SKILL : Linux Administration SECONDARY SKILL PERCENTAGE : 25 TERTIARY COMPETENCY : Tools TERTIARY SKILL : Splunk TERTIARY SKILL PERCENTAGE : 24

Job Tags

Full time,

Similar Jobs

RCS Staffing

Remote Engineering Recruiter (Work from Home) Job at RCS Staffing

 ...potential! We have nationwide Engineering jobs for you to fill! You work remote 100%! Required Experience 1-2 years Staffing Agency...  ...1-2 years of High Volume Staffing Experience Work From Home Experience Recruiting Engineering related roles is a plus Recruiting... 

The Peter G. Peterson Foundation

Research Associate Job at The Peter G. Peterson Foundation

 ...put America on a sustainable fiscal path. As a non-partisan organization, the Foundation engages in grant-making, partnerships, and research to educate and involve Americans from a variety of perspectives. Department Summary The Research Team is a group of policy... 

Cisco Equipment Rentals LLC

Field Technician - Heavy Equipment Dealership Job at Cisco Equipment Rentals LLC

Cisco Equipment Rentals is seeking a highly skilled and motivated Field Technician to provide exceptional service and maintenance for our fleet of heavy equipment. As a Field Technician, you will work directly at customer job sites to diagnose, repair, and maintain equipment...

Total Quality Logistics

Entry Level Recruiter Job at Total Quality Logistics

 ...Develop and maintain strong relationships with your hiring managers, peers and recruiting leadership What you need: Recruiting experience preferred, but no experience required - we provide paid training and an elite mentoring program Thrive in a metrics-driven... 

International Leadership of Texas

Attendance Clerk Job at International Leadership of Texas

 ...Public Education Information Management System (PEIMS) data, and grades. Qualifications: Education/Certification: High school diploma or GED Special Knowledge/Skills: Ability to use software to develop spreadsheets and databases, and do word processing...