Bizoforce Hiring | L2 SRE Operations Engineer – Onsite | Texas
Bizoforce: Accelerating Digital Innovation
Job Description
Bizoforce is hiring an experienced L2 SRE Operations Engineer for an onsite opportunity with a leading telecommunications company in Texas. This role is ideal for operations engineers who excel in incident resolution, automation, and system reliability. The L2 SRE acts as the bridge between operations and engineering—resolving complex issues, driving automation, and mentoring L1 teams to build scalable and self-healing systems.
Experience
4–9 years in SRE, DevOps, or Systems Engineering at a Senior or Principal Engineer level.
Key Responsibilities
- Resolve escalated incidents across Kubernetes, API Proxy, WAF, databases, and infrastructure platforms.
- Design, maintain, and improve runbooks, automating manual steps wherever possible.
- Develop and enhance self-healing systems and self-service tools for internal users.
- Analyze incident trends, propose monitoring and capacity improvements, and enhance reliability.
- Collaborate with engineering teams on deployments, upgrades, and performance tuning.
- Lead incident management, root cause analysis (RCA), and postmortem documentation.
- Mentor L1 engineers, improving operational maturity and automation coverage.
Required Skills (Must-Have)
- Advanced Incident Troubleshooting & Resolution — Diagnose and resolve multi-layer issues (infrastructure, application, network).
- Kubernetes & Container Orchestration — Skilled in deployments, scaling, and debugging cluster-level issues.
- Automation & Scripting — Proficiency in Python, Go, Bash, Ansible, Terraform for reducing manual toil.
- Observability & Monitoring — Expertise with Prometheus, Grafana, Splunk, and alerting systems.
- CI/CD & Infrastructure as Code (IaC) — Familiarity with GitOps workflows, Jenkins, and cloud provisioning tools.
- Database Troubleshooting — Knowledge of SQL and NoSQL performance tuning and issue resolution.
- Incident Management & RCA — Act as Incident Commander, lead bridge calls, and document learnings.
- Mentorship & Runbook Improvement — Guide L1 engineers and continuously enhance operational runbooks.
Preferred Skills (Nice-to-Have)
- Experience in cloud platforms (AWS, Azure, GCP) for provisioning and scaling workloads.
- Knowledge of security and WAF management, including rule tuning and vulnerability handling.
- Exposure to capacity planning and performance optimization in large-scale environments.
- Experience integrating AIOps or ChatOps for automated anomaly detection and remediation.
- Hands-on experience managing hybrid infrastructure (on-prem + cloud) environments.
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- Proven hands-on experience with Kubernetes, automation tools, CI/CD pipelines, and monitoring systems.
- Strong understanding of networking, database systems, and infrastructure reliability.
- Excellent analytical, communication, and collaboration skills.
- Must be a US Citizen or Green Card holder.
Assignment Details
- Start Date: Based on availability
- Location: Texas (Onsite)
- End Client: Confidential (Leading Telecommunications Company)
- Visa Preference: GC or USC
- Positions Open: Multiple
Why Join Bizoforce
- Work with a top-tier telecom enterprise on mission-critical systems.
- Be part of an innovative SRE team focused on automation and reliability.
- Access to a collaborative, growth-driven environment with global exposure.
- Competitive compensation and long-term onsite engagement.
Requirements
- 4 years of experience required
Share this job
About the Company
Bizoforce: Accelerating Digital Innovation
Chicago