Site Reliability Engineer Lead

Description

Summary:

We are seeking a highly skilled Digital Site Reliability Engineer (SRE) with expertise in Risk Management and Privileged Access Management (PAM). In this critical role, you will ensure the reliability, scalability, and security of our digital systems while managing and mitigating risks associated with privileged access. Your work will directly impact the security and operational stability of our digital platforms.

Responsibilities:

  • Site Reliability Engineering
    • Design, implement, and manage highly reliable and scalable digital systems.
    • Develop and maintain automation for deployment, monitoring, and incident response to minimize downtime.
    • Collaborate with application and infrastructure teams to improve system performance and reliability.
  • Risk Management
    • Identify, evaluate, and mitigate risks within the digital infrastructure, particularly around privileged access.
    • Develop and enforce policies and procedures to reduce vulnerabilities and operational risks.
    • Conduct regular audits of systems and processes to ensure compliance with organizational and regulatory standards.
  • Privileged Access Management (PAM)
    • Implement and maintain PAM tools and frameworks to safeguard critical systems and data.
    • Design workflows for privileged account onboarding, offboarding, and lifecycle management.
    • Monitor and analyze access patterns to detect anomalies and ensure proper use of privileged credentials.
  • Incident Response and Problem Management
    • Act as a subject matter expert during security and operational incidents related to risk and access management.
    • Drive root cause analysis and develop long-term solutions for systemic issues.
  • Collaboration and Stakeholder Engagement
    • Partner with cross-functional teams, including cybersecurity, application support, and compliance teams, to align on security and operational goals.
    • Communicate technical risks and mitigations effectively to both technical and non-technical stakeholders.
  • Continuous Improvement
    • Continuously optimize systems and processes for better performance, reliability, and security.
    • Stay updated on industry trends, emerging technologies, and best practices related to SRE, risk management, and PAM
  • In addition to this –
    • Coordinate planning and execution with internal software engineering teams, business partners and technical leaders across the division.
    • Own deployment, availability, reliability, performance, and customer escalation targets for these environments
    • Proactive identification and reduction of issues through design, testing, and implementation of software
    • Uphold high organizational standard of great employee and team satisfaction.
    • Optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.
    • Provide primary operational support and engineering for multiple large, distributed software applications.
    • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.

Basic Qualifications:

  • Bachelor’s degree in Computer Science, Information Systems, or a related field
  • 5+ years of experience in Site Reliability Engineering, Security Engineering, or a similar role
  • 5+ years experience risk management frameworks, methodologies, and tools.
  • 5+ years experience implementing and managing PAM solutions (e.g., CyberArk, BeyondTrust, etc.).
  • Proficiency in scripting and automation (e.g., Python, PowerShell, Ansible, etc.).
  • 5+ years experience monitoring and observability tools (e.g., Dynatrace, Prometheus, Grafana, Splunk, etc.).
  • 5+ years experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization (e.g., Kubernetes, Docker).

Preferred Qualifications:

  • Masters Degree preferred
  • Microsoft Office experience 
  • Experience working in multi-platform environment
  • Ability to balance both development and support roles
  • Experience in working on projects that involve business segments
  • Strong analytical, strong troubleshooting skills and excellent communication skills
  • Strong interpersonal skills, focus on customer service, and the ability to work well with other IT, vendor, and business groups
  • Strong problem-solving and analytical skills with a focus on security and operational excellence.
  • Excellent communication skills with the ability to engage stakeholders at all levels.


Exempt Status: (Yes = not eligible for overtime pay) (No = eligible for overtime pay)

Yes

Workplace Type:

Office

Huntington is an equal opportunity and affirmative action employer and is committed to providing equal employment opportunities for all regardless of race, color, religion, sex, national origin, age, disability, sexual orientation, veteran status, gender identity and expression, genetic information, or any other basis protected by local, state, or federal law.

Tobacco-Free Hiring Practice: Visit Huntington's Career Web Site for more details.

Agency Statement: Huntington does not accept solicitation from Third Party Recruiters for any position