27242- Systems Engineer
Chicago, IL 60606 US
Job Description
Systems Engineer
#JP27242
Location: Chicago, IL (close to all trains for an easy commute from Chicagoland and NW Indiana)
Work Schedule: Hybrid (only 2 days onsite) – must work remote on Saturdays
Duration: 15-18 Months+ (initially until December 2023)
Description:
- The Systems Engineer - Monitoring role helps to run and support, with supervision, the systems that provide monitoring, visibility and management of the network infrastructure, applications and systems at the client.
- This includes but is not limited to status, health, performance and capacity planning to avoid problems with system reliability and throughput. The incumbent must have a general understanding of systems architecture (OS and hardware), software configuration and networking technologies.
Principal Accountabilities:
- Handle sysadmin responsibilities to several of the monitoring tools, including but not limited to Splunk, BMC ITSM ecosystem (Helix/remedy, TSOM, Discovery), xMatters, WebWatchBot, AppDynamics, etc.
- Understand the architecture of these tools and how they work and integrate.
- Develop and educate end-users with Splunk Searches/Dashboards, managing Splunk Forwarders, indexes, Search Heads, and cluster mgmt., troubleshooting and documentation. Pursue Splunk best practices (apps, add-ons, searches, etc.).
- Responsibilities include upgrades, vulnerability remediation, end-user support, etc.
- Support systems/process in local time with full end to end ownership of issues in time zone.
- Be available and responsible to all high risk / critical issues and proactively establish clear communications
- Ensure our environments are highly available and scalable. Respond quickly to issues and escalations.
- Apply automation standards to improve speed and quality of applications using specific tools for automation such as scripting languages, Chef etc.
- Track and/or find answers to gaps in our automation
- Accurately defines complex problem statements; Gathers and compares data about problems, documents the details and prepares analysis reports, seeking out all feasible alternatives.
- Demonstrates knowledge of skilled systems (Linux/Windows), distributed computing architecture (client server, intranet/internet, networking technologies), hardware platforms and resources - CPU, memory, virtualization, clustering and cloud computing. Configures systems and modifies settings to ensure proper function.
- Develops work breakdown structures.
- Handles all deployments and rectifies gaps in instruction including script deployment and validation, automating where possible.
- Proactively looks for opportunities to harden processes/systems by means of audits, automation, and/or other measures.
- Identifies & defines problem statement and its upstream/downstream impacts and leads resolution; Simplifies/decomposes the problem to smaller problems to reduce complexity; Troubleshoots most known and some new issues, determines the root cause, provides solutions, and takes initiative to see the solution through.
- Documents solution, provides clarification and support to colleagues, and reviews work products of others within the team.