Why Ryan?
Flexible Work Environment
Award-Winning Culture
World-Class Benefits and Compensation
Accelerated Career Path
Community Outreach
Mentorship Opportunities
Wellness-Centric Benefits
Duties and responsibilities:
- System Monitoring: Design, implement, and manage comprehensive monitoring solutions to ensure continuous oversight of servers, networks, and applications.
- Performance Analysis: Regularly analyze system performance metrics to identify bottlenecks, trends, and areas for improvement.
- Incident Management: Respond promptly to system alerts, troubleshoot issues, and coordinate with relevant teams to resolve problems efficiently.
- Capacity Planning: Assess current system capacities and forecast future needs to support business growth, ensuring resources are adequately allocated.
- Collaboration: Work closely with IT, development, and operations teams to communicate performance findings and recommend enhancements.
- Notification and Escalation Tool(s): Maintain the notification and escalation tool to ensure it is properly configured to reduce alert fatigue.
- Documentation: Maintain detailed records of system configurations, performance metrics, and incident resolutions to support knowledge sharing and compliance requirements.
- Continuous Improvement: Stay abreast of industry best practices and emerging technologies to propose and implement improvements to monitoring and performance strategies.
Education and Experience:
- Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent experience.
- 3+ years of experience in IT infrastructure management with a focus on monitoring tools such as SolarWinds, Nagios, Datadog, or other network monitoring solutions.
Required Skills:
- Strong knowledge of monitoring tools (e.g., Nagios, Zabbix, SolarWinds) and performance analysis techniques. Familiarity with scripting languages (e.g., Python, Bash) is advantageous.
- Hands-on experience with SolarWinds products (NPM, SAM, NCM, etc.).
- Experience with network protocols, including TCP/IP, DNS, SNMP, HTTP, and other industry-standard protocols.
- Prior experience in troubleshooting network and server issues, performance tuning, and log analysis.
- Ability to interpret complex data sets to identify issues and recommend actionable solutions.
- Excellent verbal and written communication skills, with the ability to convey technical information to non-technical stakeholders.
- Proven ability to troubleshoot complex system issues methodically and effectively.
- Demonstrated experience working collaboratively in a team-oriented environment.
- Ability to handle multiple priorities and meet deadlines in a fast-paced environment.
- Strong attention to detail and organizational skills to manage complex configurations and monitoring setups.
Preferred Skills:
- Experience with cloud-based infrastructure monitoring solutions (AWS, Azure) is a plus.
- Knowledge of network automation and scripting languages (Python, Ansible) is a plus.
Certificates and Licenses:
- SolarWinds training and certification is desirable. Certifications related to SolarWinds (e.g., SolarWinds Certified Professional) or networking (e.g., CCNA, CompTIA Network+) are a plus.
- Equivalent 3+ years’ experience in a Systems Administrator position can substitute for SolarWinds or Microsoft training and certification.
Supervisory Responsibilities:
• This position does not have direct supervisory responsibilities.
Work Environment:
- Full-time position with occasional on-call responsibilities during emergencies.
- May require travel to secondary data centers or remote sites.
- Collaborative team environment with cross-functional interactions.
Equal Opportunity Employer: disability/veteran