Monitor and manage servers, network devices, and storage systems.
Ensure the availability, reliability, and performance of IT infrastructure.
Perform regular maintenance, upgrades, and patches on servers and network devices.
Create, modify, and delete user accounts and access permissions.
Ensure proper user access controls and security policies are in place.
Manage user authentication, including password policies and multi-factor authentication.
Implement and maintain backup and disaster recovery solutions.
Perform regular backups, verify backup integrity, and test restoration procedures.
Develop and maintain backup and recovery policies and procedures.
Implement and maintain security measures to protect IT systems from unauthorized access, cyber threats, and malware.
Monitor security logs, perform security audits, and investigate security incidents.
Ensure compliance with security policies, standards, and regulations.
Plan, test, and deploy security patches, updates, and hotfixes on servers, network devices, and endpoints.
Maintain patch management policies and procedures to ensure system security and compliance.
Monitor system performance, health, and capacity.
Identify and resolve performance issues, bottlenecks, and resource constraints.
Implement performance tuning measures to optimize system performance.
Log, prioritize, and manage incidents and problems reported by end-users and monitoring systems.
Investigate, diagnose, and resolve system-related incidents and problems.
Perform root cause analysis and implement preventive measures to avoid recurrence.
Document system configurations, procedures, and policies.
Generate and distribute reports on system performance, incidents, and changes.
Maintain system documentation up to date and ensure it’s easily accessible to the team.
Plan, test, and implement changes to IT systems and infrastructure.
Ensure changes are managed and documented according to change management policies.
Coordinate with stakeholders to minimize disruptions during change implementation.
Develop, implement, and maintain disaster recovery plans and procedures.
Conduct regular disaster recovery tests and exercises.
Update and improve disaster recovery plans based on test results and changes in the environment.
Qualifications:
Bachelor’s degree in computer science, Information Technology, Business Administration, or related field.
Certifications such as CompTIA Security+, Microsoft Certified: Azure Administrator Associate, or equivalent.
Relevant vendor-specific certifications such as Microsoft Certified: Windows Server, Cisco Certified Network Associate (CCNA), or Red Hat Certified Engineer (RHCE).
Excellent problem-solving and troubleshooting skills.
Strong knowledge of server and network infrastructure.
Familiarity with cloud platforms and services, such as AWS, Azure, or Google Cloud Platform.
Proficiency in server operating systems, such as Windows Server, Linux, or Unix.
Experience with virtualization technologies, such as VMware or Hyper-V.
Familiarity with networking concepts, protocols, and devices.
Knowledge of security best practices and tools.
Excellent communication and interpersonal skills.
Ability to work well in a team and collaborate with stakeholders.