• *Description
• *The Digital Modernization Sector has an opening for a AWS Administrator/Tivoli Workload Scheduler to support a large healthcare contract.
We are seeking an AWS Administrator/ Tivoli Workload Scheduler engineer to support the day-to-day operations, security, and performance of our AWS cloud infrastructure while managing enterprise-level workload scheduling and automation through Tivoli/IBM Workload Scheduler (TWS/IWS). This role is hands-on and operationally focused, requiring strong troubleshooting skills, disciplined execution, and the ability to support business-critical batch and cloud-native workflows. This role also focuses on The ideal candidate has strong AWS administration experience, a clear grasp of workload automation platforms, and the ability to operate effectively in a production enterprise environment with minimal supervision.
• *Core Purpose**
Maintain the health, security, and performance of AWS environments while ensuring reliable execution of automated job scheduling workflows using TWS.
• *AWS Infrastructure Management
• Provision, configure, and maintain AWS resources including EC2, S3, IAM, and VPCs in alignment with the AWS Well-Architected Framework.
• Perform routine system maintenance, patching, and configuration updates across cloud environments.
• *Workload Automation & Scheduling
• Operate and support the TWS/IWS platform, including job stream monitoring, dependency management, and agent health checks.
• Ensure reliable execution of daily and intraday production workloads.
• Monitor batch workloads and ensure successful completion of processes.
• Implement scheduling solutions for application teams
• Follow enterprise change management processes and procedures
• Troubleshoot and resolve job failures, scheduling conflicts, and dependency issues
• Assist with workload automation platform maintenance activities
• *Security & Compliance
• Enforce least-privilege access controls using AWS IAM.
• Monitor and remediate security findings using tools such as AWS Security Hub.
• Support audit and compliance requirements related to cloud infrastructure and automation platforms.
• *Incident Response & Operations Support
• Triage, diagnose, and resolve incidents involving AWS infrastructure and failed or delayed scheduling batches.
• Escalate complex issues appropriately and participate in root-cause analysis efforts.
• *Performance & Cost Optimization
• Identify performance bottlenecks and inefficiencies related to cloud resource usage and job throughput.
• Implement auto-scaling, scheduling adjustments, or script improvements to improve performance and control costs.
• *Backup & Disaster Recovery
• Maintain, test, and document backup and disaster recovery strategies for AWS resources and TWS databases.
• Participate in disaster recovery exercises and validate recovery procedures.
• *Basic Qualifications
• *Education
• Bachelor’s degree in computer science, Information Technology, or a related field,. Additional years of experience may be substituted in lieu of degree.
• *Experience
• 4–6 years of experience in IT operations or infrastructure support roles.
• At least 1 year of hands-on AWS administration experience with limited supervision.
• An understanding of workload automation concepts including: Job dependencies, workstations, and job streams
• Ability to analyze logs and diagnose issues
• Must be able to obtain and maintain a public trust clearance.
• All candidates supporting the CMS programs must havelived in theUnited Statesatleastthree(3)out ofthelastfive(5)yearsprior in order to be considered.
• *Certifications
• AWS Certified SysOps Administrator – Associate
• *Tivoli / IBM Workload Scheduler (TWS) Knowledge
• *Core Architecture
• Understanding of TWS distributed architecture, including:
• Master Domain Manager (MDM): Central scheduling authority and database.
• Dynamic Workload Console (DWC): Web-based interface for job design and monitoring.
• Agents: Including Fault-Tolerant Agents (FTA) capable of running jobs during temporary MDM communication outages.
• *AWS Integration
• Deploy and manage TWS agents on EC2 instances to execute scripts and applications.
• Support cloud-native workflow triggers using AWS Lambda or Step Functions integrated with TWS.
• Manage file-based dependencies leveraging Amazon S3.
• *Daily Operations & Maintenance
• Create, modify, and maintain job definitions, calendars, and scheduling resources.
• Troubleshoot job failures and delays by analyzing
Apply tot his job
Apply To this Job