AI Operations Engineer
Job Type: Contract (6 months, potential to renew for 12 months)
Location: Remote (LATAM)
Compensation: in USD
At Talentus, we are looking for you!
We are a US company with a strong presence in LATAM and across 20 countries worldwide. Our key near-shore BPO services include smart-sourcing, dedicated or cluster teams, managed IT services, software outsourcing, and top ERP & CRM solutions, driven by our practices across diverse industries.
We are currently seeking an AI Operations Engineer to join one of our US clients. The AI Operations Engineer will play a crucial role in supporting implementations and managing IT infrastructure environments to ensure smooth operations. The ideal candidate will have a strong background in data science, experience with AI/ML, and familiarity with ServiceNow, particularly in the context of automation and incident management.
Responsibilities:
* Implementation & Environment Management:
o Support the implementation and management of AI/ML solutions within the IT infrastructure.
o Ensure operational efficiency across different environments by monitoring performance and troubleshooting issues.
* Monitoring Strategy:
o Assist in defining and managing the enterprise monitoring strategy for observability and event management.
o Develop a comprehensive monitoring approach covering metrics, logs, traces, alerts, dashboards, and reports.
* Automation & AI Integration:
o Leverage AI and machine learning techniques for predictive analytics, incident forecasting, and automating the detection, diagnosis, and resolution of incidents.
o Collaborate with IT teams to optimize monitoring processes and tools, ensuring alignment with business requirements.
* Collaboration & Communication:
o Work closely with end users, production support teams, and stakeholders to gather requirements and deliver effective monitoring solutions.
o Provide guidance and support to users on maximizing the benefits of monitoring capabilities.
* Tool Evaluation & Integration:
o Evaluate and select appropriate monitoring tools and platforms, integrating them with existing IT systems and processes.
o Assist in managing the roadmap for monitoring and automation tools, proposing innovative solutions in line with technology strategy.
* Performance Management:
o Establish and maintain service level agreements (SLAs) and key performance indicators (KPIs) to monitor and report on system performance and quality.
o Collaborate with IT teams to ensure best practices are followed across the enterprise.
Required Skills:
* Background in Data Science or a related field.
* Familiarity with AI/ML concepts and automation, particularly within ServiceNow.
* Experience working in IT operations or on an IT operations team.
* Strong knowledge of observability and event management best practices.
* Familiarity with multi-cloud platforms and ITIL processes, especially in event and incident management.
* Experience with monitoring tools such as SL1 and ServiceNow ITOM.
* Excellent problem-solving skills and the ability to work in a dynamic environment.
* Strong communication skills for collaboration with technical and non-technical stakeholders.
What do we offer?
* Full-time remote role
* Competitive hourly rate paid in $USD
* Opportunities for career growth and professional development
* Chance to collaborate with a global team across diverse industries and countries
* Continuous Learning and Improvement with access to Udemy courses
If you are passionate about leveraging AI and automation to enhance IT operations and are looking for a dynamic opportunity, we want to hear from you!
#J-18808-Ljbffr