United States, United States ยท Information Technology
Observability Architect-Remote
*No Sponsorship or 3rd Parties at this time Remote- 100% 1 year contract-32hrs a week Great Company Unlimited Growth *Very Specific Skill Set
Scope: We are seeking a skilled Senior Engineer or Architect experienced in AIOps, Observability, and SRE engineering practices to join our team to enhance IT software engineering through Architecture and Governance practices. In this role, you will be responsible for designing, prototyping, testing, and documenting solutions that mature the enterprise observability, performance, resilience and overall reliability of our IT systems and applications.
Principal Responsibilities and Outputs:
Architectural Guidance: Publish technology strategies and supporting architectures to mature business and technology operations to enable AI/MLOps.
Standards and Best Practices: Publish observability standards and best practices for adopting new and existing frameworks or technologies.
Technical Solutions: Translate business goals into technical solutions designs to include descriptive and diagnostic capabilities through engineering at delivery satisfying non-functional requirements for business solutions.
Delivery Enhancements: Create actionable Observability Driven Development procedures to ensure consistent adoption of open standard (i.e. OTel, MELTS) industry frameworks.
AI Augmented Testing: Deliver strategies to help enable more AI-Augmented testing capabilities empower federated execution and central enterprise governance.
Communication and Education: Develop and routinely publish communication as well as training and education sessions for knowledge transfer and raising awareness of current or future enterprise direction.
Reliability Design: Design and implement full stack applications for reliability and integration patterns to enable more operational predictability and prescriptive disruption response.
Monitoring and Alerting: Establish appropriate monitoring and alerting standards for performance, scalability, availability, and reliability.
Potential deliverables :
Historical Analytics Architecture (Requirements Documents, Logical, and Technical Designs)
Data Fabric Architecture (Requirements Documents, Logical, and Technical Designs)
Alerting Architecture (Requirements Documents, Logical, and Technical Designs)
AI Ops Strategy
AI Observability Strategy
OTel Standards and Strategies
Logging Standards
Various Prototype work
Observability API demonstrations
AIOps (Predictive and Prescriptive activities) demonstrations
Observability Maturity Models and Assessment Structure
Creating Training and Education Materials
Skills
Designing, prototyping, testing, and documenting solutions that mature the enterprise observability, performance, resilience, and overall reliability