Jobs at CruiTek

View all jobs

Observability Architect-Remote

United States, United States ยท Information Technology
Observability Architect-Remote
*No Sponsorship or 3rd Parties at this time

Remote- 100%
1 year contract-32hrs a week
Great Company
Unlimited Growth
*Very Specific Skill Set

Scope:
We are seeking a skilled Senior Engineer or Architect experienced in AIOps, Observability, and SRE engineering practices to join our team to enhance IT software engineering through Architecture and Governance practices. In this role, you will be responsible for designing, prototyping, testing, and documenting solutions that mature the enterprise observability, performance, resilience and overall reliability of our IT systems and applications.

Principal Responsibilities and Outputs:
  • Architectural Guidance: Publish technology strategies and supporting architectures to mature business and technology operations to enable AI/MLOps.
  • Standards and Best Practices: Publish observability standards and best practices for adopting new and existing frameworks or technologies.
  • Technical Solutions: Translate business goals into technical solutions designs to include descriptive and diagnostic capabilities through engineering at delivery satisfying non-functional requirements for business solutions.
  • Delivery Enhancements: Create actionable Observability Driven Development procedures to ensure consistent adoption of open standard (i.e. OTel, MELTS) industry frameworks.
  • AI Augmented Testing: Deliver strategies to help enable more AI-Augmented testing capabilities empower federated execution and central enterprise governance.
  • Communication and Education: Develop and routinely publish communication as well as training and education sessions for knowledge transfer and raising awareness of current or future enterprise direction.
  • Reliability Design: Design and implement full stack applications for reliability and integration patterns to enable more operational predictability and prescriptive disruption response.
  • Monitoring and Alerting: Establish appropriate monitoring and alerting standards for performance, scalability, availability, and reliability.

Potential deliverables :
  • Historical Analytics Architecture (Requirements Documents, Logical, and Technical Designs)
  • Data Fabric Architecture (Requirements Documents, Logical, and Technical Designs)
  • Alerting Architecture (Requirements Documents, Logical, and Technical Designs)
  • AI Ops Strategy
  • AI Observability Strategy
  • OTel Standards and Strategies
  • Logging Standards
  • Various Prototype work
  • Observability API demonstrations
  • AIOps (Predictive and Prescriptive activities) demonstrations
  • Observability Maturity Models and Assessment Structure
  • Creating Training and Education Materials

Skills
  • Designing, prototyping, testing, and documenting solutions that mature the enterprise observability, performance, resilience, and overall reliability

Share This Job

Powered by