Site Reliability Engineer with solid experience in observability and monitoring of critical systems for high-availability enterprise environments. Developing expertise in SRE practices including golden signals monitoring (latency, traffic, errors, saturation), service health assessment, and collaborative SLO implementation under senior technical guidance. Currently leading my first critical project while completing my Computer Engineering degree.
- 📊 Golden Signals Monitoring (Latency, Traffic, Errors, Saturation)
- 🎯 SLI/SLO Implementation & Management
- 📈 Service Health Threshold Definition
- 🔍 Error Budget Tracking & Analysis
- 🛠️ MTTR Optimization through incident correlation
- 📋 Post-mortem Analysis for continuous improvement
- 📊 Golden signals monitoring using BigPanda, Datadog, and Grafana stack
- 🔍 SLI/SLO framework development with team collaboration
- 📈 Service health threshold optimization for enterprise infrastructure
- 🛠️ MTTR reduction through improved incident correlation
- 🎓 Completing Computer Engineering degree (4th year)
- Led implementation of observability architecture for enterprise infrastructure
- Configuring monitoring for Kubernetes environments using Prometheus Operator (test/lab environments)
- Developing Grafana Cloud dashboards for key components: Kubernetes, RabbitMQ, Redis, AWS RDS, S3
- Managing project with reduced team, delivering quality results on time
- Root cause analysis and post-mortem investigations using BigPanda correlation platform
- Reviewing incident and change tickets in ServiceNow for pattern identification
- Analyzing existing monitoring in Splunk and supporting migration to Datadog
- Optimizing monitoring correlation logic to improve noise reduction
- Building HA architecture with Grafana and Prometheus for production monitoring
- Configuring monitors to validate availability in critical applications
- Implementing centralized logging with Loki and Network Logs Concentrator
- Computer Engineering - Instituto Profesional Duoc UC (2022-2026) - Currently in 4th year
- Computer Programming Analyst - Instituto Profesional Duoc UC (2022-2024) - Grade: 60/70
- Scrum Foundation Professional - CertiProf
- Oracle Cloud Infrastructure Foundations Associate
- Microsoft Certified: Azure AI Fundamentals
- Getting Started with OpenTofu - The Linux Foundation
- Data Science - FCFM, Universidad de Chile
- AWS Educate - Serverless, Machine Learning, Cloud Ops
- Google Cloud Skill Badges - Multiple badges
Final project for Duoc UC - Grade: 60/70
Tech: React, Bootstrap
Academic project
Tech: React, Bootstrap
Real freelance project
Tech: HTML, CSS, JavaScript
Personal practice project
Tech: React, Bootstrap
- 🔍 Advanced incident correlation techniques with BigPanda
- 📊 Kubernetes monitoring optimization for production environments
- 🎯 SRE best practices and industry standards
- 🎓 Completing my engineering degree (graduating 2026)
- 🏢 Currently employed at Innfinit as SRE Engineer (Nov 2022 - Present)
- 🎯 Open to discussing new SRE opportunities
- 📍 Located in Santiago, Chile
- 🌍 Remote work friendly
- 📧 Email: fabianignaciomv@gmail.com
- 🔗 LinkedIn: linkedin.com/in/fabianimv
- 📱 Phone: +56 9 6414 2352
- 🌐 Portfolio: fabianimv.github.io/portfolio
"I believe in honest professional growth. This profile reflects my real experience and current skills. I'm passionate about learning, transparent about my level, and committed to continuous improvement in the SRE field."