Back to Trellion
DevOps Engineer
Montreal, CACAD110,000-140,000/yearlyDec 10, 2025HYBRIDengineeringMID
About the Role
Trellion is building the infrastructure behind intelligent hiring systems. Our platform runs real-time AI pipelines, large-scale data ingestion, and high-availability matching systems. This role is about making that machinery fast, reliable, and unbreakable.
You will own infrastructure end-to-end, designing systems that keep our AI, APIs, data pipelines, and internal tools running in production. You will build platforms rather than babysit servers.
Responsibilities
- Design and maintain cloud infrastructure for AI and data-heavy workloads
- Build and maintain CI/CD pipelines for backend and ML services
- Develop and operate containerized services with Docker and Kubernetes
- Implement observability, logging, and alerting
- Use Infrastructure as Code to automate environments
- Apply security hardening and access control
- Optimize performance and control costs
- Lead incident response and ensure production reliability
Requirements
Strong in most of the following:
- Linux in production
- Docker and Kubernetes
- CI/CD systems
- Cloud platforms
- Infrastructure as Code
- Networking fundamentals
- Monitoring, logging, and alerting
- Security best practices
- Git workflows
Nice to have:
- Experience supporting ML pipelines
- Distributed systems
- High-traffic API platforms
- Kafka, Redis, PostgreSQL, or similar
What We Care About
- You automate instead of repeating yourself
- You build systems that fail gracefully
- You think in failure modes and recovery paths
- You hate flaky deployments
- You document what matters
- You move fast without being reckless
Compensation & Benefits
- Base salary: $110,000–$140,000 CAD
- Equity participation
- Hybrid work in Montreal
- Ownership over production infrastructure
- Zero babysitting culture
How to Apply
Send your resume, GitHub, and any production infrastructure you've owned or scaled to [email protected]
Requirements
LinuxDockerKubernetesCI/CDCloud platformsInfrastructure as CodeNetworkingMonitoringLoggingAlertingSecurity best practicesGitKafkaRedisPostgreSQL
Ready to apply?
Join Trellion