Site Reliability Engineer
At Bloq.it, we’ve created the world’s leading smart locker solution. Solving online deliveries by enabling everyone to participate easily, reducing delivery costs and making them more sustainable.
We’re quickly expanding, and after growing at 1000% for three years in a row, we’re now the fastest-growing Smart Locker company in the world and one of the fastest growing scale-ups in Europe.
We are in search of a Site Reliability Engineer to join our innovative team as our new #bloqstar. In this role, you'll play a crucial role in maintaining the health, stability, and performance of our production systems. This role is designed for a highly technical engineer who thrives in troubleshooting complex issues, collaborating with cross-functional teams, and building observability and monitoring services. As part of the 3rd level support team, you will be responsible for investigating and resolving escalated issues that affect system availability, performance, and reliability.
What you’ll be doing:
- Provide expert-level troubleshooting and incident management for escalated production issues, including performance degradation, outages, and system anomalies.
- Diagnose and resolve complex issues across infrastructure, applications, and services, working closely with development teams to identify root causes.
- Collaborate with operations, development, and security teams to drive proactive improvements in the reliability, scalability, and availability of systems.
- Maintain and enhance system observability tools, ensuring proper monitoring, alerting, and logging to detect issues early and respond to incidents effectively.
- Contribute to the creation and refinement of runbooks, incident response protocols, and other technical documentation for internal teams.
- Automate repetitive tasks to improve operational efficiency and reduce toil.
- Define and implement incident response processes, including root cause analysis and post-mortems.
What you’ll bring to the table:
- At least 3 years of proven professional experience as Site Reliability Engineer (SRE) or a similar role.
- Strong expertise in monitoring and observability tools (Prometheus, Grafana, Datadog, elasticsearch, kibana, New Relic, OpenTelemetry, etc.).
- Experience with NoSQL Databases (MongoDB or Elasticsearch are a nice to have)
- Deep understanding of incident management, post-mortem analysis, and on-call best practices.
- Experience with AWS cloud platform.
- Experience creating automations and tooling.
- Strong knowledge skills in Python and JavaScript/TypeScript.
- Expertise in Unix/Linux console debugging, using commands and tools such as grep, awk, sed, strace, tcpdump, lsof, journalctl, and others.
- A problem-solving mindset with a data-driven approach to resilience engineering.
It would be great if you would also have:
- Prior experience setting up SRE practices from scratch.
- Experience in products combining both hardware and software.
- Experience in high-growth, product-driven startups.
- Familiarity with ITIL or incident management frameworks.
- Experience implementing error budgets and reliability SLAs.
- Experience with Kubernetes and containerization.
Why join us?
- The opportunity to join ourSoftware team and play a pivotal role in contributing to innovative solutions that redefine Bloq.it's revolution in the smart locker industry ;
- A dynamic and fast-paced work environment with a culture of innovation, collaboration, and continuous learning ;
- Competitive salary and flexible benefits package, tailored to your experience and skills ;
- Eligibility for performance-based bonus, tied to your results and designed to reward your impact ;
- Work how you work best - we offer a remote-friendly policy and flexible hours so you can stay productive and keep life balanced ;
- Portuguese Health Insurance ;
- Unlimited days off (subject to manager approval).
Ready to join the revolution?
- Department
- Software - Software Enablement
- Role
- Site Reliability Engineer
- Locations
- Lisboa, Portugal
- Remote status
- Fully Remote
- Employment type
- Full-time
About Bloq.it
At Bloq.it, we provide end-to-end solutions for Smart Lockers, and our software ecosystem, Bloq.it OS, is the leading tech solution available in the market.
We've had the pleasure of working with some of the biggest names in e-commerce, logistics, and retail in Europe, such as Vinted and DHL.
We have recently become the fastest-growing Smart Locker company in the world. Before Bloq.it the industry had stagnated and lacked innovation and good products. We strive to have the same impact in this industry as Tesla has had on the car industry.
We believe that Smart Lockers are a big part of the future for everyone, and we want to play our part in making sure that Smart Lockers become as mainstream as the mobile phone.