DevOps Engineer
il y a 18 heures
CommonShare is building infrastructure for trustworthy, transparent supply chains. We help brands, suppliers, and certifiers work from a shared source of truth on sustainability data, so better products can reach the market faster and with more confidence.We're a small, fast-moving team working at the intersection of technology, sustainability, and commerce.
The Role
We're looking for a DevOps / Platform Engineer to own and evolve the infrastructure behind a new set of data-driven, user-facing workflows and conversational experiences on our platform.
You'll design and operate the cloud environment, CI/CD pipelines, and observability stack that power:
- Configurable, multi-step user interactions
- Data-intensive backend services
- Integrations with AI-powered services and chat-based features
This is a hands-on role with a strong focus on reliability, security, and developer experience.
Responsibilities
- Design, build, and maintain cloud infrastructure
(e.g., on AWS/GCP) for our core platform and new backend services. - Implement infrastructure as code
(Terraform/Pulumi/CloudFormation or similar) to manage environments in a repeatable way. - Own CI/CD pipelines
(e.g., GitHub Actions, GitLab CI, CircleCI) to ensure fast, reliable deployments across multiple services. - Containerize and orchestrate services
using Docker and (ideally) Kubernetes or a similar orchestrator. - Set up and improve observability
: metrics, logging, tracing, alerting, and dashboards so we can see how our systems behave end-to-end. - Harden security and compliance
: secrets management, access control, network/security groups, TLS, and least-privilege by default. - Optimize performance and cost
of the underlying infrastructure, including databases, caches, and external service integrations. - Support AI-integrated workloads
by ensuring reliable connectivity, rate limiting, and graceful degradation when third-party services are slow or unavailable. - Partner with backend and frontend engineers
to improve developer experience: local dev setups, staging environments, feature flags, and rollout strategies. - Participate in incident response and postmortems
, and help build the tooling and processes to reduce their frequency and impact.
Our Stack
We don't expect you to know every piece, but you should be comfortable in a similar environment:
- Languages / services:
Ruby, Python, –based backend services - Data:
PostgreSQL, Redis or similar in-memory store, possibly search technologies - Cloud:
AWS or GCP (VPC, EC2/GKE/EKS, RDS/Cloud SQL, load balancers, object storage, etc.) - IaC:
Terraform, Pulumi, or CloudFormation - CI/CD:
GitHub Actions, GitLab CI, CircleCI, or similar - Containers:
Docker, Kubernetes or another orchestrator - Observability:
Prometheus/Grafana, OpenTelemetry, ELK/EFK stack, or comparable tools - AI / external services:
Integrations with third-party APIs, including LLM/AI providers
Requirements
- Have
4+ years of experience
in DevOps, Site Reliability, or Platform Engineering roles. - Have
deep experience with at least one major cloud provider
(AWS or GCP preferred). - Are comfortable
designing and operating production systems
with high reliability expectations. - Have strong experience with
infrastructure as code
and
CI/CD
practices. - Have worked with
containerized applications
and at least one orchestration platform. - Know your way around
PostgreSQL or similar databases
from an infrastructure standpoint (backups, monitoring, scaling, etc.). - Have supported
backend teams building data-heavy or workflow-heavy applications
. - Communicate clearly, document well, and enjoy collaborating across engineering and product.
Nice to Have
- Experience operating
conversational or AI-augmented services
(e.g., systems that rely on LLM APIs or other external ML services). - Experience with
service meshes
, API gateways, and rate-limiting for external integrations. - Background in
B2B SaaS
, multi-tenant architectures, or systems with complex permission models. - Interest or experience in
sustainability, supply chains, or certifications
.
How we Work
- We value
ownership
and give engineers meaningful responsibility over systems and outcomes. - We ship iteratively and learn from real usage; you'll help define the right guardrails and tooling to move fast safely.
- We're remote-friendly and rely on
asynchronous communication and good documentation
.