What You’ll Do
As a Senior DevOps & Solutions Architect, you will be a key leader in designing, building, and scaling the core infrastructure that powers our agentic environment. This isn’t just about maintaining systems; it’s about architecting the future of our enterprise-grade AI solutions. You will be a central partner to product teams and client-facing squads, providing technical leadership and strategic direction to ensure our platform is robust, scalable, and secure.
InteractiveAI runs on two high-performance engines: Product Teams that craft and scale our Agentic IDE, and Implementation Squads that ship high-impact, domain-specific AI solutions. Depending on your craft and ambition, you’ll join the team where you can create outsized value—and you’ll have a transparent, performance-based path to growth and rewards.
- Architect and scale multi-tenant, cloud-agnostic runtimes (Kubernetes/GPU clusters) supporting on-prem, VPC, and hybrid installations.
- Design and implement secure, end-to-end CI/CD pipelines for automating complex ML workflows, from data ingestion and fine-tuning (LoRA/QLoRA) to secure, high-stakes deployments. Provide solutions architecture expertise by partnering with product and client performance squads to accelerate the journey of custom agents from sandbox (
- 5 days) to production (4–6 weeks), meeting tight SLAs.
- Lead the adoption of infrastructure-as-code best practices using tools like Terraform, Ansible, or similar.
- Define and manage the strategy for our containerized workloads (Docker, Kubernetes, etc.) to optimize for performance, cost, and reliability.
- Establish and enforce security, compliance, and data governance standards, particularly for enterprise clients.
- Mentor junior engineers and provide strategic guidance on infrastructure design, incident response, and system reliability.
What We’re Looking For
We’re seeking a seasoned architect who can lead the design and implementation of a robust, scalable infrastructure for our agentic platform and its ecosystem of solutions. You should have a proven track record of architectural leadership, strong fundamentals, and a deep understanding of operational maturity.
Minimum Requirements:
- 5+ years of experience in DevOps, Site Reliability, or Infrastructure Engineering roles, with at least 2 years in a solutions or systems architect capacity.
- Proven experience deploying and managing complex AI/ML production workloads on at least one major public cloud (e.g., AWS, GCP, or Azure).
- Extensive experience designing, deploying, and managing robust, resilient, and distributed cloud solutions at scale.
- Deep expertise in containerization and orchestration (Docker, Kubernetes).
- Strong track record of building and managing advanced CI/CD pipelines for complex software and ML lifecycles.
- Expert-level proficiency with infrastructure-as-code tools (Terraform, CloudFormation, or Pulumi).
- Strong scripting and automation skills (Python, Bash, or similar).
- Extensive experience with monitoring and logging stacks (e.g., Prometheus, Grafana, ELK).
- Exceptional communication and collaboration skills with a proven ability to lead and influence cross-functional teams.
- Experience with ML/AI-specific infrastructure and MLOps tooling (e.g., MLflow, Weights & Biases).
- Demonstrated experience implementing security practices and compliance frameworks (e.g., GDPR, ISO 27001) in highly regulated environments.
- Previous work in enterprise-grade or highly regulated industries is a significant plus.
What You’ll Get
- Competitive base salary + performance bonuses.
- Future equity opportunity for high performers.
- Health & wellness allowances.
- Private health insurance.
- Flexible work setup + travel when needed (ideally Hybrid in Lisbon or Madrid).
- 25 days of holidays/paid time off (excluding local public holidays).
Who You Are
- Proactive & Resourceful: You take initiative to identify gaps and drive solutions without waiting for instructions.
- Accountable & High-Ownership: You treat our codebase and infrastructure as your own, and you honor commitments.
- Entrepreneurial Mindset: You thrive in ambiguity, embrace rapid change, and deliver in a high-paced startup setting.
- Architectural Leader: You can translate business needs into technical solutions and provide clear architectural vision.
- Team Player: You collaborate effectively across disciplines, give and receive feedback constructively, and mentor others.
Interview Process
We keep our process focused and respectful of your time. Most candidates complete it in 2–3 weeks. Here’s what to expect:
- Intro Call – 30 minutes with our team to align on fit and expectations.
- Take-Home Challenge – A practical, real-world architecture design task.
- Technical Interview – Deep dive into the challenge, technical expertise, and architectural philosophy.
- Cultural and Values Interview – Discussion on motivation, cultural, and value alignment.
- Offer – Final conversation and offer.
About us
InteractiveAI is a fast-growing startup on a mission to empower enterprises with fully managed AI agent lifecycles.
We are building the next generation of enterprise-AI solutions, delivering an end-to-end Agentic IDE alongside an extensible ecosystem of agentic resources and solutions.
Our platform allows companies to orchestrate, monitor, evaluate, deploy and improve AI agents—and soon fine-tune and own their own models.
We value autonomy, speed, and innovation, and we’re building a world-class team to match. Our squads are lean, focused, and execution-driven.
If you thrive in high-performance environments and want to be part of a company that rewards transformational outcomes, this is for you.