Jobs / Nordcloud, an IBM Company

AI / ML Engineer

Nordcloud, an IBM Company · Germany
Visa: unknownSalary: unknownWork mode: unknown
Skills
awsazureci/cddockergcpgithub actionskubernetesterraform

Description

Join Nordcloud and be part of the European cloud revolution. We supercharge our customers to innovate in hyperscaler cloud, enabling seamless migration, advanced security, and data-driven success.

Currently, we are looking for an AI/ML Engineer to join our team in Germany.

Due to business requirements, this role can only be performed from Germany. If you are not currently based in Germany, you must be willing to relocate before your start date.

We’re seeking candidates with proven practical experience in professional services who are ready for the next step in their careers.

Responsibilities

  • Lead the implementation and composition of tool-using agents that interact with APIs, databases, and knowledge systems.
  • Leading a team of AI/ML Engineers to implement multi-agent systems.
  • Building persistent agent memory systems (short-, long-, and episodic memory).
  • Implementing fault-tolerant orchestration for multi-agent pipelines.
  • Building and scaling cloud-native systems (on AWS, Azure, or GCP).
  • Simulation and testing of multi-agent interactions for scalability, safety, and emergent behaviours.
  • Building guardrail systems using tools like Guardrails AI, NeMo Guardrails, or custom validators.
  • Embedding compliance and observability hooks in every agent interaction.


Skills

  • Core AI/ML Expertise: deep understanding of transformer architectures, attention mechanisms, and LLM training pipelines.
  • Agentic System Design: understanding of agent architectures (e.g., ReAct, Reflexion, Voyager, AutoGPT, CrewAI, AutoGen).
  • Familiarity with agent orchestration frameworks (LangChain / LangGraph, Semantic Kernel, LlamaIndex, Swarm, etc.).
  • Deep understanding of multi-agent communication protocols (e.g., MCP and A2A) — message passing, coordination, and negotiation strategies.
  • Designing hierarchical agents: planner, executor, verifier, critic, and memory manager roles.
  • Ability to balance autonomy vs. control, implementing “human-in-the-loop” governance mechanisms.
  • AI / ML Engineering incl MLOps / LLMOps: hands-on experience with Version control, CI/CD, and containerization (GitHub Actions, Docker, Kubernetes).
  • CI/CD for ML: model registry, versioning, and promotion (MLflow, Weights & Biases).
  • LLMOps: prompt evaluation, feedback loops, token optimisation, cost monitoring.
  • Understanding of Continuous deployment of multi-agent pipelines via Argo CD, GitOps, or Terraform.
  • Observability for AI: telemetry on performance, latency, and behavioural drift.
  • Knowledge, Context & Data Integration: integration of vector databases for memory and retrieval.
  • Designing retrieval-augmented generation (RAG) pipelines with dynamic context injection.
  • Familiarity with document loaders, chunking strategies, and embedding optimisation.
  • AI Safety, Guardrails and Governance: understanding of prompt injection, data exfiltration, and model hallucination vulnerabilities.
  • Experience with safety layers (content filters, moderation, model output evaluation).
  • Familiarity with explainability and interpretability frameworks.
  • Designing ethical and secure agent autonomy frameworks (role constraints, audit trails).


We encourage you to apply, even if you don’t meet all of the requirements. We value your growth potential and enthusiasm!

What we offer:

  • Individual training budget and exam fees for certifications.
  • Flexible working hours and a remote working model.
  • Company laptop and needed equipment.
  • Local package such as 30-day holiday allowance, pension allowance, Qualitrain card, and many more.


Please read our Recruitment Privacy Policy before applying. All applicants must have the right to work in Germany.

About Nordcloud

Nordcloud is a European leader in cloud implementation, application development, managed services and training. It’s a recognised cloud-native pioneer with a proven track record helping organisations leverage public cloud in a way that balances quick wins, immediate savings and sustainable value.

Nordcloud is triple-certified across Amazon Web Services, Microsoft Azure and Google Cloud Platform – with 10 European hubs, over 1,300 employees and has delivered over 1,000 successful cloud projects for companies ranging from midsize to large corporates.

Our clients benefit from multi-cloud expertise that guides best practices, preempts pitfalls, provides essential technical support and steers teams through cultural change. From strategy planning to application management, we take our customers through the whole cloud journey to drive real business outcomes from cloud technology.

Learn more at www.nordcloud.com

Nordcloud values diversity and is dedicated to providing equal opportunities for all candidates and employees.

Get new job alerts Weekly digest to your inbox.