
AI Security
Giskard Hub
Continuously red-teams LLM, RAG, and agent applications for prompt injection, jailbreaks, and quality failures before production.
Giskard Hub Overview
Giskard is an AI red teaming platform that continuously tests large language model (LLM) applications, retrieval-augmented generation (RAG) systems, and conversational agents for security and quality failures before they reach production. It uses a black-box approach that needs only API access, running a library of more than 50 adversarial probes to surface vulnerabilities a development team would otherwise miss. It is offered as an open-source library and an enterprise Hub.
The platform generates adversarial tests from security taxonomies, external threat resources, and a customer's own knowledge base, probing for prompt injection, data disclosure, hallucinations, and harmful content. Tests run from a user interface or a Python software development kit (SDK) and integrate into CI/CD pipelines, comparing model versions to catch regressions and enriching test datasets as new vulnerabilities are found. Results map to frameworks including the OWASP LLM Top 10 and the EU AI Act for audit-ready reporting.
Giskard is SOC 2 Type II compliant and GDPR-aligned, with European and United States data residency options and a zero-training policy. Founded in 2021 and based in Paris, it is backed by Bpifrance and the European Commission's EIC Accelerator, and counts Michelin, BNP Paribas, and Decathlon among its users.
Key Capabilities
mapped to solution categoriesAutonomously plans and executes multi-step adversarial campaigns against AI systems, emulating real attacker workflows across reconnaissance, exploitation, and escalation rather than running a fixed checklist of tests.
Tests LLMs and AI applications against a library of direct and indirect prompt-injection and jailbreak techniques, reporting which payloads bypass system instructions and safety controls.
Re-runs red-team campaigns continuously and at release gates in the CI/CD pipeline as models, prompts, and configurations change, catching new exploit paths before and after deployment.
Reports validated AI vulnerabilities with reproduction evidence, attacker context, and remediation guidance, mapped to the OWASP LLM Top 10, MITRE ATLAS, EU AI Act, and NIST AI RMF for auditable AI risk reporting.
Attacks deployed guardrails, system prompts, and content filters to measure how reliably they block adversarial inputs, quantifying bypass rates rather than assuming the controls work.
Routes high-value automated findings to specialist AI red teamers for manual exploitation, chaining, and depth beyond automated coverage, blending platform testing with human expertise.
Compliance
certificationsImplementation & support
Info last updated on June 26, 2026