All Knowledge

The Giskard hub

RealPerformance, A Dataset of Language Model Business Compliance Issues

Giskard launches RealPerformance to address the gap between the focus on security and business compliance issues: the first systematic dataset of business performance failures in conversational AI, based on real-world testing across banks, insurers, and other industries.

All Knowledge

RealPerformance, A Dataset of Language Model Business Compliance Issues

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

Giskard announces Phare, a new open & multi-lingual LLM Benchmark

DeepSeek R1: Complete analysis of capabilities and limitations

[Release notes] Giskard integrates with LiteLLM: Simplifying LLM agent testing across foundation models

AI Liability in the EU: Business guide to Product (PLD) and AI Liability Directives (AILD)

Giskard Vision: Enhance Computer Vision models for image classification, object an landmark detection

Evaluating LLM applications: Giskard Integration with NVIDIA NeMo Guardrails

Global AI Treaty: EU, UK, US, and Israel sign landmark AI regulation

The EU AI Act published in the EU Official Journal: Next steps for AI Regulation

Giskard leads GenAI Evaluation in France 2030's ArGiMi Consortium

Partnership announcement: Bringing Giskard LLM evaluation to Databricks

[Release notes] LLM app vulnerability scanner for Mistral, OpenAI, Ollama, and Custom Local LLMs

New course with DeepLearningAI: Red Teaming LLM Applications

LLM Red Teaming: Detect safety & security breaches in your LLM apps

EU AI ACT: 8 Takeaways from the Council's Final Approval

Giskard's retrospective of 2023 and a glimpse into what's next for 2024!

EU AI Act: The EU Strikes a Historic Agreement to Regulate AI

Biden's Executive Order: The Push to Regulate AI in the US

Our LLM Testing solution is launching on Product Hunt 🚀

Towards AI Regulation: How Countries are Shaping the Future of Artificial Intelligence

AI Safety and Security: A Conversation with Giskard's Co-Founder and CPO

OWASP Top 10 for LLM 2023: Understanding the Risks of Large Language Models

White House pledge targets AI regulation with Top Tech companies

1,000 GitHub stars, 3M€, and new LLM scan feature 💫

The Open-Source AI Imperative: Key Takeaways from Hugging Face CEO's Testimony to the US Congress

Giskard’s new beta is out! ⭐ Scan your model to detect hidden vulnerabilities

The EU AI Act: What can you expect from the upcoming European regulation of AI?

Exclusive Interview: How to eliminate risks of AI incidents in production

🔥 The safest way to use ChatGPT... and other LLMs

Giskard 1.4 is out! What's new in this version? ⭐

Giskard mentioned as a significant vendor in Gartner's Market Guide for AI Trust, Risk and Security Management

Exclusive interview: our first television appearance on AI risks & security

Giskard closes its first financing round to expand Enterprise offering

Giskard is coming to your notebook: Python meets Java via gRPC tunnel

Why do Citibeats & Altaroad Test AI Models? The Business Value of Test-Driven Data Science

Does User Experience Matter to ML Engineers? Giskard Latest Release

Why & how we decided to change Giskard's identity

Giskard's new feature: Automated Machine Learning Testing

Who cares about AI Quality? Launching our AI Innovator community

Why & how we decided to make Giskard Open-Source

Wishing y’all a happy & healthy 2022! 🎊