financetom
Technology
financetom
/
Technology
/
KushoAI Launches APIEval-20, the First Open Benchmark for AI API Test Generation
News World Market Environment Technology Personal Finance Politics Retail Business Economy Cryptocurrency Forex Stocks Market Commodities
KushoAI Launches APIEval-20, the First Open Benchmark for AI API Test Generation
Apr 2, 2026 7:11 AM

No existing benchmark measured whether AI agents can find real API bugs from a schema and payload alone100+ downloads in first week by developers and contributors; freely available on HuggingFaceKushoAI has run its own agent against the benchmark; head-to-head comparison report in developmentSAN FRANCISCO, April 2, 2026 /PRNewswire/ -- KushoAI, an AI-native API testing platform used by 30,000+ engineers across 6,000+ enterprises and high-growth technology companies, today released APIEval-20, an open benchmark for evaluating whether AI agents can generate tests that catch real bugs in APIs given only a request schema and sample payload: no source code, no documentation, no additional context.

Analysis of 1.4 million AI-driven test executions across 2,616 organizations shows that authentication failures account for 34% of API outages and 41% of APIs experience undocumented schema changes within 30 days, yet no standard existed for measuring whether AI agents could detect these failures systematically. APIEval-20 extends the benchmark tradition established by HumanEval for code generation and SWE-bench for bug fixing, applying the same rigour to API testing.

Abhishek Saikia, Co-Founder & CEO, KushoAI, said, "Every vendor selling AI-powered API testing uses the same language: schema validation, payload fuzzing, bug detection. There has been no shared reference point for what any of that means in practice. APIEval-20 gives the field a concrete, reproducible measure of whether an AI agent thinks like a QA engineer."

A Head of Engineering at a Fortune 500 financial services company noted in feedback to KushoAI that they had been evaluating AI testing tools for the past year and consistently ran into the challenge of comparing them objectively. They highlighted that APIEval-20 is the first framework they have seen that directly addresses this gap, surfacing shortcomings in agent reasoning that are not visible in demo environments.

Key Benchmark Details

20 scenarios across payments, authentication, e-commerce, scheduling, user management, notifications, and search. Each contains 3 to 8 planted bugs across simple, moderate, and complex tiers.Binary evaluation against live reference implementations. Scoring weights bug detection at 70%, coverage at 20%, and efficiency at 10%.Benchmark Report: resources.kusho.ai/api-eval-20

Dataset: huggingface.co/datasets/kusho-ai/api-eval-20

About KushoAI 

KushoAI is an AI-native API testing and software reliability platform. Used by 30,000+ engineers across 6,000+ organizations, backed by Antler and Blume Ventures. Visit kusho.ai. 

Logo: https://mma.prnewswire.com/media/2948973/KushoAI_Logo.jpg

 

View original content to download multimedia:https://www.prnewswire.com/news-releases/kushoai-launches-apieval-20-the-first-open-benchmark-for-ai-api-test-generation-302732888.html

SOURCE KushoAI

Comments
Welcome to financetom comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
Related Articles >
Forecast update for Bitcoin -31-07-2025
Forecast update for Bitcoin -31-07-2025
Jul 31, 2025
The price of (BTCUSD) settled high in its last intraday trading, supporting its continuous trading above EMA50, which represents a dynamic support that assists the stability of the main bullish trend amid its trading alongside a bias line, besides the emergence of the positive signals on the (RSI), despite reaching overbought levels. BestTradingSignal.com Professional Trading Signals High-accuracy trading signals delivered...
EOS price suffers from negative pressures - Analysis - 31-07-2025
EOS price suffers from negative pressures - Analysis - 31-07-2025
Jul 31, 2025
EOS (EOSUSDT) held steady at a lower level in its recent intraday trading, under the control of a prevailing short-term downtrend and trading along a descending trendline that supports this direction. The price continues to face dynamic resistance from remaining below the 50-day simple moving average. The latest decline followed earlier movement that successfully relieved its clearly oversold Stochastic conditions,...
Forecast update for Ethereum -31-07-2025
Forecast update for Ethereum -31-07-2025
Jul 31, 2025
The price of (ETHUSD) settled high in its last intraday trading, amid the dominance of the main bullish trend on the short-term basis and its trading alongside a supportive bias line for the trend, with the continuation of the positive pressure that comes from its trading above its EMA50, on the other hand, we notice the beginning of negative overlapping...
Cardano price shows more signs of weakness - Analysis - 31-07-2025
Cardano price shows more signs of weakness - Analysis - 31-07-2025
Jul 31, 2025
Cardano (ADAUSD) declined in its recent intraday trading, under the influence of a short-term corrective downtrend and trading along a descending trendline. The asset remains under continued bearish pressure from trading below the 50-day simple moving average. Additionally, a bearish crossover has started to appear in the Stochastic after reaching extremely overbought territory relative to price movement, indicating the formation...
Copyright 2023-2026 - www.financetom.com All Rights Reserved