How and Why Agents Can Identify Bug-Introducing Commits
arXiv:2603.29378v1 Announce Type: new Abstract: \'Sliwerski, Zimmermann, and Zeller (SZZ) just won the 2026 ACM SIGSOFT Impact Award for asking: When do changes induce fixes? Their paper from 2005 served as the foundation for a wide array of approaches aimed at identifying bug-introducing changes (or commits) from fix commits in software repositories. But even after two decades of progress, the best-performing approach from 2025 yields a modest increase of 10 percentage points in F1-score on the most popular Linux kernel dataset. In this paper, we uncover how and why LLM-based agents can substantially advance the state-of-the-art in identifying bug-introducing commits from fix commits. We propose a simple agentic workflow based on searching a set of candidate commits and find that it raise
View PDF
Abstract:Śliwerski, Zimmermann, and Zeller (SZZ) just won the 2026 ACM SIGSOFT Impact Award for asking: When do changes induce fixes? Their paper from 2005 served as the foundation for a wide array of approaches aimed at identifying bug-introducing changes (or commits) from fix commits in software repositories. But even after two decades of progress, the best-performing approach from 2025 yields a modest increase of 10 percentage points in F1-score on the most popular Linux kernel dataset. In this paper, we uncover how and why LLM-based agents can substantially advance the state-of-the-art in identifying bug-introducing commits from fix commits. We propose a simple agentic workflow based on searching a set of candidate commits and find that it raises the F1-score from 0.64 to 0.81 on the most popular Linux kernel dataset, a bigger jump than between the original 2005 method (0.54) and the previous SOTA (0.64). We also uncover why agents are so successful: They derive short greppable patterns from the fix commit diff and message and use them to effectively search and find bug-introducing commits in large candidate sets. Finally, we also discuss how these insights might enable further progress in bug detection, root cause understanding, and repair.
Subjects:
Software Engineering (cs.SE)
Cite as: arXiv:2603.29378 [cs.SE]
(or arXiv:2603.29378v1 [cs.SE] for this version)
https://doi.org/10.48550/arXiv.2603.29378
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Niklas Risse [view email] [v1] Tue, 31 Mar 2026 07:48:27 UTC (83 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
announceinsightagenticMarch 2026 sponsors-only newsletter
<p>I just sent the March edition of my <a href="https://github.com/sponsors/simonw/">sponsors-only monthly newsletter</a>. If you are a sponsor (or if you start a sponsorship now) you can <a href="https://github.com/simonw-private/monthly/blob/main/2026-03-march.md">access it here</a>. In this month's newsletter:</p> <ul> <li>More agentic engineering patterns</li> <li>Streaming experts with MoE models on a Mac</li> <li>Model releases in March</li> <li>Vibe porting</li> <li>Supply chain attacks against PyPI and NPM</li> <li>Stuff I shipped</li> <li>What I'm using, March 2026 edition</li> <li>And a couple of museums</li> </ul> <p>Here's <a href="https://gist.github.com/simonw/8b5fa061937842659dbcd5bd676ce0e8">a copy of the February newsletter</a> as a preview of what you'll get. Pay $10/mont
How to Build Production Ready AgentScope Workflows with ReAct Agents, Custom Tools, Multi-Agent Debate, Structured Output and Concurrent Pipelines
In this tutorial, we build a complete AgentScope workflow from the ground up and run everything in Colab. We start by wiring OpenAI through AgentScope and validating a basic model call to understand how messages and responses are handled. From there, we define custom tool functions, register them in a toolkit, and inspect the auto-generated […] The post How to Build Production Ready AgentScope Workflows with ReAct Agents, Custom Tools, Multi-Agent Debate, Structured Output and Concurrent Pipelines appeared first on MarkTechPost .
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Self-Evolving AI
March 2026 sponsors-only newsletter
<p>I just sent the March edition of my <a href="https://github.com/sponsors/simonw/">sponsors-only monthly newsletter</a>. If you are a sponsor (or if you start a sponsorship now) you can <a href="https://github.com/simonw-private/monthly/blob/main/2026-03-march.md">access it here</a>. In this month's newsletter:</p> <ul> <li>More agentic engineering patterns</li> <li>Streaming experts with MoE models on a Mac</li> <li>Model releases in March</li> <li>Vibe porting</li> <li>Supply chain attacks against PyPI and NPM</li> <li>Stuff I shipped</li> <li>What I'm using, March 2026 edition</li> <li>And a couple of museums</li> </ul> <p>Here's <a href="https://gist.github.com/simonw/8b5fa061937842659dbcd5bd676ce0e8">a copy of the February newsletter</a> as a preview of what you'll get. Pay $10/mont
Ukraine Builds AI-Driven “Agentic State” to Automate Government Services - Odessa Journal
<a href="https://news.google.com/rss/articles/CBMingFBVV95cUxPeVlOYmt5a3NwNDNsM3dTMk1PaFQwN0xDZ29tTkkyZUlEYkc0bVRhUjd3QklZMDJmLXhPdXo0VXZLekpOeXp1NkVpX3puZkNkcXNsUFBWWkNZZWZIWXRKQWZpeFFwYWtST3ViS0VISjRIbEZBSEl3bGI3VnNkMUxxZjRLX0lVYkFqV2RHczc3Skxlelc1akJ1Z1JjcGJhQQ?oc=5" target="_blank">Ukraine Builds AI-Driven “Agentic State” to Automate Government Services</a> <font color="#6f6f6f">Odessa Journal</font>
OutSystems Introduces Agentic Systems Engineering to Power Governed, Open Enterprise AI - Thailand Business News
<a href="https://news.google.com/rss/articles/CBMizgFBVV95cUxNZ2IxMU1kWGk5elYxcUU3ZDc0WHFlYUp1UzlsM2ZXRFhPZU9wMW9rNG5KQVc0dlV3VlRrWGlWMXExY1hfalVaWDJSd1oxMlVuN1dmQ2s1MHcwWGtpSmk0c1MzVUIzaVQxV2plZFNSSHg3bmpXcU9UOXdMZ0t6aWRGbV9TMEhQYWRXSGlkekRFTFdNU2JUU203NWo5cEctdDlQVXJhZVYtaHFLcDdVT19IOUJaRnBvbGgwUHJmX2toeGoyeTFjYmcwSF9JWHo2QQ?oc=5" target="_blank">OutSystems Introduces Agentic Systems Engineering to Power Governed, Open Enterprise AI</a> <font color="#6f6f6f">Thailand Business News</font>

Robust Geospatial Coordination of Multi-Agent Communications Networks Under Attrition
arXiv:2512.02079v2 Announce Type: replace-cross Abstract: Coordinating emergency responses in extreme environments, such as wildfires, requires resilient and high-bandwidth communication backbones. While autonomous aerial swarms can establish ad-hoc networks to provide this connectivity, the high risk of individual node attrition in these settings often leads to network fragmentation and mission-critical downtime. To overcome this challenge, we introduce and formalize the problem of Robust Task Networking Under Attrition (RTNUA), which extends connectivity maintenance in multi-robot systems to explicitly address proactive redundancy and attrition recovery. We then introduce Physics-Informed Robust Employment of Multi-Agent Networks ($\Phi$IREMAN), a topological algorithm leveraging physics
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!