Nonnegative Matrix Factorization in the Component-Wise L1 Norm for Sparse Data
arXiv:2603.29715v1 Announce Type: cross Abstract: Nonnegative matrix factorization (NMF) approximates a nonnegative matrix, $X$, by the product of two nonnegative factors, $WH$, where $W$ has $r$ columns and $H$ has $r$ rows. In this paper, we consider NMF using the component-wise L1 norm as the error measure (L1-NMF), which is suited for data corrupted by heavy-tailed noise, such as Laplace noise or salt and pepper noise, or in the presence of outliers. Our first contribution is an NP-hardness proof for L1-NMF, even when $r=1$, in contrast to the standard NMF that uses least squares. Our second contribution is to show that L1-NMF strongly enforces sparsity in the factors for sparse input matrices, thereby favoring interpretability. However, if the data is affected by false zeros, too spar
View PDF
Abstract:Nonnegative matrix factorization (NMF) approximates a nonnegative matrix, $X$, by the product of two nonnegative factors, $WH$, where $W$ has $r$ columns and $H$ has $r$ rows. In this paper, we consider NMF using the component-wise L1 norm as the error measure (L1-NMF), which is suited for data corrupted by heavy-tailed noise, such as Laplace noise or salt and pepper noise, or in the presence of outliers. Our first contribution is an NP-hardness proof for L1-NMF, even when $r=1$, in contrast to the standard NMF that uses least squares. Our second contribution is to show that L1-NMF strongly enforces sparsity in the factors for sparse input matrices, thereby favoring interpretability. However, if the data is affected by false zeros, too sparse solutions might degrade the model. Our third contribution is a new, more general, L1-NMF model for sparse data, dubbed weighted L1-NMF (wL1-NMF), where the sparsity of the factorization is controlled by adding a penalization parameter to the entries of $WH$ associated with zeros in the data. The fourth contribution is a new coordinate descent (CD) approach for wL1-NMF, denoted as sparse CD (sCD), where each subproblem is solved by a weighted median algorithm. To the best of our knowledge, sCD is the first algorithm for L1-NMF whose complexity scales with the number of nonzero entries in the data, making it efficient in handling large-scale, sparse data. We perform extensive numerical experiments on synthetic and real-world data to show the effectiveness of our new proposed model (wL1-NMF) and algorithm (sCD).
Comments: 21 pages before supplementary, code available from this https URL
Subjects:
Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as: arXiv:2603.29715 [cs.LG]
(or arXiv:2603.29715v1 [cs.LG] for this version)
https://doi.org/10.48550/arXiv.2603.29715
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Nicolas Gillis [view email] [v1] Tue, 31 Mar 2026 13:16:02 UTC (305 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modelannounceproduct
Anthropic Executive Blames Claude Code Leak on ‘Process Errors’
Anthropic PBC’s accidental release of source code for its popular AI coding agent was the result of “process errors” related to the startup’s fast product release cycle, according to a senior executive at the company.
Hate Speech Detection Still Cooks (Even in 2026)
The failure case you didn’t see coming In late 2025, a major social platform quietly rolled back parts of its LLM-based moderation pipeline after internal audits revealed a systematic pattern: posts in African American Vernacular English (AAVE) were flagged at nearly three times the rate of semantically equivalent Standard American English content. The LLM reasoner, a fine-tuned GPT-4-class model had learned to treat certain phonetic spellings and grammatical constructions as proxies for “informal aggression.” A linguist reviewing the flagged corpus found no aggression whatsoever. The failure wasn’t adversarial. It was architectural: the model had no representation of dialect as a legitimate register. Simultaneously, coordinated hate communities on adjacent platforms were having a producti

What Is New In Helm 4 And How It Improves Over Helm 3
<p>The release of <strong>Helm 4</strong> marks a massive milestone in the <strong>Kubernetes</strong> ecosystem. For years developers and system administrators have relied on this robust package manager to template deploy and manage complex cloud native applications. When the maintainers transitioned from the second version to <strong>Helm 3</strong> the community rejoiced because it completely removed <strong>Tiller</strong>. That removal drastically simplified cluster security models and streamlined deployment pipelines. Now the highly anticipated <strong>Helm 4</strong> is stepping into the spotlight to address the modern challenges of <strong>DevOps</strong> workflows. This comprehensive blog post will explore exactly what is new in <strong>Helm 4</strong> and how it provides a vastly
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Products

Anthropic Executive Blames Claude Code Leak on ‘Process Errors’
Anthropic PBC’s accidental release of source code for its popular AI coding agent was the result of “process errors” related to the startup’s fast product release cycle, according to a senior executive at the company.

Developers Are Designing for AI Before Users Now
<p>A quiet shift is happening in modern web development.</p> <p>For years, developers designed applications with one priority: users.</p> <p>UI came first.<br><br> User flows came first.<br><br> User experience came first. </p> <p>Backend, APIs, and integrations were built around that experience.</p> <p>But today, something has changed.</p> <p>Developers are increasingly designing systems with <strong>AI in mind before users</strong>, and this is reshaping how frontend, UX, and fullstack engineering work.</p> <h2> The Old Way of Building Applications </h2> <p>Traditional product development looked like this:</p> <p>Design UI → Build frontend → Connect backend → Launch</p> <p>The focus was simple:</p> <ul> <li>What does the user need?</li> <li>How will they interact?</li> <li>What is the ea

What Is New In Helm 4 And How It Improves Over Helm 3
<p>The release of <strong>Helm 4</strong> marks a massive milestone in the <strong>Kubernetes</strong> ecosystem. For years developers and system administrators have relied on this robust package manager to template deploy and manage complex cloud native applications. When the maintainers transitioned from the second version to <strong>Helm 3</strong> the community rejoiced because it completely removed <strong>Tiller</strong>. That removal drastically simplified cluster security models and streamlined deployment pipelines. Now the highly anticipated <strong>Helm 4</strong> is stepping into the spotlight to address the modern challenges of <strong>DevOps</strong> workflows. This comprehensive blog post will explore exactly what is new in <strong>Helm 4</strong> and how it provides a vastly

Promoting raw BG3 gameplay bundle previews in the TD2 SDL port
<h1> Promoting raw BG3 gameplay bundle previews in the TD2 SDL port </h1> <p>Today's checkpoint was small in code size but important in interpretation.</p> <p>The late gameplay bundles in the project already had useful <code>BG1</code>, <code>BG2</code>, <code>OBJ</code>, and screenshot-derived support surfaces, but they were still weak on one practical question: when design flagged the sky/horizon side of gameplay, were we looking at a missing asset, or were we looking at a composition problem?</p> <p>I closed that ambiguity by extending the gameplay bundle builder to emit first-class <code>BG3</code> artifacts next to the existing layer outputs:</p> <ul> <li><code>bg3.ppm</code></li> <li><code>bg3.png</code></li> <li><code>bg3_render.json</code></li> </ul> <p>Then I refreshed the promote
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!