Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessHow We Built an EdTech Platform That Scaled to 250K Daily UsersDEV CommunityRoguelike Devlog: Redesigning a Game UI With an AI 2D Game MakerDEV CommunityI spent days debugging a cron job that was "working fine"DEV CommunityLLM Agents Need a Nervous System, Not Just a BrainDEV CommunityThe 22,000 Token Tax: Why I Killed My MCP ServerDEV CommunityOpenSpec (Spec-Driven Development) Failed My Experiment — Instructions.md Was Simpler and FasterDEV CommunityI Asked AI to Do Agile Sprint Planning (GitHub Copilot Test)DEV Community🌪️ Proof of Work: The To-Do List of Infinite RegretDEV CommunityShaping the UAE’s Digital Destiny: Building Sovereignty, Trust, and Resilience in the Cyber EraEE TimesA new dating app, Sonder, has a deliberately annoying sign-up process (and it’s working)TechCrunchPromoting late-gameplay BG3 composition contracts in the TD2 SDL portDEV CommunityArtificial Intelligence Is Facing a Crisis of Control—and the Industry Knows It - Council on Foreign RelationsGoogle News: AI SafetyBlack Hat USADark ReadingBlack Hat AsiaAI BusinessHow We Built an EdTech Platform That Scaled to 250K Daily UsersDEV CommunityRoguelike Devlog: Redesigning a Game UI With an AI 2D Game MakerDEV CommunityI spent days debugging a cron job that was "working fine"DEV CommunityLLM Agents Need a Nervous System, Not Just a BrainDEV CommunityThe 22,000 Token Tax: Why I Killed My MCP ServerDEV CommunityOpenSpec (Spec-Driven Development) Failed My Experiment — Instructions.md Was Simpler and FasterDEV CommunityI Asked AI to Do Agile Sprint Planning (GitHub Copilot Test)DEV Community🌪️ Proof of Work: The To-Do List of Infinite RegretDEV CommunityShaping the UAE’s Digital Destiny: Building Sovereignty, Trust, and Resilience in the Cyber EraEE TimesA new dating app, Sonder, has a deliberately annoying sign-up process (and it’s working)TechCrunchPromoting late-gameplay BG3 composition contracts in the TD2 SDL portDEV CommunityArtificial Intelligence Is Facing a Crisis of Control—and the Industry Knows It - Council on Foreign RelationsGoogle News: AI Safety

Assessing uncertainty of sequence representations generated by protein language models

Nature Machine Learningby R. PrabakaranYana BrombergApril 1, 20262 min read1 views
Source Quiz

Assessing uncertainty of sequence representations generated by protein language models

Language model-inferred embeddings are replacing structure-derived descriptions of proteins, genes and genomes. We propose a model-agnostic measure to quantify reliability of these new representations.

This is a preview of subscription content, access via your institution

Access options

Access Nature and 54 other Nature Portfolio journals

Get Nature+, our best-value online-access subscription

$32.99 / 30 days

cancel any time

Subscribe to this journal

Receive 12 print issues and online access

$259.00 per year

only $21.58 per issue

Buy this article

  • Purchase on SpringerLink
  • Instant access to the full article PDF.

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: RNS-based assessments of embeddings identify poorly represented proteins across different data sets.

References

  • Vaswani, A. et al. Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017). This work introduces the attention mechanism in transformer architecture.
  • Weissenow, K. & Rost, B. Are protein language models the new universal key? Curr. Opin. Struct. Biol. 91, 102997 (2025). This review article discusses the transition from evolutionary information to machine-learned embeddings for protein prediction.

Article  CAS  PubMed

Google Scholar

  • Dallago, C. et al. Learned embeddings from deep learning to visualize and predict protein sets. Curr. Protoc. 1, e113 (2021). This article introduces ‘Bioembeddings’, a publicly available library of pLM pipelines.

Article  PubMed

Google Scholar

  • Saul, B. N. & Christian, D. W. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48, 443–453 (1970). The earliest work we have identified that illustrates use of random sequences to evaluate significance of protein sequence similarities.

Article

Google Scholar

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Prabakaran, R. & Yana Bromberg, Y. Quantifying uncertainty in protein representations across models and tasks. Nat. Methods https://doi.org/10.1038/s41592-026-03028-7 (2026).

About this article

Cite this article

Assessing uncertainty of sequence representations generated by protein language models. Nat Methods (2026). https://doi.org/10.1038/s41592-026-03027-8

Download citation

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage model

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Assessing u…modellanguage mo…Nature Mach…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 199 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models