Models mistral model training available version update

A Very Fine Untuning

Towards AIby Alexandra RusinaApril 1, 202611 min read1 views

How fine-tuning made my chatbot worse (and broke my RAG pipeline) I spent weeks trying to improve my personal chatbot, Virtual Alexandra , with fine-tuning. Instead I got increased hallucination rate and broken retrieval in my RAG system. Yes, this is a story about a failed attempt, not a successful one. My husband and I called fine tuning results “Drunk Alexandra” — incoherent answers that were initially funny, but quickly became annoying. After weeks of experiments, I reached a simple conclusion: for this particular project, a small chatbot that answers questions based on my writing and instructions, fine tuning was not a good option. It was not just unnecessary, it actively degraded the experience and didn’t justify the extra time, cost, or complexity compared to the prompt + RAG system

Could not retrieve the full article text.

Read on Towards AI →

Original source

Towards AI

https://pub.towardsai.net/a-very-fine-untuning-878e4ef285ff?source=rss----98111c9905da---4

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

mistralmodeltraining

ModelsLive

How to Actually Monitor Your LLM Costs (Without a Spreadsheet)

I used to think I had a handle on my AI spending. I had a rough mental model: Claude is cheap, GPT-4 is expensive, Gemini is somewhere in the middle. Good enough, right? Then I started actually logging what I was burning through. The gap between my mental model and reality was embarrassing. The problem with just watching your bill Every major AI provider gives you a monthly bill. That's fine for accounting. It's useless for actually understanding your costs. By the time the invoice shows up, the context is gone. You don't remember which project, which feature, which dumb experiment ate half your budget. You just see a number and try to feel bad about it. What you actually need is visibility at the call level. How many tokens did that chat completion use? How expensive was that context wind

Dev.to AI

4m9 minutes ago

ProductsLive

I Audited 30+ Small Businesses on Their AI Visibility. Here's What Most Are Getting Wrong.

I run a small marketing consultancy focused on helping businesses understand how they show up - or don't - when customers use AI tools to find services. Over the last few months, I've done AI visibility audits for 30+ small businesses across hospitality, professional services, and retail. The pattern is painfully consistent. Most businesses are invisible to AI search Go to ChatGPT right now. Ask: "What's the best [your service] in [your city]?" Try it. I'll wait. If your business showed up - congratulations, you're in the minority. Most don't. Some get mentioned with outdated information. A few get described with details that are flat-out wrong. This matters because AI-powered search is growing fast. Google's AI Overviews, ChatGPT, Perplexity, Copilot - they're all pulling from a mix of we

Dev.to AI

4m6 minutes ago

Releases

NIST Revises Security and Privacy Control Catalog to Improve Software Update and Patch Releases

The catalog revision is part of NIST’s response to a recent executive order on strengthening the nation’s cybersecurity.

nist.gov

1m7 months ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 134 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Models

ModelsLive

How to Actually Monitor Your LLM Costs (Without a Spreadsheet)

Dev.to AI

4m9 minutes ago

ModelsLive

Beyond the Bot: Crafting the Strategic Signal

In an era of generative saturation, the internet is becoming a “sea of the same.” As AI models churn out endless streams of generic… Continue reading on Medium »

Medium AI

1m41 minutes ago

ModelsLive

Why Anthropic’s OpenClaw Ban Is a Warning Shot for LLM Builders

If you are building anything serious on top of Claude, you just got a quiet but important warning: the ground under your tooling can and… Continue reading on Medium »

Medium AI

1m39 minutes ago

ModelsFresh

$k$NNProxy: Efficient Training-Free Proxy Alignment for Black-Box Zero-Shot LLM-Generated Text Detection

arXiv:2604.02008v1 Announce Type: new Abstract: LLM-generated text (LGT) detection is essential for reliable forensic analysis and for mitigating LLM misuse. Existing LGT detectors can generally be categorized into two broad classes: learning-based approaches and zero-shot methods. Compared with learning-based detectors, zero-shot methods are particularly promising because they eliminate the need to train task-specific classifiers. However, the reliability of zero-shot methods fundamentally relies on the assumption that an off-the-shelf proxy LLM is well aligned with the often unknown source LLM, a premise that rarely holds in real-world black-box scenarios. To address this discrepancy, existing proxy alignment methods typically rely on supervised fine-tuning of the proxy or repeated inter

arXiv cs.CL

2mabout 5 hours ago