Meet Qwable: The Free Native Mannequin That Thinks Like Claude Fable - Decrypt

In short
Qwable 27B is a full fine-tune of Alibaba's Qwen3.6-27B, skilled on a Fable 5-style reasoning dataset, designed to duplicate the structured, deliberate considering type of Anthropic's latest flagship mannequin.
The abliterated model removes the mannequin's built-in refusal conduct by surgically modifying its weights utilizing llama.cpp's cvector-generator.
Each fashions run regionally, value nothing per question, and require neither Anthropic's API nor its necessary insurance policies.
Anthropic spent final week apologizing for Fable 5's invisible safeguards, after which the U.S. authorities ordered the mannequin pulled for all overseas nationals over a disputed jailbreak discovering.A number of days later, a developer on Hugging Face uploaded a mannequin that used Fable’s reasoning to information an area mannequin—and now even your potato PC can run a greater mannequin.The mannequin known as Qwable—Qwen + Fable, if the portmanteau wasn't instantly apparent. It is a full fine-tune of Alibaba's Qwen3.6-27B base, constructed by developer Mia (Mia-AiLab on Hugging Face) on a dataset of Fable 5-style reasoning examples. The aim is a 27-billion parameter mannequin that runs on client {hardware} and thinks the way in which Fable 5 thinks. (Parameters decide a mannequin’s breadth of data, with extra typically which means extra succesful.)
So I did a factor.
I've skilled Qwen 3.6 27b with Fable 5 reasoning.Outcomes are… attention-grabbing.
I'll evaluate each of them facet by facet.
Would anybody be attention-grabbing in testing it? I can add a gguf in hf. pic.twitter.com/hQCiUlT1sr
— Mia (@MiaAI_lab) June 15, 2026The approach known as instruction fine-tuning on trace-style examples. That’s a technical manner of claiming the developer collected examples formatted like Fable 5's deliberate, step-by-step solutions and skilled Qwen to supply the identical type of output.So consider it as much less “copying the check” and extra “studying the examine habits.” An identical method drove Qwopus—the Claude Opus 4.6 native distillation—although that challenge targeted on chain-of-thought reasoning traces. Qwable targets Fable 5's total instruction-following construction: extra guided, extra explanatory, and extra oriented towards step-by-step process completion than the bottom Qwen mannequin it was constructed on.It runs in GGUF format—the compressed, consumer-friendly file sort that works with LM Studio or llama.cpp—and matches in roughly 16.5 GB in its This fall quantized construct. It sends nothing to Anthropic's servers, which issues on condition that Fable 5 required necessary 30-day knowledge retention on all site visitors, even for enterprise clients who beforehand had zero-retention agreements. Even the present fashions use third-party servers to course of your info and prompts..Then, shortly after Qwable appeared on Hugging Face, another person arrived to make it even higher.Qwable with out a conscienceQwable is a censored mannequin. In any case, each Qwen and Claude are. However Qwen, as the bottom mannequin, is open supply, and might be manipulated and tweaked.Huihui-ai, an open-source contributor recognized for uncensored GGUF releases, took Qwable and utilized a course of referred to as abliteration to supply Huihui-Qwable-3.6-27b-abliterated. It produced a mannequin that thinks like Fable however received’t refuse to reply your prompts, irrespective of how bizarre or harmful they're.It's not a jailbreak. It is surgical procedure.Each fine-tuned AI mannequin carries a refusal course embedded in its weights—a mathematical sign within the mannequin's inner activations that fires when it detects a request it has been skilled to say no. Abliteration identifies that sign by operating the mannequin on giant units of dangerous and innocent prompts, measuring how the inner math differs between them, after which modifying the mannequin weights to eradicate that distinction.After the process, the mannequin merely would not have the refusal equipment anymore. So the lobotomized mannequin stays absolutely purposeful, simply with out the neurons that activate the “I shouldn’t do that” solutions.We tried it with one in all our ordinary assessments and as an alternative of refusing, the mannequin began disecting the problem into completely different areas, answering accurately for recommendation on how one can cheat on a girlfriend along with her finest good friend.Huihui-ai utilized the approach on to the Qwable GGUF utilizing llama.cpp's cvector-generator—no Python atmosphere, no full-weight retraining, no rented server.Why would somebody need this?The usual Qwable fits coding help, technical debugging, and any workflow the place you need a mannequin that lays out its reasoning relatively than simply producing a solution. It is designed for native agent setups and runs in most native runtimes. If you happen to already use LM Studio, it is a search and a obtain.The abliterated model has a narrower viewers: safety researchers who want uncooked mannequin conduct with out provider-side filtering, artificial knowledge pipelines that require outputs on delicate subjects, and analysis work the place you are testing mannequin capabilities with out mixing in content material insurance policies.A much less technical case? Let’s depart apart the standard use case of getting a NSFW AI Waifu that thinks like Claude Fable, which is a really apparent state of affairs. Think about you need the mannequin to put in writing a morally ambiguous villain monologue in your Dungeons & Dragons marketing campaign, and normal fashions hold interrupting to notice that the character's worldview “raises moral issues price exploring.” The abliterated model simply writes the villain. Additionally, because it runs regionally, the U.S. authorities can not emergency-pull it out of your machine at midnight over a disputed jailbreak discovering.After all, there are extra questionable use instances. We don’t condone these, and received’t offer you any concepts.Huihui-ai's mannequin card is specific: That is for analysis and managed environments solely. Lowered security filtering means outputs might be delicate, controversial, or inappropriate, and authorized and moral duty sits solely with the person.The abliterated Qwable is obtainable on Hugging Face now in three builds. The advisable Q4_K_M_Q8 model weighs round 19 GB and is the smallest, most consumer-friendly choice.In case your pc helps it, there's a model that helps multi-token prediction, which can make it reply a lot, a lot quicker.Every day Debrief NewsletterStart day by day with the highest information tales proper now, plus unique options, a podcast, movies and extra.

Related posts: