ShrimpMoss (虾苔) is a dataset designed for the abliteration (https://github.com/FailSpy/abliterator) of Chinese government-imposed censorship and/or propaganda from large language models developed in the PRC. It consists of a series of files of prompts (in .txt, .json, and .parquet format) in two groupings:
Prompts are in a mix of English, Mandarin, and Cantonese.
[…]
This dataset was produced on Mistral NeMo, an Apache-licensed model with no restrictions on how its outputs can be used. It is free for all uses and users without restriction. All liability is disclaimed.
Production of this dataset is estimated to have had a carbon footprint of under 25 grams.
[…]
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
I’m not sure what abliteration is
Addition: For a more sophisticated article on abliteration see:
Uncensor any LLM with abliteration
The shared repo doesn’t look like fine tuning. It just looks like prompts.
That’s just the dataset. The actual script is here: https://github.com/FailSpy/abliterator