My Writings Are in the LibGen AI Training Corpus
The Atlantic提供了一个搜索工具,允许用户在“LibGen”数据库中查找特定作品。该数据库包含受版权保护的内容,曾被Meta用于训练其AI模型。由于无法确定Meta具体使用了哪些部分,因此存在不确定性。作者搜索自己的名字得到了199个结果。 2025-3-21 18:26:22 Author: www.schneier.com(查看原文) 阅读量:15 收藏

The Atlantic has a search tool that allows you to search for specific works in the “LibGen” database of copyrighted works that Meta used to train its AI models. (The rest of the article is behind a paywall, but not the search tool.)

It’s impossible to know exactly which parts of LibGen Meta used to train its AI, and which parts it might have decided to exclude; this snapshot was taken in January 2025, after Meta is known to have accessed the database, so some titles here would not have been available to download.

Still…interesting.

Searching my name yields 199 results: all of my books in different versions, plus a bunch of shorter items.

Tags:

Posted on March 21, 2025 at 2:26 PM1 Comments

Sidebar photo of Bruce Schneier by Joe MacInnis.


文章来源: https://www.schneier.com/blog/archives/2025/03/my-writings-are-in-the-libgen-ai-training-corpus.html
如有侵权请联系:admin#unsafe.sh