(January 11 2024) TorrentFreak — Meta Latest Tech Company to Admit Use of 'Pirated' Book Dataset to Train AI Models
(January 11 2024) TorrentFreak — Meta Latest Tech Company to Admit Use of 'Pirated' Book Dataset to Train AI Models
torrentfreak.com Meta Admits Use of 'Pirated' Book Dataset to Train AI * TorrentFreak
Meta admits in court that it used portions of the Books3 dataset to train its Llama models. This dataset includes many pirated books.
4
comments
We should blacklist their IPs
3 0 ReplyLooks like they stole shit that was already in an open source dataset (fortune 500 tech companies LOVE THIS) idk
3 0 ReplyAh, piracy is bad now?
1 0 Reply
Pirated AI software sounds cyberpunk as fuck. I love it
2 0 Reply