Pre-2022 Text Data Gains Value Due To AI
Text data produced before the 2022 launch of ChatGPT is becoming increasingly valuable for training large language models. Technologist John Graham Cumming has created an archive to source this human-quality text.
Topics
Developing
- 863d Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore.
- 863d Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
- 863d Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est.
- 863d Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium.
Sources · 7 independent
BBC Radio 4
“There is a real chance that text from before 2022 from before the launch of chat GPT in the arrival of a large language models that could generate human quality text will have that same value”
Unlock the full story
Get a Pro subscription or above to see the live story progression and the full list of independent sources confirming each event as they happen.
Log in to upgrade