Difference between revisions of "Stack Exchange"
Jump to navigation
Jump to search
(irc chan) |
(include text about dumps being restricted) |
||
Line 11: | Line 11: | ||
'''StackExchange''' is the biggest Q&A network of the 2010s. The software is proprietary (although there are ports like AskBot), and the content is freely licensed (CC-BY-SA). | '''StackExchange''' is the biggest Q&A network of the 2010s. The software is proprietary (although there are ports like AskBot), and the content is freely licensed (CC-BY-SA). | ||
* Dumps | * Dumps were published every few months at https://archive.org/details/stackexchange | ||
** These dumps were briefly discontinued some time after {{datetime|2023-03}}<ref>https://meta.stackexchange.com/questions/389922/june-2023-data-dump-is-missing</ref><ref>https://meta.stackoverflow.com/questions/424299/stack-overflow-is-no-longer-providing-creative-commons-data-dumps</ref><ref>https://meta.stackexchange.com/questions/388551/is-se-going-to-be-selling-our-content-for-ai-model-training-and-what-exactly</ref>, but were resumed after pressure from various sources on {{datetime|2023-06-14}}<ref>https://meta.stackexchange.com/a/390023</ref>. | ** These dumps were briefly discontinued some time after {{datetime|2023-03}}<ref>https://meta.stackexchange.com/questions/389922/june-2023-data-dump-is-missing</ref><ref>https://meta.stackoverflow.com/questions/424299/stack-overflow-is-no-longer-providing-creative-commons-data-dumps</ref><ref>https://meta.stackexchange.com/questions/388551/is-se-going-to-be-selling-our-content-for-ai-model-training-and-what-exactly</ref>, but were resumed after pressure from various sources on {{datetime|2023-06-14}}<ref>https://meta.stackexchange.com/a/390023</ref>. | ||
** On {{datetime|2024-07-12}}, it was announced<ref>https://meta.stackexchange.com/questions/401324/announcing-a-change-to-the-data-dump-process</ref> that these dumps would no longer be publicly accessible on archive.org. Instead, they will now be available for download on the settings page after logging in. Additionally, users must agree to new terms before being permitted to download the dumps: <blockquote>"I understand that this file is being provided to me for my own use and for projects that do not include training a large language model (LLM), and that should I distribute this file for the purpose of LLM training, Stack Overflow reserves the right to decline to allow me access to future downloads of this data dump."</blockquote> | |||
* http://www.stackprinter.com/deleted archives deleted questions | * http://www.stackprinter.com/deleted archives deleted questions | ||
Revision as of 19:40, 8 August 2024
Stack Exchange | |
![]() | |
![]() | |
URL | https://stackexchange.com/ and many more |
Status | Online! |
Archiving status | Saved by itself |
Archiving type | Unknown |
IRC channel | #stackunderflow (on hackint) |
StackExchange is the biggest Q&A network of the 2010s. The software is proprietary (although there are ports like AskBot), and the content is freely licensed (CC-BY-SA).
- Dumps were published every few months at https://archive.org/details/stackexchange
- These dumps were briefly discontinued some time after 2023-03[1][2][3], but were resumed after pressure from various sources on 2023-06-14[4].
- On 2024-07-12, it was announced[5] that these dumps would no longer be publicly accessible on archive.org. Instead, they will now be available for download on the settings page after logging in. Additionally, users must agree to new terms before being permitted to download the dumps:
"I understand that this file is being provided to me for my own use and for projects that do not include training a large language model (LLM), and that should I distribute this file for the purpose of LLM training, Stack Overflow reserves the right to decline to allow me access to future downloads of this data dump."
- http://www.stackprinter.com/deleted archives deleted questions
ZIM format:
- https://download.kiwix.org/zim/stack_exchange/
- https://archive.org/search.php?query=stackoverflow.com%20zim
References
- ↑ https://meta.stackexchange.com/questions/389922/june-2023-data-dump-is-missing
- ↑ https://meta.stackoverflow.com/questions/424299/stack-overflow-is-no-longer-providing-creative-commons-data-dumps
- ↑ https://meta.stackexchange.com/questions/388551/is-se-going-to-be-selling-our-content-for-ai-model-training-and-what-exactly
- ↑ https://meta.stackexchange.com/a/390023
- ↑ https://meta.stackexchange.com/questions/401324/announcing-a-change-to-the-data-dump-process