|Archiving status||In progress...|
|IRC channel||(on hackint)|
|Data[how to use]||archiveteam_imgur|
Imgur is an image sharing community and former image host.
Seems stable for now. Images submitted by free users are deleted after not being accessed for 6 months.
Imgur serves a massive amount of traffic. In 2012 alone, 42 petabytes of data were transferred. Fortunately, the amount of images uploaded is much less, albeit still a lot. In 2012, around 300,000,000 images were uploaded; assuming an average size of 120KB, that's 36TB in one year. As of 2014, there were 650 million images with 1.5 million being added each day according to one source. An analysis in 2015 based on extrapolation from a sample of random image IDs estimated about 2 billion images with a total raw full-resolution image size of 376 TiB.
Imgur was originally created as a gift to the Reddit community because of problems experienced with other image hosting services. It was used extensively on Reddit for many years until Reddit introduced their own native image host (at http://i.redd.it/) in 2016, causing a significant decrease in submissions to Imgur.
In 2018, there were reports that Imgur had started to enforce their terms of service, specifically that using Imgur for image hosting is not permitted and may result in deletion of those images.
In 2023, in a page titled "Imgur Terms of Service Update [April 19, 2023]": "Our new Terms of Service will go into effect on May 15, 2023. We will be focused on removing old, unused, and inactive content that is not tied to a user account from our platform as well as nudity, pornography, & sexually explicit content. You will need to download/save any images that you wish to save if they no longer adhere to these Terms. Most notably, this would include explicit/pornographic content." Like Tumblr, Imgur will remove porn and similar in 2023-05-15; they will also remove "old, unused, and inactive content", which makes Imgur much less of a persistent image host.
A sitemap from May 2017 onwards can be found at https://imgur.com/imgur-assets/sitemap_gallery/gallery_images.xml. This sitemap only covers galleries, i.e. albums shared publicly on the platform.
While the Wayback Machine is able to save individual Imgur posts as of 2022, the "trending" section and the home page are not saved properly, as indicated by archives from November and December displaying the same trends on the front page.
How to help if you have lists of URLs
This project requires lists of URLs for content on the target website. If you have a source of URLs, please:
- Use the PCRE regular expression
- Note that this regex is intentionally broad to cover many different URL formats. Please do not try to use a more narrow pattern as it may miss valid URLs. We can always filter or transform the results as needed later.
- Enable case-insensitive matching (e.g.
grep) to not miss URLs with capitalised domains or similar.
- If you use
grep, remember to include the
--texton GNU grep) option to ensure it will continue searching for matches when encountering binary data.
- Example command (GNU grep):
grep -Pahoi '\S*imgur\S*' FILENAME FILENAME...
- If the output exceeds a few megabytes, please compress it, preferably using
- Upload the file to https://transfer.archivete.am/.
- Share the resulting URL in the project IRC channel.
- If you would like to keep the list non-public instead, e.g. for privacy reasons or for not wanting to be publicly associated with it, please get in touch with a channel op (e.g. arkiver or JustAnotherArchivist). Note that the items generated from your list would still be processed publicly, of course, but they would be mixed with everything else.
See also Category:Projects requiring URL lists for other ArchiveTeam projects that necessitate URL lists.