Difference between revisions of "Wallhaven"

From Archiveteam
Jump to navigation Jump to search
(information about alpha.wallhaven.cc)
 
(add logo)
Line 1: Line 1:
 
{{Infobox project
 
{{Infobox project
 
| title = Wallhaven (Alpha Phase)
 
| title = Wallhaven (Alpha Phase)
 +
| logo = Wallhaven_logo.png
 
| image = Wallhaven.jpg
 
| image = Wallhaven.jpg
 
| description = wallpaper repository
 
| description = wallpaper repository

Revision as of 12:06, 14 September 2014

Wallhaven (Alpha Phase)
Wallhaven logo
wallpaper repository
wallpaper repository
URL http://alpha.wallhaven.cc
Project status Online!
Archiving status In progress...
Project source Unknown
Project tracker Unknown
IRC channel #archiveteam (on EFnet)
Project lead Unknown

wallhaven.cc is a store of wallpapers and other high-resolution media typically scraped from chans' /hr, /wg, and /w boards.

It seems to be a replacement for wallbase.cc project.

Overview

It is in alpha phase now. Content uploaded to alpha.wallhaven.cc will likely be deleted after that phase is over.

The notice on the page reads:

Alpha Notice: We are expecting to start fresh at the end of the alpha phase. The alpha is only intended as a sneak peak and a quick and dirty bug test.

We should archive the content on alpha.wallhaven.cc.

Work thus far

Some page analysis.

Site Specifics

The structure is very similar to wallbase.cc. Scraping is very easy. Some urls have changed a bit.

Stats:

  • Around 21k wallpapers so far.
  • Per day around 1k new wallpapers are uploaded.

Data:

  • Categories: alpha.wallhaven.cc/tags/id
  • Tags: alpha.wallhaven.cc/tag/id
  • Wallpapers: alpha.wallhaven.cc/wallpaper/id
  • Users: alpha.wallhaven.cc/user/id

Media:

  • Wallpapers: alpha.wallhaven.cc/wallpapers/full/wallhaven-ID(.jpg/.png)

Other notes:

  • Tags can have aliases. This seems to be new. It's kinda cool, I think.
  • The domain implements rate limiting or the infrastructure is a lot slower compared to the wallbase.cc infrastructure.