Difference between revisions of "Weibo"

From Archiveteam
Jump to navigation Jump to search
 
Line 8: Line 8:
| archiving_status = {{nosavedyet}}
| archiving_status = {{nosavedyet}}
}}
}}
'''Weibo''' is the Chinese incumbent microblogging platform, a mix of Twitter and Facebook (which basically don't reach China at all).
'''Weibo''' is the Chinese incumbent microblogging platform.  


== Archives ==
== Archives ==
Line 16: Line 16:
* for {{URL|http://weibo.com}} (before April 2014).  
* for {{URL|http://weibo.com}} (before April 2014).  


Webpages fetched '''after''' April 2014 are mostly n/a on [[IA]], since [https://web.archive.org/web/20140424233224/http://www.weibo.com/ most crawls on IA being redirected to a page on passport.weibo.com, titled "Sina Visitor System". ]
Webpages fetched '''after''' April 2014 are mostly broken on [[IA]], since [https://web.archive.org/web/20140424233224/http://www.weibo.com/ most crawls on IA being redirected to a page on passport.weibo.com, titled "Sina Visitor System". ] This behaviour applied '''unless''' User-Agent is set to "spider"/"googlebot"/etc<ref>{{URL|https://bindog.github.io/blog/2014/10/15/set-the-ua-to-bypass-sina-visitor-system/}}</ref>, so specifying a User-Agent in {{URL|https://weibo.com/robots.txt|robots.txt}} is required for crawling. [[IA]]'s crawls and [[ArchiveBot]] (if no searchbot UA specified) would not work on this site.
 
Changing User-Agent to "googlebot" alike is required for crawing according to some report.<ref>{{URL|https://bindog.github.io/blog/2014/10/15/set-the-ua-to-bypass-sina-visitor-system/}}</ref>


== External links ==
== External links ==

Latest revision as of 08:35, 27 November 2021

Weibo
Weibo logo
An example Weibo account
An example Weibo account
URL weibo.com
Status Online!
Archiving status Not saved yet
Archiving type Unknown
IRC channel #archiveteam-bs (on hackint)

Weibo is the Chinese incumbent microblogging platform.

Archives

Some profiles or posts are available on Internet Archive:

Webpages fetched after April 2014 are mostly broken on IA, since most crawls on IA being redirected to a page on passport.weibo.com, titled "Sina Visitor System". This behaviour applied unless User-Agent is set to "spider"/"googlebot"/etc[1], so specifying a User-Agent in robots.txt[IAWcite.todayMemWeb] is required for crawling. IA's crawls and ArchiveBot (if no searchbot UA specified) would not work on this site.

External links

Current domains

Former doamins

References