images.videolan.org
robots.txt

Robots Exclusion Standard data for images.videolan.org

Resource Scan

Scan Details

Site Domain images.videolan.org
Base Domain videolan.org
Scan Status Ok
Last Scan2024-05-21T18:45:47+00:00
Next Scan 2024-06-20T18:45:47+00:00

Last Scan

Scanned2024-05-21T18:45:47+00:00
URL https://images.videolan.org/robots.txt
Domain IPs 213.36.253.2, 2a01:e0d:1:3:58bf:fa02:c0de:5
Response IP 213.36.253.2
Found Yes
Hash 2b8dbbc4e994c8e1c17af32b5a5d1a0d117e0c80e466c83817b8e0fac37e066f
SimHash 9459e17a6b71

Groups

*

Rule Path
Disallow /pub
Disallow /removed
Disallow /doc/logs
Disallow /mirror.php
Disallow /mirror-geo.php
Disallow /mirror-geo-redirect.php
Disallow /vlc/download-skins2-go.php
Disallow /private
Disallow /~videolan/
Disallow /developers/vlc/po
Disallow /developers/vlc-branch/po

*

Rule Path
Disallow CVS
Disallow .svn

turnitinbot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

slysearch

Rule Path
Disallow /

Comments

  • $Id$
  • Do not crawl CVS and .svn directories
  • "This robot collects content from the Internet for the sole purpose of
  • helping educational institutions prevent plagiarism. [...] we compare
  • student papers against the content we find on the Internet to see if we
  • can find similarities." (http://www.turnitin.com/robot/crawlerinfo.html)
  • --> fuck off.
  • "NameProtect engages in crawling activity in search of a wide range of
  • brand and other intellectual property violations that may be of interest
  • to our clients." (http://www.nameprotect.com/botinfo.html)
  • --> fuck off.
  • "iThenticate® is a new service we have developed to combat the piracy
  • of intellectual property and ensure the originality of written work for
  • publishers, non-profit agencies, corporations, and newspapers."
  • (http://www.slysearch.com/)
  • --> fuck off.