futureme.org
robots.txt

Robots Exclusion Standard data for futureme.org

Resource Scan

Scan Details

Site Domain futureme.org
Base Domain futureme.org
Scan Status Ok
Last Scan2024-11-08T14:15:05+00:00
Next Scan 2024-11-15T14:15:05+00:00

Last Scan

Scanned2024-11-08T14:15:05+00:00
URL https://futureme.org/robots.txt
Redirect https://www.futureme.org/robots.txt
Redirect Domain www.futureme.org
Redirect Base futureme.org
Domain IPs 18.155.68.126, 18.155.68.33, 18.155.68.4, 18.155.68.88
Redirect IPs 13.248.132.87, 35.71.145.101, 75.2.97.79, 99.83.151.71
Response IP 35.71.145.101
Found Yes
Hash 201d4ecad9a7281f90e4150ee507deed784ffddc3be012b2722a49e23acadd25
SimHash bac469854054

Groups

wget/1.10.2

Rule Path
Disallow /

*

Rule Path
Disallow /user/
Disallow /users/
Disallow /attachments/
Disallow /admin/
Disallow /l/

Other Records

Field Value
crawl-delay 10

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /
  • (like it will really listen...)