futureme.org
robots.txt

Robots Exclusion Standard data for futureme.org

Resource Scan

Scan Details

Site Domain futureme.org
Base Domain futureme.org
Scan Status Ok
Last Scan2024-09-20T13:45:33+00:00
Next Scan 2024-09-27T13:45:33+00:00

Last Scan

Scanned2024-09-20T13:45:33+00:00
URL https://futureme.org/robots.txt
Redirect https://www.futureme.org/robots.txt
Redirect Domain www.futureme.org
Redirect Base futureme.org
Domain IPs 18.155.68.126, 18.155.68.33, 18.155.68.4, 18.155.68.88
Redirect IPs 18.205.36.100, 52.204.242.176, 54.157.58.70, 54.162.128.250
Response IP 54.162.128.250
Found Yes
Hash 201d4ecad9a7281f90e4150ee507deed784ffddc3be012b2722a49e23acadd25
SimHash bac469854054

Groups

wget/1.10.2

Rule Path
Disallow /

*

Rule Path
Disallow /user/
Disallow /users/
Disallow /attachments/
Disallow /admin/
Disallow /l/

Other Records

Field Value
crawl-delay 10

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /
  • (like it will really listen...)