/.well-known/

Log In Sign Up

futureme.org
robots.txt

Robots Exclusion Standard data for futureme.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	futureme.org
Base Domain	futureme.org
Scan Status	Ok
Last Scan	2024-11-08T14:15:05+00:00
Next Scan	2024-11-15T14:15:05+00:00

Last Scan

Scanned	2024-11-08T14:15:05+00:00
URL	https://futureme.org/robots.txt
Redirect	https://www.futureme.org/robots.txt
Redirect Domain	www.futureme.org
Redirect Base	futureme.org
Domain IPs	18.155.68.126, 18.155.68.33, 18.155.68.4, 18.155.68.88
Redirect IPs	13.248.132.87, 35.71.145.101, 75.2.97.79, 99.83.151.71
Response IP	35.71.145.101
Found	Yes
Hash	201d4ecad9a7281f90e4150ee507deed784ffddc3be012b2722a49e23acadd25
SimHash	bac469854054

Groups

wget/1.10.2

Rule

Path

Disallow

/

*

Rule

Path

Disallow

/user/

Disallow

/users/

Disallow

/attachments/

Disallow

/admin/

Disallow

/l/

Other Records

Field

Value

crawl-delay

10

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /
(like it will really listen...)

Back to top