ourconservationlegacy.org
robots.txt

Robots Exclusion Standard data for ourconservationlegacy.org

Resource Scan

Scan Details

Site Domain ourconservationlegacy.org
Base Domain ourconservationlegacy.org
Scan Status Ok
Last Scan2024-09-02T00:39:40+00:00
Next Scan 2024-10-02T00:39:40+00:00

Last Scan

Scanned2024-09-02T00:39:40+00:00
URL https://ourconservationlegacy.org/robots.txt
Domain IPs 104.21.77.35, 172.67.204.3, 2606:4700:3030::ac43:cc03, 2606:4700:3031::6815:4d23
Response IP 104.21.77.35
Found Yes
Hash c4701840e3c7b2bace63e4937a0eb1d42f30956d212d7e553d9ee099069b6566
SimHash c82dc7bf1451

Groups

googlebot

Rule Path
Disallow

apis-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

feedfetcher-google

Rule Path
Disallow

google-read-aloud

Rule Path
Disallow

duplexweb-google

Rule Path
Disallow

googleweblight

Rule Path
Disallow

storebot-google

Rule Path
Disallow

google-site-verification

Rule Path
Disallow