autismsocietypgh.org
robots.txt

Robots Exclusion Standard data for autismsocietypgh.org

Resource Scan

Scan Details

Site Domain autismsocietypgh.org
Base Domain autismsocietypgh.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-14T01:29:40+00:00
Next Scan 2024-11-21T01:29:40+00:00

Last Successful Scan

Scanned2024-10-13T22:33:29+00:00
URL https://autismsocietypgh.org/robots.txt
Redirect https://xoilactvx.me/robots.txt
Redirect Domain xoilactvx.me
Redirect Base xoilactvx.me
Domain IPs 104.21.67.241, 172.67.183.42, 2606:4700:3030::ac43:b72a, 2606:4700:3037::6815:43f1
Redirect IPs 104.18.24.103, 104.18.25.103, 2606:4700::6812:1867, 2606:4700::6812:1967
Response IP 104.18.24.103
Found Yes
Hash 49ef553ccfc93c91a7feb95dba03e97934836d1cb78d6296b68c1808b5de90f3
SimHash 9a4f70c26911

Groups

ia_archiver

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/