1-4a.com
robots.txt

Robots Exclusion Standard data for 1-4a.com

Resource Scan

Scan Details

Site Domain 1-4a.com
Base Domain 1-4a.com
Scan Status Ok
Last Scan2024-06-06T22:47:27+00:00
Next Scan 2024-06-13T22:47:27+00:00

Last Scan

Scanned2024-06-06T22:47:27+00:00
URL https://1-4a.com/robots.txt
Domain IPs 104.21.49.135, 172.67.163.127, 2606:4700:3036::6815:3187, 2606:4700:3037::ac43:a37f
Response IP 104.21.49.135
Found Yes
Hash 9c43cfce1cc9c9757fd8b0027000fe9077e84682870bce79aade33744b900943
SimHash 8115453aa762

Groups

ia_archiver

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

npbot 1

Rule Path
Disallow /

npbot

Rule Path
Disallow /

npbot*

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow