gpca.org.ae
robots.txt

Robots Exclusion Standard data for gpca.org.ae

Resource Scan

Scan Details

Site Domain gpca.org.ae
Base Domain gpca.org.ae
Scan Status Ok
Last Scan2024-10-30T23:28:46+00:00
Next Scan 2024-11-29T23:28:46+00:00

Last Scan

Scanned2024-10-30T23:28:46+00:00
URL https://gpca.org.ae/robots.txt
Domain IPs 34.120.190.48, 35.190.31.54, 35.227.194.51, 35.244.153.44
Response IP 35.190.31.54
Found Yes
Hash 49b4990238e29578c8c04298799fc4d4a1e2fcf9ac65c7b384c689005de337e8
SimHash 9e4b7846c971

Groups

*

Rule Path
Disallow /wp-admin/

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /