gq.co.za
robots.txt

Robots Exclusion Standard data for gq.co.za

Resource Scan

Scan Details

Site Domain gq.co.za
Base Domain gq.co.za
Scan Status Ok
Last Scan2024-10-31T09:34:53+00:00
Next Scan 2024-11-07T09:34:53+00:00

Last Scan

Scanned2024-10-31T09:34:53+00:00
URL https://gq.co.za/robots.txt
Redirect https://www.gq.co.za/robots.txt
Redirect Domain www.gq.co.za
Redirect Base gq.co.za
Domain IPs 104.21.94.173, 172.67.138.210, 2606:4700:3030::6815:5ead, 2606:4700:3033::ac43:8ad2
Redirect IPs 104.21.94.173, 172.67.138.210, 2606:4700:3030::6815:5ead, 2606:4700:3033::ac43:8ad2
Response IP 172.67.138.210
Found Yes
Hash 0fdfc2c7d8e099ae000f2a6cf0ed35f8ba33140d25a6509cdab2298e2fca9d0b
SimHash 45a05c5152d4

Groups

*

Rule Path
Allow *.js
Allow *.css
Disallow */preview/
Disallow /test$
Disallow *search?q
Disallow /xmlrpc.php
Disallow /profile
Disallow /cgi-bin/
Disallow /tmp/
Disallow /junk/
Disallow /index.php?*
Disallow /trackback/
Disallow /administrator/
Disallow */trackback/
Disallow /license.txt
Disallow /*.php$

Other Records

Field Value
sitemap https://www.gq.co.za/sitemap.xml
sitemap https://www.gq.co.za/sitemap-tags.xml
sitemap https://www.gq.co.za/sitemap-0.xml